Download Jar. Libraryless. Click here for Pure Java version (5449L/37K).
!7 p { S query = "gramophone"; S userAgent = "Mozilla/5.0 (Windows NT 6.1; Win64; x64; rv:53.0) Gecko/20100101 Firefox/53.0"; S html = loadPageWithUserAgent("http://google.com/search?q=" + urlencode(query) + "&lr=lang_en&hl=en", userAgent); S url = first(loadPage_responseHeaders->get("Location")); if (url != null) html = loadPageWithUserAgent(url, userAgent); //print(html); pnlStruct(loadPage_responseHeaders!); // Every h3 is a search result L<S> htmlTok = htmlTok(html); LL<S> h3s = findContainerTagDeep(htmlTok, "h3"); pnlStruct(h3s); for (L<S> tok : h3s) { L<S> linkTok = first(findContainerTag(tok, "a")); if (empty(linkTok)) continue; S link = tagGet(second(linkTok), "href"); S text = join(dropTags(contentsOfContainerTag(linkTok))); L<S> sub = subList(htmlTok, magicIndexOfSubList(htmlTok, tok)+l(tok)-1); S desc = trim(htmldecode(dropTags(join(first(findContainerTagWithParams(sub, "span", "class" := "st")))))); print("Link: " + link); print(" Text: " + text); print(" Desc: " + desc); } }
download show line numbers debug dex old transpilations
Travelled to 13 computer(s): aoiabmzegqzx, bhatertpkbcr, cbybwowwnfue, cfunsshuasjs, gwrvuhgaqvyk, ishqpsrjomds, lpdgvwnxivlt, mqqgnosmbjvj, pyentgdyhuwx, pzhvpgtvlbxg, tslmcundralx, tvejysmllsmz, vouqrxazstgt
No comments. add comment
| Snippet ID: | #1011236 |
| Snippet name: | Scrape Google Spike [OK] |
| Eternal ID of this version: | #1011236/19 |
| Text MD5: | a9c63f65d196a19c0e501e4037e3a861 |
| Transpilation MD5: | d9873086d1d0da7681f357242c2010b2 |
| Author: | stefan |
| Category: | javax / networking |
| Type: | JavaX source code (desktop) |
| Public (visible to everyone): | Yes |
| Archived (hidden from active list): | No |
| Created/modified: | 2017-11-14 09:06:41 |
| Source code size: | 1113 bytes / 30 lines |
| Pitched / IR pitched: | No / No |
| Views / Downloads: | 812 / 1780 |
| Version history: | 18 change(s) |
| Referenced in: | #1011241 - quickGoogle - returns pairs of (link, text) |