1 | // tok must come from htmlcoarsetok |
2 | // returns all container tags found (including content) as CNC |
3 | // honors nested tags correctly |
4 | static LL<S> findContainerOrEmptyTag(L<S> tok, S tag) {
|
5 | new LL<S> l; |
6 | for (int i = 1; i < l(tok); i += 2) |
7 | if (isEmptyTag(tok.get(i), tag)) |
8 | l.add(tok.subList(i-1, i+1)); |
9 | else if (isOpeningTag(tok.get(i), tag)) {
|
10 | int j, level = 1; |
11 | for (j = i+2; j < tok.size(); j += 2) |
12 | if (isOpeningTag(tok.get(j), tag)) |
13 | ++level; |
14 | else if (isTag(tok.get(j), "/" + tag)) {
|
15 | --level; |
16 | if (level == 0) {
|
17 | l.add(subList(tok, i-1, j+2)); // actual CNC |
18 | break; |
19 | } |
20 | } |
21 | i = j; |
22 | } |
23 | ret l; |
24 | } |
25 | |
26 | static LL<S> findContainerOrEmptyTag(S html, S tag) {
|
27 | ret findContainerOrEmptyTag(htmlTok(html), tag); |
28 | } |
Began life as a copy of #1002309
download show line numbers debug dex old transpilations
Travelled to 14 computer(s): aoiabmzegqzx, bhatertpkbcr, cbybwowwnfue, cfunsshuasjs, ddnzoavkxhuk, gwrvuhgaqvyk, ishqpsrjomds, lpdgvwnxivlt, mqqgnosmbjvj, pyentgdyhuwx, pzhvpgtvlbxg, tslmcundralx, tvejysmllsmz, vouqrxazstgt
No comments. add comment
| Snippet ID: | #1005097 |
| Snippet name: | findContainerOrEmptyTag - HTML/XML parsing |
| Eternal ID of this version: | #1005097/3 |
| Text MD5: | 176a163bf36f402b2db1dbd44cbf8ae4 |
| Author: | stefan |
| Category: | javax |
| Type: | JavaX fragment (include) |
| Public (visible to everyone): | Yes |
| Archived (hidden from active list): | No |
| Created/modified: | 2021-07-24 03:13:06 |
| Source code size: | 847 bytes / 28 lines |
| Pitched / IR pitched: | No / No |
| Views / Downloads: | 820 / 819 |
| Version history: | 2 change(s) |
| Referenced in: | [show references] |