Not logged in.  Login/Logout/Register | List snippets | | Create snippet | Upload image | Upload data

26
LINES

< > BotCompany Repo | #1002309 - findContainerTag - HTML/XML parsing, fixed tag name

JavaX fragment (include)

// tok must come from htmlTok
// returns all container tags found (including content) as CNC
// should be OK for both HTML and XML
static LL<S> findContainerTag(L<S> tok, S tag) {
  new LL<S> l;
  for (int i = 1; i < l(tok); i += 2)
    if (isOpeningTag(tok.get(i), tag)) {
      int j, level = 1;
      for (j = i+2; j < tok.size(); j += 2)
        if (isOpeningTag(tok.get(j), tag))
          ++level;
        else if (isTag(tok.get(j), "/" + tag)) {
          --level;
          if (level == 0) {
            l.add(tok.subList(i-1, j+2)); // actual CNC
            break;
          }
        }
      i = j;
    }
  ret l;
}

static LL<S> findContainerTag(S html, S tag) {
  ret findContainerTag(htmlTok(html), tag);
}

download  show line numbers  debug dex   

Travelled to 11 computer(s): aoiabmzegqzx, bhatertpkbcr, cbybwowwnfue, cfunsshuasjs, gwrvuhgaqvyk, ishqpsrjomds, lpdgvwnxivlt, mqqgnosmbjvj, pzhvpgtvlbxg, tslmcundralx, tvejysmllsmz

No comments. add comment

Snippet ID: #1002309
Snippet name: findContainerTag - HTML/XML parsing, fixed tag name
Eternal ID of this version: #1002309/4
Text MD5: e9fb0e70c32881e56e083e92fe7c1ce4
Author: stefan
Category: javax
Type: JavaX fragment (include)
Public (visible to everyone): Yes
Archived (hidden from active list): No
Created/modified: 2017-10-19 19:46:06
Source code size: 747 bytes / 26 lines
Pitched / IR pitched: No / No
Views / Downloads: 513 / 633
Version history: 3 change(s)
Referenced in: [show references]

Formerly at http://tinybrain.de/1002309 & http://1002309.tinybrain.de