// TODO: process CDATA, scripts static List<String> htmlcoarsetok(String s) { List<String> tok = new ArrayList<String>(); int l = s == null ? 0 : s.length(); int i = 0; while (i < l) { int j = i; char c; // scan for non-tags while (j < l) { if (s.charAt(j) != '<') // regular character ++j; else if (s.substring(j, Math.min(j+4, l)).equals("<!--")) { // HTML comment j = j+4; do ++j; while (j < l && !s.substring(j, Math.min(j+3, l)).equals("-->")); j = Math.min(j+3, l); } else // it's a tag break; } tok.add(s.substring(i, j)); i = j; if (i >= l) break; c = s.charAt(i); // scan for tags if (c == '<') { ++j; while (j < l && s.charAt(j) != '>') ++j; // TODO: strings? if (j < l) ++j; } tok.add(s.substring(i, j)); i = j; } if ((tok.size() % 2) == 0) tok.add(""); return tok; }
Began life as a copy of #1000670
download show line numbers debug dex old transpilations
Travelled to 4 computer(s): bhatertpkbcr, mqqgnosmbjvj, pyentgdyhuwx, vouqrxazstgt
No comments. add comment
Snippet ID: | #1030282 |
Snippet name: | htmlcoarsetok [backup] |
Eternal ID of this version: | #1030282/1 |
Text MD5: | 48de0706a35ab5a5353b803994f8d124 |
Author: | stefan |
Category: | |
Type: | JavaX fragment (include) |
Public (visible to everyone): | Yes |
Archived (hidden from active list): | No |
Created/modified: | 2020-11-25 14:43:59 |
Source code size: | 1013 bytes / 45 lines |
Pitched / IR pitched: | No / No |
Views / Downloads: | 207 / 227 |
Referenced in: | [show references] |