1 | static WithChoices<LS> greedySplitIntoWordsCI_withChoices(S s, Cl<S> words) {
|
2 | TreeSet<S> wordsSet = asCISet(words); |
3 | int i = 0, last = 0; |
4 | new LS out; |
5 | while (i < l(s)) {
|
6 | int longest = l(longestPrefixInCISet(substring(s, i), wordsSet)); |
7 | for (int j = l(s); j > i; j--) |
8 | if (contains(wordsSet, substring(s, i, j))) {
|
9 | S word = prefixesInCISet(, wordsSet); |
10 | if (nempty(word)) {
|
11 | addIfNempty(out, substring(s, last, i)); |
12 | out.add(substring(s, i, i+l(word)); |
13 | last = i = i+l(word); |
14 | } else ++i; |
15 | } |
16 | addIfNempty(out, substring(s, last, i)); |
17 | ret out; |
18 | } |
Began life as a copy of #1028147
download show line numbers debug dex old transpilations
Travelled to 7 computer(s): bhatertpkbcr, mqqgnosmbjvj, pyentgdyhuwx, pzhvpgtvlbxg, tvejysmllsmz, vouqrxazstgt, xrpafgyirdlv
No comments. add comment
| Snippet ID: | #1028149 |
| Snippet name: | greedySplitIntoWordsCI_withChoices [dev.] |
| Eternal ID of this version: | #1028149/1 |
| Text MD5: | e9e13080f56c1700c4766d807c17a270 |
| Author: | stefan |
| Category: | javax / stefan's os / nlp |
| Type: | JavaX fragment (include) |
| Public (visible to everyone): | Yes |
| Archived (hidden from active list): | No |
| Created/modified: | 2020-05-23 16:42:44 |
| Source code size: | 612 bytes / 18 lines |
| Pitched / IR pitched: | No / No |
| Views / Downloads: | 476 / 455 |
| Referenced in: | [show references] |