Skip to content

Commit

Permalink
do the thing
Browse files Browse the repository at this point in the history
  • Loading branch information
mi2ebi committed Dec 6, 2023
1 parent 8665d1a commit 05bce59
Show file tree
Hide file tree
Showing 2 changed files with 7 additions and 1 deletion.
6 changes: 6 additions & 0 deletions min/data-errors.txt
Original file line number Diff line number Diff line change
Expand Up @@ -66,6 +66,7 @@ benre: Unrecognized part of speech
benzo: Unrecognized part of speech
be'oi: Unrecognized part of speech
bergu: Unrecognized part of speech
berlìn: Not a word
besto: Unrecognized part of speech
betka: Unrecognized part of speech
be'ucu'i: Unrecognized part of speech
Expand Down Expand Up @@ -199,6 +200,7 @@ ci'o'au: Unrecognized part of speech
ci'oi: Unrecognized part of speech
ciroi: Unrecognized part of speech
cisni: Unrecognized part of speech
ciztolnei: Not a word
ciztolnei: Unrecognized part of speech
clafi: Unrecognized part of speech
claxu bu: Unrecognized part of speech
Expand Down Expand Up @@ -473,6 +475,7 @@ gau'i'o: Unrecognized part of speech
gau'o: Unrecognized part of speech
ga'u re'o: Unrecognized part of speech
ga'u re'o: Bad selma'o or rafsi list
gaurkihu: Not a word
ge'ai: Unrecognized part of speech
ge'ei: Unrecognized part of speech
gei'ai: Unrecognized part of speech
Expand Down Expand Up @@ -907,6 +910,7 @@ mapra: Unrecognized part of speech
maska: Unrecognized part of speech
matce: Unrecognized part of speech
matxe: Unrecognized part of speech
ma;u: Not a word
ma;u: Unrecognized part of speech
mau'a: Unrecognized part of speech
mau'au: Unrecognized part of speech
Expand Down Expand Up @@ -1079,6 +1083,7 @@ nu bu: Unrecognized part of speech
nudle: Unrecognized part of speech
nu'ecu'i: Unrecognized part of speech
nu'enai: Unrecognized part of speech
nuhempcr: Not a word
nuktu: Unrecognized part of speech
nulpa: Unrecognized part of speech
numje: Unrecognized part of speech
Expand Down Expand Up @@ -1846,6 +1851,7 @@ vu'oi: Unrecognized part of speech
vy.bu: Unrecognized part of speech
vybu: Unrecognized part of speech
vy'y: Unrecognized part of speech
xa!: Not a word
xa!: Unrecognized part of speech
xa'a: Unrecognized part of speech
xa'a'a: Unrecognized part of speech
Expand Down
2 changes: 1 addition & 1 deletion scripts/Dataparse.java
Original file line number Diff line number Diff line change
Expand Up @@ -23,7 +23,7 @@ public static void main(String[] args) throws Exception {
String[] rafsilist = null;
ArrayList<String> defns = new ArrayList<>();
ArrayList<String> notes = new ArrayList<>();
if (word.matches("[^a-g'i-pr-vxyz., ]|[cfkpstx][bdgjvz]|[bdgjvz][cfkpstx]|[cjsz]{2}|[ck]x|x[ck]|mz|nt[cs]|nd[jz]")) {
if (word.matches(".*([^a-g'i-pr-vxyzA-GI-PR-VXYZ., ]|[cfkpstx][bdgjvz]|[bdgjvz][cfkpstx]|[cjsz]{2}|[ck]x|x[ck]|mz|nt[cs]|nd[jz]).*")) {
errors.add(word + ": Not a word");
}
if (wordline.length < 4) {
Expand Down

0 comments on commit 05bce59

Please sign in to comment.