Skip to content

Commit

Permalink
Modifier Letter Apostrophe was missing
Browse files Browse the repository at this point in the history
Add additional letter from dialect transcriptions.
  • Loading branch information
rueter committed Nov 16, 2024
1 parent ceacf70 commit fc1d29e
Showing 1 changed file with 1 addition and 0 deletions.
1 change: 1 addition & 0 deletions tools/tokenisers/tokeniser-disamb-gt-desc.pmscript
Original file line number Diff line number Diff line change
Expand Up @@ -66,6 +66,7 @@ Define alphabet "a-z" !! * lower-case ASCII
|"A-Z" !! * upper-case ASCII
|Lst({àáâãāăȧäåǎȁȃąæǽǣèéêēĕėëěȅȇȩęìíîĩīīĭi̇ïǐįȉȋɨòóôõōŏȯöőǒȍȏơǫɵøǭǿœùúûũūŭüůűǔȕȗưųʉýŷȳÿƴɏÀÁÂÃĀĂȦÄÅǍȀȂĄÆǼǢÈÉÊĒĔĖËĚȄȆȨĘÌÍÎĨĪĪĬİÏǏĮȈȊƗÒÓÔÕŌŎȮÖŐǑȌȎƠǪƟØǬǾŒÙÚÛŨŪŬÜŮŰǓȔȖƯŲɄÝŶȲŸƳɎšžčđðíŋňŧñńŠŽČĐÐÍŊŇŦÑ})
!! * select extended latin symbols
| Lst({ʼĺšń·e͔i͔t́śźлüš́āžƞǵñv́h́źēīūōǟd́})
| "0-9" !! ASCII digits
| Lst({_§°}) !! * select symbols
!! * Combining diacritics as individual symbols,
Expand Down

0 comments on commit fc1d29e

Please sign in to comment.