fix setting pause symbol for non-kana symbol #8
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Maybe this is more of a problem with the dictionary...
njd_set_pronunciation
setsread
,pron
and other features for symbols with 0 mora size. Specifically, non-kana symbols will be set as読点
.In the following example,
~
is incorrectly parsed as名詞
using MeCab and naist-jdic (whereas it should be助詞
).Since its mora size is 0, its
read
,pron
are set to、
andpos
is set to記号
. Consequently, its features would be the following, which is weird.So I think
pos_group
,ctype
andcform
should also be modified and its features become: