Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Two versions of the same Chinese character #17

Open
jayeuijunglee opened this issue Mar 26, 2021 · 0 comments
Open

Two versions of the same Chinese character #17

jayeuijunglee opened this issue Mar 26, 2021 · 0 comments

Comments

@jayeuijunglee
Copy link

jayeuijunglee commented Mar 26, 2021

Hi. It seems that the same Chinese character can have two versions, which look slightly different and also have different unicode values. And only one version is recognized as hanja.

For example, 李 has two versions, unicode 674e and unicode f9e1. Only the first version passes as hanja:
image

My guess, from looking at 李, 金, 宅, is that all unicode values f900-fa60 in the unicode tables (http://www.tamasoft.co.jp/en/general-info/unicode.html) suffer the same problem.

Would it be possible to include unicode values f900-fa60 to be recognized by hanja?

Thank you!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant