-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
CV2-5247-support-yake-keyword-extraction-for-chinese #112
CV2-5247-support-yake-keyword-extraction-for-chinese #112
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Can you add a test with a text example that shows this working? Ideally would be a chunk of non space delimited Chinese text that is badly parsed without jieba and then returns appropriate keywords when processed with jieba.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
seems like a really cool library!
One extremely minor note otherwise good to go! |
…or language == 'zh' or language == 'zh-TW'` to `if language[:2]=="zh"`
Description
using jieba to segment mandarin text
Reference: CV2-5247 and CV2-4909
How has this been tested?
Has it been tested locally? Are there automated tests?
locally. will add automated tests as well.
Are there any external dependencies?
Are there changes required in sysops terraform for this feature or fix?
Have you considered secure coding practices when writing this code?
Please list any security concerns that may be relevant.