-
Notifications
You must be signed in to change notification settings - Fork 41
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Is the OntoNotes5.0 dataset marked by Spacy? #42
Comments
Hi, what would that mean to get mark by Spacy? |
Hello, Because I downloaded your dataset from huggingface, the wasn't in the tokens of the data became "was", "n't". This is the same as Spacy's word segmentation, so I have this question. |
Hello, Because I downloaded your dataset from huggingface, the wasn't in the tokens of the data became "was", "n't". This is the same as Spacy's word segmentation, so I have this question.
I want to label the data according to your data format in the actual data
***@***.***
From: 【非认证用户,注意安全】 Asahi Ushio
Date: 2022-12-05 19:45
To: asahi417/tner
CC: zhanghanweii; Author
Subject: Re: [asahi417/tner] Is the OntoNotes5.0 dataset marked by Spacy? (Issue #42)
Hi, what would that mean to get mark by Spacy?
—
Reply to this email directly, view it on GitHub, or unsubscribe.
You are receiving this because you authored the thread.Message ID: ***@***.***>
|
I am not sure whether we used Spacy tokenizer to process the data, but I think I did according to that. |
Thanks a lot for your answer! ! |
Is the OntoNotes5.0 dataset marked by Spacy?
The text was updated successfully, but these errors were encountered: