Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

wrong output of ArkrefClient - coreference resolver #249

Open
oferbr opened this issue Aug 5, 2013 · 0 comments
Open

wrong output of ArkrefClient - coreference resolver #249

oferbr opened this issue Aug 5, 2013 · 0 comments
Assignees

Comments

@oferbr
Copy link
Contributor

oferbr commented Aug 5, 2013

eu.excitementproject.eop.lap.biu.en.coreference.arkref.ArkrefClient seems to output redundant (and maybe just wrong) mentions, returned by getArkrefOutput() (and not by get getArkrefDockedOutput(), which is computed separately, and seems to work fine).

For instance, run eu.excitementproject.eop.lap.biu.en.coreference.arkref.ArkrefCoreferenceResolversTests, test1 (the output is down here). The problem is in TREE COREFERENCE INFORMATION. For instance, group #10 has a mention that is only a comma, which doesn't make sense, and also is not part of the docked output. Another example is in group #12, which has a mention that is only the word "and", and again this doesn't make sense and is not part of the docked output.

** NOTE - when this is solved, the coreference functionality must be added to eu.excitementproject.eop.lap.biu.uima.CasTreeConverter (and its tests in CasTreeConverterTester, CasTreeConverterTest). Otherwise, BIUTEE cannot use coreference at all.

----------------------- output of ArkrefCoreferenceResolversTests.test1 -----------------
TEXT:
Prince George of Cambridge (George Alexander Louis, born 22 July 2013) is the only child of Prince William, Duke of Cambridge, and his wife Catherine, Duchess of Cambridge. He is the only grandchild of Charles, Prince of Wales, and is third in line to succeed his great-grandmother, Queen Elizabeth II, after his grandfather and father.

TEXT TREES:
(ROOT / (null)[(null)] / (null) / (null))
(10 / born[bear] / VBD / (null))
(2 / George[george] / NNP / nsubj)
(1 / Prince[prince] / NNP / nn)
(3 / of[of] / IN / prep)
(8 / Louis[louis] / NNP / pobj)
(4 / Cambridge[cambridge] / NNP / nn)
(5 / -LRB-[-lrb-] / -LRB- / punct)
(6 / George[george] / NNP / nn)
(7 / Alexander[alexander] / NNP / nn)
(9 / ,[,] / , / punct)
(11 / 22[22] / CD / dobj)
(12 / July[july] / NNP / tmod)
(13 / 2013[2013] / CD / num)
(14 / -RRB-[-rrb-] / -RRB- / punct)
(18 / child[child] / NN / dobj)
(15 / is[be] / VBZ / cop)
(16 / the[the] / DT / det)
(17 / only[only] / JJ / amod)
(19 / of[of] / IN / prep)
(21 / William[william] / NNP / pobj)
(20 / Prince[prince] / NNP / nn)
(22 / ,[,] / , / punct)
(23 / Duke[duke] / NNP / conj)
(24 / of[of] / IN / prep)
(25 / Cambridge[cambridge] / NNP / pobj)
(26 / ,[,] / , / punct)
(27 / and[and] / CC / cc)
(29 / wife[wife] / NN / conj)
(28 / his[his] / PRP$ / poss)
(32 / Duchess[duchess] / NNP / appos)
(30 / Catherine[catherine] / NNP / nn)
(31 / ,[,] / , / punct)
(33 / of[of] / IN / prep)
(34 / Cambridge[cambridge] / NNP / pobj)
(35 / .[.] / . / punct)
(ROOT / (null)[(null)] / (null) / (null))
(5 / grandchild[grandchild] / NN / (null))
(1 / He[he] / PRP / nsubj)
(2 / is[be] / VBZ / cop)
(3 / the[the] / DT / det)
(4 / only[only] / JJ / amod)
(6 / of[of] / IN / prep)
(7 / Charles[charles] / NNP / pobj)
(8 / ,[,] / , / punct)
(9 / Prince[prince] / NNP / appos)
(10 / of[of] / IN / prep)
(11 / Wales[wales] / NNP / pobj)
(12 / ,[,] / , / punct)
(13 / and[and] / CC / cc)
(15 / third[third] / JJ / conj)
(14 / is[be] / VBZ / cop)
(16 / in[in] / IN / prep)
(17 / line[line] / NN / pobj)
(19 / succeed[succeed] / VB / infmod)
(18 / to[to] / TO / aux)
(21 / great-grandmother[great-grandmother] / NN / dobj)
(20 / his[his] / PRP$ / poss)
(22 / ,[,] / , / punct)
(25 / II[ii] / NNP / appos)
(23 / Queen[queen] / NNP / nn)
(24 / Elizabeth[elizabeth] / NNP / nn)
(26 / ,[,] / , / punct)
(27 / after[after] / IN / prep)
(29 / grandfather[grandfather] / NN / pobj)
(28 / his[his] / PRP$ / poss)
(30 / and[and] / CC / cc)
(31 / father[father] / NN / conj)
(32 / .[.] / . / punct)

TREE COREFERENCE INFORMATION:
TreeCoreferenceInformation - coref groups and their members:
group #1:
* (1 / Prince[prince] / NNP / nn)
* (21 / William[william] / NNP / pobj)
(20 / Prince[prince] / NNP / nn)
(22 / ,[,] / , / punct)
(23 / Duke[duke] / NNP / conj)
(24 / of[of] / IN / prep)
(25 / Cambridge[cambridge] / NNP / pobj)
(26 / ,[,] / , / punct)
(27 / and[and] / CC / cc)
(29 / wife[wife] / NN / conj)
(28 / his[his] / PRP$ / poss)
(32 / Duchess[duchess] / NNP / appos)
(30 / Catherine[catherine] / NNP / nn)
(31 / ,[,] / , / punct)
(33 / of[of] / IN / prep)
(34 / Cambridge[cambridge] / NNP / pobj)
group #2:
* (10 / born[bear] / VBD / (null))
(2 / George[george] / NNP / nsubj)
(1 / Prince[prince] / NNP / nn)
(3 / of[of] / IN / prep)
(8 / Louis[louis] / NNP / pobj)
(4 / Cambridge[cambridge] / NNP / nn)
(5 / -LRB-[-lrb-] / -LRB- / punct)
(6 / George[george] / NNP / nn)
(7 / Alexander[alexander] / NNP / nn)
(9 / ,[,] / , / punct)
(11 / 22[22] / CD / dobj)
(12 / July[july] / NNP / tmod)
(13 / 2013[2013] / CD / num)
(14 / -RRB-[-rrb-] / -RRB- / punct)
(18 / child[child] / NN / dobj)
(15 / is[be] / VBZ / cop)
(16 / the[the] / DT / det)
(17 / only[only] / JJ / amod)
(19 / of[of] / IN / prep)
(21 / William[william] / NNP / pobj)
(20 / Prince[prince] / NNP / nn)
(22 / ,[,] / , / punct)
(23 / Duke[duke] / NNP / conj)
(24 / of[of] / IN / prep)
(25 / Cambridge[cambridge] / NNP / pobj)
(26 / ,[,] / , / punct)
(27 / and[and] / CC / cc)
(29 / wife[wife] / NN / conj)
(28 / his[his] / PRP$ / poss)
(32 / Duchess[duchess] / NNP / appos)
(30 / Catherine[catherine] / NNP / nn)
(31 / ,[,] / , / punct)
(33 / of[of] / IN / prep)
(34 / Cambridge[cambridge] / NNP / pobj)
(35 / .[.] / . / punct)
group #3:
* (3 / of[of] / IN / prep)
(8 / Louis[louis] / NNP / pobj)
(4 / Cambridge[cambridge] / NNP / nn)
(5 / -LRB-[-lrb-] / -LRB- / punct)
(6 / George[george] / NNP / nn)
(7 / Alexander[alexander] / NNP / nn)
* (9 / ,[,] / , / punct)
* (29 / wife[wife] / NN / conj)
(28 / his[his] / PRP$ / poss)
(32 / Duchess[duchess] / NNP / appos)
(30 / Catherine[catherine] / NNP / nn)
(31 / ,[,] / , / punct)
(33 / of[of] / IN / prep)
(34 / Cambridge[cambridge] / NNP / pobj)
group #4:
* (4 / Cambridge[cambridge] / NNP / nn)
group #5:
* (5 / -LRB-[-lrb-] / -LRB- / punct)
* (14 / -RRB-[-rrb-] / -RRB- / punct)
* (18 / child[child] / NN / dobj)
(15 / is[be] / VBZ / cop)
(16 / the[the] / DT / det)
(17 / only[only] / JJ / amod)
(19 / of[of] / IN / prep)
(21 / William[william] / NNP / pobj)
(20 / Prince[prince] / NNP / nn)
(22 / ,[,] / , / punct)
(23 / Duke[duke] / NNP / conj)
(24 / of[of] / IN / prep)
(25 / Cambridge[cambridge] / NNP / pobj)
(26 / ,[,] / , / punct)
(27 / and[and] / CC / cc)
(29 / wife[wife] / NN / conj)
(28 / his[his] / PRP$ / poss)
(32 / Duchess[duchess] / NNP / appos)
(30 / Catherine[catherine] / NNP / nn)
(31 / ,[,] / , / punct)
(33 / of[of] / IN / prep)
(34 / Cambridge[cambridge] / NNP / pobj)
* (5 / grandchild[grandchild] / NN / (null))
(1 / He[he] / PRP / nsubj)
(2 / is[be] / VBZ / cop)
(3 / the[the] / DT / det)
(4 / only[only] / JJ / amod)
(6 / of[of] / IN / prep)
(7 / Charles[charles] / NNP / pobj)
(8 / ,[,] / , / punct)
(9 / Prince[prince] / NNP / appos)
(10 / of[of] / IN / prep)
(11 / Wales[wales] / NNP / pobj)
(12 / ,[,] / , / punct)
(13 / and[and] / CC / cc)
(15 / third[third] / JJ / conj)
(14 / is[be] / VBZ / cop)
(16 / in[in] / IN / prep)
(17 / line[line] / NN / pobj)
(19 / succeed[succeed] / VB / infmod)
(18 / to[to] / TO / aux)
(21 / great-grandmother[great-grandmother] / NN / dobj)
(20 / his[his] / PRP$ / poss)
(22 / ,[,] / , / punct)
(25 / II[ii] / NNP / appos)
(23 / Queen[queen] / NNP / nn)
(24 / Elizabeth[elizabeth] / NNP / nn)
(26 / ,[,] / , / punct)
(27 / after[after] / IN / prep)
(29 / grandfather[grandfather] / NN / pobj)
(28 / his[his] / PRP$ / poss)
(30 / and[and] / CC / cc)
(31 / father[father] / NN / conj)
(32 / .[.] / . / punct)
group #6:
* (11 / 22[22] / CD / dobj)
group #7:
* (12 / July[july] / NNP / tmod)
(13 / 2013[2013] / CD / num)
group #8:
* (23 / Duke[duke] / NNP / conj)
(24 / of[of] / IN / prep)
(25 / Cambridge[cambridge] / NNP / pobj)
* (28 / his[his] / PRP$ / poss)
group #9:
* (6 / of[of] / IN / prep)
(7 / Charles[charles] / NNP / pobj)
(8 / ,[,] / , / punct)
(9 / Prince[prince] / NNP / appos)
(10 / of[of] / IN / prep)
(11 / Wales[wales] / NNP / pobj)
group #10:
* (8 / ,[,] / , / punct)
* (21 / great-grandmother[great-grandmother] / NN / dobj)
(20 / his[his] / PRP$ / poss)
(22 / ,[,] / , / punct)
(25 / II[ii] / NNP / appos)
(23 / Queen[queen] / NNP / nn)
(24 / Elizabeth[elizabeth] / NNP / nn)
(26 / ,[,] / , / punct)
group #11:
* (11 / Wales[wales] / NNP / pobj)
group #12:
* (13 / and[and] / CC / cc)
* (29 / grandfather[grandfather] / NN / pobj)
(28 / his[his] / PRP$ / poss)
(30 / and[and] / CC / cc)
(31 / father[father] / NN / conj)
group #13:
* (17 / line[line] / NN / pobj)
(19 / succeed[succeed] / VB / infmod)
(18 / to[to] / TO / aux)
(21 / great-grandmother[great-grandmother] / NN / dobj)
(20 / his[his] / PRP$ / poss)
(22 / ,[,] / , / punct)
(25 / II[ii] / NNP / appos)
(23 / Queen[queen] / NNP / nn)
(24 / Elizabeth[elizabeth] / NNP / nn)
(26 / ,[,] / , / punct)

DOCKED COREFERENCE INFORMATION:
[DockedMention(tag=19_21, span=<260..302>, text=his great-grandmother, Queen Elizabeth II,), DockedMention(tag=19_21, span=<283..301>, text=Queen Elizabeth II)]
[DockedMention(tag=1_6_14_15_20_23, span=<0..70>, text=Prince George of Cambridge (George Alexander Louis, born 22 July 2013)), DockedMention(tag=1_6_14_15_20_23, span=<74..171>, text=the only child of Prince William, Duke of Cambridge, and his wife Catherine, Duchess of Cambridge), DockedMention(tag=1_6_14_15_20_23, span=<173..175>, text=He), DockedMention(tag=1_6_14_15_20_23, span=<179..226>, text=the only grandchild of Charles, Prince of Wales), DockedMention(tag=1_6_14_15_20_23, span=<260..263>, text=his), DockedMention(tag=1_6_14_15_20_23, span=<309..312>, text=his)]
[DockedMention(tag=8_11, span=<108..125>, text=Duke of Cambridge), DockedMention(tag=8_11, span=<131..134>, text=his)]
[DockedMention(tag=10_12, span=<131..171>, text=his wife Catherine, Duchess of Cambridge), DockedMention(tag=10_12, span=<135..139>, text=wife)]
[DockedMention(tag=2_9_13, span=<17..26>, text=Cambridge), DockedMention(tag=2_9_13, span=<116..125>, text=Cambridge), DockedMention(tag=2_9_13, span=<162..171>, text=Cambridge)]

@ghost ghost assigned asher-stern Aug 5, 2013
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants