mwe: já não #98

livyreal · 2016-11-30T16:27:01Z

"já não" está tageado errado. não deveria ser ADV e a relação entre eles deveria ser "mwe".

aqui então temos que pegar todos os "já não" que são MWE e mudar a pós de "não NOUN" para "não ADV", a MWEPOS geral é ADV; a deprel de "não" aponta para "já" por mwe e a deprel de já aponta para a mesma coisa que "não" apontava" pela relação "neg".

Como era:

7	já	já	ADP	PRP|@ADVL>	_	8	case	_	MWE:já=não
8	não	não	NOUN	N|M|S|@P<	Gender=Masc|Number=Sing	9	nmod	_	_

Como fica

7	já	já	ADP	_	_	9	neg	_	MWE:já=não MWEPOS=ADV
8	não	não	ADV	_	_	7	mwe	_	_

@concordam, @vcvpaiva e @claudiafreitas ?

The text was updated successfully, but these errors were encountered:

vcvpaiva · 2016-11-30T23:03:49Z

concordo @livyreal, exceto que como @fcbr vai te dizer tem que ser MWE:já_não MWEPOS=ADV em vez de já=não MWEPOS=ADV.

fcbr · 2016-12-12T17:22:19Z

@livyreal neg foi removido da v2, entao como fica a sua regra?

livyreal · 2016-12-12T17:38:01Z

CP911-6

como está:

9	já	já	ADP	PRP|@ADVL>	_	10	case	_	MWE=já_não
10	não	não	NOUN	N|M|S|@P<	Gender=Masc|Number=Sing	12	nmod	_	_

como fica:

9	já	já	ADP	_	_	12	advmod	_	MWE=já_não MWEPOS=ADV
10	não	não	ADV	_	_	10	fixed	       _	_

fcbr · 2016-12-12T18:37:22Z

@livyreal este e' um exemplo generico? Todos os "ja's" devem se conectar via advmod?

livyreal · 2016-12-12T18:58:47Z

sim.

arademaker · 2016-12-12T19:44:53Z

@livyreal seu exemplo está errado token 10 está apontando para o próprio token 10.

livyreal · 2016-12-12T19:46:37Z

obrigada, @arademaker!

como está:

9	já	já	ADP	PRP|@ADVL>	_	10	case	_	MWE=já_não
10	não	não	NOUN	N|M|S|@P<	Gender=Masc|Number=Sing	12	nmod	_	_

como fica:

9	já	já	ADP	_	_	12	advmod	_	MWE=já_não MWEPOS=ADV
10	não	não	ADV	_	_	9	fixed	       _	_

arademaker · 2016-12-12T20:46:49Z

@livyreal features ficam nulas mesmo? Não lembro por que estamos insistindo em manter no campo misc estes valores MWE e MWEPOS. E o campo misc não pode ter espaço, se formos manter isso, preicsamos de um separador, talvez o mesmo | usado no campo feats.

livyreal · 2016-12-12T21:05:34Z

não há nada nas mudanças de V2 que indicam uma nova marcação de quais são as expressões MWE então mantemos o span da mwe e também o general POS tag, como era em V1. Já era assim, mas nosso corpus estava incompleto.

De fato tem a feature Polarity=Neg, que é a grande mudança de V1 pra V2. Lançaram tantas features novas que eu estou perdida ainda.

Finalmente deve ficar:

9	já	já	ADP	_	_	12	advmod	_	MWE=já_não|MWEPOS=ADV
10	não	não	ADV	_	Polarity=Neg	9	fixed	       _	_

arademaker · 2016-12-12T21:11:34Z

@livyreal não entendi seu argumento!! Talvez seja melhor falarmos. Podemos pelas dependências recuperar a MWE original sim, não perdemos nada ao jogar fora estes valores MWE e MWEPOS do campo misc, eu acho.

livyreal · 2016-12-12T21:23:18Z

MWEPOS é o campo mais importante de todos... pq é ele que é relevante para a deprel que vai ligar a MWE ao resto da sentença! Se for uma mwe NOUN, ela pode participar da deprel nmod, se for uma mwe ADV, ela terá que participar de uma deprel advmod. No caso do "ou seja", "ou" é ADP, "seja" é verbo, mas "ou seja" é CCONJ. Isto é muito mais importante do que a POS de cada uma das palavras internamente.

Não dá pra jogar fora não. a MWE pós até daria mas não vejo pq já que as guidelines dizem isto e nós estamos com o corpus todo correto neste sentido.

arademaker · 2016-12-12T21:59:19Z

@livyreal por mais que eu goste da idéia de ter um sistema de tipos nas relações que pudessem garantir que uma dada relação pede determinadas POS tags nos seus extremos, nada nas documentações ou no script validate.py sugere que este tipo de validação seja feita.

Não temos os valores de MWE consistentes não, dado exatamente o caso #23. Acho que manter coisas que não são pedidas e podem ser recuperadas é tornar o trabalho de manutenção mais trabalhoso do que o necessário.

livyreal · 2016-12-15T12:18:35Z

só pra constar, depois do comentário acima do @arademaker , escrevi para a lista de UD (MWEPOS é o assunto do e-mail) e discutimos extensivamente como e onde marcar MWEPOS e MWEspan. MWEspan é redundante, mas MWEPOS é necessário (ainda que um parser comum não vá considerar estas expressões). já que temos todas as MWEspan no corpus e falta algumas MWEPOS, decidimos por manter as duas, já que a informação reducante está correta.

arademaker · 2016-12-15T14:46:19Z

@livyreal sim, documentar é bom. Correções: não temos um valor MWEspan apenas MWE. Não é nossa prioridade mas acho que antes do release final deveríamos remover este valor, não decidimos que vai ficar assim não, só adiamos a conversa, sem consenso. Sobre parser comum, escrevendo assim me pareceu que vc teria a expectativa de algum parser usar esta informação, acho que nenhum parser irá usar o campo MISC para aprender algo, este foi o comentário do Joakim.

vcvpaiva · 2017-02-05T00:10:01Z

gostaria de re-abrir esse issue pra perguntar: perdemos a marcacao de negativity entao completamente?

fcbr · 2017-02-05T00:44:12Z

A negativity foi transferida para a feature Polarity=Neg do token nao.

arademaker · 2017-02-06T09:45:09Z

podemos fechar?

vcvpaiva · 2017-02-07T19:30:10Z

@fcbr and @livyreal Polarity=Neg is not enough to characterize negation, I hope you do understand that this is worse that the relation "neg", which was not enough either, but it was more informative.

arademaker · 2017-02-07T19:37:31Z

@vcvpaiva do you have concrete examples? The v2 UD spec removed the relation 'neg', it was not our decision. But we can argue if we understand the concrete cases that justify the limits of polarity alone for characterize negations

vcvpaiva · 2017-02-07T20:52:25Z

claro que eu sei que a decisao foi deles @arademaker. veja no corpus SICK

1 Nobody nobody NOUN NN _ 3 nsubj _ PRP|?|?
2 is be VERB VBZ _ 3 aux _ VBZ|02604760-v|Entity+
3 playing play VERB VBG _ 0 ROOT _ VBG|01719302-v|DramaticActing+
4 ping ping NOUN NN _ 5 nn _ NN|07389569-n|RadiatingSound+
5 pong pong NOUN NN _ 3 dobj _ JJ|?|?

there is nothing indicating a negation, but that is there, right?

Now, the difference of having a word marked as Polarity=Neg versus a relation, is that the relation at least gives you some indication of where the scope of your negation is, while a word marked, does not.

But NO, there is no reason to take this to the UD guys, who know this fairly well.
As Chris Manning likes to repeat, if you want to move UDs in one of the directions that it's trying to accomplish(which we are, in the direction of semantics), it is very easy.
The difficulty is keeping all the other constraints that he's trying to keep (the universality, the easy of annotation, the respect to traditional grammar, etc. ) on the air.

arademaker · 2017-02-28T15:23:06Z

Now, the difference of having a word marked as Polarity=Neg versus a relation, is that the relation at least gives you some indication of where the scope of your negation is, while a word marked, does not.

The scope of the negation can be the subtree of the given node. Not true that using Polarity you don't have any clue about the scope.

livyreal mentioned this issue Nov 30, 2016

negative PARTicles #71

Closed

fcbr self-assigned this Dec 12, 2016

livyreal added a implementar automatic decidido labels Dec 12, 2016

fcbr pushed a commit that referenced this issue Dec 14, 2016

adding tentative code for issue #98

7a14247

fcbr closed this as completed in 6dc908f Dec 15, 2016

vcvpaiva reopened this Feb 5, 2017

arademaker closed this as completed Feb 6, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

mwe: já não #98

mwe: já não #98

livyreal commented Nov 30, 2016 •

edited

Loading

vcvpaiva commented Nov 30, 2016

fcbr commented Dec 12, 2016

livyreal commented Dec 12, 2016

fcbr commented Dec 12, 2016

livyreal commented Dec 12, 2016

arademaker commented Dec 12, 2016

livyreal commented Dec 12, 2016

arademaker commented Dec 12, 2016

livyreal commented Dec 12, 2016 •

edited

Loading

arademaker commented Dec 12, 2016

livyreal commented Dec 12, 2016

arademaker commented Dec 12, 2016 •

edited

Loading

livyreal commented Dec 15, 2016

arademaker commented Dec 15, 2016 •

edited

Loading

vcvpaiva commented Feb 5, 2017

fcbr commented Feb 5, 2017

arademaker commented Feb 6, 2017

vcvpaiva commented Feb 7, 2017

arademaker commented Feb 7, 2017

vcvpaiva commented Feb 7, 2017

arademaker commented Feb 28, 2017

mwe: já não #98

mwe: já não #98

Comments

livyreal commented Nov 30, 2016 • edited Loading

vcvpaiva commented Nov 30, 2016

fcbr commented Dec 12, 2016

livyreal commented Dec 12, 2016

fcbr commented Dec 12, 2016

livyreal commented Dec 12, 2016

arademaker commented Dec 12, 2016

livyreal commented Dec 12, 2016

arademaker commented Dec 12, 2016

livyreal commented Dec 12, 2016 • edited Loading

arademaker commented Dec 12, 2016

livyreal commented Dec 12, 2016

arademaker commented Dec 12, 2016 • edited Loading

livyreal commented Dec 15, 2016

arademaker commented Dec 15, 2016 • edited Loading

vcvpaiva commented Feb 5, 2017

fcbr commented Feb 5, 2017

arademaker commented Feb 6, 2017

vcvpaiva commented Feb 7, 2017

arademaker commented Feb 7, 2017

vcvpaiva commented Feb 7, 2017

arademaker commented Feb 28, 2017

livyreal commented Nov 30, 2016 •

edited

Loading

livyreal commented Dec 12, 2016 •

edited

Loading

arademaker commented Dec 12, 2016 •

edited

Loading

arademaker commented Dec 15, 2016 •

edited

Loading