Syntactic nuclei in dependency parsing - A multilingual exploration
Publikation: Bidrag til bog/antologi/rapport › Konferencebidrag i proceedings › Forskning › fagfællebedømt
Standard models for syntactic dependency parsing take words to be the elementary units that enter into dependency relations. In this paper, we investigate whether there are any benefits from enriching these models with the more abstract notion of nucleus proposed by Tesnière. We do this by showing how the concept of nucleus can be defined in the framework of Universal Dependencies and how we can use composition functions to make a transition-based dependency parser aware of this concept. Experiments on 12 languages show that nucleus composition gives small but significant improvements in parsing accuracy. Further analysis reveals that the improvement mainly concerns a small number of dependency relations, including nominal modifiers, relations of coordination, main predicates, and direct objects.
Originalsprog | Engelsk |
---|---|
Titel | EACL 2021 - 16th Conference of the European Chapter of the Association for Computational Linguistics, Proceedings of the Conference |
Antal sider | 12 |
Forlag | Association for Computational Linguistics (ACL) |
Publikationsdato | 2021 |
Sider | 1376-1387 |
ISBN (Elektronisk) | 9781954085022 |
Status | Udgivet - 2021 |
Eksternt udgivet | Ja |
Begivenhed | 16th Conference of the European Chapter of the Associationfor Computational Linguistics, EACL 2021 - Virtual, Online Varighed: 19 apr. 2021 → 23 apr. 2021 |
Konference
Konference | 16th Conference of the European Chapter of the Associationfor Computational Linguistics, EACL 2021 |
---|---|
By | Virtual, Online |
Periode | 19/04/2021 → 23/04/2021 |
Sponsor | Babelscape, Bloomberg Engineering, Facebook AI, Grammarly, LegalForce |
Navn | EACL 2021 - 16th Conference of the European Chapter of the Association for Computational Linguistics, Proceedings of the Conference |
---|
Bibliografisk note
Publisher Copyright:
© 2021 Association for Computational Linguistics
ID: 366045841