Morphosyntactic structure of terms in Basque for automatic terminology extraction

  1. Urízar Enbeitia, Rubén
  2. Ezeiza Ramos, Nerea
  3. Alegría Loinaz, Iñaki
Libro:
Proceedings of the Ninth EURALEX International Congress, EURALEX 2000: Stuttgart, Germany, August 8th - 12th, 2000
  1. Heid, Ulrich (ed. lit.)
  2. Evert, Stefan (ed. lit.)
  3. Lehmann, Egbert (ed. lit.)
  4. Rohrer, Christian (ed. lit.)

Editorial: Stuttgart : Institut für Maschinelle Sprachverarbeitung, Universität Stuttgart, 2000

Año de publicación: 2000

Páginas: 373-381

Congreso: EURALEX. International Congress (9. 2000. Stuttgart)

Tipo: Aportación congreso

Resumen

This paper describes the morphosyntactic patterns of technical terms in Basque and presents an architecture for a term-extracting tool. As Basque is a highly inflected agglutinative language, part-of-speech information is not enough to define term patterns. The use of morphological and syntactic information is essential to reduce considerably the number of structures. For example, a noun, an adverbial, a postpositive adjectival, the nominal form of a verb and even a determiner in the genitive case may work as a prepositive adjective; however, they all share the same syntactic function. Therefore, for the termextracting tool to perform properly, the texts must be morphosyntactically analysed and disambiguated. Then a shallow syntactic parser will identify the previously described patterns.