Construcción de un corpus etiquetado sintácticamente para el euskera

  1. Aduriz, Itziar
  2. Aldezabal Roteta, Izaskun
  3. Aranzabe Urruzola, María Jesús
  4. Arrieta Kortajarena, Bertol
  5. Arriola Egurrola, José María
  6. Atutxa Salazar, Aitziber
  7. Díaz de Ilarraza Sánchez, Arantza
  8. Gojenola Galletebeitia, Koldobika
  9. Oronoz Anchordoqui, Maite
  10. Sarasola Gabiola, Kepa
Revue:
Procesamiento del lenguaje natural

ISSN: 1135-5948

Année de publication: 2002

Titre de la publication: XVII Congreso de la SEPLN. Universidad de Valladolid, 11-13 septiembre 2002

Número: 29

Pages: 5-11

Type: Article

D'autres publications dans: Procesamiento del lenguaje natural

Résumé

The aim of this work is the construction of a syntactically annotated treebank for Basque. In this paper we present first, the basis of the annotation. After examining several options we chose the scheme presented in (Carrol et al., 1998). It follows the EAGLES standards and it is based on the idea of adding to each sentence in the corpus a series of grammatical relations specifying the dependencies between modifiers and their nucleus. After the formalism has been presented, we will describe the problems we have found and the decisions we have taken to solve them. Next we present an example showing the application of the scheme to an initial corpus. Finally, we present the main conclusions about the applicability to Basque and future work.