Construcción de un corpus etiquetado sintácticamente para el euskera

  1. Aduriz, Itziar
  2. Aldezabal Roteta, Izaskun
  3. Aranzabe Urruzola, María Jesús
  4. Arrieta Kortajarena, Bertol
  5. Arriola Egurrola, José María
  6. Atutxa Salazar, Aitziber
  7. Díaz de Ilarraza Sánchez, Arantza
  8. Gojenola Galletebeitia, Koldobika
  9. Oronoz Anchordoqui, Maite
  10. Sarasola Gabiola, Kepa
Zeitschrift:
Procesamiento del lenguaje natural

ISSN: 1135-5948

Datum der Publikation: 2002

Titel der Ausgabe: XVII Congreso de la SEPLN. Universidad de Valladolid, 11-13 septiembre 2002

Nummer: 29

Seiten: 5-11

Art: Artikel

Andere Publikationen in: Procesamiento del lenguaje natural

Zusammenfassung

The aim of this work is the construction of a syntactically annotated treebank for Basque. In this paper we present first, the basis of the annotation. After examining several options we chose the scheme presented in (Carrol et al., 1998). It follows the EAGLES standards and it is based on the idea of adding to each sentence in the corpus a series of grammatical relations specifying the dependencies between modifiers and their nucleus. After the formalism has been presented, we will describe the problems we have found and the decisions we have taken to solve them. Next we present an example showing the application of the scheme to an initial corpus. Finally, we present the main conclusions about the applicability to Basque and future work.