Matxin-Informatikaversión del traductor Matxin adaptada al dominio de la informática

  1. Alegría Loinaz, Iñaki
  2. Cabezón, Unai
  3. Labaka Intxauspe, Gorka
  4. Mayor Martinez, Aingeru
  5. Sarasola Gabiola, Kepa
Journal:
Procesamiento del lenguaje natural

ISSN: 1135-5948

Year of publication: 2011

Issue: 47

Pages: 322-324

Type: Article

More publications in: Procesamiento del lenguaje natural

Abstract

We present Matxin-Informatika, a new version of Matxin translation system (from Spanish to Basque) that has been adapted to the domain of computer science using bilingual corpus and lexical resources. This version is being used in a manual post-editing task to allow further improvement of the translator by means of statistical post-editing.

Bibliographic References

  • Alegria L, Arregi X., Díaz de Ilarraza A., Labaka G., Lersundi M., Mayor A., Sarasola K. 2008. Strategies for sustainable MT for Basque: incremental design, reusability, standardization and open-source. Proceedings of the IJCNLP-08, pp: 235-243. Hydefabad, India.
  • Labaka, G. 2010. EUSMT: Incorporating Linguistic Information into SMT for a Morphologically Rich Language. Its use in SMT-RBMT-EBMT — hybridation. PhD Thesis. (UPV-EHU). Donostia.
  • Mayor, A. 2007. Matxin: Erregeletan oinarritutako itzulpen automatikoko sistema baten eraikuntza estaldura handiko baliabide linguistikoak — berrerabiliz. PhD Thesis. (UPV-EHU). Donostia.
  • Simard, M., Ueffing, N., Isabelle, P., and Kuhn, R. 2007. Rule-based translation with statistical phrase-based post-editing Proceedings of the Second Workshop on Statistical Machine Translation. pp:203-206. Prague, Czech Republic.
  • Franz Josef Och, Hermann Ney. 2003. A Systematic Comparison of Various Statistical Alignment Models. Computational Linguistics, volume 29, number 1, pp. 19-51