NewsReader project

  1. Rodrigo Agerri
  2. Eneko Agerri
  3. Itziar Aldabe
  4. Begoña Altuna
  5. Zuhaitz Beloki
  6. Egoitz Laparra
  7. Maddalen López de Lacalle
  8. German Rigau
  9. Aitor Soroa
  10. Rubén Urizar
Revista:
Procesamiento del lenguaje natural

ISSN: 1135-5948

Año de publicación: 2014

Número: 53

Páginas: 155-158

Tipo: Artículo

Otras publicaciones en: Procesamiento del lenguaje natural

Resumen

The European project NewsReader develops advanced technology to process daily news streams in 4 languages, extracting what happened, when and where it happened and who was involved. NewsReader reads massive amounts of news coming from thousands of sources. It compares the results across sources to complement information and determine where the different sources disagree. Furthermore, it merges current news with previous news, creating a long-term history rather than separate events. The result is cumulated over time, producing an extremely large knowledge base that is visualized using new techniques to provide more comprehensive access.

Referencias bibliográficas

  • Agerri, Rodrigo, Josu Bermudez, and German Rigau. 2014. IXA Pipeline: Efficient and ready to use multilingual NLP tools. In Ninth conference on International Language Resources and Evaluation (LREC-2014), 26-30 May, Reykjavik, Iceland.
  • Artola, Xabier, Zuhaitz Beloki, and Aitor Soroa. 2014. A stream computing approach towards scalable NLP. In Proceedings of the 9th Language Resources and Evaluation Conference (LREC2014).
  • Bosma, Wauter, Piek Vossen, Aitor Soroa, German Rigau, Maurizio Tesconi, Andrea Marchetti, Monica Monachini, and Carlo Aliprandi. 2009. Kaf: a generic semantic annotation format. In Proceedings of the GL2009 Workshop on Semantic Annotation.
  • Fokkens, Antske, Aitor Soroa, Zuhaitz Beloki, Niels Ockeloen, German Rigau, Willem Robert van Hage, and Piek Vossen. 2014. NAF and GAF: Linking linguistic annotations. In To appear in Proceedings of 10th Joint ACL/ISO Workshop on Interoperable Semantic Annotation (ISA-10).
  • Ide, Nancy, Laurent Romary, and ´Eric Villemonte de La Clergerie. 2003. International standard for a linguistic annotation framework. In Proceedings of the HLT-NAACL 2003 Workshop on Software Engineering and Architecture of Language Technology Systems (SEALTS). Association for Computational Linguistics.
  • López de Lacalle, Maddalen, Egoitz Laparra, and German Rigau. 2014. Predicate matrix: extending semlink through wordnet mappings. In Ninth conference on International Language Resources and Evaluation (LREC-2014), 26-30 May, Reykjavik, Iceland.
  • Van Hage, W.R., V. Malaisé, G.K.D. De Vries, G. Schreiber, and M.W. van Someren. 2011. Abstracting and reasoning over ship trajectories and web data with the simple event model (SEM). Multimedia Tools and Applications, pages 1–23.
  • Vossen, Piek, German Rigau, Luciano Serafini, Pim Stouten, Francis Irving, and Willem Van Hage. 2014. Newsreader: recording history from daily news streams. In Ninth conference on International Language Resources and Evaluation (LREC-2014), 26-30 May, Reykjavik, Iceland.