PUBLICATION | 2008 Mining Massive Data Sets for Security The NATO Advanced Study Institute (ASI) on Mining Massive Data Sets for Security, held in Villa Cagnola, Gazzada (Italy) from 10 to 21 September 2007, brought...
PUBLICATION | 2008 Real-Time News Event Extraction for Global Crisis Monitoring This paper presents a real-time news event extraction system developed by the Joint Research Centre of the European Commission. It is capable of accurately and efficiently extracting violent and disaster...
PUBLICATION | 2009 Automatic Construction of Multilingual Name Dictionaries This chapter is a contribution to the forthcoming book 'Learning Machine Translation', MIT Press, to be published in 2008. ABSTRACT: Machine Translation and other Natural Language Processing systems...
PUBLICATION | 2010 Enhancing N-Gram-based Summary Evaluation Using Information Content and a Taxonomy In this paper we propose a novel information-theoretic metric for automatic summary evaluation when model summaries are available as in the setting of the AESOP task of the Update Summarization track...
PUBLICATION | 2008 Text Mining from the Web for Medical Intelligence Global medical and epidemic surveillance is an essential function of Public Health agencies, whose mandate is to protect the public from major health threats. To perform this function effectively one...
PUBLICATION | 2011 Expanding a multilingual media monitoring and information extraction tool to a new language: Swahili The Europe Media Monitor (EMM) family of applications is a set of multilingual tools that gather, cluster and classify news in currently fifty languages and that extract named entities and quotations...
PUBLICATION | 2011 Building a Multilingual Named Entity-Annotated Corpus Using Annotation Projection As developers of a highly multilingual named entity recognition (NER) system, we face an evaluation resource bottleneck problem: we need evaluation data in many languages...
PUBLICATION | 2010 Enhancing N-Gram-based Summary Evaluation Using Information Content and a Taxonomy In this paper we propose a novel information-theoretic metric for automatic summary evaluation when model summaries are available as in the setting of the AESOP task of the Update Summarization track...
PUBLICATION | 2010 Automatic Expansion of a Social Network Using Sentiment Analysis In this paper we present an approach for automatic learning of a signed social network from online news articles. The vertices in this network represent people and the edges are labeled with the polarity...
PUBLICATION | 2011 Frontex Real-time News Event Extraction Framework An ever-growing amount of information relevant for early detection of certain threats can be extracted from on-line news. This led to an emergence of news mining tools to help analysts...