Information Extraction Agent

Abstract: Abstract

with the explosion of the World Wide Web, a wealth of data on many different subjects has become available online. Because of the magnitude of information currently available analyzing it and understanding is an immense task that has motivated much research efforts.

Some scientific branches have evolved to help summarize, analyze and understand the huge quantity of data. Examples of such branches include data and web mining, text summarization, information retrieval and information extraction, among others.

Information extraction, the field on which this thesis is focused, aims to extract certain information from unstructured or semi structured text and put it in a predefined form or a database, in order to facilitate information finding.

In the presented work, a tool that augments each paragraph in a document or publication with metadata describing that paragraph was developed. The class of documents targeted by this work is those which cover similar concepts within some given domain the augmentation process is based on knowledge from the domain, which is acquired dynamically from indexed documents. The search process described in this work results only in paragraph(s) which match the entered search criteria, being returned. Moreover the search engine of the tool exists as a web service to give the search facility to any other agent or any search system, to use the search engine of the tool regardless of the platform or operating system.


Publication year 2005
Availability location المعمل المركزى للنظم الزراعية الخبيرة - 6 ش النور - نادى الصيد- الدقى - الجيزة
Availability number
Organization Name
Country Egypt
Author(s) from ARC
Agris Categories Documentation and information
AGROVOC
TERMS
Extraction. Information processing. Information retrieval.
Publication Type Master Thesis