(Publisher of Peer Reviewed Open Access Journals)

International Journal of Advanced Computer Research (IJACR)

ISSN (Print):2249-7277    ISSN (Online):2277-7970
Volume-3 Issue-9 March-2013
Full-Text PDF
Paper Title : Extracting Person Name, Date and Place from Text Documents Using LEX Tool
Author Name : Roohi Sharma
Abstract :

This paper contains the details of how one can extract person name, date and place from a text document using finite state automata and LEX tool. If we search a text document for some important information manually, the process is slow, tedious and error prone. The regular expressions are used to parse textual data to match patterns and extract variables. The lexical analyzer is used in this research, which scans the input program character by character and groups them together to form tokens. This paper describes a technique to perform identification and extraction of information by using LEX tool. It finds the names, date and places that appear in machine-readable text document. Regular expressions through which required information is extracted are also discussed.

Keywords : Regular Expressions, Finite State Automata, Information Extraction, Pattern Matching, Lexical Analyzer.
Cite this article : Roohi Sharma, " Extracting Person Name, Date and Place from Text Documents Using LEX Tool " , International Journal of Advanced Computer Research (IJACR), Volume-3, Issue-9, March-2013 ,pp.26-29.