skip navigation


Register for Latest Research

Stay Informed
Register with NCJRS to receive NCJRS's biweekly e-newsletter JUSTINFO and additional periodic emails from NCJRS and the NCJRS federal sponsors that highlight the latest research published or sponsored by the Office of Justice Programs.

NCJRS Abstract

The document referenced below is part of the NCJRS Virtual Library collection. To conduct further searches of the collection, visit the Virtual Library. See the Obtain Documents page for direction on how to access resources online, via mail, through interlibrary loans, or in a local library.


NCJ Number: 217681 Add to Shopping cart Find in a Library
Title: Software System for Information Extraction in Criminal Justice Information Systems
Author(s): Tianhao Wu; Stephen V. Zanias; William M. Pottenger
Corporate Author: Lehigh University
United States of America
Date Published: 2006
Page Count: 178
Sponsoring Agency: Lehigh University
Bethlehem, PA 18015
National Institute of Justice (NIJ)
Washington, DC 20531
National Institute of Justice/NCJRS
Rockville, MD 20849
NCJRS Photocopy Services
Rockville, MD 20849-6000
Grant Number: 2003-IJ-CX-K003
Sale Source: National Institute of Justice/NCJRS
Box 6000
Rockville, MD 20849
United States of America

NCJRS Photocopy Services
Box 6000
Rockville, MD 20849-6000
United States of America
Document: PDF
Type: Program/Project Description
Format: Document
Language: English
Country: United States of America
Annotation: This federally supported report provides extensive description and background on information extraction (IE) and reviews several commercial IE systems.
Abstract: The purpose of this project was to build an information extraction system that automatically extracts features from textual data commonly used by law enforcement agencies. Such valuable information, highly useful in criminal investigations, is often not stored in a database in relational form. This project’s technology is capable of automatically extracting such information from the source text and automatically entering the information into a fielded, relational database. The extracted information can thus be readily retrieved and compared with other database records using modern computer-based information retrieval systems. The technique used significantly shortens the time needed to train an information extraction system of this nature. This approach enables the extraction of such features for use in everyday search and retrieval applications such as suspect identification. This system will provide input to advanced text mining algorithms for pattern detection. Such algorithms can be used, for example, to map modus operandi to physical descriptions of criminal suspects. Based on information extraction technology, Leigh has developed a software system name the BPD_IE System (Bethlehem Police Department Information Extraction System) that automatically extracts key items of information from narrative textual data and links unsolved criminal cases to solved cases, providing investigators with valuable leads. The technology automatically obtains modus operandi and physical descriptions from these textual documents. This data is then stored in fielded, relational databases which can be easily searched. Figures, references and appendixes A-B
Main Term(s): Information processing
Index Term(s): Computer generated reports; Computer software; Computers; Data collections; Information collection; Information dissemination; NIJ final report; NIJ grant-related documents; Science and Technology; Technical evolution
To cite this abstract, use the following link:

*A link to the full-text document is provided whenever possible. For documents not available online, a link to the publisher's website is provided. Tell us how you use the NCJRS Library and Abstracts Database - send us your feedback.