pt.tumba.cage
Class ExtractAbbrev

java.lang.Object
  extended by pt.tumba.cage.ExtractAbbrev

public class ExtractAbbrev
extends java.lang.Object

A simple algorithm for extracting of abbreviations and their definitions from text. Abbreviations (short forms) are extracted from the input file, and those abbreviations for which a definition (long form) is found are printed out, along with that definition, one per line. A file consisting of short-form/long-form pairs (separated by ":") can be specified in tandem with the -testlist option for the purposes of evaluating the algorithm.

Author:
Bruno Martins
See Also:
A Simple Algorithm for Identifying Abbreviation Definitions in Biomedical Text A.S. Schwartz, M.A. Hearst; Pacific Symposium on Biocomputing 8:451-462(2003) for a detailed description of the algorithm.

Constructor Summary
ExtractAbbrev()
           
 
Method Summary
static void main(java.lang.String[] args)
          Main method .
 
Methods inherited from class java.lang.Object
equals, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Constructor Detail

ExtractAbbrev

public ExtractAbbrev()
Method Detail

main

public static void main(java.lang.String[] args)
Main method .

Parameters:
argv - The command line input, tokenized.