Class ExtractAbbrev

  extended by pt.tumba.cage.ExtractAbbrev

public class ExtractAbbrev
extends java.lang.Object

A simple algorithm for extracting of abbreviations and their definitions from text. Abbreviations (short forms) are extracted from the input file, and those abbreviations for which a definition (long form) is found are printed out, along with that definition, one per line. A file consisting of short-form/long-form pairs (separated by ":") can be specified in tandem with the -testlist option for the purposes of evaluating the algorithm.

Bruno Martins
See Also:
A Simple Algorithm for Identifying Abbreviation Definitions in Biomedical Text A.S. Schwartz, M.A. Hearst; Pacific Symposium on Biocomputing 8:451-462(2003) for a detailed description of the algorithm.

Constructor Summary
Method Summary
static void main(java.lang.String[] args)
          Main method .
Methods inherited from class java.lang.Object
equals, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait

Constructor Detail


public ExtractAbbrev()
Method Detail


public static void main(java.lang.String[] args)
Main method .

argv - The command line input, tokenized.