Extracting named entities (names, places, dates, and other words and phrases that establish
the meaning of a body of text) is critical to software systems that process large amounts
of unstructured data coming from sources such as email, document files, and the Web.
The classification algorithm for marking web pages with their corresponding
geographical scopes, with basis on a probabilistic graphical model of geographical concepts.
A probabilistic graphical model of geographical concepts, which is the basis of
the algorithm for classifying web pages according to their geographical scopes.