Package sofie.parsing

Class Summary
Parser Class Parser This class is part of the SOFIE system (http://mpii.de/yago-naga/sofie).
Token Class Token This class is part of the SOFIE system (http://mpii.de/yago-naga/sofie).
Token.AnyName Superclass for tokens that are considered in patterns
Token.CanonicProperName Disambiguated proper names
Token.CommonIndividual Superclass for common individuals
Token.Company Companies
Token.Date Normalized dates
Token.Functional Tokens that serve purely functional purposes
Token.Literal Literal tokens
Token.Number Numbers
Token.Person Persons
Token.ProperName Named entities
Token.Punctuation Punctuation tokens
Token.Repeat A token that places the title token left of itself
Token.SemiCanonicProperName Semi-Disambiguated proper names (like wikipedia links)
Token.Separator Tokens that cannot be part of a pattern
Token.StartCommentToken Token that starts a comment (e.g.
Token.StopWord Stop word tokens
Token.Title Token that shall be repeated before repeat tokens
Token.URL URLs
Token.USState US States
Token.WikiLink non-proper-name Wikipedia links
Token.Word Normal word tokens
Tokenizer Class Tokenizer This class is part of the SOFIE system (http://mpii.de/yago-naga/sofie).
Tokenizer.HTMLTokenizer Tokenizer for HTML
Tokenizer.PatternRunner A pattern runner is an object that finds one specific pattern in a string.
Tokenizer.WikiTokenizer Tokenizer for Wikipedia
 

Enum Summary
Token.NameType Types of a common name