Char |
Decodes, encodes and normalizes UNICODE, UTF8, HTML and URI/URL strings |
DateParser |
Parses and normalizes different date formats (e.g. "January 5th, 2000" or "500 BC") |
Name |
Provides primitive heuristics to recognize and parse person names and organization names |
NounGroup |
Splits a noun group (given by a String) into its modifiers and its head |
NumberParser |
Parses and normalizes complex number expressions (e.g. "10 million meters") |
NumberFormatter |
A simple number formatter |
PlingStemmer |
Stems an English noun to singular. Knows nearly all exceptions. |
RegularExpression |
Parses a Regular Expression and converts it to an automaton, allows to invert it |
ArrayQueue |
Implements a simple non-blocking queue |
BitVector |
Implements a bit vector, i.e. a list of bits, a set of small integers |
CombinedIterator |
Combines multiple iterators to one iterator |
CompressedString |
Compresses a String in a potentially lossy way to 7, 6 or less bits per character (instead of 16) |
DirectedGraph |
Implements a directed graph wit ancestor finding |
FilteredIterator |
Implements an iterator that allows filtering out certain elements |
FinalMap |
Provides a nicer constructor for a TreeMap |
FinalSet |
Provides a very simple container implementation with zero overhead |
Frequency Vector |
Provides recall and precision measures on bags of words, including fuzzy recall and fuzzy precision, and Wilson Interval computation |
Immutable |
Wraps a list or a set so that it becomes immutable. |
IntSet |
implements a set of small integers as a bit vector with constant time access |
IterableForIterator |
Wraps an iterator so that it can be used in a for-each-loop |
IterableForEnumeration |
Wraps an untyped enumeration into a typed iterator |
MappedIterator |
implements an iterator that maps each element by a function before yielding it |
Pair |
For the simple datatype Pair |
PeekIterator |
An Iterator that can look ahead (peek) one element |
SmallStack |
Implements a fast stack for int, long, double |
SparseVector |
Represents a Sparse Vector, i.e. a vector that has only few non-zero entries. Implements k-means |
SmallStack |
Represents an efficient stack for primitive datatypes (int, long, double, boolean) |
SVMModel |
Implements an SVM-light-Model |
Tree |
For the simple datatype Tree |
Trie |
implements a trie (an efficient set of strings based on prefixes) |
UndirectedGraph |
implements an undirected graph |
Visitor |
For the common visitor design pattern |
Visitable |
For the common visitor design pattern |
CSVFile |
Writing to a comma-separated file (CSV file). |
CSVLines |
Can iterate through the columns of a comma-separated file (CSV file). |
DeepFileSet |
Represents a set of files as given by a wildcard string. Can recurse subfolders. |
FigureProducer |
Produces Latex tables and JPG plots for table data |
FileLines |
Provides an iterator over the lines in a file |
FileSet |
Represents a set of files as given by a wildcard string. Does not include folders, is not case-sensitive. |
HTMLReader |
Reads characters from an HTML-file |
MatchReader |
Provides an iterator over Regular Expression matches in a file |
SimpleInputStreamReader |
Reads characters from a file, regardless of the encoding |
SimpleOutputStreamWriter |
Writes character to a file, regardless of the encoding (see here for a problem description) |
UTF8Reader |
Reads characters from an UTF8-encoded file |
UTF8Writer |
Writes UTF8-encoded characters to a file |