Text Segmentation attempts to determine boundaries between topics in long documents by dividing the document text into text segments. Each segment comprises a consecutive sequence of sentences or paragraphs sharing a coherent topic.
All MorphAdorner segmenters must implement the {@link edu.northwestern.at.utils.corpuslinguistics.textsegmenter.TextSegmenter} interface. The {@link edu.northwestern.at.utils.corpuslinguistics.textsegmenter.TextSegmenterFactory} provides the mechanism for instantiating a default or specified instance of a segmenter implementation.
MorphAdorner includes two text segmenters, both based upon code written by Freddy Choi.