|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |
Parser-Interface of the Web search engine.
The parser reads any given text or HTML input (represented by the Reader
) and extracts
all words and links from that source.
The result must be returned as Object that implements the ISDocumentInterface
String
,
HTMLEditorKit
,
StringTokenizer
Method Summary | |
boolean |
isStopword(String who)
Decides whether the given token is claimed as stopword or not. |
ISDocumentInterface |
parse(Reader input)
Performs the input analysis. |
String |
stem(String who)
Applies the Porter stemming algorithm and returns the resulting word stem. |
Method Detail |
public ISDocumentInterface parse(Reader input)
input
- the input of the parser (e.g., text file or HTTP connection), represented by the Reader
public boolean isStopword(String who)
who
- The String to be checked.
public String stem(String who)
who
- The word to be stemmed.
|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |