|
||||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | |||||||||
java.lang.ObjectISSearch.ISCrawler
The Crawler class of the Web search engine. This class is used to start and stop the Crawler, to reset the engine and to control crawling parameters.
Runnable,
Thread,
InetAddress,
URL,
HttpURLConnection,
InputStreamReader,
BufferedReader,
Exception| Field Summary |
| Fields inherited from interface ISSearch.ISCrawlerInterface |
RUNNING, STOPPED |
| Constructor Summary | |
ISCrawler()
Creates a new instance of ISCrawler |
|
| Method Summary | |
void |
addLink(URL link)
Adds a new link to the URL queue, if the link is not yet visited. |
URL |
getBest()
Returns the best candidate to be visited next. |
int |
getCrawlingDepth()
Returns the current maximum allowed crawling depth. |
ISDocumentInterface |
getCurrentDocument()
Returns the last document visited by the Crawler. |
URL |
getCurrentURL()
Returns the last URL visited by the Crawler. |
int |
getMaxQueueSize()
Returns the maximum allowed size of the URL Queue |
int |
getQueueSize()
Returns the current size of the URL queue |
int |
getState()
Returns the current state of the crawler. |
boolean |
isVisited(URL doc)
Checks if the URL of the given document is already visited by the crawler. |
void |
reset()
Resets the crawler. |
void |
run()
When an object implementing interface Runnable is used
to create a thread, starting the thread causes the object's
run method to be called in that separately executing
thread.
|
void |
setCrawlingDepth(int depth)
Sets the maximum allowed crawling depth. |
void |
setQueueMaxSize(int m)
Set the maximum allowed size of the URL queue |
void |
start()
Starts the thread of the crawler and changes the engine state to RUNNING |
void |
stop()
Stops the crawler. |
| Methods inherited from class java.lang.Object |
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
| Constructor Detail |
public ISCrawler()
| Method Detail |
public void addLink(URL link)
addLink in interface ISCrawlerInterfacelink - The URL link representation of the new targetpublic URL getBest()
getBest in interface ISCrawlerInterfacenull if the queue is empty.public int getCrawlingDepth()
getCrawlingDepth in interface ISCrawlerInterfacepublic ISDocumentInterface getCurrentDocument()
getCurrentDocument in interface ISCrawlerInterfacepublic URL getCurrentURL()
getCurrentURL in interface ISCrawlerInterfacepublic int getMaxQueueSize()
getMaxQueueSize in interface ISCrawlerInterfacepublic int getQueueSize()
getQueueSize in interface ISCrawlerInterfacepublic int getState()
RUNNING and STOPPED.
getState in interface ISCrawlerInterfaceRUNNING oder STOPPEDpublic boolean isVisited(URL doc)
isVisited in interface ISCrawlerInterfacetrue if the engine was able to recognize
the given URL as already visited, false.public void setCrawlingDepth(int depth)
setCrawlingDepth in interface ISCrawlerInterfacedepth - The maximum allowed craling depth.public void setQueueMaxSize(int m)
setQueueMaxSize in interface ISCrawlerInterfacem - The maximum allowed Queue sizepublic void start()
RUNNING
start in interface ISCrawlerInterfacepublic void stop()
STOPPED.
stop in interface ISCrawlerInterfacepublic void reset()
STOPPED,
reset in interface ISCrawlerInterfacepublic void run()
RunnableRunnable is used
to create a thread, starting the thread causes the object's
run method to be called in that separately executing
thread.
The general contract of the method run is that it may
take any action whatsoever.
run in interface RunnableThread.run()
|
||||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | |||||||||