org.semanticdesktop.aperture.crawler.web
Class WebCrawlerFactory

java.lang.Object
  extended by org.semanticdesktop.aperture.crawler.web.WebCrawlerFactory
All Implemented Interfaces:
CrawlerFactory

public class WebCrawlerFactory
extends Object
implements CrawlerFactory


Constructor Summary
WebCrawlerFactory()
           
 
Method Summary
 Crawler getCrawler(DataSource source)
          Return a Crawler that can crawl the specified DataSource.
 LinkExtractorRegistry getLinkExtractorRegistry()
           
 MimeTypeIdentifier getMimeTypeIdentifier()
           
 Set getSupportedTypes()
          Returns a set of URIs indicating the DataSource types that are supported by the Crawler implementation provided by this CrawlerFactory.
 void setLinkExtractorRegistry(LinkExtractorRegistry linkExtractorRegistry)
           
 void setMimeTypeIdentifier(MimeTypeIdentifier mimeTypeIdentifier)
           
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Constructor Detail

WebCrawlerFactory

public WebCrawlerFactory()
Method Detail

getLinkExtractorRegistry

public LinkExtractorRegistry getLinkExtractorRegistry()

setLinkExtractorRegistry

public void setLinkExtractorRegistry(LinkExtractorRegistry linkExtractorRegistry)

getMimeTypeIdentifier

public MimeTypeIdentifier getMimeTypeIdentifier()

setMimeTypeIdentifier

public void setMimeTypeIdentifier(MimeTypeIdentifier mimeTypeIdentifier)

getSupportedTypes

public Set getSupportedTypes()
Description copied from interface: CrawlerFactory
Returns a set of URIs indicating the DataSource types that are supported by the Crawler implementation provided by this CrawlerFactory.

Specified by:
getSupportedTypes in interface CrawlerFactory
Returns:
A Set of DataSource type URIs.

getCrawler

public Crawler getCrawler(DataSource source)
Description copied from interface: CrawlerFactory
Return a Crawler that can crawl the specified DataSource. The getType method of this DataSource should return a URI that is contained in the supported types set of this CrawlerFactory.

Specified by:
getCrawler in interface CrawlerFactory
Parameters:
source - The DataSource for which a Crawler needs to be generated.
Returns:
A Crawler whose getDataSource method returns the specified DataSource.


Copyright © 2010 Aperture Development Team. All Rights Reserved.