org.semanticdesktop.aperture.crawler.web
Class WebCrawlerFactory
java.lang.Object
org.semanticdesktop.aperture.crawler.web.WebCrawlerFactory
- All Implemented Interfaces:
- CrawlerFactory
public class WebCrawlerFactory
- extends Object
- implements CrawlerFactory
Methods inherited from class java.lang.Object |
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
WebCrawlerFactory
public WebCrawlerFactory()
getLinkExtractorRegistry
public LinkExtractorRegistry getLinkExtractorRegistry()
setLinkExtractorRegistry
public void setLinkExtractorRegistry(LinkExtractorRegistry linkExtractorRegistry)
getMimeTypeIdentifier
public MimeTypeIdentifier getMimeTypeIdentifier()
setMimeTypeIdentifier
public void setMimeTypeIdentifier(MimeTypeIdentifier mimeTypeIdentifier)
getSupportedTypes
public Set getSupportedTypes()
- Description copied from interface:
CrawlerFactory
- Returns a set of URIs indicating the DataSource types that are supported by the Crawler
implementation provided by this CrawlerFactory.
- Specified by:
getSupportedTypes
in interface CrawlerFactory
- Returns:
- A Set of DataSource type URIs.
getCrawler
public Crawler getCrawler(DataSource source)
- Description copied from interface:
CrawlerFactory
- Return a Crawler that can crawl the specified DataSource. The getType method of this DataSource
should return a URI that is contained in the supported types set of this CrawlerFactory.
- Specified by:
getCrawler
in interface CrawlerFactory
- Parameters:
source
- The DataSource for which a Crawler needs to be generated.
- Returns:
- A Crawler whose getDataSource method returns the specified DataSource.
Copyright © 2010 Aperture Development Team. All Rights Reserved.