|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |
java.lang.Object org.semanticdesktop.aperture.subcrawler.base.AbstractSubCrawler org.semanticdesktop.aperture.subcrawler.vcard.VcardSubCrawler
public class VcardSubCrawler
A SubCrawler
Implementation working with VCard documents.
Known issues:
URIs for VCARDS
This crawler uses following conventions to generate URIS:
Constructor Summary | |
---|---|
VcardSubCrawler()
|
Method Summary | |
---|---|
String |
getUriPrefix()
Returns the prefix used when generating uris. |
void |
stopSubCrawler()
Stops a running crawl as fast as possible. |
void |
subCrawl(URI id,
InputStream stream,
SubCrawlerHandler handler,
DataSource dataSource,
AccessData accessData,
Charset charset,
String mimeType,
RDFContainer parentMetadata)
Starts crawling the given stream and to report the encountered DataObjects to the given SubCrawlerHandler. |
Methods inherited from class org.semanticdesktop.aperture.subcrawler.base.AbstractSubCrawler |
---|
createChildUri, getDataObject |
Methods inherited from class java.lang.Object |
---|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
Constructor Detail |
---|
public VcardSubCrawler()
Method Detail |
---|
public void subCrawl(URI id, InputStream stream, SubCrawlerHandler handler, DataSource dataSource, AccessData accessData, Charset charset, String mimeType, RDFContainer parentMetadata) throws SubCrawlerException
SubCrawler
id
- the URI identifying the object (e.g. a file or web page) from which the stream was obtained.
This URI is treated as the URI of the parent object, all objects encountered in the stream
are considered to be contained within the parent object. (optional, the implementation may
use this uri or the one returned from the RDFContainer.getDescribedUri()
method of
the parentMetadata)stream
- the stream to be crawled. (obligatory)handler
- The crawler handler that is to receive the notifications from the SubCrawler
(obligatory)dataSource
- the data source that will be returned by the DataObject.getDataSource()
method of the returned data objects. Some implementations may require that this reference is
not null and that it contains some particular informationaccessData
- the AccessData used to determine if the encountered objects are to be returned as
new, modified, unmodified or deleted. Information about new or modified objects is stored
within for use in future crawls. This parameter may be null if this functionality is not
desired, in which case all DataObjects will be reported as new. (optional)charset
- the charset in which the input stream is encoded (optional).mimeType
- the MIME type of the passed stream (optional).parentMetadata
- The 'parent' RDFContainer, that will contain the metadata about the top-level
entity in the stream. A SubCrawler may (in some cases) limit itself to augmenting the
metadata in this RDFContainer without delivering any additional DataObjects. (obligatory)
SubCrawlerException
- if any of the obligatory parameters is null or if any error during the
crawling process occuredSubCrawler.subCrawl(URI, InputStream, SubCrawlerHandler, DataSource, AccessData, Charset, String, RDFContainer)
public void stopSubCrawler()
SubCrawler
SubCrawler.stopSubCrawler()
public String getUriPrefix()
AbstractSubCrawler
SubCrawler
class for
more details.
getUriPrefix
in class AbstractSubCrawler
|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |