| 
||||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | |||||||||
java.lang.Objectorg.semanticdesktop.aperture.subcrawler.base.AbstractSubCrawler
org.semanticdesktop.aperture.subcrawler.mime.MimeSubCrawler
public class MimeSubCrawler
A SubCrawer implementation for message/rfc822-style messages.
It is basically a thin wrapper around the DataObjectFactory.
| Constructor Summary | |
|---|---|
MimeSubCrawler()
 | 
|
| Method Summary | |
|---|---|
 MessageDataObject | 
createDataObject(URI dataObjectId,
                 DataSource dataSource,
                 RDFContainer metadata,
                 javax.mail.internet.MimeMessage msg)
Creates a message data object for the given parameters  | 
 DataObject | 
getDataObject(URI parentUri,
              String path,
              InputStream stream,
              DataSource dataSource,
              Charset charset,
              String mimeType,
              RDFContainerFactory factory)
Get a DataObject from the specified stream with the given path.  | 
 javax.mail.internet.MimeMessage | 
getLastMimeMessage()
Returns the last MimeMessage that was created internally by the subCrawl method.  | 
 InputStream | 
getPartStream(javax.mail.Part part)
Returns an input stream with the part content.  | 
 String | 
getUriPrefix()
Returns the prefix used when generating uris.  | 
 void | 
stopSubCrawler()
Stops a running crawl as fast as possible.  | 
 void | 
subCrawl(URI id,
         InputStream stream,
         SubCrawlerHandler handler,
         DataSource dataSource,
         AccessData accessData,
         Charset charset,
         String mimeType,
         RDFContainer parentMetadata)
Starts crawling the given stream and to report the encountered DataObjects to the given SubCrawlerHandler.  | 
| Methods inherited from class org.semanticdesktop.aperture.subcrawler.base.AbstractSubCrawler | 
|---|
createChildUri | 
| Methods inherited from class java.lang.Object | 
|---|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait | 
| Constructor Detail | 
|---|
public MimeSubCrawler()
| Method Detail | 
|---|
public void subCrawl(URI id,
                     InputStream stream,
                     SubCrawlerHandler handler,
                     DataSource dataSource,
                     AccessData accessData,
                     Charset charset,
                     String mimeType,
                     RDFContainer parentMetadata)
              throws SubCrawlerException
SubCrawler
subCrawl in interface SubCrawlerid - the URI identifying the object (e.g. a file or web page) from which the stream was obtained.
            This URI is treated as the URI of the parent object, all objects encountered in the stream
            are considered to be contained within the parent object. (optional, the implementation may
            use this uri or the one returned from the RDFContainer.getDescribedUri() method of
            the parentMetadata)stream - the stream to be crawled. (obligatory)handler - The crawler handler that is to receive the notifications from the SubCrawler
            (obligatory)dataSource - the data source that will be returned by the DataObject.getDataSource()
            method of the returned data objects. Some implementations may require that this reference is
            not null and that it contains some particular informationaccessData - the AccessData used to determine if the encountered objects are to be returned as
            new, modified, unmodified or deleted. Information about new or modified objects is stored
            within for use in future crawls. This parameter may be null if this functionality is not
            desired, in which case all DataObjects will be reported as new. (optional)charset - the charset in which the input stream is encoded (optional).mimeType - the MIME type of the passed stream (optional).parentMetadata - The 'parent' RDFContainer, that will contain the metadata about the top-level
            entity in the stream. A SubCrawler may (in some cases) limit itself to augmenting the
            metadata in this RDFContainer without delivering any additional DataObjects. (obligatory)
SubCrawlerException - if any of the obligatory parameters is null or if any error during the
             crawling process occuredpublic void stopSubCrawler()
SubCrawler
stopSubCrawler in interface SubCrawlerpublic javax.mail.internet.MimeMessage getLastMimeMessage()
public DataObject getDataObject(URI parentUri,
                                String path,
                                InputStream stream,
                                DataSource dataSource,
                                Charset charset,
                                String mimeType,
                                RDFContainerFactory factory)
                         throws SubCrawlerException,
                                PathNotFoundException
SubCrawler
getDataObject in interface SubCrawlergetDataObject in class AbstractSubCrawlerparentUri - the URI of the parent object where the path will be looked forpath - the path of the requested resourcestream - the stream that contains the resourcedataSource - data source that will be returned by the DataObject.getDataSource() method of
            the returned data object. Some implementations may require that this reference is not null
            and that it contains some particular informationcharset - the charset in which the input stream is encoded (optional).mimeType - the MIME type of the passed stream (optional).factory - An RDFContainerFactory that delivers the RDFContainer to which the metadata of the
            DataObject should be added. The provided RDFContainer can later be retrieved as the
            DataObject's metadata container.
SubCrawlerException - if any I/O error occurs
PathNotFoundException - if the requested path is not foundpublic String getUriPrefix()
AbstractSubCrawlerSubCrawler class for
 more details.
getUriPrefix in class AbstractSubCrawler
public InputStream getPartStream(javax.mail.Part part)
                          throws javax.mail.MessagingException,
                                 IOException
DataObjectFactory.PartStreamFactoryPart.getInputStream() method, designed to allow for customization of the returned input
 stream.
getPartStream in interface DataObjectFactory.PartStreamFactoryjavax.mail.MessagingException
IOException
public MessageDataObject createDataObject(URI dataObjectId,
                                          DataSource dataSource,
                                          RDFContainer metadata,
                                          javax.mail.internet.MimeMessage msg)
                                   throws javax.mail.MessagingException
DataObjectFactory.PartStreamFactory
createDataObject in interface DataObjectFactory.PartStreamFactoryjavax.mail.MessagingException
  | 
||||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | |||||||||