|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |
java.lang.Object org.semanticdesktop.aperture.extractor.mime.MimeExtractor
MimeSubCrawler
instead.
public class MimeExtractor
An Extractor implementation for message/rfc822-style messages.
The main purpose of this class is to process mails that are stored as .eml-files, as mails originating from a mail server are typically already completely processed (i.e., a structured DataObject is returned rather than a FileDataObject whose stream has to be processed).
Furthermore, this class can also handle web archives in MHTML style (.mht files), as created by Internet Explorer and Mozilla/Firefox (using the Mozilla Archive Format plugin), which have a similar MIME structure.
Only typical body parts are processed during full-text extraction, (binary) attachments are not handled.
Constructor Summary | |
---|---|
MimeExtractor()
Deprecated. |
Method Summary | |
---|---|
void |
extract(URI id,
InputStream stream,
Charset charset,
String mimeType,
RDFContainer result)
Deprecated. Extracts full-text and metadata from the specified binary stream and stores the extracted information as RDF statements in the specified RDFContainer. |
Methods inherited from class java.lang.Object |
---|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
Constructor Detail |
---|
public MimeExtractor()
Method Detail |
---|
public void extract(URI id, InputStream stream, Charset charset, String mimeType, RDFContainer result) throws ExtractorException
Extractor
The specified InputStream is expected to already use some kind of buffering so that the Extractors are not required to internally buffer bytes to improve performance.
extract
in interface Extractor
id
- the URI identifying the object (e.g. a file or web page) from which the stream was obtained.
The generated statements should describe this URI.stream
- the InputStream delivering the raw bytes.charset
- the charset in which the inputstream is encoded (optional).mimeType
- the MIME type of the passed stream (optional).result
- the container in which this Extractor can put its created RDF statements.
ExtractorException
- in case of any error during the extraction process.
|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |