org.semanticdesktop.aperture.extractor.microsoft.util
Interface PoiUtil.TextExtractor
- Enclosing class:
- PoiUtil
public static interface PoiUtil.TextExtractor
A TextExtractor is a delegate that extracts the full-text from an MS Office document using a
POIFSFileSystem. Implementations typically support specific MS Office document types, such as Word,
Excel and PowerPoint.
Method Summary |
String |
getText(org.apache.poi.poifs.filesystem.DirectoryNode dir,
org.apache.poi.poifs.filesystem.POIFSFileSystem fs)
Extract the full-text from an MS Office document. |
getText
String getText(org.apache.poi.poifs.filesystem.DirectoryNode dir,
org.apache.poi.poifs.filesystem.POIFSFileSystem fs)
throws IOException
- Extract the full-text from an MS Office document.
- Parameters:
dirNode
- The directory node providing structural access to the MS Office document.fs
- The POIFSFileSystem containing the MS Office document
- Returns:
- A String containing the full-text of the document.
- Throws:
IOException
- whenever access to the directory node caused an IOException.
Copyright © 2010 Aperture Development Team. All Rights Reserved.