org.semanticdesktop.aperture.extractor.microsoft.util
Interface PoiUtil.TextExtractor

Enclosing class:
PoiUtil

public static interface PoiUtil.TextExtractor

A TextExtractor is a delegate that extracts the full-text from an MS Office document using a POIFSFileSystem. Implementations typically support specific MS Office document types, such as Word, Excel and PowerPoint.


Method Summary
 String getText(org.apache.poi.poifs.filesystem.DirectoryNode dir, org.apache.poi.poifs.filesystem.POIFSFileSystem fs)
          Extract the full-text from an MS Office document.
 

Method Detail

getText

String getText(org.apache.poi.poifs.filesystem.DirectoryNode dir,
               org.apache.poi.poifs.filesystem.POIFSFileSystem fs)
               throws IOException
Extract the full-text from an MS Office document.

Parameters:
dirNode - The directory node providing structural access to the MS Office document.
fs - The POIFSFileSystem containing the MS Office document
Returns:
A String containing the full-text of the document.
Throws:
IOException - whenever access to the directory node caused an IOException.


Copyright © 2010 Aperture Development Team. All Rights Reserved.