|
||||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | |||||||||
java.lang.Objectorg.apache.poi.POITextExtractor
public abstract class POITextExtractor
Common Parent for Text Extractors of POI Documents. You will typically find the implementation of a given format's text extractor under org.apache.poi.[format].extractor .
ExcelExtractor,
PowerPointExtractor,
VisioTextExtractor,
WordExtractor| Field Summary | |
|---|---|
protected POIDocument |
document
The POIDocument that's open |
| Constructor Summary | |
|---|---|
|
POITextExtractor(POIDocument document)
Creates a new text extractor for the given document |
protected |
POITextExtractor(POITextExtractor otherExtractor)
Creates a new text extractor, using the same document as another text extractor. |
| Method Summary | |
|---|---|
abstract POITextExtractor |
getMetadataTextExtractor()
Returns another text extractor, which is able to output the textual content of the document metadata / properties, such as author and title. |
abstract java.lang.String |
getText()
Retrieves all the text from the document. |
| Methods inherited from class java.lang.Object |
|---|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
| Field Detail |
|---|
protected POIDocument document
| Constructor Detail |
|---|
public POITextExtractor(POIDocument document)
protected POITextExtractor(POITextExtractor otherExtractor)
| Method Detail |
|---|
public abstract java.lang.String getText()
public abstract POITextExtractor getMetadataTextExtractor()
|
||||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | |||||||||