|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |
java.lang.Object org.htmlparser.visitors.NodeVisitor org.semanticdesktop.aperture.helper.html.HtmlParserUtil.ContentExtractor
public static class HtmlParserUtil.ContentExtractor
A NodeVisitor specialization that is able to start all over with interpreting parsing events.
Constructor Summary | |
---|---|
HtmlParserUtil.ContentExtractor()
|
Method Summary | |
---|---|
String |
getAuthor()
Return the extracted author, if any. |
String |
getDescription()
Return the extracted description, if any. |
Iterator |
getKeywords()
Return the extracted meta keywords, if any. |
String |
getText()
Return the extracted full-text, if any. |
String |
getTitle()
Return the extracted title, if any. |
void |
reset()
Remove all extracted information so that the ContentExtractor can be used anew. |
void |
visitEndTag(org.htmlparser.Tag tag)
|
void |
visitStringNode(org.htmlparser.Text node)
|
void |
visitTag(org.htmlparser.Tag tag)
|
Methods inherited from class org.htmlparser.visitors.NodeVisitor |
---|
beginParsing, finishedParsing, shouldRecurseChildren, shouldRecurseSelf, visitRemarkNode |
Methods inherited from class java.lang.Object |
---|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
Constructor Detail |
---|
public HtmlParserUtil.ContentExtractor()
Method Detail |
---|
public void reset()
public String getText()
public Iterator getKeywords()
public String getTitle()
public String getAuthor()
public String getDescription()
public void visitStringNode(org.htmlparser.Text node)
visitStringNode
in class org.htmlparser.visitors.NodeVisitor
public void visitTag(org.htmlparser.Tag tag)
visitTag
in class org.htmlparser.visitors.NodeVisitor
public void visitEndTag(org.htmlparser.Tag tag)
visitEndTag
in class org.htmlparser.visitors.NodeVisitor
|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |