Class MetaParseState
- java.lang.Object
-
- org.apache.manifoldcf.connectorcommon.fuzzyml.CharacterReceiver
-
- org.apache.manifoldcf.connectorcommon.fuzzyml.SingleCharacterReceiver
-
- org.apache.manifoldcf.connectorcommon.fuzzyml.TagParseState
-
- org.apache.manifoldcf.connectorcommon.fuzzyml.HTMLParseState
-
- org.apache.manifoldcf.crawler.connectors.webcrawler.ScriptParseState
-
- org.apache.manifoldcf.crawler.connectors.webcrawler.MetaParseState
-
- Direct Known Subclasses:
LinkParseState
public class MetaParseState extends ScriptParseState
This class recognizes and interprets all meta tags
-
-
Field Summary
Fields Modifier and Type Field Description protected IMetaTagHandlerhandler-
Fields inherited from class org.apache.manifoldcf.crawler.connectors.webcrawler.ScriptParseState
scriptParseState, SCRIPTPARSESTATE_INSCRIPT, SCRIPTPARSESTATE_NORMAL
-
Fields inherited from class org.apache.manifoldcf.connectorcommon.fuzzyml.TagParseState
accumBuffer, ampBuffer, bTagDepth, currentAttrList, currentAttrName, currentAttrNameBuffer, currentState, currentTagName, currentTagNameBuffer, currentValueBuffer, inAmpersand, mapLookup, TAGPARSESTATE_IN_ATTR_LOOKING_FOR_VALUE, TAGPARSESTATE_IN_ATTR_NAME, TAGPARSESTATE_IN_ATTR_VALUE, TAGPARSESTATE_IN_BANG_TOKEN, TAGPARSESTATE_IN_BRACKET_TOKEN, TAGPARSESTATE_IN_CDATA_BODY, TAGPARSESTATE_IN_COMMENT, TAGPARSESTATE_IN_DOUBLE_QUOTES_ATTR_VALUE, TAGPARSESTATE_IN_END_TAG_NAME, TAGPARSESTATE_IN_QTAG_ATTR_LOOKING_FOR_VALUE, TAGPARSESTATE_IN_QTAG_ATTR_NAME, TAGPARSESTATE_IN_QTAG_ATTR_VALUE, TAGPARSESTATE_IN_QTAG_DOUBLE_QUOTES_ATTR_VALUE, TAGPARSESTATE_IN_QTAG_NAME, TAGPARSESTATE_IN_QTAG_SAW_QUESTION, TAGPARSESTATE_IN_QTAG_SINGLE_QUOTES_ATTR_VALUE, TAGPARSESTATE_IN_QTAG_UNQUOTED_ATTR_VALUE, TAGPARSESTATE_IN_SINGLE_QUOTES_ATTR_VALUE, TAGPARSESTATE_IN_TAG_NAME, TAGPARSESTATE_IN_TAG_SAW_SLASH, TAGPARSESTATE_IN_UNQUOTED_ATTR_VALUE, TAGPARSESTATE_IN_UNQUOTED_ATTR_VALUE_SAW_SLASH, TAGPARSESTATE_NEED_FINAL_BRACKET, TAGPARSESTATE_NORMAL, TAGPARSESTATE_SAWCOMMENTDASH, TAGPARSESTATE_SAWDASH, TAGPARSESTATE_SAWEXCLAMATION, TAGPARSESTATE_SAWLEFTANGLE, TAGPARSESTATE_SAWRIGHTBRACKET, TAGPARSESTATE_SAWSECONDCOMMENTDASH, TAGPARSESTATE_SAWSECONDRIGHTBRACKET
-
-
Constructor Summary
Constructors Constructor Description MetaParseState(IMetaTagHandler handler)
-
Method Summary
All Methods Instance Methods Concrete Methods Modifier and Type Method Description protected booleannoteNonscriptTag(java.lang.String tagName, java.util.Map<java.lang.String,java.lang.String> attributes)-
Methods inherited from class org.apache.manifoldcf.crawler.connectors.webcrawler.ScriptParseState
acceptNewTag, noteNonscriptEndTag, noteTag, noteTagEnd
-
Methods inherited from class org.apache.manifoldcf.connectorcommon.fuzzyml.HTMLParseState
noteBTag, noteBTagToken, noteEndBTag, noteEndEscaped, noteEndTag, noteEscaped, noteEscapedCharacter, noteQTag, noteTag
-
Methods inherited from class org.apache.manifoldcf.connectorcommon.fuzzyml.TagParseState
attributeDecode, dealWithCharacter, dumpValues, isPunctuation, isWhitespace, mapChunk, newBuffer, noteNormalCharacter, outputAmpBuffer
-
Methods inherited from class org.apache.manifoldcf.connectorcommon.fuzzyml.SingleCharacterReceiver
dealWithCharacters, dealWithRemainder
-
-
-
-
Field Detail
-
handler
protected IMetaTagHandler handler
-
-
Constructor Detail
-
MetaParseState
public MetaParseState(IMetaTagHandler handler)
-
-
Method Detail
-
noteNonscriptTag
protected boolean noteNonscriptTag(java.lang.String tagName, java.util.Map<java.lang.String,java.lang.String> attributes) throws org.apache.manifoldcf.core.interfaces.ManifoldCFException- Overrides:
noteNonscriptTagin classScriptParseState- Throws:
org.apache.manifoldcf.core.interfaces.ManifoldCFException
-
-