Package org.apache.nutch.parse.zip
Class ZipParser
- java.lang.Object
-
- org.apache.nutch.parse.zip.ZipParser
-
- All Implemented Interfaces:
Configurable,Parser,Pluggable
public class ZipParser extends Object implements Parser
ZipParser class based on MSPowerPointParser class by Stephan Strittmatter. Nutch parse plugin for zip files - Content Type : application/zip
-
-
Field Summary
-
Fields inherited from interface org.apache.nutch.parse.Parser
X_POINT_ID
-
-
Constructor Summary
Constructors Constructor Description ZipParser()Creates a new instance of ZipParser
-
Method Summary
All Methods Static Methods Instance Methods Concrete Methods Modifier and Type Method Description ConfigurationgetConf()ParseResultgetParse(Content content)This method parses the given content and returns a map of <key, parse> pairs.static voidmain(String[] args)voidsetConf(Configuration conf)
-
-
-
Method Detail
-
getParse
public ParseResult getParse(Content content)
Description copied from interface:ParserThis method parses the given content and returns a map of <key, parse> pairs.
Parseinstances will be persisted under the given key.Note: Meta-redirects should be followed only when they are coming from the original URL. That is:
Assume fetcher is in parsing mode and is currently processing foo.bar.com/redirect.html. If this url contains a meta redirect to another url, fetcher should only follow the redirect if the map contains an entry of the form <"foo.bar.com/redirect.html",Parsewith aParseStatusindicating the redirect>.
-
setConf
public void setConf(Configuration conf)
- Specified by:
setConfin interfaceConfigurable
-
getConf
public Configuration getConf()
- Specified by:
getConfin interfaceConfigurable
-
main
public static void main(String[] args) throws IOException
- Throws:
IOException
-
-