Class PrefixURLFilter
- java.lang.Object
-
- org.apache.nutch.urlfilter.prefix.PrefixURLFilter
-
- All Implemented Interfaces:
Configurable,URLFilter,Pluggable
public class PrefixURLFilter extends Object implements URLFilter
Filters URLs based on a file of URL prefixes. The file is named by (1) property "urlfilter.prefix.file" in ./conf/nutch-default.xml, or (2) the attribute "file" in plugin.xml of this plugin.The format of this file is one URL prefix per line.
-
-
Field Summary
-
Fields inherited from interface org.apache.nutch.net.URLFilter
X_POINT_ID
-
-
Constructor Summary
Constructors Constructor Description PrefixURLFilter()PrefixURLFilter(String stringRules)
-
Method Summary
All Methods Static Methods Instance Methods Concrete Methods Modifier and Type Method Description Stringfilter(String url)Interface for a filter that transforms a URL: it can pass the original URL through or "delete" the URL by returning nullConfigurationgetConf()static voidmain(String[] args)voidsetConf(Configuration conf)
-
-
-
Constructor Detail
-
PrefixURLFilter
public PrefixURLFilter() throws IOException- Throws:
IOException
-
PrefixURLFilter
public PrefixURLFilter(String stringRules) throws IOException
- Throws:
IOException
-
-
Method Detail
-
filter
public String filter(String url)
Description copied from interface:URLFilterInterface for a filter that transforms a URL: it can pass the original URL through or "delete" the URL by returning null
-
main
public static void main(String[] args) throws IOException
- Throws:
IOException
-
setConf
public void setConf(Configuration conf)
- Specified by:
setConfin interfaceConfigurable
-
getConf
public Configuration getConf()
- Specified by:
getConfin interfaceConfigurable
-
-