Package org.apache.nutch.parse
Class ParseImpl
- java.lang.Object
-
- org.apache.nutch.parse.ParseImpl
-
-
Method Summary
All Methods Static Methods Instance Methods Concrete Methods Modifier and Type Method Description ParseDatagetData()Other data extracted from the page.StringgetText()The textual content of the page.booleanisCanonical()Indicates if the parse is coming from a url or a sub-urlstatic ParseImplread(DataInput in)voidreadFields(DataInput in)voidwrite(DataOutput out)
-
-
-
Method Detail
-
getText
public String getText()
Description copied from interface:ParseThe textual content of the page. This is indexed, searched, and used when generating snippets.
-
getData
public ParseData getData()
Description copied from interface:ParseOther data extracted from the page.
-
isCanonical
public boolean isCanonical()
Description copied from interface:ParseIndicates if the parse is coming from a url or a sub-url- Specified by:
isCanonicalin interfaceParse- Returns:
- true if canonical, false otherwise
-
write
public final void write(DataOutput out) throws IOException
- Specified by:
writein interfaceWritable- Throws:
IOException
-
readFields
public void readFields(DataInput in) throws IOException
- Specified by:
readFieldsin interfaceWritable- Throws:
IOException
-
read
public static ParseImpl read(DataInput in) throws IOException
- Throws:
IOException
-
-