Package org.apache.nutch.parse.headings
Parse filter to extract headings (h1, h2, etc.) from DOM parse tree.
-
Class Summary Class Description HeadingsParseFilter HtmlParseFilter to retrieve h1 and h2 values from the DOM.
| Class | Description |
|---|---|
| HeadingsParseFilter |
HtmlParseFilter to retrieve h1 and h2 values from the DOM.
|