FileDocCategorySizeDatePackage
HTMLParser.javaAPI DocApache Lucene 2.2.02026Sat Jun 16 22:20:58 BST 2007org.apache.lucene.benchmark.byTask.feeds

HTMLParser

public interface HTMLParser
HTML Parsing Interfacew for test purposes

Fields Summary
Constructors Summary
Methods Summary
public DocDataparse(java.lang.String name, java.util.Date date, java.io.Reader reader, java.text.DateFormat dateFormat)
Parse the input Reader and return DocData. A provided name or date is used for the result, otherwise an attempt is made to set them from the parsed data.

param
dateFormat date formatter to use for extracting the date.
param
name name of the result doc data. If null, attempt to set by parsed data.
param
date date of the result doc data. If null, attempt to set by parsed data.
param
reader of html text to parse.
return
Parsed doc data.
throws
IOException
throws
InterruptedException

public DocDataparse(java.lang.String name, java.util.Date date, java.lang.StringBuffer inputText, java.text.DateFormat dateFormat)
Parse the inputText and return DocData.

param
inputText the html text to parse.
see
#parse(String, Date, Reader, DateFormat)