Name: HTML Table Reader
Author: Kevin Neal
Version: 0.5
Created: Apr 2008
Modified: July 2008
Purpose: Extracts data from a table on a web page. One record is added to the data stream for each row in a table, except the first row which is used to identify property names. All records will have a property 'HTML Table Index' to denote which table they came from.
Requirements: Pipeline Pilot 6.1 or later
Reporting Collection
O/S: any
Limitations: Does not handle a table that has a title. It views the title as the property name for the first column.
Keyword: generator read html table reader pilotscript regular expression regex unmerge
Contents: HTML Table Reader.xml
HTML Table Reader Example.xml
Installation:
1. Unzip the archive.
2. Open the HTML Table Viewer Example protocol with Pipeline Pilot.
3. Run the protocol.