HTML Table Reader

Name: HTML Table Reader

Author: Kevin Neal

Version: 0.5

Created: Apr 2008

Modified: July 2008

Purpose: Extracts data from a table on a web page. One record is added to the data stream for each row in a table, except the first row which is used to identify property names. All records will have a property 'HTML Table Index' to denote which table they came from.

Requirements: Pipeline Pilot 6.1 or later
Reporting Collection

O/S: any

Limitations: Does not handle a table that has a title. It views the title as the property name for the first column.

Keyword: generator read html table reader pilotscript regular expression regex unmerge

Contents: HTML Table Reader.xml
HTML Table Reader Example.xml

Installation:

1. Unzip the archive.
2. Open the HTML Table Viewer Example protocol with Pipeline Pilot.
3. Run the protocol.