Removing Zero-Variance Properties from the Data Stream

Name: Removing Zero-Variance Properties from the Data Stream

Author: Andrei Caracoti

Version: 2.0

Created: 10/2003

Modified: 7/2007 - Components and help text updated. Input changed from an SD file to a delimited text file.

Purpose: This protocol demonstrates the use of the “R Remove Zero-Variance Properties” component to remove properties with no variance in value from the data stream. In the protocol, a data file is read, and two properties with no variance in value (property_a & property_b), are added to the data stream. The first data record is filtered out of the data stream, and a small variance is added to property_b. The first record is merged back into the data stream, and all data are passed through the “R Remove Zero-Variance Properties” component. The property with no variance, property_a, is removed from the data stream, while the one where we introduced a small variance, property_b, remains.

Requirements: Pipeline Pilot 6.1.1 (collections: R Statistics)

O/S: PP Server Windows and Linux
PP Client Windows

Limitations: None

Keyword(s): R Remove Zero-Variance Properties

Contents: Removing Zero-Variance Properties from Data Stream.xml

Installation: Drag the protocol into the Pipeline Pilot client workspace, or drag and drop it on one of the Pipeline Pilot client explorer tabs to import it directly in the protocol database.