Food and Drug Administration RSS News Feeds MDS Map & Clustering

Name: FDA RSS News Feeds MDS Map & Clustering
Author: Stephane Vellay
Version: 1.1
Created: 04/2009

Purpose: This is an example of how to handle RSS Feeds with the Text Analytics collection.

  • Reads Food and Drug Administration RSS News Feeds
  • Calculate Text Descriptors from Title and Abstract
  • Generate document similarity matrix
  • Clusters documents (AGNES)
  • Reduces dimensions using Multi Dimensional Scaling
  • Reports a MDS Map and a detailed sortable table


This protocol uses the Data Connector reporting component to link the MDS map and the table. You can select some document to highlight them in both. You also have access to the original document by clicking on the chart or the title in the sortable table.

Keyword: Text Analytics Integration Interactive Reporting FDA Food and Drug Administration RSS News Feeds MultiDimensional Scaling AGNES Clustering R

Requirements:

  • Pipeline Pilot 7.5 & Collection Update 1
  • Text Analytics Collection
  • R-Statistics Collection
  • Reporting Collection
  • /!\\ The Pipeline Pilot Server needs to access the Internet


Installation:

  • (Optional) If not already there, create a folder named "Text Analytics" under "Protocols/Web Services"
  • Drag and drop the .xml file in the Explorer window of your Pipeline Pilot client
  • Run from the webport & enjoy