Extract Training Sets and Batch QSAR

Name: Extract Training Sets and Batch QSAR

Author: Steven Muskal

Version : 1.0

Created: 8/2007

Modified: N/A

Purpose: These protocols will extract training sets and build Bayesian QSAR models using excerpted data from Eidogen-Sertanty’s Kinase Knowledgebase (KKB). Data are provided for the following Kinase targets: ABL, SRC, and AURKA. Larger data sets can be requested from info@eidogen-sertanty.com.

Requirements: Pipeline Pilot 6.1.1 (collections: Chemistry, Data Modeling)

O/S: PP Server Windows and Linux
PP Client Windows

Limitations: None

Keyword: Kinase QSAR Bayesian Modeling Training Set Extraction

Contents: KKB-Excerpt-ExtractTrainingSets.xml
KKB-Excerpt-Batch-QSAR.xml
Q207-Excerpt_KB_SAR.sdf
Q207-Excerpt_KB_SAR.txt

Installation:

1. Drag the protocols into the Pipeline Pilot client workspace, or drag/drop into one of the Pipeline Pilot client explorer tabs to import it directly in the protocol database.

2. In “Global variables” of KKB-ExtractTrainingSets.xml, update the location of the Excerpt_KB_SAR.* files – i.e. the “inputDir” variable.

3. Run the ExtractTrainingSets protocol then run the Batch-QSAR protocol.

Please Note – Large data sets for three Kinase three targets (ABL, SRC, and AURKA) have been exported from Eidogen-Sertanty’s Kinase Knowledgebase (KKB) - http://www.eidogen-sertanty.com/products_kinasekb.html - provided within the Q207-Excerpt_KB_SAR.* data files. Please reference Eidogen-Sertanty’s KKB database when (re)using these data.