Name: NCBI Online BLAST
Author: Scott Markel
Version: 2.0
Created: 02/2005
Modified: 08/2007 (This Readme text)
Purpose: Run Blast on-line on the NCBI server
Descriuption: These component runs blastall online at NCBI to generate BLAST results that are added to the sequence data record. A hit list data record is output for each input query sequence. The hits can be extracted from the hit list for individual
manipulation using the Extract BLAST Hits component. Sequences with no BLAST hits are routed to the fail port. There are parameters for the database name and for the expectation value.
blastn compares incoming nucleotide sequences with a nucleotide database, resulting in nucleotide alignments.
blastp compares incoming protein sequences with a protein database, resulting in protein alignments.
blastx compares incoming nucleotide sequences with a protein database, resulting in protein alignments.
tblastn compares incoming protein sequences with a nucleotide database, resulting in protein alignments.
tblastx compares incoming nucleotide sequences with a nucleotide database, resulting in protein alignments.
The NCBI Online BLASTn component supports the following nucleotide databases:
- nr All GenBank+RefSeq Nucleotides+EMBL+DDBJ+PDB sequences (but no EST, STS, GSS, or phase 0, 1 or 2 HTGS sequences). No longer "non-redundant"
- est Database of GenBank+EMBL+DDBJ sequences from EST Divisions
- est_human Human subset of GenBank+EMBL+DDBJ sequences from EST Divisions
- est_mouse Mouse subset of GenBank+EMBL+DDBJ sequences from EST Divisions
- est_others Non-Mouse, non-Human sequences of GenBank+EMBL+DDBJ sequences from EST Divisions
- gss Genome Survey Sequence, includes single-pass genomic data, exon-trapped sequences, and Alu PCR sequences
- htgs Unfinished High Throughput Genomic Sequences: phases 0, 1 and 2 (finished, phase 3 HTG sequences are in nr)
- pat Nucleotides from the Patent division of GenBank
- pdb Sequences derived from the 3-dimensional structure from Brookhaven Protein Data Bank
- month All new or revised GenBank+EMBL+DDBJ+PDB sequences released in the last 30 days
- alu Select Alu repeats from REPBASE, suitable for masking Alu repeats from query sequences
- dbsts Database of GenBank+EMBL+DDBJ sequences from STS Divisions
- chromosome Searches Complete Genomes, Complete Chromosome, or contigs form the NCBI Reference Sequence project
Requirements: Pipeline Pilot 6.1.1
O/S: PP Server Windows and Linux
PP client Windows
Limitations: None
Keywords: NCBI, blast, DNA, RNA, Protein, alignment
Contents: Components:
NCBI Online BLASTn.xml
NCBI Online BLASTp.xml
NCBI Online BLASTx.xml
NCBI Online tBLASTn.xml
NCBI Online tBLASTx.xml
Installation: Drag and drop the components on the component area.