Home     Genomes     Genome Browser     Blat     Tables     PCR     Session    FAQ     Help  
  Schema for Human Proteins - Human Proteins Mapped by Chained tBLASTn
  Database: bosTau4    Primary Table: blastHg18KG    Row Count: 89,273
Format description: Summary info about a patSpace alignment
fieldexampleSQL type info description
bin 585smallint unsigned range Indexing field to speed chromosome range queries.
matches 147int unsigned range Number of bases that match that aren't repeats
misMatches 79int unsigned range Number of bases that don't match
repMatches 0int unsigned range Number of bases that match but are part of repeats
nCount 0int unsigned range Number of 'N' bases
qNumInsert 0int unsigned range Number of inserts in query
qBaseInsert 0int unsigned range Number of bases inserted in query
tNumInsert 0int unsigned range Number of inserts in target
tBaseInsert 0int unsigned range Number of bases inserted in target
strand +-char(2) values + or - for strand. First character query, second target (optional)
qName NM_001288varchar(255) values Query sequence name
qSize 240int unsigned range Query sequence size
qStart 12int unsigned range Alignment start position in query
qEnd 240int unsigned range Alignment end position in query
tName chr1varchar(255) values Target sequence name
tSize 161106243int unsigned range Target sequence size
tStart 103398int unsigned range Alignment start position in target
tEnd 117528int unsigned range Alignment end position in target
blockCount 5int unsigned range Number of blocks in alignment
blockSizes 36,42,36,58,54,longblob   Size of each block
qStarts 12,48,90,128,186,longblob   Start of each block in query.
tStarts 160988715,160989738,1609904...longblob   Start of each block in target.

  Sample Rows
 
binmatchesmisMatchesrepMatchesnCountqNumInsertqBaseInserttNumInserttBaseInsertstrandqNameqSizeqStartqEndtNametSizetStarttEndblockCountblockSizesqStartstStarts
58514779000000+-NM_00128824012240chr1161106243103398117528536,42,36,58,54,12,48,90,128,186,160988715,160989738,160990434,160991704,161002683,
58514779000000+-X8768924012240chr1161106243103398117528536,42,36,58,54,12,48,90,128,186,160988715,160989738,160990434,160991704,161002683,
7342720200656475+-AF4484397040704chr11611062431033981582591667,25,3,40,6,4,53,75,35,42,52,36,42,36,60,53,0,67,93,101,149,174,191,254,329,364,406,476,512,554,591,651,160947984,160948242,160948317,160948326,160948446,160948464,160948476,160948635,160948869,160948977,160949109,160988715,16098973 ...
7342720200656475+-NM_0532777040704chr11611062431033981582591667,25,3,40,6,4,53,75,35,42,52,36,42,36,60,53,0,67,93,101,149,174,191,254,329,364,406,476,512,554,591,651,160947984,160948242,160948317,160948326,160948446,160948464,160948476,160948635,160948869,160948977,160949109,160988715,16098973 ...
58517250000000+-NM_01692925121248chr1161106243103401117528536,41,34,59,52,21,57,100,137,196,160988715,160989738,160990437,160991704,161002686,
58517649000000+-NM_01394325324251chr1161106243103401117528536,43,35,59,52,24,60,103,140,199,160988715,160989738,160990437,160991704,161002686,
58515074000000+-BC00536724719245chr1161106243103404117528535,42,36,60,51,19,55,97,134,194,160988715,160989738,160990434,160991701,161002686,
58515074000000+-NM_00128924719245chr1161106243103404117528535,42,36,60,51,19,55,97,134,194,160988715,160989738,160990434,160991701,161002686,
58513337000000+-AK07514420521196chr1161106243114362117528436,41,34,59,21,57,100,137,160988715,160989738,160990437,160991704,
5866266001314165++BC1119842014135chr1161106243157288157837169,13,8,5,6,2,5,4,5,11,7,7,14,10,5,17,4,13,26,34,39,45,47,52,56,61,75,82,89,103,113,118,157288,157324,157384,157414,157432,157471,157507,157525,157540,157570,157603,157642,157672,157717,157765,157786,

Note: all start coordinates in our database are 0-based, not 1-based. See explanation here.


  Human Proteins (blastHg18KG) Track Description
 

Description

This track contains tBLASTn alignments of the peptides from the predicted and known genes identified in the hg18 UCSC Genes track.

Methods

First, the predicted proteins from the human Known Genes track were aligned with the human genome using the blat program to discover exon boundaries. Next, the amino acid sequences that make up each exon were aligned with the cow sequence using the tBLASTn program. Finally, the putative cow exons were chained together using an organism-specific maximum gap size but no gap penalty. The single best exon chains extending over more than 60% of the query protein were included. Exon chains that extended over 60% of the query and matched at least 60% of the protein's amino acids were also included.

Credits

tBLASTn is part of the NCBI Blast tool set. For more information on Blast, see Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ. Basic local alignment search tool. J Mol Biol. 1990 Oct 5;215(3):403-410.

Blat was written by Jim Kent. The remaining utilities used to produce this track were written by Jim Kent or Brian Raney.