Search to RefSeqBP_Rel49
BLASTX 2.2.24 [Aug-08-2010]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= 20110601C-008166
(973 letters)
Database: RefSeq49_BP.fasta
33,088 sequences; 17,681,374 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
Alignment gi|NP_001068815.1| THAP domain-containing protein 3 [Bos taurus]. 254 8e-68
Alignment gi|NP_001029820.1| THAP domain-containing protein 1 [Bos taurus]. 87 3e-17
Alignment gi|NP_001095670.1| THAP domain-containing protein 8 [Bos taurus]. 62 6e-10
Alignment gi|NP_001033758.1| THAP domain-containing protein 4 [Bos taurus]. 56 4e-08
Alignment gi|NP_001091935.1| THAP domain-containing protein 7 [Bos taurus]. 51 2e-06
Alignment gi|XP_002688436.1| PREDICTED: THAP domain containing 9 [Bos tau... 51 2e-06
Alignment gi|XP_874067.2| PREDICTED: THAP domain containing 6 [Bos taurus]. 51 2e-06
Alignment gi|NP_001096760.1| THAP domain-containing protein 6 [Bos taurus]. 50 4e-06
>ref|NP_001068815.1| THAP domain-containing protein 3 [Bos taurus].
Length = 239
Score = 254 bits (649), Expect = 8e-68
Identities = 128/180 (71%), Positives = 135/180 (75%), Gaps = 1/180 (0%)
Frame = +2
Query: 68 MPKSCAARQCCNRYSSRRKQLTFHRFPFSRPELLKEWVLNIGRGDFEPKQHTVICSEHFR 247
MPKSCAARQCCNRYS+RRKQLTFHRFPFSRPELLKEWVLNIGRGDFEPKQHTVICSEHFR
Sbjct: 1 MPKSCAARQCCNRYSNRRKQLTFHRFPFSRPELLKEWVLNIGRGDFEPKQHTVICSEHFR 60
Query: 248 PECFSAFGNRKNLKHNAVPTEFTFQGPPQLVRETADPDGQIG-AASEKGEVHPETGPSEC 424
PECFSAFGNRKNLKHNAVPT F FQGPPQLVRE DP G+ G A S + +V PETG EC
Sbjct: 61 PECFSAFGNRKNLKHNAVPTVFAFQGPPQLVRENTDPTGRSGDATSGERKVLPETGSGEC 120
Query: 425 GPGRKRDTALEVRQPPPDAGGPRAQGQPGRTGAVAXXXXXXXXXXXXXRPPSTQPPDHSY 604
G GRK DT +EV Q PP+ GG AQ P T + R TQP DHSY
Sbjct: 121 GLGRKMDTTVEVLQLPPEVGGLGAQ-VPPHTPETSGVPGQPASPPELKRRLPTQPSDHSY 179
>ref|NP_001029820.1| THAP domain-containing protein 1 [Bos taurus].
Length = 213
Score = 86.7 bits (213), Expect = 3e-17
Identities = 42/100 (42%), Positives = 60/100 (60%)
Frame = +2
Query: 68 MPKSCAARQCCNRYSSRRKQLTFHRFPFSRPELLKEWVLNIGRGDFEPKQHTVICSEHFR 247
M +SC+A C NRY + K ++FH+FP +RP L K+W + R +F+P +++ ICSEHF
Sbjct: 1 MVQSCSAYGCKNRYD-KDKPVSFHKFPLTRPSLCKKWEAAVRRKNFKPTKYSSICSEHFT 59
Query: 248 PECFSAFGNRKNLKHNAVPTEFTFQGPPQLVRETADPDGQ 367
P+CF N K LK +AVPT F P + +P Q
Sbjct: 60 PDCFKRECNNKLLKEDAVPTIFLCTEPHDKKEDLLEPQEQ 99
>ref|NP_001095670.1| THAP domain-containing protein 8 [Bos taurus].
Length = 257
Score = 62.4 bits (150), Expect = 6e-10
Identities = 49/166 (29%), Positives = 71/166 (42%), Gaps = 15/166 (9%)
Frame = +2
Query: 68 MPKSCAARQCCN---RYSSRRKQLTFHRFPFSRPELLKEWVLNIGRGDFEPKQHTVICSE 238
MPK C A C N + + + ++F++FP L+ W+ ++GR + P H +CSE
Sbjct: 1 MPKYCLAPNCSNTAGQLGADNRPVSFYKFPLKDGPRLQAWLRHMGREHWVPSCHQHLCSE 60
Query: 239 HFRPECFSAFGNRKNLKHNAVPTEFTFQGPPQ-------LVRETADPDGQIGAASEKGEV 397
HF P CF + L+ +AVP+ F+ P Q + P Q + G V
Sbjct: 61 HFAPSCFQWRWGVRYLRPDAVPSIFSRVPPAQRQQSSRSTEKPVVPPPLQSTPSLASGPV 120
Query: 398 H-----PETGPSECGPGRKRDTALEVRQPPPDAGGPRAQGQPGRTG 520
P +G E P T L + QP P P A Q R G
Sbjct: 121 QLLVLGPASGAPE-APATVFLTPLSL-QPAPAGPRPGASAQHPRAG 164
>ref|NP_001033758.1| THAP domain-containing protein 4 [Bos taurus].
Length = 570
Score = 56.2 bits (134), Expect = 4e-08
Identities = 29/81 (35%), Positives = 45/81 (55%), Gaps = 3/81 (3%)
Frame = +2
Query: 80 CAARQCCNRYSSRRKQ-LTFHRFPFSRPELLKEWVLNIGRGDFEPKQHTVICSEHFRPEC 256
CAA C NR K+ ++FHRFP + L +W+ + R ++ P +++ +CSEHF +
Sbjct: 5 CAAANCSNRQGKGEKRAVSFHRFPLKDSKRLMQWLKAVQRDNWTPTKYSFLCSEHFTKDS 64
Query: 257 FS--AFGNRKNLKHNAVPTEF 313
FS + LK AVP+ F
Sbjct: 65 FSKRLEDQHRLLKPTAVPSIF 85
>ref|NP_001091935.1| THAP domain-containing protein 7 [Bos taurus].
Length = 308
Score = 50.8 bits (120), Expect = 2e-06
Identities = 31/93 (33%), Positives = 45/93 (48%), Gaps = 11/93 (11%)
Frame = +2
Query: 68 MPKSCAARQCCNRYS--SRRKQLTFHRFPFSRPELLKEWVLNI------GRGDFEP-KQH 220
MP+ C+A CC R + +R + ++FHR P W+ N G+G ++P ++
Sbjct: 1 MPRHCSAAGCCTRDTRETRNRGISFHRLPKKDNPRRGLWLANCQRLDPSGQGLWDPASEY 60
Query: 221 TVICSEHFRPECFSAFG--NRKNLKHNAVPTEF 313
CS+HF CF G LK AVPT F
Sbjct: 61 IYFCSKHFEENCFELVGISGYHRLKEGAVPTIF 93
>ref|XP_002688436.1| PREDICTED: THAP domain containing 9 [Bos taurus].
Length = 899
Score = 50.8 bits (120), Expect = 2e-06
Identities = 30/88 (34%), Positives = 48/88 (54%), Gaps = 8/88 (9%)
Frame = +2
Query: 68 MPKSCAARQCCNRYS--SRRKQLTFHRFPFSRPELLKEWVLNIGRGD------FEPKQHT 223
M +SC+A C R + SR + L+FH+FP + K W+ + R D + P
Sbjct: 1 MTRSCSAVGCSTRDTVLSRERGLSFHQFPTDTIQRSK-WIRAVNRMDPRSKKIWIPGPGA 59
Query: 224 VICSEHFRPECFSAFGNRKNLKHNAVPT 307
++CS+HF+ F ++G R+ LK AVP+
Sbjct: 60 MLCSKHFQESDFESYGIRRKLKKGAVPS 87
>ref|XP_874067.2| PREDICTED: THAP domain containing 6 [Bos taurus].
Length = 899
Score = 50.8 bits (120), Expect = 2e-06
Identities = 30/88 (34%), Positives = 48/88 (54%), Gaps = 8/88 (9%)
Frame = +2
Query: 68 MPKSCAARQCCNRYS--SRRKQLTFHRFPFSRPELLKEWVLNIGRGD------FEPKQHT 223
M +SC+A C R + SR + L+FH+FP + K W+ + R D + P
Sbjct: 1 MTRSCSAVGCSTRDTVLSRERGLSFHQFPTDTIQRSK-WIRAVNRMDPRSKKIWIPGPGA 59
Query: 224 VICSEHFRPECFSAFGNRKNLKHNAVPT 307
++CS+HF+ F ++G R+ LK AVP+
Sbjct: 60 MLCSKHFQESDFESYGIRRKLKKGAVPS 87
>ref|NP_001096760.1| THAP domain-containing protein 6 [Bos taurus].
Length = 180
Score = 49.7 bits (117), Expect = 4e-06
Identities = 33/90 (36%), Positives = 46/90 (51%), Gaps = 8/90 (8%)
Frame = +2
Query: 68 MPKSCAARQCCNRY--SSRRKQLTFHRFPFSRPELLKEWVLNIGR------GDFEPKQHT 223
M K C+A C +R +S+ K LTFH FP + ++WVL + R G +EPK+
Sbjct: 1 MVKCCSAVGCASRCLPNSKLKGLTFHVFPTDE-NIKRKWVLAMKRLDVNAAGIWEPKKGD 59
Query: 224 VICSEHFRPECFSAFGNRKNLKHNAVPTEF 313
V+CS HF+ F LK VP+ F
Sbjct: 60 VLCSRHFKKTDFDRSTPNIKLKPGVVPSIF 89
Database: RefSeq49_BP.fasta
Posted date: Oct 17, 2011 1:42 PM
Number of letters in database: 17,681,374
Number of sequences in database: 33,088
Lambda K H
0.312 0.131 0.418
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 33088
Number of Hits to DB: 40,689,968
Number of extensions: 1411151
Number of successful extensions: 10931
Number of sequences better than 1.0e-05: 8
Number of HSP's gapped: 10634
Number of HSP's successfully gapped: 8
Length of query: 324
Length of database: 17,681,374
Length adjustment: 102
Effective length of query: 222
Effective length of database: 14,306,398
Effective search space: 3176020356
Effective search space used: 3176020356
Neighboring words threshold: 12
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.4 bits)
S2: 34 (17.7 bits)
Search to RefSeqCP_Rel49
BLASTX 2.2.24 [Aug-08-2010]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= 20110601C-008166
(973 letters)
Database: RefSeq49_CP.fasta
33,336 sequences; 18,874,504 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
Alignment gi|XP_849390.1| PREDICTED: similar to THAP domain protein 3 iso... 218 5e-57
Alignment gi|XP_858681.1| PREDICTED: similar to THAP domain protein 3 iso... 201 9e-52
Alignment gi|XP_848435.1| PREDICTED: similar to THAP domain protein 1 iso... 89 5e-18
Alignment gi|XP_532789.1| PREDICTED: similar to THAP domain protein 1 iso... 86 4e-17
Alignment gi|XP_541680.2| PREDICTED: similar to CLIP-170-related protein ... 65 1e-10
Alignment gi|XP_539521.2| PREDICTED: similar to THAP domain containing 5 ... 62 8e-10
Alignment gi|XP_544956.2| PREDICTED: similar to THAP domain containing 9 ... 52 1e-06
Alignment gi|XP_543564.2| PREDICTED: similar to THAP domain protein 7 [Ca... 51 2e-06
Alignment gi|XP_855909.1| PREDICTED: similar to THAP domain containing 6 ... 50 3e-06
Alignment gi|XP_544933.1| PREDICTED: similar to THAP domain containing 6 ... 49 7e-06
>ref|XP_849390.1| PREDICTED: similar to THAP domain protein 3 isoform 1 [Canis
familiaris].
Length = 225
Score = 218 bits (556), Expect = 5e-57
Identities = 127/234 (54%), Positives = 133/234 (56%), Gaps = 2/234 (0%)
Frame = +2
Query: 68 MPKSCAARQCCNRYSSRRKQLTFHRFPFSRPELLKEWVLNIGRGDFEPKQHTVICSEHFR 247
MPKSCAARQCCNRYSSRRKQLTFHRFPFSRPELLKEWVLNIGR +F+PKQHTVICSEHFR
Sbjct: 1 MPKSCAARQCCNRYSSRRKQLTFHRFPFSRPELLKEWVLNIGRANFKPKQHTVICSEHFR 60
Query: 248 PECFSAFGNRKNLKHNAVPTEFTFQGPPQLVRETADP-DGQIGAASEKG-EVHPETGPSE 421
PECFSAFGNRKNLK NAVPT F FQ QL RE ADP G S K V E P+E
Sbjct: 61 PECFSAFGNRKNLKQNAVPTVFAFQDATQLARENADPAGGDTNVDSHKEVSVASEVVPAE 120
Query: 422 CGPGRKRDTALEVRQPPPDAGGPRAQGQPGRTGAVAXXXXXXXXXXXXXRPPSTQPPDHS 601
CG GRK + ALEV PP A GP Q P R P Q DHS
Sbjct: 121 CGWGRKLEAALEVL--PPMASGPAEQVVPRRLQGT-------QAPAQQASPSPAQTSDHS 171
Query: 602 YXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXMASHLQACRAGSRPEQQS 763
Y M+S L+A RAG PE QS
Sbjct: 172 YALLDLDALKKKLFLTLKENEKLRKRLKAQRLVIRRMSSRLRAHRAGPPPEPQS 225
>ref|XP_858681.1| PREDICTED: similar to THAP domain protein 3 isoform 2 [Canis
familiaris].
Length = 203
Score = 201 bits (511), Expect = 9e-52
Identities = 117/232 (50%), Positives = 124/232 (53%)
Frame = +2
Query: 68 MPKSCAARQCCNRYSSRRKQLTFHRFPFSRPELLKEWVLNIGRGDFEPKQHTVICSEHFR 247
MPKSCAARQCCNRYSSRRKQLTFHRFPFSRPELLKEWVLNIGR +F+PKQHTVICSEHFR
Sbjct: 1 MPKSCAARQCCNRYSSRRKQLTFHRFPFSRPELLKEWVLNIGRANFKPKQHTVICSEHFR 60
Query: 248 PECFSAFGNRKNLKHNAVPTEFTFQGPPQLVRETADPDGQIGAASEKGEVHPETGPSECG 427
PECFSAFGNRKNLK NAVPT F FQ Q+ E P+ECG
Sbjct: 61 PECFSAFGNRKNLKQNAVPTVFAFQDATQVASEVV--------------------PAECG 100
Query: 428 PGRKRDTALEVRQPPPDAGGPRAQGQPGRTGAVAXXXXXXXXXXXXXRPPSTQPPDHSYX 607
GRK + ALEV PP A GP Q P R P Q DHSY
Sbjct: 101 WGRKLEAALEVL--PPMASGPAEQVVPRRLQGT-------QAPAQQASPSPAQTSDHSYA 151
Query: 608 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXMASHLQACRAGSRPEQQS 763
M+S L+A RAG PE QS
Sbjct: 152 LLDLDALKKKLFLTLKENEKLRKRLKAQRLVIRRMSSRLRAHRAGPPPEPQS 203
>ref|XP_848435.1| PREDICTED: similar to THAP domain protein 1 isoform 2 [Canis
familiaris].
Length = 213
Score = 89.4 bits (220), Expect = 5e-18
Identities = 43/101 (42%), Positives = 61/101 (60%)
Frame = +2
Query: 68 MPKSCAARQCCNRYSSRRKQLTFHRFPFSRPELLKEWVLNIGRGDFEPKQHTVICSEHFR 247
M +SC+A C NRY + K ++FH+FP +RP L K+W + R +F+P +++ ICSEHF
Sbjct: 1 MVQSCSAYGCKNRYD-KDKPVSFHKFPLTRPSLCKKWEAAVRRKNFKPTKYSSICSEHFT 59
Query: 248 PECFSAFGNRKNLKHNAVPTEFTFQGPPQLVRETADPDGQI 370
P+CF N K LK NAVPT F P + +P Q+
Sbjct: 60 PDCFKRECNNKLLKENAVPTIFLCTEPHDKKEDLLEPQEQL 100
>ref|XP_532789.1| PREDICTED: similar to THAP domain protein 1 isoform 1 [Canis
familiaris].
Length = 178
Score = 86.3 bits (212), Expect = 4e-17
Identities = 41/87 (47%), Positives = 56/87 (64%)
Frame = +2
Query: 68 MPKSCAARQCCNRYSSRRKQLTFHRFPFSRPELLKEWVLNIGRGDFEPKQHTVICSEHFR 247
M +SC+A C NRY + K ++FH+FP +RP L K+W + R +F+P +++ ICSEHF
Sbjct: 1 MVQSCSAYGCKNRYD-KDKPVSFHKFPLTRPSLCKKWEAAVRRKNFKPTKYSSICSEHFT 59
Query: 248 PECFSAFGNRKNLKHNAVPTEFTFQGP 328
P+CF N K LK NAVPT F P
Sbjct: 60 PDCFKRECNNKLLKENAVPTIFLCTEP 86
>ref|XP_541680.2| PREDICTED: similar to CLIP-170-related protein [Canis familiaris].
Length = 871
Score = 64.7 bits (156), Expect = 1e-10
Identities = 40/146 (27%), Positives = 66/146 (45%), Gaps = 3/146 (2%)
Frame = +2
Query: 68 MPKSCAARQCCN---RYSSRRKQLTFHRFPFSRPELLKEWVLNIGRGDFEPKQHTVICSE 238
MPK C A C N R + + ++F++FP L+ W+ ++G D+ P H +CSE
Sbjct: 1 MPKYCRAPNCSNTAGRLGADNRPVSFYKFPLKDGPRLQAWLRHMGHEDWVPSCHHHLCSE 60
Query: 239 HFRPECFSAFGNRKNLKHNAVPTEFTFQGPPQLVRETADPDGQIGAASEKGEVHPETGPS 418
HF P CF + L+ +AVP+ F+ P + + + + + + S PE P
Sbjct: 61 HFTPSCFQWRWGVRYLRPDAVPSIFSPAPPAKRQQSSRSTEKPVESPSS-----PEAMPL 115
Query: 419 ECGPGRKRDTALEVRQPPPDAGGPRA 496
P + + +GGP A
Sbjct: 116 SPDPTVSASGPMHLAVLGSASGGPEA 141
>ref|XP_539521.2| PREDICTED: similar to THAP domain containing 5 [Canis familiaris].
Length = 394
Score = 62.0 bits (149), Expect = 8e-10
Identities = 30/85 (35%), Positives = 50/85 (58%), Gaps = 2/85 (2%)
Frame = +2
Query: 68 MPKSCAARQCCNRY--SSRRKQLTFHRFPFSRPELLKEWVLNIGRGDFEPKQHTVICSEH 241
MP+ CAA C NR +S+ ++L+F+ FP E L++W+ N+ R + P ++ +CS+H
Sbjct: 1 MPRYCAAICCKNRRGRNSKDRKLSFYPFPLHDKERLEKWLKNMKRDSWVPSKYQFLCSDH 60
Query: 242 FRPECFSAFGNRKNLKHNAVPTEFT 316
F P+ + LK A+PT F+
Sbjct: 61 FTPDSLDIRWGIRYLKQTAIPTIFS 85
>ref|XP_544956.2| PREDICTED: similar to THAP domain containing 9 [Canis familiaris].
Length = 902
Score = 51.6 bits (122), Expect = 1e-06
Identities = 30/88 (34%), Positives = 48/88 (54%), Gaps = 8/88 (9%)
Frame = +2
Query: 68 MPKSCAARQCCNRYS--SRRKQLTFHRFPFSRPELLKEWVLNIGRGD------FEPKQHT 223
M +SC+A C R + SR + L+FH+FP + K W+ + R D + P
Sbjct: 1 MTRSCSAVGCSTRDTVLSRERGLSFHQFPTDTIQRSK-WIRAVNRVDPRSKKIWIPGPGA 59
Query: 224 VICSEHFRPECFSAFGNRKNLKHNAVPT 307
++CS+HF+ F ++G R+ LK AVP+
Sbjct: 60 ILCSKHFQESDFESYGIRRKLKKGAVPS 87
>ref|XP_543564.2| PREDICTED: similar to THAP domain protein 7 [Canis familiaris].
Length = 313
Score = 50.8 bits (120), Expect = 2e-06
Identities = 31/93 (33%), Positives = 45/93 (48%), Gaps = 11/93 (11%)
Frame = +2
Query: 68 MPKSCAARQCCNRYS--SRRKQLTFHRFPFSRPELLKEWVLNI------GRGDFEP-KQH 220
MP+ C+A CC R + +R + ++FHR P W+ N G+G ++P ++
Sbjct: 1 MPRHCSAAGCCTRDTRETRNRGISFHRLPKKDNPRRGLWLANCQRLDPSGQGLWDPASEY 60
Query: 221 TVICSEHFRPECFSAFG--NRKNLKHNAVPTEF 313
CS+HF CF G LK AVPT F
Sbjct: 61 IYFCSKHFEENCFELVGISGYHRLKEGAVPTIF 93
>ref|XP_855909.1| PREDICTED: similar to THAP domain containing 6 isoform 4 [Canis
familiaris].
Length = 180
Score = 50.1 bits (118), Expect = 3e-06
Identities = 35/101 (34%), Positives = 51/101 (50%), Gaps = 8/101 (7%)
Frame = +2
Query: 68 MPKSCAARQCCNRY--SSRRKQLTFHRFPFSRPELLKEWVLNIGR------GDFEPKQHT 223
M K C+A C +R +S+ K LTFH FP + ++WVL + R G +EPK+
Sbjct: 1 MVKCCSAIGCASRCLPNSKLKGLTFHVFPTDE-NVKRKWVLAMKRLDVNAAGIWEPKKGD 59
Query: 224 VICSEHFRPECFSAFGNRKNLKHNAVPTEFTFQGPPQLVRE 346
V+CS HF+ F LK +P+ F PP ++E
Sbjct: 60 VLCSRHFKKTDFDRSTPNIKLKPGVIPSIF---DPPSHLQE 97
>ref|XP_544933.1| PREDICTED: similar to THAP domain containing 6 isoform 1 [Canis
familiaris].
Length = 222
Score = 48.9 bits (115), Expect = 7e-06
Identities = 32/90 (35%), Positives = 46/90 (51%), Gaps = 8/90 (8%)
Frame = +2
Query: 68 MPKSCAARQCCNRY--SSRRKQLTFHRFPFSRPELLKEWVLNIGR------GDFEPKQHT 223
M K C+A C +R +S+ K LTFH FP + ++WVL + R G +EPK+
Sbjct: 1 MVKCCSAIGCASRCLPNSKLKGLTFHVFPTDE-NVKRKWVLAMKRLDVNAAGIWEPKKGD 59
Query: 224 VICSEHFRPECFSAFGNRKNLKHNAVPTEF 313
V+CS HF+ F LK +P+ F
Sbjct: 60 VLCSRHFKKTDFDRSTPNIKLKPGVIPSIF 89
Database: RefSeq49_CP.fasta
Posted date: Oct 17, 2011 1:42 PM
Number of letters in database: 18,874,504
Number of sequences in database: 33,336
Lambda K H
0.312 0.131 0.418
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 33336
Number of Hits to DB: 42,059,688
Number of extensions: 1398302
Number of successful extensions: 11398
Number of sequences better than 1.0e-05: 11
Number of HSP's gapped: 11064
Number of HSP's successfully gapped: 11
Length of query: 324
Length of database: 18,874,504
Length adjustment: 103
Effective length of query: 221
Effective length of database: 15,440,896
Effective search space: 3412438016
Effective search space used: 3412438016
Neighboring words threshold: 12
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.4 bits)
S2: 34 (17.7 bits)
Search to RefSeqSP_Rel49
BLASTX 2.2.24 [Aug-08-2010]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= 20110601C-008166
(973 letters)
Database: RefSeq49_SP.fasta
24,897 sequences; 11,343,932 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
Alignment gi|XP_003359917.1| PREDICTED: THAP domain-containing protein 1-... 87 1e-17
Alignment gi|XP_003126431.1| PREDICTED: LOW QUALITY PROTEIN: THAP domain-... 77 2e-14
Alignment gi|XP_003134826.1| PREDICTED: THAP domain-containing protein 5-... 60 2e-09
Alignment gi|XP_003127122.1| PREDICTED: THAP domain-containing protein 8-... 59 3e-09
Alignment gi|XP_003133046.1| PREDICTED: THAP domain-containing protein 7-... 52 5e-07
Alignment gi|XP_003129402.1| PREDICTED: THAP domain-containing protein 9 ... 50 1e-06
>ref|XP_003359917.1| PREDICTED: THAP domain-containing protein 1-like [Sus scrofa].
Length = 213
Score = 87.4 bits (215), Expect = 1e-17
Identities = 42/101 (41%), Positives = 61/101 (60%)
Frame = +2
Query: 68 MPKSCAARQCCNRYSSRRKQLTFHRFPFSRPELLKEWVLNIGRGDFEPKQHTVICSEHFR 247
M +SC+A C NRY + K ++FH+FP +RP L K+W + R +F+P +++ ICSEHF
Sbjct: 1 MVQSCSAYGCKNRYD-KDKPVSFHKFPLTRPSLCKKWEAAVRRKNFKPTKYSSICSEHFT 59
Query: 248 PECFSAFGNRKNLKHNAVPTEFTFQGPPQLVRETADPDGQI 370
P+CF N K LK +AVPT F P + +P Q+
Sbjct: 60 PDCFKRECNNKLLKEDAVPTIFLCTEPHDKKEDLLEPQEQL 100
>ref|XP_003126431.1| PREDICTED: LOW QUALITY PROTEIN: THAP domain-containing protein
2-like [Sus scrofa].
Length = 227
Score = 76.6 bits (187), Expect = 2e-14
Identities = 37/84 (44%), Positives = 50/84 (59%)
Frame = +2
Query: 68 MPKSCAARQCCNRYSSRRKQLTFHRFPFSRPELLKEWVLNIGRGDFEPKQHTVICSEHFR 247
MP +CAA C Y+ + ++FHRFP P+ KEWV + R +F P +HT +CS+HF
Sbjct: 1 MPTNCAAAGCATTYN-KHINISFHRFPLD-PKRRKEWVRLVRRKNFVPGKHTFLCSKHFE 58
Query: 248 PECFSAFGNRKNLKHNAVPTEFTF 319
CF G + LK +AVPT F F
Sbjct: 59 ASCFDLTGQTRRLKMDAVPTIFDF 82
>ref|XP_003134826.1| PREDICTED: THAP domain-containing protein 5-like [Sus scrofa].
Length = 395
Score = 59.7 bits (143), Expect = 2e-09
Identities = 29/85 (34%), Positives = 47/85 (55%), Gaps = 2/85 (2%)
Frame = +2
Query: 68 MPKSCAARQCCNRYSSRR--KQLTFHRFPFSRPELLKEWVLNIGRGDFEPKQHTVICSEH 241
MP+ CAA C NR ++L+F+ FP E L++W+ N+ R + P ++ +CS+H
Sbjct: 1 MPRYCAAICCKNRRGRNNTDRKLSFYPFPLHDKERLEKWLKNMKRDSWVPSKYQFLCSDH 60
Query: 242 FRPECFSAFGNRKNLKHNAVPTEFT 316
F P+ + LK A+PT F+
Sbjct: 61 FTPDSLDIRWGIRYLKQTAIPTIFS 85
>ref|XP_003127122.1| PREDICTED: THAP domain-containing protein 8-like [Sus scrofa].
Length = 269
Score = 59.3 bits (142), Expect = 3e-09
Identities = 48/162 (29%), Positives = 67/162 (41%), Gaps = 20/162 (12%)
Frame = +2
Query: 68 MPKSCAARQCCN---RYSSRRKQLTFHRFPFSRPELLKEWVLNIGRGDFEPKQHTVICSE 238
MP+ C A C N R + + ++F++FP L+ W+ ++G + P H +CSE
Sbjct: 1 MPRYCRAPNCSNTAGRLGADHRPVSFYKFPLKDGPRLQAWLRHMGLEHWVPSCHQHLCSE 60
Query: 239 HFRPECFSAFGNRKNLKHNAVPTEF---TFQG-------------PPQLVRETADPDGQI 370
HF P CF + L+ +AVP+ F TF PP T+ G
Sbjct: 61 HFAPSCFQWRWGVRYLRPDAVPSIFSPATFTERQENCGSTEKPVMPPPPPEATSLFPGSA 120
Query: 371 GAASEKGEVH-PETGPSECGPGRKRDTALEVRQPPPDAGGPR 493
G A G V GP+ GP L PP GPR
Sbjct: 121 GPA--PGPVRLVVLGPASGGPEAPATVILTPLPLPPVPAGPR 160
>ref|XP_003133046.1| PREDICTED: THAP domain-containing protein 7-like isoform 1 [Sus
scrofa].
Length = 313
Score = 52.0 bits (123), Expect = 5e-07
Identities = 46/158 (29%), Positives = 63/158 (39%), Gaps = 20/158 (12%)
Frame = +2
Query: 68 MPKSCAARQCCNRYS--SRRKQLTFHRFPFSRPELLKEWVLNI------GRGDFEP-KQH 220
MP+ C+A CC R + +R + ++FHR P W+ N G+G ++P ++
Sbjct: 1 MPRHCSAAGCCTRDTRETRNRGISFHRLPKKDNPRRGLWLANCQRLDPSGQGLWDPASEY 60
Query: 221 TVICSEHFRPECFSAFG--NRKNLKHNAVPTEFTFQGPPQLVRETADPDGQIGAASEKGE 394
CS+HF CF G LK AVPT F+ +L R A KG
Sbjct: 61 IYFCSKHFEENCFELVGISGYHRLKEGAVPT--IFESFSKLRR----------TAKTKGH 108
Query: 395 VHPETGP---------SECGPGRKRDTALEVRQPPPDA 481
+P P C GR T PPP A
Sbjct: 109 SYPPGPPDVSRLRRCRKRCSEGRGPTTPF---SPPPPA 143
>ref|XP_003129402.1| PREDICTED: THAP domain-containing protein 9 [Sus scrofa].
Length = 903
Score = 50.4 bits (119), Expect = 1e-06
Identities = 29/88 (32%), Positives = 48/88 (54%), Gaps = 8/88 (9%)
Frame = +2
Query: 68 MPKSCAARQCCNRYS--SRRKQLTFHRFPFSRPELLKEWVLNIGRGD------FEPKQHT 223
M +SC+A C R + SR + L+FH+FP + +W+ + R D + P
Sbjct: 1 MTRSCSAVGCSTRDTVLSRERGLSFHQFPTDTIQR-SQWIRAVNRMDPRSKKIWIPGPGA 59
Query: 224 VICSEHFRPECFSAFGNRKNLKHNAVPT 307
++CS+HF+ F ++G R+ LK AVP+
Sbjct: 60 MLCSKHFQESDFESYGIRRKLKKGAVPS 87
Database: RefSeq49_SP.fasta
Posted date: Oct 17, 2011 1:42 PM
Number of letters in database: 11,343,932
Number of sequences in database: 24,897
Lambda K H
0.312 0.131 0.418
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 24897
Number of Hits to DB: 26,904,719
Number of extensions: 996415
Number of successful extensions: 7540
Number of sequences better than 1.0e-05: 6
Number of HSP's gapped: 7311
Number of HSP's successfully gapped: 6
Length of query: 324
Length of database: 11,343,932
Length adjustment: 99
Effective length of query: 225
Effective length of database: 8,879,129
Effective search space: 1997804025
Effective search space used: 1997804025
Neighboring words threshold: 12
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.4 bits)
S2: 34 (17.7 bits)
Search to RefSeqMP_Rel49
BLASTX 2.2.24 [Aug-08-2010]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= 20110601C-008166
(973 letters)
Database: RefSeq49_MP.fasta
30,036 sequences; 15,617,559 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
Alignment gi|NP_780361.2| THAP domain-containing protein 3 isoform 1 [Mus... 200 1e-51
Alignment gi|NP_001139401.1| THAP domain-containing protein 3 isoform 2 [... 182 3e-46
Alignment gi|NP_950243.1| THAP domain-containing protein 1 [Mus musculus]. 90 2e-18
Alignment gi|NP_080056.1| THAP domain-containing protein 2 [Mus musculus]. 76 4e-14
Alignment gi|NP_080196.3| THAP domain-containing protein 4 [Mus musculus]. 56 5e-08
Alignment gi|NP_081185.1| THAP domain-containing protein 7 [Mus musculus]. 51 2e-06
>ref|NP_780361.2| THAP domain-containing protein 3 isoform 1 [Mus musculus].
Length = 218
Score = 200 bits (509), Expect = 1e-51
Identities = 109/180 (60%), Positives = 111/180 (61%), Gaps = 1/180 (0%)
Frame = +2
Query: 68 MPKSCAARQCCNRYSSRRKQLTFHRFPFSRPELLKEWVLNIGRGDFEPKQHTVICSEHFR 247
MPKSCAARQCCNRYSSRRKQLTFHRFPFSRPELL+EWVLNIGR DF+PKQHTVICSEHFR
Sbjct: 1 MPKSCAARQCCNRYSSRRKQLTFHRFPFSRPELLREWVLNIGRADFKPKQHTVICSEHFR 60
Query: 248 PECFSAFGNRKNLKHNAVPTEFTFQGPPQLVRETADPDGQIGAASEKGEVHPETGPSECG 427
PECFSAFGNRKNLKHNAVPT F FQ P EV PE G
Sbjct: 61 PECFSAFGNRKNLKHNAVPTVFAFQNPT--------------------EVCPEVGAGGDS 100
Query: 428 PGRKRDTALEVRQPPPDAGGPRAQGQPGRTGAVA-XXXXXXXXXXXXXRPPSTQPPDHSY 604
GR DT LE QPP GP Q P R A RP QP DHSY
Sbjct: 101 SGRNMDTTLEELQPPTPE-GPVQQVLPDREAMEATEAAGLPASPLGLKRPLPGQPSDHSY 159
>ref|NP_001139401.1| THAP domain-containing protein 3 isoform 2 [Mus musculus].
Length = 184
Score = 182 bits (462), Expect = 3e-46
Identities = 85/106 (80%), Positives = 92/106 (86%)
Frame = +2
Query: 68 MPKSCAARQCCNRYSSRRKQLTFHRFPFSRPELLKEWVLNIGRGDFEPKQHTVICSEHFR 247
MPKSCAARQCCNRYSSRRKQLTFHRFPFSRPELL+EWVLNIGR DF+PKQHTVICSEHFR
Sbjct: 1 MPKSCAARQCCNRYSSRRKQLTFHRFPFSRPELLREWVLNIGRADFKPKQHTVICSEHFR 60
Query: 248 PECFSAFGNRKNLKHNAVPTEFTFQGPPQLVRETADPDGQIGAASE 385
PECFSAFGNRKNLKHNAVPT F FQ P +++ PD + A+E
Sbjct: 61 PECFSAFGNRKNLKHNAVPTVFAFQNPTEVL-----PDREAMEATE 101
>ref|NP_950243.1| THAP domain-containing protein 1 [Mus musculus].
Length = 210
Score = 90.1 bits (222), Expect = 2e-18
Identities = 51/135 (37%), Positives = 73/135 (54%)
Frame = +2
Query: 68 MPKSCAARQCCNRYSSRRKQLTFHRFPFSRPELLKEWVLNIGRGDFEPKQHTVICSEHFR 247
M +SC+A C NRY + K ++FH+FP +RP L K+W + R +F+P +++ ICSEHF
Sbjct: 1 MVQSCSAYGCKNRYD-KDKPVSFHKFPLTRPSLCKQWEAAVKRKNFKPTKYSSICSEHFT 59
Query: 248 PECFSAFGNRKNLKHNAVPTEFTFQGPPQLVRETADPDGQIGAASEKGEVHPETGPSECG 427
P+CF N K LK NAVPT F + P + + D + Q E PS
Sbjct: 60 PDCFKRECNNKLLKENAVPTIFLYIEPHE---KKEDLESQ------------EQLPSPSP 104
Query: 428 PGRKRDTALEVRQPP 472
P + D A+ + PP
Sbjct: 105 PASQVDAAIGLLMPP 119
>ref|NP_080056.1| THAP domain-containing protein 2 [Mus musculus].
Length = 217
Score = 75.9 bits (185), Expect = 4e-14
Identities = 37/84 (44%), Positives = 50/84 (59%)
Frame = +2
Query: 68 MPKSCAARQCCNRYSSRRKQLTFHRFPFSRPELLKEWVLNIGRGDFEPKQHTVICSEHFR 247
MP +CAA C Y+ + ++FHRFP P+ KEWV + R +F P +HT +CS+HF
Sbjct: 1 MPTNCAAAGCAATYN-KHINISFHRFPLD-PKRRKEWVRLVRRKNFVPGKHTFLCSKHFE 58
Query: 248 PECFSAFGNRKNLKHNAVPTEFTF 319
CF G + LK +AVPT F F
Sbjct: 59 ASCFDLTGQTRRLKMDAVPTIFDF 82
>ref|NP_080196.3| THAP domain-containing protein 4 [Mus musculus].
Length = 569
Score = 55.8 bits (133), Expect = 5e-08
Identities = 40/145 (27%), Positives = 63/145 (43%), Gaps = 11/145 (7%)
Frame = +2
Query: 80 CAARQCCNRYSSRRKQ-LTFHRFPFSRPELLKEWVLNIGRGDFEPKQHTVICSEHFRPEC 256
CAA C NR K+ ++FHRFP + L +W+ + R ++ P +++ +CSEHF +
Sbjct: 5 CAAVNCSNRQGKGEKRAVSFHRFPLKDSKRLIQWLKAVQRDNWTPTKYSFLCSEHFTKDS 64
Query: 257 FS--AFGNRKNLKHNAVPTEFTFQGPPQLV--------RETADPDGQIGAASEKGEVHPE 406
FS + LK AVP+ F + + TA G A + KG +
Sbjct: 65 FSKRLEDQHRLLKPTAVPSIFHLSEKKRGAGGHGHARRKTTAAMRGHTSAETGKGTIGSS 124
Query: 407 TGPSECGPGRKRDTALEVRQPPPDA 481
S+ + L+ P DA
Sbjct: 125 LSSSDNLMAKPESRKLKRASPQDDA 149
>ref|NP_081185.1| THAP domain-containing protein 7 [Mus musculus].
Length = 309
Score = 50.8 bits (120), Expect = 2e-06
Identities = 31/93 (33%), Positives = 45/93 (48%), Gaps = 11/93 (11%)
Frame = +2
Query: 68 MPKSCAARQCCNRYS--SRRKQLTFHRFPFSRPELLKEWVLNI------GRGDFEP-KQH 220
MP+ C+A CC R + +R + ++FHR P W+ N G+G ++P ++
Sbjct: 1 MPRHCSAAGCCTRDTRETRNRGISFHRLPKKDNPRRGLWLANCQRLDPSGQGLWDPTSEY 60
Query: 221 TVICSEHFRPECFSAFG--NRKNLKHNAVPTEF 313
CS+HF CF G LK AVPT F
Sbjct: 61 IYFCSKHFEENCFELVGISGYHRLKEGAVPTIF 93
Database: RefSeq49_MP.fasta
Posted date: Oct 17, 2011 1:42 PM
Number of letters in database: 15,617,559
Number of sequences in database: 30,036
Lambda K H
0.312 0.131 0.418
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 30036
Number of Hits to DB: 34,059,676
Number of extensions: 1079143
Number of successful extensions: 7505
Number of sequences better than 1.0e-05: 6
Number of HSP's gapped: 7312
Number of HSP's successfully gapped: 6
Length of query: 324
Length of database: 15,617,559
Length adjustment: 102
Effective length of query: 222
Effective length of database: 12,553,887
Effective search space: 2786962914
Effective search space used: 2786962914
Neighboring words threshold: 12
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.4 bits)
S2: 34 (17.7 bits)
Search to RefSeqHP_Rel49
BLASTX 2.2.24 [Aug-08-2010]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= 20110601C-008166
(973 letters)
Database: RefSeq49_HP.fasta
32,964 sequences; 18,297,164 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
Alignment gi|NP_001182682.1| THAP domain-containing protein 3 isoform 3 [... 237 1e-62
Alignment gi|NP_001182681.1| THAP domain-containing protein 3 isoform 1 [... 233 2e-61
Alignment gi|NP_612359.2| THAP domain-containing protein 3 isoform 2 [Hom... 221 6e-58
Alignment gi|NP_060575.1| THAP domain-containing protein 1 isoform 1 [Hom... 91 2e-18
Alignment gi|NP_113623.1| THAP domain-containing protein 2 [Homo sapiens]. 77 3e-14
Alignment gi|NP_689871.1| THAP domain-containing protein 8 [Homo sapiens]. 62 1e-09
Alignment gi|NP_001123947.1| THAP domain-containing protein 5 isoform 1 [... 61 1e-09
Alignment gi|NP_057047.4| THAP domain-containing protein 4 isoform 1 [Hom... 55 1e-07
Alignment gi|NP_001008695.1| THAP domain-containing protein 7 isoform 2 [... 54 2e-07
Alignment gi|NP_085050.2| THAP domain-containing protein 7 isoform 1 [Hom... 54 2e-07
>ref|NP_001182682.1| THAP domain-containing protein 3 isoform 3 [Homo sapiens].
Length = 239
Score = 237 bits (604), Expect = 1e-62
Identities = 132/240 (55%), Positives = 144/240 (60%), Gaps = 8/240 (3%)
Frame = +2
Query: 68 MPKSCAARQCCNRYSSRRKQLTFHRFPFSRPELLKEWVLNIGRGDFEPKQHTVICSEHFR 247
MPKSCAARQCCNRYSSRRKQLTFHRFPFSRPELLKEWVLNIGRG+F+PKQHTVICSEHFR
Sbjct: 1 MPKSCAARQCCNRYSSRRKQLTFHRFPFSRPELLKEWVLNIGRGNFKPKQHTVICSEHFR 60
Query: 248 PECFSAFGNRKNLKHNAVPTEFTFQGPPQLVRETADPDGQIG--AASEKGEVHPETGPSE 421
PECFSAFGNRKNLKHNAVPT F FQ P Q VRE DP + G ++S+K +V PE G E
Sbjct: 61 PECFSAFGNRKNLKHNAVPTVFAFQDPTQQVRENTDPASERGNASSSQKEKVLPEAGAGE 120
Query: 422 CGPGRKRDTALEVRQPPPDAGGPRAQGQPGRTGAVAXXXXXXXXXXXXXRPPSTQPPDHS 601
PGR DTALE Q PP+A G Q P R A R P+ QP DHS
Sbjct: 121 DSPGRNMDTALEELQLPPNAEGHVKQVSPRRPQA-TEAVGRPTGPAGLRRTPNKQPSDHS 179
Query: 602 YXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXMASHLQACR------AGSRPEQQS 763
Y M+S L+AC+ A PEQQS
Sbjct: 180 YALLDLDSLKKKLFLTLKENEKLRKRLQAQRLVMRRMSSRLRACKGHQGLQARLGPEQQS 239
>ref|NP_001182681.1| THAP domain-containing protein 3 isoform 1 [Homo sapiens].
Length = 238
Score = 233 bits (594), Expect = 2e-61
Identities = 132/240 (55%), Positives = 144/240 (60%), Gaps = 8/240 (3%)
Frame = +2
Query: 68 MPKSCAARQCCNRYSSRRKQLTFHRFPFSRPELLKEWVLNIGRGDFEPKQHTVICSEHFR 247
MPKSCAARQCCNRYSSRRKQLTFHRFPFSRPELLKEWVLNIGRG+F+PKQHTVICSEHFR
Sbjct: 1 MPKSCAARQCCNRYSSRRKQLTFHRFPFSRPELLKEWVLNIGRGNFKPKQHTVICSEHFR 60
Query: 248 PECFSAFGNRKNLKHNAVPTEFTFQGPPQLVRETADPDGQIG--AASEKGEVHPETGPSE 421
PECFSAFGNRKNLKHNAVPT F FQ P Q VRE DP + G ++S+K +V PE G E
Sbjct: 61 PECFSAFGNRKNLKHNAVPTVFAFQDPTQ-VRENTDPASERGNASSSQKEKVLPEAGAGE 119
Query: 422 CGPGRKRDTALEVRQPPPDAGGPRAQGQPGRTGAVAXXXXXXXXXXXXXRPPSTQPPDHS 601
PGR DTALE Q PP+A G Q P R A R P+ QP DHS
Sbjct: 120 DSPGRNMDTALEELQLPPNAEGHVKQVSPRRPQA-TEAVGRPTGPAGLRRTPNKQPSDHS 178
Query: 602 YXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXMASHLQACR------AGSRPEQQS 763
Y M+S L+AC+ A PEQQS
Sbjct: 179 YALLDLDSLKKKLFLTLKENEKLRKRLQAQRLVMRRMSSRLRACKGHQGLQARLGPEQQS 238
>ref|NP_612359.2| THAP domain-containing protein 3 isoform 2 [Homo sapiens].
Length = 175
Score = 221 bits (564), Expect = 6e-58
Identities = 109/149 (73%), Positives = 115/149 (77%), Gaps = 9/149 (6%)
Frame = +2
Query: 68 MPKSCAARQCCNRYSSRRKQLTFHRFPFSRPELLKEWVLNIGRGDFEPKQHTVICSEHFR 247
MPKSCAARQCCNRYSSRRKQLTFHRFPFSRPELLKEWVLNIGRG+F+PKQHTVICSEHFR
Sbjct: 1 MPKSCAARQCCNRYSSRRKQLTFHRFPFSRPELLKEWVLNIGRGNFKPKQHTVICSEHFR 60
Query: 248 PECFSAFGNRKNLKHNAVPTEFTFQGPPQLVRETADPDGQIGAASE---------KGEVH 400
PECFSAFGNRKNLKHNAVPT F FQ P Q VRE DP + G AS + +V
Sbjct: 61 PECFSAFGNRKNLKHNAVPTVFAFQDPTQQVRENTDPASERGNASSSQKEKTSPCRSQVL 120
Query: 401 PETGPSECGPGRKRDTALEVRQPPPDAGG 487
PE G E PGR DTALE Q PP+A G
Sbjct: 121 PEAGAGEDSPGRNMDTALEELQLPPNAEG 149
>ref|NP_060575.1| THAP domain-containing protein 1 isoform 1 [Homo sapiens].
Length = 213
Score = 90.9 bits (224), Expect = 2e-18
Identities = 44/101 (43%), Positives = 61/101 (60%)
Frame = +2
Query: 68 MPKSCAARQCCNRYSSRRKQLTFHRFPFSRPELLKEWVLNIGRGDFEPKQHTVICSEHFR 247
M +SC+A C NRY + K ++FH+FP +RP L KEW + R +F+P +++ ICSEHF
Sbjct: 1 MVQSCSAYGCKNRYD-KDKPVSFHKFPLTRPSLCKEWEAAVRRKNFKPTKYSSICSEHFT 59
Query: 248 PECFSAFGNRKNLKHNAVPTEFTFQGPPQLVRETADPDGQI 370
P+CF N K LK NAVPT F P + +P Q+
Sbjct: 60 PDCFKRECNNKLLKENAVPTIFLCTEPHDKKEDLLEPQEQL 100
>ref|NP_113623.1| THAP domain-containing protein 2 [Homo sapiens].
Length = 228
Score = 76.6 bits (187), Expect = 3e-14
Identities = 37/84 (44%), Positives = 50/84 (59%)
Frame = +2
Query: 68 MPKSCAARQCCNRYSSRRKQLTFHRFPFSRPELLKEWVLNIGRGDFEPKQHTVICSEHFR 247
MP +CAA C Y+ + ++FHRFP P+ KEWV + R +F P +HT +CS+HF
Sbjct: 1 MPTNCAAAGCATTYN-KHINISFHRFPLD-PKRRKEWVRLVRRKNFVPGKHTFLCSKHFE 58
Query: 248 PECFSAFGNRKNLKHNAVPTEFTF 319
CF G + LK +AVPT F F
Sbjct: 59 ASCFDLTGQTRRLKMDAVPTIFDF 82
>ref|NP_689871.1| THAP domain-containing protein 8 [Homo sapiens].
Length = 274
Score = 61.6 bits (148), Expect = 1e-09
Identities = 32/91 (35%), Positives = 50/91 (54%), Gaps = 3/91 (3%)
Frame = +2
Query: 68 MPKSCAARQCCN---RYSSRRKQLTFHRFPFSRPELLKEWVLNIGRGDFEPKQHTVICSE 238
MPK C A C N R + + ++F++FP L+ W+ ++G + P H +CSE
Sbjct: 1 MPKYCRAPNCSNTAGRLGADNRPVSFYKFPLKDGPRLQAWLQHMGCEHWVPSCHQHLCSE 60
Query: 239 HFRPECFSAFGNRKNLKHNAVPTEFTFQGPP 331
HF P CF + L+ +AVP+ F+ +GPP
Sbjct: 61 HFTPSCFQWRWGVRYLRPDAVPSIFS-RGPP 90
>ref|NP_001123947.1| THAP domain-containing protein 5 isoform 1 [Homo sapiens].
Length = 395
Score = 61.2 bits (147), Expect = 1e-09
Identities = 31/85 (36%), Positives = 47/85 (55%), Gaps = 2/85 (2%)
Frame = +2
Query: 68 MPKSCAARQCCNRYSSRRK--QLTFHRFPFSRPELLKEWVLNIGRGDFEPKQHTVICSEH 241
MP+ CAA C NR K +L+F+ FP E L++W+ N+ R + P ++ +CS+H
Sbjct: 1 MPRYCAAICCKNRRGRNNKDRKLSFYPFPLHDKERLEKWLKNMKRDSWVPSKYQFLCSDH 60
Query: 242 FRPECFSAFGNRKNLKHNAVPTEFT 316
F P+ + LK AVPT F+
Sbjct: 61 FTPDSLDIRWGIRYLKQTAVPTIFS 85
>ref|NP_057047.4| THAP domain-containing protein 4 isoform 1 [Homo sapiens].
Length = 577
Score = 54.7 bits (130), Expect = 1e-07
Identities = 29/81 (35%), Positives = 45/81 (55%), Gaps = 3/81 (3%)
Frame = +2
Query: 80 CAARQCCNRYSSRRKQ-LTFHRFPFSRPELLKEWVLNIGRGDFEPKQHTVICSEHFRPEC 256
CAA C NR K+ ++FHRFP + L +W+ + R ++ P +++ +CSEHF +
Sbjct: 5 CAAVNCSNRQGKGEKRAVSFHRFPLKDSKRLIQWLKAVQRDNWTPTKYSFLCSEHFTKDS 64
Query: 257 FS--AFGNRKNLKHNAVPTEF 313
FS + LK AVP+ F
Sbjct: 65 FSKRLEDQHRLLKPTAVPSIF 85
>ref|NP_001008695.1| THAP domain-containing protein 7 isoform 2 [Homo sapiens].
Length = 309
Score = 53.9 bits (128), Expect = 2e-07
Identities = 48/159 (30%), Positives = 67/159 (42%), Gaps = 21/159 (13%)
Frame = +2
Query: 68 MPKSCAARQCCNRYS--SRRKQLTFHRFPFSRPELLKEWVLNI------GRGDFEP-KQH 220
MP+ C+A CC R + +R + ++FHR P W+ N G+G ++P ++
Sbjct: 1 MPRHCSAAGCCTRDTRETRNRGISFHRLPKKDNPRRGLWLANCQRLDPSGQGLWDPASEY 60
Query: 221 TVICSEHFRPECFSAFG--NRKNLKHNAVPTEFTFQGPPQLVRETADPDGQIGAASEKGE 394
CS+HF +CF G LK AVPT F+ +L R T KG
Sbjct: 61 IYFCSKHFEEDCFELVGISGYHRLKEGAVPT--IFESFSKLRRTT----------KTKGH 108
Query: 395 VHPETGPSE----------CGPGRKRDTALEVRQPPPDA 481
+P GP+E C GR T PPP A
Sbjct: 109 SYP-PGPAEVSRLRRCRKRCSEGRGPTTPF---SPPPPA 143
>ref|NP_085050.2| THAP domain-containing protein 7 isoform 1 [Homo sapiens].
Length = 309
Score = 53.9 bits (128), Expect = 2e-07
Identities = 48/159 (30%), Positives = 67/159 (42%), Gaps = 21/159 (13%)
Frame = +2
Query: 68 MPKSCAARQCCNRYS--SRRKQLTFHRFPFSRPELLKEWVLNI------GRGDFEP-KQH 220
MP+ C+A CC R + +R + ++FHR P W+ N G+G ++P ++
Sbjct: 1 MPRHCSAAGCCTRDTRETRNRGISFHRLPKKDNPRRGLWLANCQRLDPSGQGLWDPASEY 60
Query: 221 TVICSEHFRPECFSAFG--NRKNLKHNAVPTEFTFQGPPQLVRETADPDGQIGAASEKGE 394
CS+HF +CF G LK AVPT F+ +L R T KG
Sbjct: 61 IYFCSKHFEEDCFELVGISGYHRLKEGAVPT--IFESFSKLRRTT----------KTKGH 108
Query: 395 VHPETGPSE----------CGPGRKRDTALEVRQPPPDA 481
+P GP+E C GR T PPP A
Sbjct: 109 SYP-PGPAEVSRLRRCRKRCSEGRGPTTPF---SPPPPA 143
Database: RefSeq49_HP.fasta
Posted date: Oct 17, 2011 1:42 PM
Number of letters in database: 18,297,164
Number of sequences in database: 32,964
Lambda K H
0.312 0.131 0.418
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 32964
Number of Hits to DB: 40,889,015
Number of extensions: 1331994
Number of successful extensions: 10365
Number of sequences better than 1.0e-05: 12
Number of HSP's gapped: 10074
Number of HSP's successfully gapped: 12
Length of query: 324
Length of database: 18,297,164
Length adjustment: 103
Effective length of query: 221
Effective length of database: 14,901,872
Effective search space: 3293313712
Effective search space used: 3293313712
Neighboring words threshold: 12
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.4 bits)
S2: 34 (17.7 bits)
Search to Sscrofa10_2
BLASTN 2.2.24 [Aug-08-2010]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= 20110601C-008166
(973 letters)
Database: Sscrofa_10.2.fasta
4582 sequences; 2,808,509,378 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
Sscrofa_Chr06 751 0.0
>Sscrofa_Chr06
|| Length = 157765593
Score = 751 bits (379), Expect = 0.0
Identities = 399/403 (99%), Gaps = 2/403 (0%)
Strand = Plus / Minus
Query: 568 aagaccgccgtccacgcagccaccggaccacagctacgctcttctggacttggacgcctt 627
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Sbjct: 62380639 aagaccgccgtccacgcagccaccggaccacagctacgctcttctggacttggacgcctt 62380580
Query: 628 aaaaaagaaactcttcctgactctgcgggagaacgagaggctccgcaggcgcttgcgggc 687
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Sbjct: 62380579 aaaaaagaaactcttcctgactctgcgggagaacgagaggctccgcaggcgcttgcgggc 62380520
Query: 688 ccagaggctggtgatgcggaggatggccagccacctccaagcctgccgggccgggtcgcg 747
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Sbjct: 62380519 ccagaggctggtgatgcggaggatggccagccacctccaagcctgccgggccgggtcgcg 62380460
Query: 748 gccagagcagcagagctgagcctgacagccgggcttcctccccggagcccctggccctgg 807
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Sbjct: 62380459 gccagagcagcagagctgagcctgacagccgggcttcctccccggagcccctggccctgg 62380400
Query: 808 cgtcgccaagagcccagcgctgcggccgcgggtgaaggtcagcggcatcgggcatggagg 867
||||||||||||||||| ||||||||||||||||||||||||||||||||||||||||||
Sbjct: 62380399 cgtcgccaagagcccagagctgcggccgcgggtgaaggtcagcggcatcgggcatggagg 62380340
Query: 868 gggcagggccgggagcaaacccccagggtccccgctgtcccccc-acacaagctgggcca 926
||||||||||||||||| |||||||||||||||||||||||||| |||||||||||||||
Sbjct: 62380339 gggcagggccgggagcagacccccagggtccccgctgtccccccaacacaagctgggcca 62380280
Query: 927 gcgtctggggaacaccccaag-cacacgggcaggtgctcccaa 968
||||||||||||||||||||| |||||||||||||||||||||
Sbjct: 62380279 gcgtctggggaacaccccaagccacacgggcaggtgctcccaa 62380237
Score = 385 bits (194), Expect = e-104
Identities = 194/194 (100%)
Strand = Plus / Minus
Query: 141 ggttcccgttcagccgcccagagctgctaaaggaatgggtgctgaacatcggccgaggcg 200
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Sbjct: 62384343 ggttcccgttcagccgcccagagctgctaaaggaatgggtgctgaacatcggccgaggcg 62384284
Query: 201 acttcgagcccaagcagcacacggtcatctgctcggagcacttccgccccgagtgcttca 260
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Sbjct: 62384283 acttcgagcccaagcagcacacggtcatctgctcggagcacttccgccccgagtgcttca 62384224
Query: 261 gcgcctttgggaaccgcaagaacctgaagcacaacgcggtgcccacggagttcaccttcc 320
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Sbjct: 62384223 gcgcctttgggaaccgcaagaacctgaagcacaacgcggtgcccacggagttcaccttcc 62384164
Query: 321 aggggcccccgcag 334
||||||||||||||
Sbjct: 62384163 aggggcccccgcag 62384150
Score = 214 bits (108), Expect = 7e-53
Identities = 108/108 (100%)
Strand = Plus / Minus
Query: 393 aggtccaccctgagacggggcccagcgagtgtggcccggggaggaagagggacacagcac 452
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Sbjct: 62380912 aggtccaccctgagacggggcccagcgagtgtggcccggggaggaagagggacacagcac 62380853
Query: 453 tcgaggtgcggcagccgcccccggacgccgggggcccccgagcacagg 500
||||||||||||||||||||||||||||||||||||||||||||||||
Sbjct: 62380852 tcgaggtgcggcagccgcccccggacgccgggggcccccgagcacagg 62380805
Score = 131 bits (66), Expect = 8e-28
Identities = 66/66 (100%)
Strand = Plus / Minus
Query: 331 gcagctggtgcgggagacggctgaccctgatggccagatcggggccgcctctgagaaggg 390
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Sbjct: 62381206 gcagctggtgcgggagacggctgaccctgatggccagatcggggccgcctctgagaaggg 62381147
Query: 391 agaggt 396
||||||
Sbjct: 62381146 agaggt 62381141
Score = 69.9 bits (35), Expect = 2e-09
Identities = 35/35 (100%)
Strand = Plus / Minus
Query: 497 cagggccagcctggcagaacgggagccgtggccaa 531
|||||||||||||||||||||||||||||||||||
Sbjct: 62380710 cagggccagcctggcagaacgggagccgtggccaa 62380676
Score = 63.9 bits (32), Expect = 2e-07
Identities = 32/32 (100%)
Strand = Plus / Minus
Query: 111 gcagtcgcaggaagcagctcaccttccaccgg 142
||||||||||||||||||||||||||||||||
Sbjct: 62387114 gcagtcgcaggaagcagctcaccttccaccgg 62387083
Database: Sscrofa_10.2.fasta
Posted date: Nov 16, 2011 10:34 AM
Number of letters in database: 2,808,509,378
Number of sequences in database: 4582
Lambda K H
1.37 0.711 1.31
Gapped
Lambda K H
1.37 0.711 1.31
Matrix: blastn matrix:1 -3
Gap Penalties: Existence: 5, Extension: 2
Number of Sequences: 4582
Number of Hits to DB: 20,702,678
Number of extensions: 147
Number of successful extensions: 147
Number of sequences better than 1.0e-05: 1
Number of HSP's gapped: 145
Number of HSP's successfully gapped: 6
Length of query: 973
Length of database: 2,808,509,378
Length adjustment: 21
Effective length of query: 952
Effective length of database: 2,808,413,156
Effective search space: 2673609324512
Effective search space used: 2673609324512
X1: 11 (21.8 bits)
X2: 15 (29.7 bits)
X3: 50 (99.1 bits)
S1: 18 (36.2 bits)
S2: 29 (58.0 bits)