Animal-Genome cDNA 20110601C-008166


Search to RefSeqBP_Rel49

BLASTX 2.2.24 [Aug-08-2010]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= 20110601C-008166
         (973 letters)

Database: RefSeq49_BP.fasta 
           33,088 sequences; 17,681,374 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

Alignment   gi|NP_001068815.1| THAP domain-containing protein 3 [Bos taurus].    254   8e-68
Alignment   gi|NP_001029820.1| THAP domain-containing protein 1 [Bos taurus].     87   3e-17
Alignment   gi|NP_001095670.1| THAP domain-containing protein 8 [Bos taurus].     62   6e-10
Alignment   gi|NP_001033758.1| THAP domain-containing protein 4 [Bos taurus].     56   4e-08
Alignment   gi|NP_001091935.1| THAP domain-containing protein 7 [Bos taurus].     51   2e-06
Alignment   gi|XP_002688436.1| PREDICTED: THAP domain containing 9 [Bos tau...    51   2e-06
Alignment   gi|XP_874067.2| PREDICTED: THAP domain containing 6 [Bos taurus].     51   2e-06
Alignment   gi|NP_001096760.1| THAP domain-containing protein 6 [Bos taurus].     50   4e-06

>ref|NP_001068815.1| THAP domain-containing protein 3 [Bos taurus].
          Length = 239

 Score =  254 bits (649), Expect = 8e-68
 Identities = 128/180 (71%), Positives = 135/180 (75%), Gaps = 1/180 (0%)
 Frame = +2

Query: 68  MPKSCAARQCCNRYSSRRKQLTFHRFPFSRPELLKEWVLNIGRGDFEPKQHTVICSEHFR 247
           MPKSCAARQCCNRYS+RRKQLTFHRFPFSRPELLKEWVLNIGRGDFEPKQHTVICSEHFR
Sbjct: 1   MPKSCAARQCCNRYSNRRKQLTFHRFPFSRPELLKEWVLNIGRGDFEPKQHTVICSEHFR 60

Query: 248 PECFSAFGNRKNLKHNAVPTEFTFQGPPQLVRETADPDGQIG-AASEKGEVHPETGPSEC 424
           PECFSAFGNRKNLKHNAVPT F FQGPPQLVRE  DP G+ G A S + +V PETG  EC
Sbjct: 61  PECFSAFGNRKNLKHNAVPTVFAFQGPPQLVRENTDPTGRSGDATSGERKVLPETGSGEC 120

Query: 425 GPGRKRDTALEVRQPPPDAGGPRAQGQPGRTGAVAXXXXXXXXXXXXXRPPSTQPPDHSY 604
           G GRK DT +EV Q PP+ GG  AQ  P  T   +             R   TQP DHSY
Sbjct: 121 GLGRKMDTTVEVLQLPPEVGGLGAQ-VPPHTPETSGVPGQPASPPELKRRLPTQPSDHSY 179


>ref|NP_001029820.1| THAP domain-containing protein 1 [Bos taurus].
          Length = 213

 Score = 86.7 bits (213), Expect = 3e-17
 Identities = 42/100 (42%), Positives = 60/100 (60%)
 Frame = +2

Query: 68  MPKSCAARQCCNRYSSRRKQLTFHRFPFSRPELLKEWVLNIGRGDFEPKQHTVICSEHFR 247
           M +SC+A  C NRY  + K ++FH+FP +RP L K+W   + R +F+P +++ ICSEHF 
Sbjct: 1   MVQSCSAYGCKNRYD-KDKPVSFHKFPLTRPSLCKKWEAAVRRKNFKPTKYSSICSEHFT 59

Query: 248 PECFSAFGNRKNLKHNAVPTEFTFQGPPQLVRETADPDGQ 367
           P+CF    N K LK +AVPT F    P     +  +P  Q
Sbjct: 60  PDCFKRECNNKLLKEDAVPTIFLCTEPHDKKEDLLEPQEQ 99


>ref|NP_001095670.1| THAP domain-containing protein 8 [Bos taurus].
          Length = 257

 Score = 62.4 bits (150), Expect = 6e-10
 Identities = 49/166 (29%), Positives = 71/166 (42%), Gaps = 15/166 (9%)
 Frame = +2

Query: 68  MPKSCAARQCCN---RYSSRRKQLTFHRFPFSRPELLKEWVLNIGRGDFEPKQHTVICSE 238
           MPK C A  C N   +  +  + ++F++FP      L+ W+ ++GR  + P  H  +CSE
Sbjct: 1   MPKYCLAPNCSNTAGQLGADNRPVSFYKFPLKDGPRLQAWLRHMGREHWVPSCHQHLCSE 60

Query: 239 HFRPECFSAFGNRKNLKHNAVPTEFTFQGPPQ-------LVRETADPDGQIGAASEKGEV 397
           HF P CF      + L+ +AVP+ F+   P Q         +    P  Q   +   G V
Sbjct: 61  HFAPSCFQWRWGVRYLRPDAVPSIFSRVPPAQRQQSSRSTEKPVVPPPLQSTPSLASGPV 120

Query: 398 H-----PETGPSECGPGRKRDTALEVRQPPPDAGGPRAQGQPGRTG 520
                 P +G  E  P     T L + QP P    P A  Q  R G
Sbjct: 121 QLLVLGPASGAPE-APATVFLTPLSL-QPAPAGPRPGASAQHPRAG 164


>ref|NP_001033758.1| THAP domain-containing protein 4 [Bos taurus].
          Length = 570

 Score = 56.2 bits (134), Expect = 4e-08
 Identities = 29/81 (35%), Positives = 45/81 (55%), Gaps = 3/81 (3%)
 Frame = +2

Query: 80  CAARQCCNRYSSRRKQ-LTFHRFPFSRPELLKEWVLNIGRGDFEPKQHTVICSEHFRPEC 256
           CAA  C NR     K+ ++FHRFP    + L +W+  + R ++ P +++ +CSEHF  + 
Sbjct: 5   CAAANCSNRQGKGEKRAVSFHRFPLKDSKRLMQWLKAVQRDNWTPTKYSFLCSEHFTKDS 64

Query: 257 FS--AFGNRKNLKHNAVPTEF 313
           FS       + LK  AVP+ F
Sbjct: 65  FSKRLEDQHRLLKPTAVPSIF 85


>ref|NP_001091935.1| THAP domain-containing protein 7 [Bos taurus].
          Length = 308

 Score = 50.8 bits (120), Expect = 2e-06
 Identities = 31/93 (33%), Positives = 45/93 (48%), Gaps = 11/93 (11%)
 Frame = +2

Query: 68  MPKSCAARQCCNRYS--SRRKQLTFHRFPFSRPELLKEWVLNI------GRGDFEP-KQH 220
           MP+ C+A  CC R +  +R + ++FHR P         W+ N       G+G ++P  ++
Sbjct: 1   MPRHCSAAGCCTRDTRETRNRGISFHRLPKKDNPRRGLWLANCQRLDPSGQGLWDPASEY 60

Query: 221 TVICSEHFRPECFSAFG--NRKNLKHNAVPTEF 313
              CS+HF   CF   G      LK  AVPT F
Sbjct: 61  IYFCSKHFEENCFELVGISGYHRLKEGAVPTIF 93


>ref|XP_002688436.1| PREDICTED: THAP domain containing 9 [Bos taurus].
          Length = 899

 Score = 50.8 bits (120), Expect = 2e-06
 Identities = 30/88 (34%), Positives = 48/88 (54%), Gaps = 8/88 (9%)
 Frame = +2

Query: 68  MPKSCAARQCCNRYS--SRRKQLTFHRFPFSRPELLKEWVLNIGRGD------FEPKQHT 223
           M +SC+A  C  R +  SR + L+FH+FP    +  K W+  + R D      + P    
Sbjct: 1   MTRSCSAVGCSTRDTVLSRERGLSFHQFPTDTIQRSK-WIRAVNRMDPRSKKIWIPGPGA 59

Query: 224 VICSEHFRPECFSAFGNRKNLKHNAVPT 307
           ++CS+HF+   F ++G R+ LK  AVP+
Sbjct: 60  MLCSKHFQESDFESYGIRRKLKKGAVPS 87


>ref|XP_874067.2| PREDICTED: THAP domain containing 6 [Bos taurus].
          Length = 899

 Score = 50.8 bits (120), Expect = 2e-06
 Identities = 30/88 (34%), Positives = 48/88 (54%), Gaps = 8/88 (9%)
 Frame = +2

Query: 68  MPKSCAARQCCNRYS--SRRKQLTFHRFPFSRPELLKEWVLNIGRGD------FEPKQHT 223
           M +SC+A  C  R +  SR + L+FH+FP    +  K W+  + R D      + P    
Sbjct: 1   MTRSCSAVGCSTRDTVLSRERGLSFHQFPTDTIQRSK-WIRAVNRMDPRSKKIWIPGPGA 59

Query: 224 VICSEHFRPECFSAFGNRKNLKHNAVPT 307
           ++CS+HF+   F ++G R+ LK  AVP+
Sbjct: 60  MLCSKHFQESDFESYGIRRKLKKGAVPS 87


>ref|NP_001096760.1| THAP domain-containing protein 6 [Bos taurus].
          Length = 180

 Score = 49.7 bits (117), Expect = 4e-06
 Identities = 33/90 (36%), Positives = 46/90 (51%), Gaps = 8/90 (8%)
 Frame = +2

Query: 68  MPKSCAARQCCNRY--SSRRKQLTFHRFPFSRPELLKEWVLNIGR------GDFEPKQHT 223
           M K C+A  C +R   +S+ K LTFH FP     + ++WVL + R      G +EPK+  
Sbjct: 1   MVKCCSAVGCASRCLPNSKLKGLTFHVFPTDE-NIKRKWVLAMKRLDVNAAGIWEPKKGD 59

Query: 224 VICSEHFRPECFSAFGNRKNLKHNAVPTEF 313
           V+CS HF+   F        LK   VP+ F
Sbjct: 60  VLCSRHFKKTDFDRSTPNIKLKPGVVPSIF 89


  Database: RefSeq49_BP.fasta
    Posted date:  Oct 17, 2011  1:42 PM
  Number of letters in database: 17,681,374
  Number of sequences in database:  33,088
  
Lambda     K      H
   0.312    0.131    0.418 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 33088
Number of Hits to DB: 40,689,968
Number of extensions: 1411151
Number of successful extensions: 10931
Number of sequences better than 1.0e-05: 8
Number of HSP's gapped: 10634
Number of HSP's successfully gapped: 8
Length of query: 324
Length of database: 17,681,374
Length adjustment: 102
Effective length of query: 222
Effective length of database: 14,306,398
Effective search space: 3176020356
Effective search space used: 3176020356
Neighboring words threshold: 12
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.4 bits)
S2: 34 (17.7 bits)

Search to RefSeqCP_Rel49

BLASTX 2.2.24 [Aug-08-2010]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= 20110601C-008166
         (973 letters)

Database: RefSeq49_CP.fasta 
           33,336 sequences; 18,874,504 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

Alignment   gi|XP_849390.1| PREDICTED: similar to THAP domain protein 3 iso...   218   5e-57
Alignment   gi|XP_858681.1| PREDICTED: similar to THAP domain protein 3 iso...   201   9e-52
Alignment   gi|XP_848435.1| PREDICTED: similar to THAP domain protein 1 iso...    89   5e-18
Alignment   gi|XP_532789.1| PREDICTED: similar to THAP domain protein 1 iso...    86   4e-17
Alignment   gi|XP_541680.2| PREDICTED: similar to CLIP-170-related protein ...    65   1e-10
Alignment   gi|XP_539521.2| PREDICTED: similar to THAP domain containing 5 ...    62   8e-10
Alignment   gi|XP_544956.2| PREDICTED: similar to THAP domain containing 9 ...    52   1e-06
Alignment   gi|XP_543564.2| PREDICTED: similar to THAP domain protein 7 [Ca...    51   2e-06
Alignment   gi|XP_855909.1| PREDICTED: similar to THAP domain containing 6 ...    50   3e-06
Alignment   gi|XP_544933.1| PREDICTED: similar to THAP domain containing 6 ...    49   7e-06

>ref|XP_849390.1| PREDICTED: similar to THAP domain protein 3 isoform 1 [Canis
           familiaris].
          Length = 225

 Score =  218 bits (556), Expect = 5e-57
 Identities = 127/234 (54%), Positives = 133/234 (56%), Gaps = 2/234 (0%)
 Frame = +2

Query: 68  MPKSCAARQCCNRYSSRRKQLTFHRFPFSRPELLKEWVLNIGRGDFEPKQHTVICSEHFR 247
           MPKSCAARQCCNRYSSRRKQLTFHRFPFSRPELLKEWVLNIGR +F+PKQHTVICSEHFR
Sbjct: 1   MPKSCAARQCCNRYSSRRKQLTFHRFPFSRPELLKEWVLNIGRANFKPKQHTVICSEHFR 60

Query: 248 PECFSAFGNRKNLKHNAVPTEFTFQGPPQLVRETADP-DGQIGAASEKG-EVHPETGPSE 421
           PECFSAFGNRKNLK NAVPT F FQ   QL RE ADP  G     S K   V  E  P+E
Sbjct: 61  PECFSAFGNRKNLKQNAVPTVFAFQDATQLARENADPAGGDTNVDSHKEVSVASEVVPAE 120

Query: 422 CGPGRKRDTALEVRQPPPDAGGPRAQGQPGRTGAVAXXXXXXXXXXXXXRPPSTQPPDHS 601
           CG GRK + ALEV   PP A GP  Q  P R                   P   Q  DHS
Sbjct: 121 CGWGRKLEAALEVL--PPMASGPAEQVVPRRLQGT-------QAPAQQASPSPAQTSDHS 171

Query: 602 YXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXMASHLQACRAGSRPEQQS 763
           Y                                   M+S L+A RAG  PE QS
Sbjct: 172 YALLDLDALKKKLFLTLKENEKLRKRLKAQRLVIRRMSSRLRAHRAGPPPEPQS 225


>ref|XP_858681.1| PREDICTED: similar to THAP domain protein 3 isoform 2 [Canis
           familiaris].
          Length = 203

 Score =  201 bits (511), Expect = 9e-52
 Identities = 117/232 (50%), Positives = 124/232 (53%)
 Frame = +2

Query: 68  MPKSCAARQCCNRYSSRRKQLTFHRFPFSRPELLKEWVLNIGRGDFEPKQHTVICSEHFR 247
           MPKSCAARQCCNRYSSRRKQLTFHRFPFSRPELLKEWVLNIGR +F+PKQHTVICSEHFR
Sbjct: 1   MPKSCAARQCCNRYSSRRKQLTFHRFPFSRPELLKEWVLNIGRANFKPKQHTVICSEHFR 60

Query: 248 PECFSAFGNRKNLKHNAVPTEFTFQGPPQLVRETADPDGQIGAASEKGEVHPETGPSECG 427
           PECFSAFGNRKNLK NAVPT F FQ   Q+  E                      P+ECG
Sbjct: 61  PECFSAFGNRKNLKQNAVPTVFAFQDATQVASEVV--------------------PAECG 100

Query: 428 PGRKRDTALEVRQPPPDAGGPRAQGQPGRTGAVAXXXXXXXXXXXXXRPPSTQPPDHSYX 607
            GRK + ALEV   PP A GP  Q  P R                   P   Q  DHSY 
Sbjct: 101 WGRKLEAALEVL--PPMASGPAEQVVPRRLQGT-------QAPAQQASPSPAQTSDHSYA 151

Query: 608 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXMASHLQACRAGSRPEQQS 763
                                             M+S L+A RAG  PE QS
Sbjct: 152 LLDLDALKKKLFLTLKENEKLRKRLKAQRLVIRRMSSRLRAHRAGPPPEPQS 203


>ref|XP_848435.1| PREDICTED: similar to THAP domain protein 1 isoform 2 [Canis
           familiaris].
          Length = 213

 Score = 89.4 bits (220), Expect = 5e-18
 Identities = 43/101 (42%), Positives = 61/101 (60%)
 Frame = +2

Query: 68  MPKSCAARQCCNRYSSRRKQLTFHRFPFSRPELLKEWVLNIGRGDFEPKQHTVICSEHFR 247
           M +SC+A  C NRY  + K ++FH+FP +RP L K+W   + R +F+P +++ ICSEHF 
Sbjct: 1   MVQSCSAYGCKNRYD-KDKPVSFHKFPLTRPSLCKKWEAAVRRKNFKPTKYSSICSEHFT 59

Query: 248 PECFSAFGNRKNLKHNAVPTEFTFQGPPQLVRETADPDGQI 370
           P+CF    N K LK NAVPT F    P     +  +P  Q+
Sbjct: 60  PDCFKRECNNKLLKENAVPTIFLCTEPHDKKEDLLEPQEQL 100


>ref|XP_532789.1| PREDICTED: similar to THAP domain protein 1 isoform 1 [Canis
           familiaris].
          Length = 178

 Score = 86.3 bits (212), Expect = 4e-17
 Identities = 41/87 (47%), Positives = 56/87 (64%)
 Frame = +2

Query: 68  MPKSCAARQCCNRYSSRRKQLTFHRFPFSRPELLKEWVLNIGRGDFEPKQHTVICSEHFR 247
           M +SC+A  C NRY  + K ++FH+FP +RP L K+W   + R +F+P +++ ICSEHF 
Sbjct: 1   MVQSCSAYGCKNRYD-KDKPVSFHKFPLTRPSLCKKWEAAVRRKNFKPTKYSSICSEHFT 59

Query: 248 PECFSAFGNRKNLKHNAVPTEFTFQGP 328
           P+CF    N K LK NAVPT F    P
Sbjct: 60  PDCFKRECNNKLLKENAVPTIFLCTEP 86


>ref|XP_541680.2| PREDICTED: similar to CLIP-170-related protein [Canis familiaris].
          Length = 871

 Score = 64.7 bits (156), Expect = 1e-10
 Identities = 40/146 (27%), Positives = 66/146 (45%), Gaps = 3/146 (2%)
 Frame = +2

Query: 68  MPKSCAARQCCN---RYSSRRKQLTFHRFPFSRPELLKEWVLNIGRGDFEPKQHTVICSE 238
           MPK C A  C N   R  +  + ++F++FP      L+ W+ ++G  D+ P  H  +CSE
Sbjct: 1   MPKYCRAPNCSNTAGRLGADNRPVSFYKFPLKDGPRLQAWLRHMGHEDWVPSCHHHLCSE 60

Query: 239 HFRPECFSAFGNRKNLKHNAVPTEFTFQGPPQLVRETADPDGQIGAASEKGEVHPETGPS 418
           HF P CF      + L+ +AVP+ F+   P +  + +   +  + + S      PE  P 
Sbjct: 61  HFTPSCFQWRWGVRYLRPDAVPSIFSPAPPAKRQQSSRSTEKPVESPSS-----PEAMPL 115

Query: 419 ECGPGRKRDTALEVRQPPPDAGGPRA 496
              P       + +      +GGP A
Sbjct: 116 SPDPTVSASGPMHLAVLGSASGGPEA 141


>ref|XP_539521.2| PREDICTED: similar to THAP domain containing 5 [Canis familiaris].
          Length = 394

 Score = 62.0 bits (149), Expect = 8e-10
 Identities = 30/85 (35%), Positives = 50/85 (58%), Gaps = 2/85 (2%)
 Frame = +2

Query: 68  MPKSCAARQCCNRY--SSRRKQLTFHRFPFSRPELLKEWVLNIGRGDFEPKQHTVICSEH 241
           MP+ CAA  C NR   +S+ ++L+F+ FP    E L++W+ N+ R  + P ++  +CS+H
Sbjct: 1   MPRYCAAICCKNRRGRNSKDRKLSFYPFPLHDKERLEKWLKNMKRDSWVPSKYQFLCSDH 60

Query: 242 FRPECFSAFGNRKNLKHNAVPTEFT 316
           F P+        + LK  A+PT F+
Sbjct: 61  FTPDSLDIRWGIRYLKQTAIPTIFS 85


>ref|XP_544956.2| PREDICTED: similar to THAP domain containing 9 [Canis familiaris].
          Length = 902

 Score = 51.6 bits (122), Expect = 1e-06
 Identities = 30/88 (34%), Positives = 48/88 (54%), Gaps = 8/88 (9%)
 Frame = +2

Query: 68  MPKSCAARQCCNRYS--SRRKQLTFHRFPFSRPELLKEWVLNIGRGD------FEPKQHT 223
           M +SC+A  C  R +  SR + L+FH+FP    +  K W+  + R D      + P    
Sbjct: 1   MTRSCSAVGCSTRDTVLSRERGLSFHQFPTDTIQRSK-WIRAVNRVDPRSKKIWIPGPGA 59

Query: 224 VICSEHFRPECFSAFGNRKNLKHNAVPT 307
           ++CS+HF+   F ++G R+ LK  AVP+
Sbjct: 60  ILCSKHFQESDFESYGIRRKLKKGAVPS 87


>ref|XP_543564.2| PREDICTED: similar to THAP domain protein 7 [Canis familiaris].
          Length = 313

 Score = 50.8 bits (120), Expect = 2e-06
 Identities = 31/93 (33%), Positives = 45/93 (48%), Gaps = 11/93 (11%)
 Frame = +2

Query: 68  MPKSCAARQCCNRYS--SRRKQLTFHRFPFSRPELLKEWVLNI------GRGDFEP-KQH 220
           MP+ C+A  CC R +  +R + ++FHR P         W+ N       G+G ++P  ++
Sbjct: 1   MPRHCSAAGCCTRDTRETRNRGISFHRLPKKDNPRRGLWLANCQRLDPSGQGLWDPASEY 60

Query: 221 TVICSEHFRPECFSAFG--NRKNLKHNAVPTEF 313
              CS+HF   CF   G      LK  AVPT F
Sbjct: 61  IYFCSKHFEENCFELVGISGYHRLKEGAVPTIF 93


>ref|XP_855909.1| PREDICTED: similar to THAP domain containing 6 isoform 4 [Canis
           familiaris].
          Length = 180

 Score = 50.1 bits (118), Expect = 3e-06
 Identities = 35/101 (34%), Positives = 51/101 (50%), Gaps = 8/101 (7%)
 Frame = +2

Query: 68  MPKSCAARQCCNRY--SSRRKQLTFHRFPFSRPELLKEWVLNIGR------GDFEPKQHT 223
           M K C+A  C +R   +S+ K LTFH FP     + ++WVL + R      G +EPK+  
Sbjct: 1   MVKCCSAIGCASRCLPNSKLKGLTFHVFPTDE-NVKRKWVLAMKRLDVNAAGIWEPKKGD 59

Query: 224 VICSEHFRPECFSAFGNRKNLKHNAVPTEFTFQGPPQLVRE 346
           V+CS HF+   F        LK   +P+ F    PP  ++E
Sbjct: 60  VLCSRHFKKTDFDRSTPNIKLKPGVIPSIF---DPPSHLQE 97


>ref|XP_544933.1| PREDICTED: similar to THAP domain containing 6 isoform 1 [Canis
           familiaris].
          Length = 222

 Score = 48.9 bits (115), Expect = 7e-06
 Identities = 32/90 (35%), Positives = 46/90 (51%), Gaps = 8/90 (8%)
 Frame = +2

Query: 68  MPKSCAARQCCNRY--SSRRKQLTFHRFPFSRPELLKEWVLNIGR------GDFEPKQHT 223
           M K C+A  C +R   +S+ K LTFH FP     + ++WVL + R      G +EPK+  
Sbjct: 1   MVKCCSAIGCASRCLPNSKLKGLTFHVFPTDE-NVKRKWVLAMKRLDVNAAGIWEPKKGD 59

Query: 224 VICSEHFRPECFSAFGNRKNLKHNAVPTEF 313
           V+CS HF+   F        LK   +P+ F
Sbjct: 60  VLCSRHFKKTDFDRSTPNIKLKPGVIPSIF 89


  Database: RefSeq49_CP.fasta
    Posted date:  Oct 17, 2011  1:42 PM
  Number of letters in database: 18,874,504
  Number of sequences in database:  33,336
  
Lambda     K      H
   0.312    0.131    0.418 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 33336
Number of Hits to DB: 42,059,688
Number of extensions: 1398302
Number of successful extensions: 11398
Number of sequences better than 1.0e-05: 11
Number of HSP's gapped: 11064
Number of HSP's successfully gapped: 11
Length of query: 324
Length of database: 18,874,504
Length adjustment: 103
Effective length of query: 221
Effective length of database: 15,440,896
Effective search space: 3412438016
Effective search space used: 3412438016
Neighboring words threshold: 12
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.4 bits)
S2: 34 (17.7 bits)

Search to RefSeqSP_Rel49

BLASTX 2.2.24 [Aug-08-2010]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= 20110601C-008166
         (973 letters)

Database: RefSeq49_SP.fasta 
           24,897 sequences; 11,343,932 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

Alignment   gi|XP_003359917.1| PREDICTED: THAP domain-containing protein 1-...    87   1e-17
Alignment   gi|XP_003126431.1| PREDICTED: LOW QUALITY PROTEIN: THAP domain-...    77   2e-14
Alignment   gi|XP_003134826.1| PREDICTED: THAP domain-containing protein 5-...    60   2e-09
Alignment   gi|XP_003127122.1| PREDICTED: THAP domain-containing protein 8-...    59   3e-09
Alignment   gi|XP_003133046.1| PREDICTED: THAP domain-containing protein 7-...    52   5e-07
Alignment   gi|XP_003129402.1| PREDICTED: THAP domain-containing protein 9 ...    50   1e-06

>ref|XP_003359917.1| PREDICTED: THAP domain-containing protein 1-like [Sus scrofa].
          Length = 213

 Score = 87.4 bits (215), Expect = 1e-17
 Identities = 42/101 (41%), Positives = 61/101 (60%)
 Frame = +2

Query: 68  MPKSCAARQCCNRYSSRRKQLTFHRFPFSRPELLKEWVLNIGRGDFEPKQHTVICSEHFR 247
           M +SC+A  C NRY  + K ++FH+FP +RP L K+W   + R +F+P +++ ICSEHF 
Sbjct: 1   MVQSCSAYGCKNRYD-KDKPVSFHKFPLTRPSLCKKWEAAVRRKNFKPTKYSSICSEHFT 59

Query: 248 PECFSAFGNRKNLKHNAVPTEFTFQGPPQLVRETADPDGQI 370
           P+CF    N K LK +AVPT F    P     +  +P  Q+
Sbjct: 60  PDCFKRECNNKLLKEDAVPTIFLCTEPHDKKEDLLEPQEQL 100


>ref|XP_003126431.1| PREDICTED: LOW QUALITY PROTEIN: THAP domain-containing protein
           2-like [Sus scrofa].
          Length = 227

 Score = 76.6 bits (187), Expect = 2e-14
 Identities = 37/84 (44%), Positives = 50/84 (59%)
 Frame = +2

Query: 68  MPKSCAARQCCNRYSSRRKQLTFHRFPFSRPELLKEWVLNIGRGDFEPKQHTVICSEHFR 247
           MP +CAA  C   Y+ +   ++FHRFP   P+  KEWV  + R +F P +HT +CS+HF 
Sbjct: 1   MPTNCAAAGCATTYN-KHINISFHRFPLD-PKRRKEWVRLVRRKNFVPGKHTFLCSKHFE 58

Query: 248 PECFSAFGNRKNLKHNAVPTEFTF 319
             CF   G  + LK +AVPT F F
Sbjct: 59  ASCFDLTGQTRRLKMDAVPTIFDF 82


>ref|XP_003134826.1| PREDICTED: THAP domain-containing protein 5-like [Sus scrofa].
          Length = 395

 Score = 59.7 bits (143), Expect = 2e-09
 Identities = 29/85 (34%), Positives = 47/85 (55%), Gaps = 2/85 (2%)
 Frame = +2

Query: 68  MPKSCAARQCCNRYSSRR--KQLTFHRFPFSRPELLKEWVLNIGRGDFEPKQHTVICSEH 241
           MP+ CAA  C NR       ++L+F+ FP    E L++W+ N+ R  + P ++  +CS+H
Sbjct: 1   MPRYCAAICCKNRRGRNNTDRKLSFYPFPLHDKERLEKWLKNMKRDSWVPSKYQFLCSDH 60

Query: 242 FRPECFSAFGNRKNLKHNAVPTEFT 316
           F P+        + LK  A+PT F+
Sbjct: 61  FTPDSLDIRWGIRYLKQTAIPTIFS 85


>ref|XP_003127122.1| PREDICTED: THAP domain-containing protein 8-like [Sus scrofa].
          Length = 269

 Score = 59.3 bits (142), Expect = 3e-09
 Identities = 48/162 (29%), Positives = 67/162 (41%), Gaps = 20/162 (12%)
 Frame = +2

Query: 68  MPKSCAARQCCN---RYSSRRKQLTFHRFPFSRPELLKEWVLNIGRGDFEPKQHTVICSE 238
           MP+ C A  C N   R  +  + ++F++FP      L+ W+ ++G   + P  H  +CSE
Sbjct: 1   MPRYCRAPNCSNTAGRLGADHRPVSFYKFPLKDGPRLQAWLRHMGLEHWVPSCHQHLCSE 60

Query: 239 HFRPECFSAFGNRKNLKHNAVPTEF---TFQG-------------PPQLVRETADPDGQI 370
           HF P CF      + L+ +AVP+ F   TF               PP     T+   G  
Sbjct: 61  HFAPSCFQWRWGVRYLRPDAVPSIFSPATFTERQENCGSTEKPVMPPPPPEATSLFPGSA 120

Query: 371 GAASEKGEVH-PETGPSECGPGRKRDTALEVRQPPPDAGGPR 493
           G A   G V     GP+  GP       L     PP   GPR
Sbjct: 121 GPA--PGPVRLVVLGPASGGPEAPATVILTPLPLPPVPAGPR 160


>ref|XP_003133046.1| PREDICTED: THAP domain-containing protein 7-like isoform 1 [Sus
           scrofa].
          Length = 313

 Score = 52.0 bits (123), Expect = 5e-07
 Identities = 46/158 (29%), Positives = 63/158 (39%), Gaps = 20/158 (12%)
 Frame = +2

Query: 68  MPKSCAARQCCNRYS--SRRKQLTFHRFPFSRPELLKEWVLNI------GRGDFEP-KQH 220
           MP+ C+A  CC R +  +R + ++FHR P         W+ N       G+G ++P  ++
Sbjct: 1   MPRHCSAAGCCTRDTRETRNRGISFHRLPKKDNPRRGLWLANCQRLDPSGQGLWDPASEY 60

Query: 221 TVICSEHFRPECFSAFG--NRKNLKHNAVPTEFTFQGPPQLVRETADPDGQIGAASEKGE 394
              CS+HF   CF   G      LK  AVPT   F+   +L R           A  KG 
Sbjct: 61  IYFCSKHFEENCFELVGISGYHRLKEGAVPT--IFESFSKLRR----------TAKTKGH 108

Query: 395 VHPETGP---------SECGPGRKRDTALEVRQPPPDA 481
            +P   P           C  GR   T      PPP A
Sbjct: 109 SYPPGPPDVSRLRRCRKRCSEGRGPTTPF---SPPPPA 143


>ref|XP_003129402.1| PREDICTED: THAP domain-containing protein 9 [Sus scrofa].
          Length = 903

 Score = 50.4 bits (119), Expect = 1e-06
 Identities = 29/88 (32%), Positives = 48/88 (54%), Gaps = 8/88 (9%)
 Frame = +2

Query: 68  MPKSCAARQCCNRYS--SRRKQLTFHRFPFSRPELLKEWVLNIGRGD------FEPKQHT 223
           M +SC+A  C  R +  SR + L+FH+FP    +   +W+  + R D      + P    
Sbjct: 1   MTRSCSAVGCSTRDTVLSRERGLSFHQFPTDTIQR-SQWIRAVNRMDPRSKKIWIPGPGA 59

Query: 224 VICSEHFRPECFSAFGNRKNLKHNAVPT 307
           ++CS+HF+   F ++G R+ LK  AVP+
Sbjct: 60  MLCSKHFQESDFESYGIRRKLKKGAVPS 87


  Database: RefSeq49_SP.fasta
    Posted date:  Oct 17, 2011  1:42 PM
  Number of letters in database: 11,343,932
  Number of sequences in database:  24,897
  
Lambda     K      H
   0.312    0.131    0.418 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 24897
Number of Hits to DB: 26,904,719
Number of extensions: 996415
Number of successful extensions: 7540
Number of sequences better than 1.0e-05: 6
Number of HSP's gapped: 7311
Number of HSP's successfully gapped: 6
Length of query: 324
Length of database: 11,343,932
Length adjustment: 99
Effective length of query: 225
Effective length of database: 8,879,129
Effective search space: 1997804025
Effective search space used: 1997804025
Neighboring words threshold: 12
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.4 bits)
S2: 34 (17.7 bits)

Search to RefSeqMP_Rel49

BLASTX 2.2.24 [Aug-08-2010]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= 20110601C-008166
         (973 letters)

Database: RefSeq49_MP.fasta 
           30,036 sequences; 15,617,559 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

Alignment   gi|NP_780361.2| THAP domain-containing protein 3 isoform 1 [Mus...   200   1e-51
Alignment   gi|NP_001139401.1| THAP domain-containing protein 3 isoform 2 [...   182   3e-46
Alignment   gi|NP_950243.1| THAP domain-containing protein 1 [Mus musculus].      90   2e-18
Alignment   gi|NP_080056.1| THAP domain-containing protein 2 [Mus musculus].      76   4e-14
Alignment   gi|NP_080196.3| THAP domain-containing protein 4 [Mus musculus].      56   5e-08
Alignment   gi|NP_081185.1| THAP domain-containing protein 7 [Mus musculus].      51   2e-06

>ref|NP_780361.2| THAP domain-containing protein 3 isoform 1 [Mus musculus].
          Length = 218

 Score =  200 bits (509), Expect = 1e-51
 Identities = 109/180 (60%), Positives = 111/180 (61%), Gaps = 1/180 (0%)
 Frame = +2

Query: 68  MPKSCAARQCCNRYSSRRKQLTFHRFPFSRPELLKEWVLNIGRGDFEPKQHTVICSEHFR 247
           MPKSCAARQCCNRYSSRRKQLTFHRFPFSRPELL+EWVLNIGR DF+PKQHTVICSEHFR
Sbjct: 1   MPKSCAARQCCNRYSSRRKQLTFHRFPFSRPELLREWVLNIGRADFKPKQHTVICSEHFR 60

Query: 248 PECFSAFGNRKNLKHNAVPTEFTFQGPPQLVRETADPDGQIGAASEKGEVHPETGPSECG 427
           PECFSAFGNRKNLKHNAVPT F FQ P                     EV PE G     
Sbjct: 61  PECFSAFGNRKNLKHNAVPTVFAFQNPT--------------------EVCPEVGAGGDS 100

Query: 428 PGRKRDTALEVRQPPPDAGGPRAQGQPGRTGAVA-XXXXXXXXXXXXXRPPSTQPPDHSY 604
            GR  DT LE  QPP    GP  Q  P R    A              RP   QP DHSY
Sbjct: 101 SGRNMDTTLEELQPPTPE-GPVQQVLPDREAMEATEAAGLPASPLGLKRPLPGQPSDHSY 159


>ref|NP_001139401.1| THAP domain-containing protein 3 isoform 2 [Mus musculus].
          Length = 184

 Score =  182 bits (462), Expect = 3e-46
 Identities = 85/106 (80%), Positives = 92/106 (86%)
 Frame = +2

Query: 68  MPKSCAARQCCNRYSSRRKQLTFHRFPFSRPELLKEWVLNIGRGDFEPKQHTVICSEHFR 247
           MPKSCAARQCCNRYSSRRKQLTFHRFPFSRPELL+EWVLNIGR DF+PKQHTVICSEHFR
Sbjct: 1   MPKSCAARQCCNRYSSRRKQLTFHRFPFSRPELLREWVLNIGRADFKPKQHTVICSEHFR 60

Query: 248 PECFSAFGNRKNLKHNAVPTEFTFQGPPQLVRETADPDGQIGAASE 385
           PECFSAFGNRKNLKHNAVPT F FQ P +++     PD +   A+E
Sbjct: 61  PECFSAFGNRKNLKHNAVPTVFAFQNPTEVL-----PDREAMEATE 101


>ref|NP_950243.1| THAP domain-containing protein 1 [Mus musculus].
          Length = 210

 Score = 90.1 bits (222), Expect = 2e-18
 Identities = 51/135 (37%), Positives = 73/135 (54%)
 Frame = +2

Query: 68  MPKSCAARQCCNRYSSRRKQLTFHRFPFSRPELLKEWVLNIGRGDFEPKQHTVICSEHFR 247
           M +SC+A  C NRY  + K ++FH+FP +RP L K+W   + R +F+P +++ ICSEHF 
Sbjct: 1   MVQSCSAYGCKNRYD-KDKPVSFHKFPLTRPSLCKQWEAAVKRKNFKPTKYSSICSEHFT 59

Query: 248 PECFSAFGNRKNLKHNAVPTEFTFQGPPQLVRETADPDGQIGAASEKGEVHPETGPSECG 427
           P+CF    N K LK NAVPT F +  P +   +  D + Q            E  PS   
Sbjct: 60  PDCFKRECNNKLLKENAVPTIFLYIEPHE---KKEDLESQ------------EQLPSPSP 104

Query: 428 PGRKRDTALEVRQPP 472
           P  + D A+ +  PP
Sbjct: 105 PASQVDAAIGLLMPP 119


>ref|NP_080056.1| THAP domain-containing protein 2 [Mus musculus].
          Length = 217

 Score = 75.9 bits (185), Expect = 4e-14
 Identities = 37/84 (44%), Positives = 50/84 (59%)
 Frame = +2

Query: 68  MPKSCAARQCCNRYSSRRKQLTFHRFPFSRPELLKEWVLNIGRGDFEPKQHTVICSEHFR 247
           MP +CAA  C   Y+ +   ++FHRFP   P+  KEWV  + R +F P +HT +CS+HF 
Sbjct: 1   MPTNCAAAGCAATYN-KHINISFHRFPLD-PKRRKEWVRLVRRKNFVPGKHTFLCSKHFE 58

Query: 248 PECFSAFGNRKNLKHNAVPTEFTF 319
             CF   G  + LK +AVPT F F
Sbjct: 59  ASCFDLTGQTRRLKMDAVPTIFDF 82


>ref|NP_080196.3| THAP domain-containing protein 4 [Mus musculus].
          Length = 569

 Score = 55.8 bits (133), Expect = 5e-08
 Identities = 40/145 (27%), Positives = 63/145 (43%), Gaps = 11/145 (7%)
 Frame = +2

Query: 80  CAARQCCNRYSSRRKQ-LTFHRFPFSRPELLKEWVLNIGRGDFEPKQHTVICSEHFRPEC 256
           CAA  C NR     K+ ++FHRFP    + L +W+  + R ++ P +++ +CSEHF  + 
Sbjct: 5   CAAVNCSNRQGKGEKRAVSFHRFPLKDSKRLIQWLKAVQRDNWTPTKYSFLCSEHFTKDS 64

Query: 257 FS--AFGNRKNLKHNAVPTEFTFQGPPQLV--------RETADPDGQIGAASEKGEVHPE 406
           FS       + LK  AVP+ F      +          + TA   G   A + KG +   
Sbjct: 65  FSKRLEDQHRLLKPTAVPSIFHLSEKKRGAGGHGHARRKTTAAMRGHTSAETGKGTIGSS 124

Query: 407 TGPSECGPGRKRDTALEVRQPPPDA 481
              S+    +     L+   P  DA
Sbjct: 125 LSSSDNLMAKPESRKLKRASPQDDA 149


>ref|NP_081185.1| THAP domain-containing protein 7 [Mus musculus].
          Length = 309

 Score = 50.8 bits (120), Expect = 2e-06
 Identities = 31/93 (33%), Positives = 45/93 (48%), Gaps = 11/93 (11%)
 Frame = +2

Query: 68  MPKSCAARQCCNRYS--SRRKQLTFHRFPFSRPELLKEWVLNI------GRGDFEP-KQH 220
           MP+ C+A  CC R +  +R + ++FHR P         W+ N       G+G ++P  ++
Sbjct: 1   MPRHCSAAGCCTRDTRETRNRGISFHRLPKKDNPRRGLWLANCQRLDPSGQGLWDPTSEY 60

Query: 221 TVICSEHFRPECFSAFG--NRKNLKHNAVPTEF 313
              CS+HF   CF   G      LK  AVPT F
Sbjct: 61  IYFCSKHFEENCFELVGISGYHRLKEGAVPTIF 93


  Database: RefSeq49_MP.fasta
    Posted date:  Oct 17, 2011  1:42 PM
  Number of letters in database: 15,617,559
  Number of sequences in database:  30,036
  
Lambda     K      H
   0.312    0.131    0.418 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 30036
Number of Hits to DB: 34,059,676
Number of extensions: 1079143
Number of successful extensions: 7505
Number of sequences better than 1.0e-05: 6
Number of HSP's gapped: 7312
Number of HSP's successfully gapped: 6
Length of query: 324
Length of database: 15,617,559
Length adjustment: 102
Effective length of query: 222
Effective length of database: 12,553,887
Effective search space: 2786962914
Effective search space used: 2786962914
Neighboring words threshold: 12
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.4 bits)
S2: 34 (17.7 bits)

Search to RefSeqHP_Rel49

BLASTX 2.2.24 [Aug-08-2010]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= 20110601C-008166
         (973 letters)

Database: RefSeq49_HP.fasta 
           32,964 sequences; 18,297,164 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

Alignment   gi|NP_001182682.1| THAP domain-containing protein 3 isoform 3 [...   237   1e-62
Alignment   gi|NP_001182681.1| THAP domain-containing protein 3 isoform 1 [...   233   2e-61
Alignment   gi|NP_612359.2| THAP domain-containing protein 3 isoform 2 [Hom...   221   6e-58
Alignment   gi|NP_060575.1| THAP domain-containing protein 1 isoform 1 [Hom...    91   2e-18
Alignment   gi|NP_113623.1| THAP domain-containing protein 2 [Homo sapiens].      77   3e-14
Alignment   gi|NP_689871.1| THAP domain-containing protein 8 [Homo sapiens].      62   1e-09
Alignment   gi|NP_001123947.1| THAP domain-containing protein 5 isoform 1 [...    61   1e-09
Alignment   gi|NP_057047.4| THAP domain-containing protein 4 isoform 1 [Hom...    55   1e-07
Alignment   gi|NP_001008695.1| THAP domain-containing protein 7 isoform 2 [...    54   2e-07
Alignment   gi|NP_085050.2| THAP domain-containing protein 7 isoform 1 [Hom...    54   2e-07

>ref|NP_001182682.1| THAP domain-containing protein 3 isoform 3 [Homo sapiens].
          Length = 239

 Score =  237 bits (604), Expect = 1e-62
 Identities = 132/240 (55%), Positives = 144/240 (60%), Gaps = 8/240 (3%)
 Frame = +2

Query: 68  MPKSCAARQCCNRYSSRRKQLTFHRFPFSRPELLKEWVLNIGRGDFEPKQHTVICSEHFR 247
           MPKSCAARQCCNRYSSRRKQLTFHRFPFSRPELLKEWVLNIGRG+F+PKQHTVICSEHFR
Sbjct: 1   MPKSCAARQCCNRYSSRRKQLTFHRFPFSRPELLKEWVLNIGRGNFKPKQHTVICSEHFR 60

Query: 248 PECFSAFGNRKNLKHNAVPTEFTFQGPPQLVRETADPDGQIG--AASEKGEVHPETGPSE 421
           PECFSAFGNRKNLKHNAVPT F FQ P Q VRE  DP  + G  ++S+K +V PE G  E
Sbjct: 61  PECFSAFGNRKNLKHNAVPTVFAFQDPTQQVRENTDPASERGNASSSQKEKVLPEAGAGE 120

Query: 422 CGPGRKRDTALEVRQPPPDAGGPRAQGQPGRTGAVAXXXXXXXXXXXXXRPPSTQPPDHS 601
             PGR  DTALE  Q PP+A G   Q  P R  A               R P+ QP DHS
Sbjct: 121 DSPGRNMDTALEELQLPPNAEGHVKQVSPRRPQA-TEAVGRPTGPAGLRRTPNKQPSDHS 179

Query: 602 YXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXMASHLQACR------AGSRPEQQS 763
           Y                                   M+S L+AC+      A   PEQQS
Sbjct: 180 YALLDLDSLKKKLFLTLKENEKLRKRLQAQRLVMRRMSSRLRACKGHQGLQARLGPEQQS 239


>ref|NP_001182681.1| THAP domain-containing protein 3 isoform 1 [Homo sapiens].
          Length = 238

 Score =  233 bits (594), Expect = 2e-61
 Identities = 132/240 (55%), Positives = 144/240 (60%), Gaps = 8/240 (3%)
 Frame = +2

Query: 68  MPKSCAARQCCNRYSSRRKQLTFHRFPFSRPELLKEWVLNIGRGDFEPKQHTVICSEHFR 247
           MPKSCAARQCCNRYSSRRKQLTFHRFPFSRPELLKEWVLNIGRG+F+PKQHTVICSEHFR
Sbjct: 1   MPKSCAARQCCNRYSSRRKQLTFHRFPFSRPELLKEWVLNIGRGNFKPKQHTVICSEHFR 60

Query: 248 PECFSAFGNRKNLKHNAVPTEFTFQGPPQLVRETADPDGQIG--AASEKGEVHPETGPSE 421
           PECFSAFGNRKNLKHNAVPT F FQ P Q VRE  DP  + G  ++S+K +V PE G  E
Sbjct: 61  PECFSAFGNRKNLKHNAVPTVFAFQDPTQ-VRENTDPASERGNASSSQKEKVLPEAGAGE 119

Query: 422 CGPGRKRDTALEVRQPPPDAGGPRAQGQPGRTGAVAXXXXXXXXXXXXXRPPSTQPPDHS 601
             PGR  DTALE  Q PP+A G   Q  P R  A               R P+ QP DHS
Sbjct: 120 DSPGRNMDTALEELQLPPNAEGHVKQVSPRRPQA-TEAVGRPTGPAGLRRTPNKQPSDHS 178

Query: 602 YXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXMASHLQACR------AGSRPEQQS 763
           Y                                   M+S L+AC+      A   PEQQS
Sbjct: 179 YALLDLDSLKKKLFLTLKENEKLRKRLQAQRLVMRRMSSRLRACKGHQGLQARLGPEQQS 238


>ref|NP_612359.2| THAP domain-containing protein 3 isoform 2 [Homo sapiens].
          Length = 175

 Score =  221 bits (564), Expect = 6e-58
 Identities = 109/149 (73%), Positives = 115/149 (77%), Gaps = 9/149 (6%)
 Frame = +2

Query: 68  MPKSCAARQCCNRYSSRRKQLTFHRFPFSRPELLKEWVLNIGRGDFEPKQHTVICSEHFR 247
           MPKSCAARQCCNRYSSRRKQLTFHRFPFSRPELLKEWVLNIGRG+F+PKQHTVICSEHFR
Sbjct: 1   MPKSCAARQCCNRYSSRRKQLTFHRFPFSRPELLKEWVLNIGRGNFKPKQHTVICSEHFR 60

Query: 248 PECFSAFGNRKNLKHNAVPTEFTFQGPPQLVRETADPDGQIGAASE---------KGEVH 400
           PECFSAFGNRKNLKHNAVPT F FQ P Q VRE  DP  + G AS          + +V 
Sbjct: 61  PECFSAFGNRKNLKHNAVPTVFAFQDPTQQVRENTDPASERGNASSSQKEKTSPCRSQVL 120

Query: 401 PETGPSECGPGRKRDTALEVRQPPPDAGG 487
           PE G  E  PGR  DTALE  Q PP+A G
Sbjct: 121 PEAGAGEDSPGRNMDTALEELQLPPNAEG 149


>ref|NP_060575.1| THAP domain-containing protein 1 isoform 1 [Homo sapiens].
          Length = 213

 Score = 90.9 bits (224), Expect = 2e-18
 Identities = 44/101 (43%), Positives = 61/101 (60%)
 Frame = +2

Query: 68  MPKSCAARQCCNRYSSRRKQLTFHRFPFSRPELLKEWVLNIGRGDFEPKQHTVICSEHFR 247
           M +SC+A  C NRY  + K ++FH+FP +RP L KEW   + R +F+P +++ ICSEHF 
Sbjct: 1   MVQSCSAYGCKNRYD-KDKPVSFHKFPLTRPSLCKEWEAAVRRKNFKPTKYSSICSEHFT 59

Query: 248 PECFSAFGNRKNLKHNAVPTEFTFQGPPQLVRETADPDGQI 370
           P+CF    N K LK NAVPT F    P     +  +P  Q+
Sbjct: 60  PDCFKRECNNKLLKENAVPTIFLCTEPHDKKEDLLEPQEQL 100


>ref|NP_113623.1| THAP domain-containing protein 2 [Homo sapiens].
          Length = 228

 Score = 76.6 bits (187), Expect = 3e-14
 Identities = 37/84 (44%), Positives = 50/84 (59%)
 Frame = +2

Query: 68  MPKSCAARQCCNRYSSRRKQLTFHRFPFSRPELLKEWVLNIGRGDFEPKQHTVICSEHFR 247
           MP +CAA  C   Y+ +   ++FHRFP   P+  KEWV  + R +F P +HT +CS+HF 
Sbjct: 1   MPTNCAAAGCATTYN-KHINISFHRFPLD-PKRRKEWVRLVRRKNFVPGKHTFLCSKHFE 58

Query: 248 PECFSAFGNRKNLKHNAVPTEFTF 319
             CF   G  + LK +AVPT F F
Sbjct: 59  ASCFDLTGQTRRLKMDAVPTIFDF 82


>ref|NP_689871.1| THAP domain-containing protein 8 [Homo sapiens].
          Length = 274

 Score = 61.6 bits (148), Expect = 1e-09
 Identities = 32/91 (35%), Positives = 50/91 (54%), Gaps = 3/91 (3%)
 Frame = +2

Query: 68  MPKSCAARQCCN---RYSSRRKQLTFHRFPFSRPELLKEWVLNIGRGDFEPKQHTVICSE 238
           MPK C A  C N   R  +  + ++F++FP      L+ W+ ++G   + P  H  +CSE
Sbjct: 1   MPKYCRAPNCSNTAGRLGADNRPVSFYKFPLKDGPRLQAWLQHMGCEHWVPSCHQHLCSE 60

Query: 239 HFRPECFSAFGNRKNLKHNAVPTEFTFQGPP 331
           HF P CF      + L+ +AVP+ F+ +GPP
Sbjct: 61  HFTPSCFQWRWGVRYLRPDAVPSIFS-RGPP 90


>ref|NP_001123947.1| THAP domain-containing protein 5 isoform 1 [Homo sapiens].
          Length = 395

 Score = 61.2 bits (147), Expect = 1e-09
 Identities = 31/85 (36%), Positives = 47/85 (55%), Gaps = 2/85 (2%)
 Frame = +2

Query: 68  MPKSCAARQCCNRYSSRRK--QLTFHRFPFSRPELLKEWVLNIGRGDFEPKQHTVICSEH 241
           MP+ CAA  C NR     K  +L+F+ FP    E L++W+ N+ R  + P ++  +CS+H
Sbjct: 1   MPRYCAAICCKNRRGRNNKDRKLSFYPFPLHDKERLEKWLKNMKRDSWVPSKYQFLCSDH 60

Query: 242 FRPECFSAFGNRKNLKHNAVPTEFT 316
           F P+        + LK  AVPT F+
Sbjct: 61  FTPDSLDIRWGIRYLKQTAVPTIFS 85


>ref|NP_057047.4| THAP domain-containing protein 4 isoform 1 [Homo sapiens].
          Length = 577

 Score = 54.7 bits (130), Expect = 1e-07
 Identities = 29/81 (35%), Positives = 45/81 (55%), Gaps = 3/81 (3%)
 Frame = +2

Query: 80  CAARQCCNRYSSRRKQ-LTFHRFPFSRPELLKEWVLNIGRGDFEPKQHTVICSEHFRPEC 256
           CAA  C NR     K+ ++FHRFP    + L +W+  + R ++ P +++ +CSEHF  + 
Sbjct: 5   CAAVNCSNRQGKGEKRAVSFHRFPLKDSKRLIQWLKAVQRDNWTPTKYSFLCSEHFTKDS 64

Query: 257 FS--AFGNRKNLKHNAVPTEF 313
           FS       + LK  AVP+ F
Sbjct: 65  FSKRLEDQHRLLKPTAVPSIF 85


>ref|NP_001008695.1| THAP domain-containing protein 7 isoform 2 [Homo sapiens].
          Length = 309

 Score = 53.9 bits (128), Expect = 2e-07
 Identities = 48/159 (30%), Positives = 67/159 (42%), Gaps = 21/159 (13%)
 Frame = +2

Query: 68  MPKSCAARQCCNRYS--SRRKQLTFHRFPFSRPELLKEWVLNI------GRGDFEP-KQH 220
           MP+ C+A  CC R +  +R + ++FHR P         W+ N       G+G ++P  ++
Sbjct: 1   MPRHCSAAGCCTRDTRETRNRGISFHRLPKKDNPRRGLWLANCQRLDPSGQGLWDPASEY 60

Query: 221 TVICSEHFRPECFSAFG--NRKNLKHNAVPTEFTFQGPPQLVRETADPDGQIGAASEKGE 394
              CS+HF  +CF   G      LK  AVPT   F+   +L R T            KG 
Sbjct: 61  IYFCSKHFEEDCFELVGISGYHRLKEGAVPT--IFESFSKLRRTT----------KTKGH 108

Query: 395 VHPETGPSE----------CGPGRKRDTALEVRQPPPDA 481
            +P  GP+E          C  GR   T      PPP A
Sbjct: 109 SYP-PGPAEVSRLRRCRKRCSEGRGPTTPF---SPPPPA 143


>ref|NP_085050.2| THAP domain-containing protein 7 isoform 1 [Homo sapiens].
          Length = 309

 Score = 53.9 bits (128), Expect = 2e-07
 Identities = 48/159 (30%), Positives = 67/159 (42%), Gaps = 21/159 (13%)
 Frame = +2

Query: 68  MPKSCAARQCCNRYS--SRRKQLTFHRFPFSRPELLKEWVLNI------GRGDFEP-KQH 220
           MP+ C+A  CC R +  +R + ++FHR P         W+ N       G+G ++P  ++
Sbjct: 1   MPRHCSAAGCCTRDTRETRNRGISFHRLPKKDNPRRGLWLANCQRLDPSGQGLWDPASEY 60

Query: 221 TVICSEHFRPECFSAFG--NRKNLKHNAVPTEFTFQGPPQLVRETADPDGQIGAASEKGE 394
              CS+HF  +CF   G      LK  AVPT   F+   +L R T            KG 
Sbjct: 61  IYFCSKHFEEDCFELVGISGYHRLKEGAVPT--IFESFSKLRRTT----------KTKGH 108

Query: 395 VHPETGPSE----------CGPGRKRDTALEVRQPPPDA 481
            +P  GP+E          C  GR   T      PPP A
Sbjct: 109 SYP-PGPAEVSRLRRCRKRCSEGRGPTTPF---SPPPPA 143


  Database: RefSeq49_HP.fasta
    Posted date:  Oct 17, 2011  1:42 PM
  Number of letters in database: 18,297,164
  Number of sequences in database:  32,964
  
Lambda     K      H
   0.312    0.131    0.418 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 32964
Number of Hits to DB: 40,889,015
Number of extensions: 1331994
Number of successful extensions: 10365
Number of sequences better than 1.0e-05: 12
Number of HSP's gapped: 10074
Number of HSP's successfully gapped: 12
Length of query: 324
Length of database: 18,297,164
Length adjustment: 103
Effective length of query: 221
Effective length of database: 14,901,872
Effective search space: 3293313712
Effective search space used: 3293313712
Neighboring words threshold: 12
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.4 bits)
S2: 34 (17.7 bits)

Search to Sscrofa10_2

BLASTN 2.2.24 [Aug-08-2010]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= 20110601C-008166
         (973 letters)

Database: Sscrofa_10.2.fasta 
           4582 sequences; 2,808,509,378 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

Sscrofa_Chr06                                                         751   0.0  

>Sscrofa_Chr06 
||          Length = 157765593

 Score =  751 bits (379), Expect = 0.0
 Identities = 399/403 (99%), Gaps = 2/403 (0%)
 Strand = Plus / Minus

                                                                            
Query: 568      aagaccgccgtccacgcagccaccggaccacagctacgctcttctggacttggacgcctt 627
                ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Sbjct: 62380639 aagaccgccgtccacgcagccaccggaccacagctacgctcttctggacttggacgcctt 62380580

                                                                            
Query: 628      aaaaaagaaactcttcctgactctgcgggagaacgagaggctccgcaggcgcttgcgggc 687
                ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Sbjct: 62380579 aaaaaagaaactcttcctgactctgcgggagaacgagaggctccgcaggcgcttgcgggc 62380520

                                                                            
Query: 688      ccagaggctggtgatgcggaggatggccagccacctccaagcctgccgggccgggtcgcg 747
                ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Sbjct: 62380519 ccagaggctggtgatgcggaggatggccagccacctccaagcctgccgggccgggtcgcg 62380460

                                                                            
Query: 748      gccagagcagcagagctgagcctgacagccgggcttcctccccggagcccctggccctgg 807
                ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Sbjct: 62380459 gccagagcagcagagctgagcctgacagccgggcttcctccccggagcccctggccctgg 62380400

                                                                            
Query: 808      cgtcgccaagagcccagcgctgcggccgcgggtgaaggtcagcggcatcgggcatggagg 867
                ||||||||||||||||| ||||||||||||||||||||||||||||||||||||||||||
Sbjct: 62380399 cgtcgccaagagcccagagctgcggccgcgggtgaaggtcagcggcatcgggcatggagg 62380340

                                                                            
Query: 868      gggcagggccgggagcaaacccccagggtccccgctgtcccccc-acacaagctgggcca 926
                ||||||||||||||||| |||||||||||||||||||||||||| |||||||||||||||
Sbjct: 62380339 gggcagggccgggagcagacccccagggtccccgctgtccccccaacacaagctgggcca 62380280

                                                           
Query: 927      gcgtctggggaacaccccaag-cacacgggcaggtgctcccaa 968
                ||||||||||||||||||||| |||||||||||||||||||||
Sbjct: 62380279 gcgtctggggaacaccccaagccacacgggcaggtgctcccaa 62380237



 Score =  385 bits (194), Expect = e-104
 Identities = 194/194 (100%)
 Strand = Plus / Minus

                                                                            
Query: 141      ggttcccgttcagccgcccagagctgctaaaggaatgggtgctgaacatcggccgaggcg 200
                ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Sbjct: 62384343 ggttcccgttcagccgcccagagctgctaaaggaatgggtgctgaacatcggccgaggcg 62384284

                                                                            
Query: 201      acttcgagcccaagcagcacacggtcatctgctcggagcacttccgccccgagtgcttca 260
                ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Sbjct: 62384283 acttcgagcccaagcagcacacggtcatctgctcggagcacttccgccccgagtgcttca 62384224

                                                                            
Query: 261      gcgcctttgggaaccgcaagaacctgaagcacaacgcggtgcccacggagttcaccttcc 320
                ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Sbjct: 62384223 gcgcctttgggaaccgcaagaacctgaagcacaacgcggtgcccacggagttcaccttcc 62384164

                              
Query: 321      aggggcccccgcag 334
                ||||||||||||||
Sbjct: 62384163 aggggcccccgcag 62384150



 Score =  214 bits (108), Expect = 7e-53
 Identities = 108/108 (100%)
 Strand = Plus / Minus

                                                                            
Query: 393      aggtccaccctgagacggggcccagcgagtgtggcccggggaggaagagggacacagcac 452
                ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Sbjct: 62380912 aggtccaccctgagacggggcccagcgagtgtggcccggggaggaagagggacacagcac 62380853

                                                                
Query: 453      tcgaggtgcggcagccgcccccggacgccgggggcccccgagcacagg 500
                ||||||||||||||||||||||||||||||||||||||||||||||||
Sbjct: 62380852 tcgaggtgcggcagccgcccccggacgccgggggcccccgagcacagg 62380805



 Score =  131 bits (66), Expect = 8e-28
 Identities = 66/66 (100%)
 Strand = Plus / Minus

                                                                            
Query: 331      gcagctggtgcgggagacggctgaccctgatggccagatcggggccgcctctgagaaggg 390
                ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Sbjct: 62381206 gcagctggtgcgggagacggctgaccctgatggccagatcggggccgcctctgagaaggg 62381147

                      
Query: 391      agaggt 396
                ||||||
Sbjct: 62381146 agaggt 62381141



 Score = 69.9 bits (35), Expect = 2e-09
 Identities = 35/35 (100%)
 Strand = Plus / Minus

                                                   
Query: 497      cagggccagcctggcagaacgggagccgtggccaa 531
                |||||||||||||||||||||||||||||||||||
Sbjct: 62380710 cagggccagcctggcagaacgggagccgtggccaa 62380676



 Score = 63.9 bits (32), Expect = 2e-07
 Identities = 32/32 (100%)
 Strand = Plus / Minus

                                                
Query: 111      gcagtcgcaggaagcagctcaccttccaccgg 142
                ||||||||||||||||||||||||||||||||
Sbjct: 62387114 gcagtcgcaggaagcagctcaccttccaccgg 62387083


  Database: Sscrofa_10.2.fasta
    Posted date:  Nov 16, 2011 10:34 AM
  Number of letters in database: 2,808,509,378
  Number of sequences in database:  4582
  
Lambda     K      H
    1.37    0.711     1.31 

Gapped
Lambda     K      H
    1.37    0.711     1.31 


Matrix: blastn matrix:1 -3
Gap Penalties: Existence: 5, Extension: 2
Number of Sequences: 4582
Number of Hits to DB: 20,702,678
Number of extensions: 147
Number of successful extensions: 147
Number of sequences better than 1.0e-05: 1
Number of HSP's gapped: 145
Number of HSP's successfully gapped: 6
Length of query: 973
Length of database: 2,808,509,378
Length adjustment: 21
Effective length of query: 952
Effective length of database: 2,808,413,156
Effective search space: 2673609324512
Effective search space used: 2673609324512
X1: 11 (21.8 bits)
X2: 15 (29.7 bits)
X3: 50 (99.1 bits)
S1: 18 (36.2 bits)
S2: 29 (58.0 bits)