Animal-Genome cDNA 20070806S-071818


Search to RefSeqBP_Rel44

BLASTX 2.2.24 [Aug-08-2010]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= 20070806S-071818
         (572 letters)

Database: RefSeq44_BP.fasta 
           33,615 sequences; 18,071,151 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

Alignment   gi|NP_001071303.1| cathepsin Z precursor [Bos taurus].               249   9e-67
Alignment   gi|NP_001028789.1| dipeptidyl peptidase 1 [Bos taurus].               61   7e-10

>ref|NP_001071303.1| cathepsin Z precursor [Bos taurus].
          Length = 304

 Score =  249 bits (637), Expect = 9e-67
 Identities = 114/147 (77%), Positives = 124/147 (84%)
 Frame = +3

Query: 132 HFRPGCCCYRPLRGDLRTQLGHRTYPRPHEYLSPSDLPWSWDWRNMNGVNYASVTRNQHI 311
           HFRPG  CYRPLRGD  TQLG RTYPRPHEYLSPSDLP SWDWRN+NGVNYASVTRNQHI
Sbjct: 27  HFRPGRGCYRPLRGDRLTQLGRRTYPRPHEYLSPSDLPKSWDWRNVNGVNYASVTRNQHI 86

Query: 312 PQYCCSCWAHGSSCAMADRLYIQRKGAWPSTLLSVLHVIDCGNAGSCDGGDDLPVRAYAL 491
           PQYC SCWAHGS+ AMADR+ I+RKGAWPSTLLSV HVIDCG+AGSC+GG+DLPV  YA 
Sbjct: 87  PQYCGSCWAHGSTSAMADRINIKRKGAWPSTLLSVQHVIDCGDAGSCEGGNDLPVWEYAH 146

Query: 492 RLGLSEETCLYFLLHDQVCDLFNPCGT 572
           R G+ +ETC  +   DQ CD FN CGT
Sbjct: 147 RHGIPDETCNNYQAKDQECDKFNQCGT 173


>ref|NP_001028789.1| dipeptidyl peptidase 1 [Bos taurus].
          Length = 463

 Score = 60.8 bits (146), Expect = 7e-10
 Identities = 40/111 (36%), Positives = 53/111 (47%), Gaps = 2/111 (1%)
 Frame = +3

Query: 240 LPWSWDWRNMNGVNYASVTRNQHIPQYCCSCWAHGSSCAMADRLYIQRKGAWPSTLLSVL 419
           LP SWDWRN++G+N+ +  RNQ     C SC++  S   M  R+ I       + +LS  
Sbjct: 231 LPTSWDWRNVHGINFVTPVRNQ---GSCGSCYSFASMGMMEARIRILTNNT-QTPILSPQ 286

Query: 420 HVIDCGN-AGSCDGG-DDLPVRAYALRLGLSEETCLYFLLHDQVCDLFNPC 566
            V+ C   A  C+GG   L    YA   GL EE C  +   D  C L   C
Sbjct: 287 EVVSCSQYAQGCEGGFPYLIAGKYAQDFGLVEEDCFPYTGTDSPCRLKEGC 337


  Database: RefSeq44_BP.fasta
    Posted date:  Nov 15, 2010  4:36 PM
  Number of letters in database: 18,071,151
  Number of sequences in database:  33,615
  
Lambda     K      H
   0.318    0.134    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 33615
Number of Hits to DB: 17,722,360
Number of extensions: 475168
Number of successful extensions: 1591
Number of sequences better than 1.0e-05: 2
Number of HSP's gapped: 1589
Number of HSP's successfully gapped: 2
Length of query: 190
Length of database: 18,071,151
Length adjustment: 97
Effective length of query: 93
Effective length of database: 14,810,496
Effective search space: 1377376128
Effective search space used: 1377376128
Neighboring words threshold: 12
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 32 (16.9 bits)

Search to RefSeqCP_Rel44

BLASTX 2.2.24 [Aug-08-2010]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= 20070806S-071818
         (572 letters)

Database: RefSeq44_CP.fasta 
           33,359 sequences; 18,921,627 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

Alignment   gi|XP_854795.1| PREDICTED: similar to Cathepsin Z precursor (Ca...   210   6e-55
Alignment   gi|NP_001182763.1| dipeptidyl peptidase 1 [Canis lupus familiar...    69   2e-12
Alignment   gi|XP_535784.2| PREDICTED: similar to Dipeptidyl-peptidase I pr...    65   4e-11

>ref|XP_854795.1| PREDICTED: similar to Cathepsin Z precursor (Cathepsin X)
           (Cathepsin P) [Canis familiaris].
          Length = 375

 Score =  210 bits (535), Expect = 6e-55
 Identities = 98/142 (69%), Positives = 111/142 (78%), Gaps = 7/142 (4%)
 Frame = +3

Query: 168 RGDLRTQLGH-------RTYPRPHEYLSPSDLPWSWDWRNMNGVNYASVTRNQHIPQYCC 326
           R +LR  L H       RTYPRPHEYLSPSDLP SWDWRN+NGVNYAS TRNQHIPQYC 
Sbjct: 103 RRELRRPLEHSPAWWPRRTYPRPHEYLSPSDLPKSWDWRNVNGVNYASATRNQHIPQYCG 162

Query: 327 SCWAHGSSCAMADRLYIQRKGAWPSTLLSVLHVIDCGNAGSCDGGDDLPVRAYALRLGLS 506
           SCWAHGS+ AMADR+ I+RKGAWPSTLLSV HV+DC NAGSC+GG+DLPV +YA   G+ 
Sbjct: 163 SCWAHGSTSAMADRINIKRKGAWPSTLLSVQHVLDCANAGSCEGGNDLPVWSYAHEHGIP 222

Query: 507 EETCLYFLLHDQVCDLFNPCGT 572
           +ETC  +   DQ C+ FN CGT
Sbjct: 223 DETCNNYQAKDQECNKFNQCGT 244


>ref|NP_001182763.1| dipeptidyl peptidase 1 [Canis lupus familiaris].
          Length = 459

 Score = 69.3 bits (168), Expect = 2e-12
 Identities = 48/136 (35%), Positives = 64/136 (47%), Gaps = 11/136 (8%)
 Frame = +3

Query: 174 DLRTQLGHRTYPRP---------HEYLSPSDLPWSWDWRNMNGVNYASVTRNQHIPQYCC 326
           D+ T++G R  PRP         HE +S   LP SWDWRN+ G N+ S  RNQ     C 
Sbjct: 199 DMMTRVGGRKIPRPKPTPLTAEIHEEISR--LPTSWDWRNVRGTNFVSPVRNQ---ASCG 253

Query: 327 SCWAHGSSCAMADRLYIQRKGAWPSTLLSVLHVIDCGN-AGSCDGG-DDLPVRAYALRLG 500
           SC+A  S+  +  R+ I       + +LS   ++ C   A  C+GG   L    YA   G
Sbjct: 254 SCYAFASTAMLEARIRILTNNT-QTPILSPQEIVSCSQYAQGCEGGFPYLIAGKYAQDFG 312

Query: 501 LSEETCLYFLLHDQVC 548
           L EE C  +   D  C
Sbjct: 313 LVEEACFPYAGSDSPC 328


>ref|XP_535784.2| PREDICTED: similar to Dipeptidyl-peptidase I precursor (DPP-I)
           (DPPI) (Cathepsin C) (Cathepsin J) (Dipeptidyl
           transferase), partial [Canis familiaris].
          Length = 481

 Score = 65.1 bits (157), Expect = 4e-11
 Identities = 46/136 (33%), Positives = 62/136 (45%), Gaps = 11/136 (8%)
 Frame = +3

Query: 174 DLRTQLGHRTYPRP---------HEYLSPSDLPWSWDWRNMNGVNYASVTRNQHIPQYCC 326
           D+  + G R  PRP         HE +S   LP SWDWRN+ G N+ S  RNQ     C 
Sbjct: 221 DMMRRAGGRKIPRPKPTPLTAEIHEEISR--LPTSWDWRNVRGTNFVSPVRNQ---ASCG 275

Query: 327 SCWAHGSSCAMADRLYIQRKGAWPSTLLSVLHVIDCGN-AGSCDGG-DDLPVRAYALRLG 500
           SC+A  S+  +  R+ I       + +LS   ++ C   A  C+GG   L    YA   G
Sbjct: 276 SCYAFASTVMLEARIRILTNNT-QTPILSPQEIVSCSQYAQGCEGGFPYLIAGKYAQDFG 334

Query: 501 LSEETCLYFLLHDQVC 548
           L +E C  +   D  C
Sbjct: 335 LVDEACFSYAGSDSPC 350


  Database: RefSeq44_CP.fasta
    Posted date:  Nov 15, 2010  4:36 PM
  Number of letters in database: 18,921,627
  Number of sequences in database:  33,359
  
Lambda     K      H
   0.318    0.134    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 33359
Number of Hits to DB: 17,721,232
Number of extensions: 420157
Number of successful extensions: 1643
Number of sequences better than 1.0e-05: 3
Number of HSP's gapped: 1641
Number of HSP's successfully gapped: 3
Length of query: 190
Length of database: 18,921,627
Length adjustment: 97
Effective length of query: 93
Effective length of database: 15,685,804
Effective search space: 1458779772
Effective search space used: 1458779772
Neighboring words threshold: 12
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 32 (16.9 bits)

Search to RefSeqHP_Rel44

BLASTX 2.2.24 [Aug-08-2010]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= 20070806S-071818
         (572 letters)

Database: RefSeq44_HP.fasta 
           33,950 sequences; 18,324,212 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

Alignment   gi|NP_001327.2| cathepsin Z preproprotein [Homo sapiens].            226   1e-59
Alignment   gi|NP_001805.3| dipeptidyl peptidase 1 isoform a preproprotein ...    62   3e-10

>ref|NP_001327.2| cathepsin Z preproprotein [Homo sapiens].
          Length = 303

 Score =  226 bits (575), Expect = 1e-59
 Identities = 103/147 (70%), Positives = 118/147 (80%)
 Frame = +3

Query: 132 HFRPGCCCYRPLRGDLRTQLGHRTYPRPHEYLSPSDLPWSWDWRNMNGVNYASVTRNQHI 311
           +FR G  CYRPLRGD    LG  TYPRPHEYLSP+DLP SWDWRN++GVNYAS+TRNQHI
Sbjct: 26  YFRRGQTCYRPLRGDGLAPLGRSTYPRPHEYLSPADLPKSWDWRNVDGVNYASITRNQHI 85

Query: 312 PQYCCSCWAHGSSCAMADRLYIQRKGAWPSTLLSVLHVIDCGNAGSCDGGDDLPVRAYAL 491
           PQYC SCWAH S+ AMADR+ I+RKGAWPSTLLSV +VIDCGNAGSC+GG+DL V  YA 
Sbjct: 86  PQYCGSCWAHASTSAMADRINIKRKGAWPSTLLSVQNVIDCGNAGSCEGGNDLSVWDYAH 145

Query: 492 RLGLSEETCLYFLLHDQVCDLFNPCGT 572
           + G+ +ETC  +   DQ CD FN CGT
Sbjct: 146 QHGIPDETCNNYQAKDQECDKFNQCGT 172


>ref|NP_001805.3| dipeptidyl peptidase 1 isoform a preproprotein [Homo sapiens].
          Length = 463

 Score = 62.4 bits (150), Expect = 3e-10
 Identities = 46/140 (32%), Positives = 64/140 (45%), Gaps = 10/140 (7%)
 Frame = +3

Query: 177 LRTQLGH-RTYPRPHEYLSPSD-------LPWSWDWRNMNGVNYASVTRNQHIPQYCCSC 332
           +R   GH R  PRP      ++       LP SWDWRN++G+N+ S  RNQ     C SC
Sbjct: 202 IRRSGGHSRKIPRPKPAPLTAEIQQKILHLPTSWDWRNVHGINFVSPVRNQ---ASCGSC 258

Query: 333 WAHGSSCAMADRLYIQRKGAWPSTLLSVLHVIDCGN-AGSCDGG-DDLPVRAYALRLGLS 506
           ++  S   +  R+ I    +  + +LS   V+ C   A  C+GG   L    YA   GL 
Sbjct: 259 YSFASMGMLEARIRILTNNS-QTPILSPQEVVSCSQYAQGCEGGFPYLIAGKYAQDFGLV 317

Query: 507 EETCLYFLLHDQVCDLFNPC 566
           EE C  +   D  C +   C
Sbjct: 318 EEACFPYTGTDSPCKMKEDC 337


  Database: RefSeq44_HP.fasta
    Posted date:  Nov 15, 2010  4:36 PM
  Number of letters in database: 18,324,212
  Number of sequences in database:  33,950
  
Lambda     K      H
   0.318    0.134    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 33950
Number of Hits to DB: 17,515,630
Number of extensions: 415405
Number of successful extensions: 1548
Number of sequences better than 1.0e-05: 2
Number of HSP's gapped: 1548
Number of HSP's successfully gapped: 2
Length of query: 190
Length of database: 18,324,212
Length adjustment: 97
Effective length of query: 93
Effective length of database: 15,031,062
Effective search space: 1397888766
Effective search space used: 1397888766
Neighboring words threshold: 12
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 32 (16.9 bits)

Search to RefSeqMP_Rel44

BLASTX 2.2.24 [Aug-08-2010]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= 20070806S-071818
         (572 letters)

Database: RefSeq44_MP.fasta 
           29,866 sequences; 15,452,059 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

Alignment   gi|NP_071720.1| cathepsin Z preproprotein [Mus musculus].            231   4e-61
Alignment   gi|NP_034112.3| dipeptidyl peptidase 1 preproprotein [Mus muscu...    64   8e-11

>ref|NP_071720.1| cathepsin Z preproprotein [Mus musculus].
          Length = 306

 Score =  231 bits (588), Expect = 4e-61
 Identities = 104/147 (70%), Positives = 119/147 (80%)
 Frame = +3

Query: 132 HFRPGCCCYRPLRGDLRTQLGHRTYPRPHEYLSPSDLPWSWDWRNMNGVNYASVTRNQHI 311
           +FR G  CY P+RGD    LG RTYPRPHEYLSP+DLP +WDWRN+NGVNYASVTRNQHI
Sbjct: 28  YFRSGQTCYHPIRGDQLALLGRRTYPRPHEYLSPADLPKNWDWRNVNGVNYASVTRNQHI 87

Query: 312 PQYCCSCWAHGSSCAMADRLYIQRKGAWPSTLLSVLHVIDCGNAGSCDGGDDLPVRAYAL 491
           PQYC SCWAHGS+ AMADR+ I+RKGAWPS LLSV +VIDCGNAGSC+GG+DLPV  YA 
Sbjct: 88  PQYCGSCWAHGSTSAMADRINIKRKGAWPSILLSVQNVIDCGNAGSCEGGNDLPVWEYAH 147

Query: 492 RLGLSEETCLYFLLHDQVCDLFNPCGT 572
           + G+ +ETC  +   DQ CD FN CGT
Sbjct: 148 KHGIPDETCNNYQAKDQDCDKFNQCGT 174


>ref|NP_034112.3| dipeptidyl peptidase 1 preproprotein [Mus musculus].
          Length = 462

 Score = 63.9 bits (154), Expect = 8e-11
 Identities = 40/106 (37%), Positives = 54/106 (50%), Gaps = 2/106 (1%)
 Frame = +3

Query: 237 DLPWSWDWRNMNGVNYASVTRNQHIPQYCCSCWAHGSSCAMADRLYIQRKGAWPSTLLSV 416
           +LP SWDWRN+ GVNY S  RNQ   + C SC++  S   +  R+ I    +  + +LS 
Sbjct: 229 NLPESWDWRNVQGVNYVSPVRNQ---ESCGSCYSFASMGMLEARIRILTNNS-QTPILSP 284

Query: 417 LHVIDCG-NAGSCDGG-DDLPVRAYALRLGLSEETCLYFLLHDQVC 548
             V+ C   A  CDGG   L    YA   G+ EE+C  +   D  C
Sbjct: 285 QEVVSCSPYAQGCDGGFPYLIAGKYAQDFGVVEESCFPYTAKDSPC 330


  Database: RefSeq44_MP.fasta
    Posted date:  Nov 15, 2010  4:36 PM
  Number of letters in database: 15,452,059
  Number of sequences in database:  29,866
  
Lambda     K      H
   0.318    0.134    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 29866
Number of Hits to DB: 14,438,094
Number of extensions: 337015
Number of successful extensions: 1323
Number of sequences better than 1.0e-05: 2
Number of HSP's gapped: 1323
Number of HSP's successfully gapped: 2
Length of query: 190
Length of database: 15,452,059
Length adjustment: 95
Effective length of query: 95
Effective length of database: 12,614,789
Effective search space: 1198404955
Effective search space used: 1198404955
Neighboring words threshold: 12
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 32 (16.9 bits)

Search to RefSeqSP_Rel44

BLASTX 2.2.24 [Aug-08-2010]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= 20070806S-071818
         (572 letters)

Database: RefSeq44_SP.fasta 
           20,576 sequences; 9,542,844 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

Alignment   gi|NP_001116576.1| cathepsin Z [Sus scrofa].                         269   7e-73
Alignment   gi|XP_003129789.1| PREDICTED: dipeptidyl peptidase 1-like [Sus ...    57   4e-09

>ref|NP_001116576.1| cathepsin Z [Sus scrofa].
          Length = 304

 Score =  269 bits (687), Expect = 7e-73
 Identities = 121/147 (82%), Positives = 129/147 (87%)
 Frame = +3

Query: 132 HFRPGCCCYRPLRGDLRTQLGHRTYPRPHEYLSPSDLPWSWDWRNMNGVNYASVTRNQHI 311
           HFRPGC CYRPLRGD RTQLGHRTYPRPHEYLSPSDLP SWDWRN+NGVNYASVTRNQHI
Sbjct: 27  HFRPGCSCYRPLRGDQRTQLGHRTYPRPHEYLSPSDLPRSWDWRNVNGVNYASVTRNQHI 86

Query: 312 PQYCCSCWAHGSSCAMADRLYIQRKGAWPSTLLSVLHVIDCGNAGSCDGGDDLPVRAYAL 491
           PQYC SCWAHGS+ AMADR+ I+RKGAWPSTLLSV HVIDCGNAGSC+GGDDLPV AYA 
Sbjct: 87  PQYCGSCWAHGSTSAMADRINIKRKGAWPSTLLSVQHVIDCGNAGSCEGGDDLPVWAYAH 146

Query: 492 RLGLSEETCLYFLLHDQVCDLFNPCGT 572
           R G+ +ETC  +   DQVCD FN CGT
Sbjct: 147 RHGIPDETCNNYQAKDQVCDKFNQCGT 173


>ref|XP_003129789.1| PREDICTED: dipeptidyl peptidase 1-like [Sus scrofa].
          Length = 463

 Score = 57.4 bits (137), Expect = 4e-09
 Identities = 39/111 (35%), Positives = 50/111 (45%), Gaps = 2/111 (1%)
 Frame = +3

Query: 240 LPWSWDWRNMNGVNYASVTRNQHIPQYCCSCWAHGSSCAMADRLYIQRKGAWPSTLLSVL 419
           LP SWDWRN+ G N+ +  RNQ     C SC++  S   M  R+ I       + +LS  
Sbjct: 231 LPASWDWRNVRGTNFVTPVRNQ---ASCGSCYSFASMGMMEARIRILTNNT-QTPILSPQ 286

Query: 420 HVIDCGN-AGSCDGG-DDLPVRAYALRLGLSEETCLYFLLHDQVCDLFNPC 566
            V+ C   A  C GG   L    YA   GL EE C  +   D  C +   C
Sbjct: 287 EVVSCSQYAQGCAGGFPYLIAGKYAQDFGLVEEACFPYTGTDSPCTVKEGC 337


  Database: RefSeq44_SP.fasta
    Posted date:  Nov 15, 2010  4:36 PM
  Number of letters in database: 9,542,844
  Number of sequences in database:  20,576
  
Lambda     K      H
   0.318    0.134    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 20576
Number of Hits to DB: 9,349,027
Number of extensions: 230901
Number of successful extensions: 920
Number of sequences better than 1.0e-05: 2
Number of HSP's gapped: 920
Number of HSP's successfully gapped: 2
Length of query: 190
Length of database: 9,542,844
Length adjustment: 92
Effective length of query: 98
Effective length of database: 7,649,852
Effective search space: 749685496
Effective search space used: 749685496
Neighboring words threshold: 12
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 32 (16.9 bits)