Search to RefSeqBP_Rel49
BLASTX 2.2.24 [Aug-08-2010]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= 20110601C-000959
(1371 letters)
Database: RefSeq49_BP.fasta
33,088 sequences; 17,681,374 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
Alignment gi|NP_001071303.1| cathepsin Z precursor [Bos taurus]. 541 e-154
Alignment gi|NP_001028789.1| dipeptidyl peptidase 1 [Bos taurus]. 110 4e-24
Alignment gi|NP_001028787.1| cathepsin S precursor [Bos taurus]. 97 3e-20
Alignment gi|XP_002694471.1| PREDICTED: cathepsin O preproprotein-like [B... 95 2e-19
Alignment gi|XP_874012.3| PREDICTED: cathepsin O [Bos taurus]. 95 2e-19
Alignment gi|NP_776456.1| cathepsin B precursor [Bos taurus]. 91 3e-18
Alignment gi|NP_001029557.1| pro-cathepsin H precursor [Bos taurus]. 86 6e-17
Alignment gi|NP_001029607.1| cathepsin K [Bos taurus]. 82 1e-15
Alignment gi|NP_001030279.1| tubulointerstitial nephritis antigen [Bos ta... 80 6e-15
Alignment gi|NP_001068884.1| cathepsin F [Bos taurus]. 75 1e-13
>ref|NP_001071303.1| cathepsin Z precursor [Bos taurus].
Length = 304
Score = 541 bits (1393), Expect = e-154
Identities = 244/274 (89%), Positives = 254/274 (92%)
Frame = +2
Query: 167 HFRPGCSCYRPLRGDQRTQLGHRTYPRPHEYLSPSDLPRSWDWRNVNGVNYASVTRNQHI 346
HFRPG CYRPLRGD+ TQLG RTYPRPHEYLSPSDLP+SWDWRNVNGVNYASVTRNQHI
Sbjct: 27 HFRPGRGCYRPLRGDRLTQLGRRTYPRPHEYLSPSDLPKSWDWRNVNGVNYASVTRNQHI 86
Query: 347 PQYCGSCWAHGSTSAMADRINIKRKGAWPSTLLSVQHVIDCGNAGSCEGGDDLPVWAYAH 526
PQYCGSCWAHGSTSAMADRINIKRKGAWPSTLLSVQHVIDCG+AGSCEGG+DLPVW YAH
Sbjct: 87 PQYCGSCWAHGSTSAMADRINIKRKGAWPSTLLSVQHVIDCGDAGSCEGGNDLPVWEYAH 146
Query: 527 RHGIPDETCNNYQAKDQVCDKFNQCGTCTEFKECHVIQNYTLWKVGDYGSVSGREKMMAE 706
RHGIPDETCNNYQAKDQ CDKFNQCGTCTEFKECHVI+NYTLWKVGDYGS+SGREKMMAE
Sbjct: 147 RHGIPDETCNNYQAKDQECDKFNQCGTCTEFKECHVIKNYTLWKVGDYGSLSGREKMMAE 206
Query: 707 IYANGPISCGIMATEKMSNYTGGIYAEYKDQAYINHIVSVAGWGVSGGTEYWNVRNSWGE 886
IY NGPISCGIMATEKMSNYTGGIY+EY DQA+INHIVSVAGWGVS G EYW VRNSWGE
Sbjct: 207 IYTNGPISCGIMATEKMSNYTGGIYSEYNDQAFINHIVSVAGWGVSDGMEYWIVRNSWGE 266
Query: 887 PWGERGWMRIVTSTYKDGGGAHYNLASRKTAPPG 988
PWGE GWMRIVTSTYK G GA YNLA ++ G
Sbjct: 267 PWGEHGWMRIVTSTYKGGEGARYNLAIEESCTFG 300
>ref|NP_001028789.1| dipeptidyl peptidase 1 [Bos taurus].
Length = 463
Score = 110 bits (274), Expect = 4e-24
Identities = 74/228 (32%), Positives = 101/228 (44%), Gaps = 10/228 (4%)
Frame = +2
Query: 275 LPRSWDWRNVNGVNYASVTRNQHIPQYCGSCWAHGSTSAMADRINIKRKGAWPSTLLSVQ 454
LP SWDWRNV+G+N+ + RNQ CGSC++ S M RI I + +LS Q
Sbjct: 231 LPTSWDWRNVHGINFVTPVRNQGS---CGSCYSFASMGMMEARIRILTNNT-QTPILSPQ 286
Query: 455 HVIDCGN-AGSCEGGDD-LPVWAYAHRHGIPDETCNNYQAKDQVCDKFNQCGTCTEFKEC 628
V+ C A CEGG L YA G+ +E C Y D C C F+
Sbjct: 287 EVVSCSQYAQGCEGGFPYLIAGKYAQDFGLVEEDCFPYTGTDSPCRLKEGC-----FRYY 341
Query: 629 HVIQNYTLWKVGDYGSVSGREKMMAEIYANGPISCGIMATEKMSNYTGGIYAE------Y 790
+Y VG + M E+ GP++ + +Y G+Y +
Sbjct: 342 SSEYHY----VGGFYGGCNEALMKLELVHQGPMAVAFEVYDDFLHYRKGVYHHTGLRDPF 397
Query: 791 KDQAYINHIVSVAGWGV--SGGTEYWNVRNSWGEPWGERGWMRIVTST 928
NH V + G+G + G +YW V+NSWG WGE G+ RI T
Sbjct: 398 NPFELTNHAVLLVGYGTDAASGLDYWIVKNSWGTSWGENGYFRIRRGT 445
>ref|NP_001028787.1| cathepsin S precursor [Bos taurus].
Length = 331
Score = 97.4 bits (241), Expect = 3e-20
Identities = 81/254 (31%), Positives = 116/254 (45%), Gaps = 15/254 (5%)
Frame = +2
Query: 239 YPRPHEYLSPSD--LPRSWDWRN---VNGVNYASVTRNQHIPQYCGSCWAHGSTSAMADR 403
+PR Y S + LP S DWR V V Y CGSCWA + A+ +
Sbjct: 101 WPRNVTYKSDPNQKLPDSMDWREKGCVTEVKYQGA---------CGSCWAFSAVGALEAQ 151
Query: 404 INIKRKGAWPSTLLSVQHVIDC-----GNAGSCEGGDDLPVWAYA-HRHGIPDETCNNYQ 565
+ +K G S LS Q+++DC GN G C GG + Y +GI E Y+
Sbjct: 152 VKLKT-GKLVS--LSAQNLVDCSTAKYGNKG-CNGGFMTEAFQYIIDNNGIDSEASYPYK 207
Query: 566 AKDQVC--DKFNQCGTCTEFKECHVIQNYTLWKVGDYGSVSGREKMMAEIYAN-GPISCG 736
A D C D N+ TC+ + E G E+ + E AN GP+S G
Sbjct: 208 AMDGKCQYDVKNRAATCSRYIELPF----------------GSEEALKEAVANKGPVSVG 251
Query: 737 IMATEK-MSNYTGGIYAEYKDQAYINHIVSVAGWGVSGGTEYWNVRNSWGEPWGERGWMR 913
I A+ Y G+Y + +NH V V G+G G +YW V+NSWG +G++G++R
Sbjct: 252 IDASHSSFFLYKTGVYYDPSCTQNVNHGVLVVGYGNLDGKDYWLVKNSWGLHFGDQGYIR 311
Query: 914 IVTSTYKDGGGAHY 955
+ ++ G A+Y
Sbjct: 312 MARNSGNHCGIANY 325
>ref|XP_002694471.1| PREDICTED: cathepsin O preproprotein-like [Bos taurus].
Length = 375
Score = 94.7 bits (234), Expect = 2e-19
Identities = 77/234 (32%), Positives = 109/234 (46%), Gaps = 8/234 (3%)
Frame = +2
Query: 239 YPR--PHEYLSPSDL--PRSWDWRNVNGVNYASVTRNQHIPQYCGSCWAHGSTSAMADRI 406
+PR EY S S+L P +DWR+ + V RNQ + CG CWA A+
Sbjct: 146 FPRFPAEEYTSISNLSLPLRFDWRDKHVVTQV---RNQ---KTCGGCWAFSVVGAVESVC 199
Query: 407 NIKRKGAWPSTLLSVQHVIDCGNAG-SCEGGDDLPVWAYAHRHGIPDETCNNYQAKDQVC 583
IK + P +LSVQ VIDC + C GG L + ++ + + Y + Q
Sbjct: 200 AIKGQ---PLEVLSVQQVIDCSYSNYGCNGGSPLSALYWLNKLQVKLVRDSEYPFQAQN- 255
Query: 584 DKFNQCGTCTEFKECHVIQNYTLWKVGDYGSVSGREKMMAE-IYANGPISCGIMATEKMS 760
G C F + H + + D+ SG+E MAE + A GP+ I+ + MS
Sbjct: 256 ------GLCRYFSDSHSGSSIKGYSAYDF---SGQEDKMAEALLALGPL---IVVVDAMS 303
Query: 761 --NYTGGIYAEYKDQAYINHIVSVAGWGVSGGTEYWNVRNSWGEPWGERGWMRI 916
+Y GGI + NH V V G+ +G YW VRNSWG WG G++R+
Sbjct: 304 WQDYLGGIIQHHCSSGEANHAVLVTGFDKTGSIPYWIVRNSWGTSWGIDGYVRV 357
>ref|XP_874012.3| PREDICTED: cathepsin O [Bos taurus].
Length = 384
Score = 94.7 bits (234), Expect = 2e-19
Identities = 77/234 (32%), Positives = 109/234 (46%), Gaps = 8/234 (3%)
Frame = +2
Query: 239 YPR--PHEYLSPSDL--PRSWDWRNVNGVNYASVTRNQHIPQYCGSCWAHGSTSAMADRI 406
+PR EY S S+L P +DWR+ + V RNQ + CG CWA A+
Sbjct: 155 FPRFPAEEYTSISNLSLPLRFDWRDKHVVTQV---RNQ---KTCGGCWAFSVVGAVESVC 208
Query: 407 NIKRKGAWPSTLLSVQHVIDCGNAG-SCEGGDDLPVWAYAHRHGIPDETCNNYQAKDQVC 583
IK + P +LSVQ VIDC + C GG L + ++ + + Y + Q
Sbjct: 209 AIKGQ---PLEVLSVQQVIDCSYSNYGCNGGSPLSALYWLNKLQVKLVRDSEYPFQAQN- 264
Query: 584 DKFNQCGTCTEFKECHVIQNYTLWKVGDYGSVSGREKMMAE-IYANGPISCGIMATEKMS 760
G C F + H + + D+ SG+E MAE + A GP+ I+ + MS
Sbjct: 265 ------GLCRYFSDSHSGSSIKGYSAYDF---SGQEDKMAEALLALGPL---IVVVDAMS 312
Query: 761 --NYTGGIYAEYKDQAYINHIVSVAGWGVSGGTEYWNVRNSWGEPWGERGWMRI 916
+Y GGI + NH V V G+ +G YW VRNSWG WG G++R+
Sbjct: 313 WQDYLGGIIQHHCSSGEANHAVLVTGFDKTGSIPYWIVRNSWGTSWGIDGYVRV 366
>ref|NP_776456.1| cathepsin B precursor [Bos taurus].
Length = 335
Score = 90.5 bits (223), Expect = 3e-18
Identities = 69/258 (26%), Positives = 102/258 (39%), Gaps = 26/258 (10%)
Frame = +2
Query: 224 LGHRTYPRPHEYLSPSDLPRSWD----WRNVNGVNYASVTRNQHIPQYCGSCWAHGSTSA 391
LG P+ + + LP S+D W N + R+Q CGSCWA G+ A
Sbjct: 63 LGGPKLPQRDAFAADVVLPESFDAREQWPNCPTIKEI---RDQGS---CGSCWAFGAVEA 116
Query: 392 MADRINIKRKGAWPSTLLSVQHVIDC--GNAGS-CEGGDDLPVWAYAHRHGIPDET---- 550
++DRI I G + +S + ++ C G G C GG W + + G+
Sbjct: 117 ISDRICIHSNGR-VNVEVSAEDMLTCCGGECGDGCNGGFPSGAWNFWTKKGLVSGGLYNS 175
Query: 551 ---CNNYQ---------AKDQVCDKFNQCGTCTEFKECHVIQNYTLWK---VGDYGSVSG 685
C Y C C++ E +Y K Y +
Sbjct: 176 HVGCRPYSIPPCEHHVNGSRPPCTGEGDTPKCSKTCEPGYSPSYKEDKHFGCSSYSVANN 235
Query: 686 REKMMAEIYANGPISCGIMATEKMSNYTGGIYAEYKDQAYINHIVSVAGWGVSGGTEYWN 865
+++MAEIY NGP+ Y G+Y + H + + GWGV GT YW
Sbjct: 236 EKEIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVSGEIMGGHAIRILGWGVENGTPYWL 295
Query: 866 VRNSWGEPWGERGWMRIV 919
V NSW WG+ G+ +I+
Sbjct: 296 VGNSWNTDWGDNGFFKIL 313
>ref|NP_001029557.1| pro-cathepsin H precursor [Bos taurus].
Length = 335
Score = 86.3 bits (212), Expect = 6e-17
Identities = 65/220 (29%), Positives = 94/220 (42%), Gaps = 7/220 (3%)
Frame = +2
Query: 278 PRSWDWRNVNGVNYASVTRNQHIPQYCGSCWAHGSTSAMADRINIKRKGAWPSTLLSVQH 457
P S DWR N+ + +NQ CGSCW +T A+ + I G P L+ Q
Sbjct: 117 PPSMDWRKKG--NFVTPVKNQGS---CGSCWTFSTTGALESAVAIAT-GKLP--FLAEQQ 168
Query: 458 VIDCG---NAGSCEGGDDLPVWAYA-HRHGIPDETCNNYQAKDQVCDKFNQCGTCTEFKE 625
++DC N C+GG + Y + GI E Y+ +D C K+ K+
Sbjct: 169 LVDCAQNFNNHGCQGGLPSQAFEYIRYNKGIMGEDTYPYRGQDGDC-KYQPSKAIAFVKD 227
Query: 626 CHVIQNYTLWKVGDYGSVSGREKMMAEIYANGPISCGIMATEKMSNYTGGIYAE---YKD 796
+ N TL + E M+ + + P+S T Y GIY+ +K
Sbjct: 228 ---VANITL---------NDEEAMVEAVALHNPVSFAFEVTADFMMYRKGIYSSTSCHKT 275
Query: 797 QAYINHIVSVAGWGVSGGTEYWNVRNSWGEPWGERGWMRI 916
+NH V G+G G YW V+NSWG WG +G+ I
Sbjct: 276 PDKVNHAVLAVGYGEEKGIPYWIVKNSWGPNWGMKGYFLI 315
>ref|NP_001029607.1| cathepsin K [Bos taurus].
Length = 334
Score = 82.0 bits (201), Expect = 1e-15
Identities = 69/235 (29%), Positives = 106/235 (45%), Gaps = 5/235 (2%)
Frame = +2
Query: 278 PRSWDWRNVNGVNYASVTRNQHIPQYCGSCWAHGSTSAMADRINIKRKGAWPSTLLSVQH 457
P S D+R Y + +NQ CGSCWA S A+ ++ K+ G + LS Q+
Sbjct: 121 PDSVDYRKKG---YVTPVKNQG---QCGSCWAFSSVGALEGQLK-KKTGKLLN--LSPQN 171
Query: 458 VIDCGNAGS-CEGGDDLPVWAYAHRH-GIPDETCNNYQAKDQVCDKFNQCGTCTEFKECH 631
++DC + C GG + Y ++ GI E Y +D+ C +N G + +
Sbjct: 172 LVDCVSENDGCGGGYMTNAFQYVQKNRGIDSEDAYPYVGQDENC-MYNPTGKAAKCRGYR 230
Query: 632 VIQNYTLWKVGDYGSVSGREKMMAEIYAN-GPISCGIMAT-EKMSNYTGGIYAEYK-DQA 802
I G EK + A GPIS I A+ Y G+Y + +
Sbjct: 231 EIPE-------------GNEKALKRAVARVGPISVAIDASLTSFQFYRKGVYYDENCNSD 277
Query: 803 YINHIVSVAGWGVSGGTEYWNVRNSWGEPWGERGWMRIVTSTYKDGGGAHYNLAS 967
+NH V G+G+ G ++W ++NSWGE WG +G+ I+ + K+ NLAS
Sbjct: 278 NLNHAVLAVGYGIQKGNKHWIIKNSWGENWGNKGY--ILMARNKNNACGIANLAS 330
>ref|NP_001030279.1| tubulointerstitial nephritis antigen [Bos taurus].
Length = 476
Score = 79.7 bits (195), Expect = 6e-15
Identities = 59/213 (27%), Positives = 82/213 (38%), Gaps = 25/213 (11%)
Frame = +2
Query: 356 CGSCWAHGSTSAMADRINIKRKGAWPSTLLSVQHVIDC--GNAGSCEGGDDLPVWAYAHR 529
C + WA + S ADRI I+ +G + + L S Q++I C C G W Y +
Sbjct: 240 CAASWAFSTASVAADRIAIQSQGRYTANL-SPQNLISCCAKKRHGCNSGSVDRAWWYLRK 298
Query: 530 HGIPDETC----------NNYQAKDQVCDKFNQCGTCTEFKECHVIQNYTLWKVGDYGSV 679
G+ C NN A D + T N Y
Sbjct: 299 RGLVSHACYPLFKDQNATNNGCAMASRSDGRGKRHATTPCPNSIEKSNRIYQCSPPYRVS 358
Query: 680 SGREKMMAEIYANGPISCGIMATEKMSNYTGGIYAEY--------KDQAYINHIVSVAGW 835
S ++M EI NGP+ + E NY GIY K + + H V + GW
Sbjct: 359 SNETEIMREIMQNGPVQAIMQVHEDFFNYKTGIYRHITSTNEDSEKYRKFRTHAVKLTGW 418
Query: 836 GVSGGTE-----YWNVRNSWGEPWGERGWMRIV 919
G G + +W NSWG+ WGE G+ RI+
Sbjct: 419 GTLRGAQGQKEKFWIAANSWGKSWGENGYFRIL 451
>ref|NP_001068884.1| cathepsin F [Bos taurus].
Length = 460
Score = 75.1 bits (183), Expect = 1e-13
Identities = 65/219 (29%), Positives = 93/219 (42%), Gaps = 9/219 (4%)
Frame = +2
Query: 278 PRSWDWRNVNGVNYASVTRNQHIPQYCGSCWAHGSTSAMADRINIKRKGAWPSTLLSV-- 451
P WDWRN V N CGSCWA T + + +KR TLLS+
Sbjct: 248 PPQWDWRNKGAVT------NVKDQGMCGSCWAFSVTGNVEGQWFLKR-----GTLLSLSE 296
Query: 452 QHVIDCGNAG-SCEGGDDLPVWAYAHRH---GIPDETCNNYQAKDQVCDKFNQCGTCTEF 619
Q ++DC +C GG LP AY+ G+ E +Y+ + Q C E
Sbjct: 297 QELLDCDKTDKACLGG--LPSNAYSAIRTLGGLETEDDYSYRGRLQTCS------FSAEK 348
Query: 620 KECHVIQNYTLWKVGDYGSVSGREKMMAEIYANGPISCGIMATEKMSNYTGGIYAEYKDQ 799
+ ++ + L K +K+ A + NGP+S I A M Y GI +
Sbjct: 349 AKVYINDSVELSK--------NEQKLAAWLAKNGPVSIAINAFG-MQFYRHGISHPLRPL 399
Query: 800 A---YINHIVSVAGWGVSGGTEYWNVRNSWGEPWGERGW 907
I+H V + G+G +W ++NSWG WGE G+
Sbjct: 400 CSPWLIDHAVLLVGYGNRSAIPFWAIKNSWGTDWGEEGY 438
Database: RefSeq49_BP.fasta
Posted date: Oct 17, 2011 1:42 PM
Number of letters in database: 17,681,374
Number of sequences in database: 33,088
Lambda K H
0.318 0.134 0.401
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 33088
Number of Hits to DB: 47,418,818
Number of extensions: 1519457
Number of successful extensions: 7074
Number of sequences better than 1.0e-05: 14
Number of HSP's gapped: 6997
Number of HSP's successfully gapped: 14
Length of query: 457
Length of database: 17,681,374
Length adjustment: 106
Effective length of query: 351
Effective length of database: 14,174,046
Effective search space: 4975090146
Effective search space used: 4975090146
Neighboring words threshold: 12
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 35 (18.1 bits)
Search to RefSeqCP_Rel49
BLASTX 2.2.24 [Aug-08-2010]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= 20110601C-000959
(1371 letters)
Database: RefSeq49_CP.fasta
33,336 sequences; 18,874,504 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
Alignment gi|XP_854795.1| PREDICTED: similar to Cathepsin Z precursor (Ca... 499 e-141
Alignment gi|NP_001182763.1| dipeptidyl peptidase 1 [Canis lupus familiar... 124 2e-28
Alignment gi|XP_535784.2| PREDICTED: similar to Dipeptidyl-peptidase I pr... 119 5e-27
Alignment gi|XP_543203.2| PREDICTED: similar to cathepsin B preproprotein... 94 3e-19
Alignment gi|NP_001002938.1| cathepsin S precursor [Canis lupus familiari... 91 3e-18
Alignment gi|XP_539782.2| PREDICTED: similar to Cathepsin O precursor [Ca... 86 6e-17
Alignment gi|NP_001029168.1| cathepsin K precursor [Canis lupus familiari... 84 4e-16
Alignment gi|XP_535330.2| PREDICTED: similar to P3ECSL [Canis familiaris]. 79 8e-15
Alignment gi|XP_536212.2| PREDICTED: similar to Cathepsin H precursor [Ca... 79 1e-14
Alignment gi|XP_538969.2| PREDICTED: similar to tubulointerstitial nephri... 78 2e-14
>ref|XP_854795.1| PREDICTED: similar to Cathepsin Z precursor (Cathepsin X)
(Cathepsin P) [Canis familiaris].
Length = 375
Score = 499 bits (1285), Expect = e-141
Identities = 221/244 (90%), Positives = 233/244 (95%)
Frame = +2
Query: 233 RTYPRPHEYLSPSDLPRSWDWRNVNGVNYASVTRNQHIPQYCGSCWAHGSTSAMADRINI 412
RTYPRPHEYLSPSDLP+SWDWRNVNGVNYAS TRNQHIPQYCGSCWAHGSTSAMADRINI
Sbjct: 120 RTYPRPHEYLSPSDLPKSWDWRNVNGVNYASATRNQHIPQYCGSCWAHGSTSAMADRINI 179
Query: 413 KRKGAWPSTLLSVQHVIDCGNAGSCEGGDDLPVWAYAHRHGIPDETCNNYQAKDQVCDKF 592
KRKGAWPSTLLSVQHV+DC NAGSCEGG+DLPVW+YAH HGIPDETCNNYQAKDQ C+KF
Sbjct: 180 KRKGAWPSTLLSVQHVLDCANAGSCEGGNDLPVWSYAHEHGIPDETCNNYQAKDQECNKF 239
Query: 593 NQCGTCTEFKECHVIQNYTLWKVGDYGSVSGREKMMAEIYANGPISCGIMATEKMSNYTG 772
NQCGTCTEFKECH IQNYTLW+VGDYGS+SGREKMMAEIYANGPISCGIMATEKM NYTG
Sbjct: 240 NQCGTCTEFKECHAIQNYTLWRVGDYGSLSGREKMMAEIYANGPISCGIMATEKMVNYTG 299
Query: 773 GIYAEYKDQAYINHIVSVAGWGVSGGTEYWNVRNSWGEPWGERGWMRIVTSTYKDGGGAH 952
GI+AEY++QAYINH++SV GWGVS GTEYW VRNSWGEPWGERGWMRIVTSTYKDG GA
Sbjct: 300 GIHAEYQEQAYINHVISVVGWGVSDGTEYWIVRNSWGEPWGERGWMRIVTSTYKDGKGAS 359
Query: 953 YNLA 964
YNLA
Sbjct: 360 YNLA 363
>ref|NP_001182763.1| dipeptidyl peptidase 1 [Canis lupus familiaris].
Length = 459
Score = 124 bits (311), Expect = 2e-28
Identities = 84/259 (32%), Positives = 114/259 (44%), Gaps = 19/259 (7%)
Frame = +2
Query: 209 DQRTQLGHRTYPRP---------HEYLSPSDLPRSWDWRNVNGVNYASVTRNQHIPQYCG 361
D T++G R PRP HE +S LP SWDWRNV G N+ S RNQ CG
Sbjct: 199 DMMTRVGGRKIPRPKPTPLTAEIHEEISR--LPTSWDWRNVRGTNFVSPVRNQ---ASCG 253
Query: 362 SCWAHGSTSAMADRINIKRKGAWPSTLLSVQHVIDCGN-AGSCEGGDD-LPVWAYAHRHG 535
SC+A ST+ + RI I + +LS Q ++ C A CEGG L YA G
Sbjct: 254 SCYAFASTAMLEARIRILTNNT-QTPILSPQEIVSCSQYAQGCEGGFPYLIAGKYAQDFG 312
Query: 536 IPDETCNNYQAKDQVCDKFNQCGTCTEFKECHVIQNYTLWKVGDYGSVSGREKMMAEIYA 715
+ +E C Y D C +C + + VG + M E+
Sbjct: 313 LVEEACFPYAGSDSPCKP----------NDCFRYYSSEYYYVGGFYGACNEALMKLELVR 362
Query: 716 NGPISCGIMATEKMSNYTGGIYAE------YKDQAYINHIVSVAGWGV--SGGTEYWNVR 871
+GP++ + +Y GIY + NH V + G+G + G +YW V+
Sbjct: 363 HGPMAVAFEVYDDFFHYQKGIYYHTGLRDPFNPFELTNHAVLLVGYGTDSASGMDYWIVK 422
Query: 872 NSWGEPWGERGWMRIVTST 928
NSWG WGE G+ RI T
Sbjct: 423 NSWGSRWGEDGYFRIRRGT 441
>ref|XP_535784.2| PREDICTED: similar to Dipeptidyl-peptidase I precursor (DPP-I)
(DPPI) (Cathepsin C) (Cathepsin J) (Dipeptidyl
transferase), partial [Canis familiaris].
Length = 481
Score = 119 bits (299), Expect = 5e-27
Identities = 87/259 (33%), Positives = 115/259 (44%), Gaps = 19/259 (7%)
Frame = +2
Query: 209 DQRTQLGHRTYPRP---------HEYLSPSDLPRSWDWRNVNGVNYASVTRNQHIPQYCG 361
D + G R PRP HE +S LP SWDWRNV G N+ S RNQ CG
Sbjct: 221 DMMRRAGGRKIPRPKPTPLTAEIHEEISR--LPTSWDWRNVRGTNFVSPVRNQ---ASCG 275
Query: 362 SCWAHGSTSAMADRINIKRKGAWPSTLLSVQHVIDCGN-AGSCEGG-DDLPVWAYAHRHG 535
SC+A ST + RI I + +LS Q ++ C A CEGG L YA G
Sbjct: 276 SCYAFASTVMLEARIRILTNNT-QTPILSPQEIVSCSQYAQGCEGGFPYLIAGKYAQDFG 334
Query: 536 IPDETCNNYQAKDQVCDKFNQCGTCTEFKECHVIQNYTLWKVGDYGSVSGREKMMAEIYA 715
+ DE C +Y D C K N C H + + G YG+ + M E+
Sbjct: 335 LVDEACFSYAGSDSPC-KPNDC--------FHYYSSEYHYVGGFYGACN-EALMKLELVR 384
Query: 716 NGPISCGIMATEKMSNYTGGIYAE------YKDQAYINHIVSVAGWGV--SGGTEYWNVR 871
+GP++ + +Y GIY NH V + G+G + G +YW V+
Sbjct: 385 HGPMAVAFEVYDDFFHYQKGIYYHTGLRDPINPFELTNHAVLLVGYGTDSASGMDYWIVK 444
Query: 872 NSWGEPWGERGWMRIVTST 928
NSWG WGE G+ +I T
Sbjct: 445 NSWGSRWGEDGYFQICRGT 463
>ref|XP_543203.2| PREDICTED: similar to cathepsin B preproprotein [Canis familiaris].
Length = 420
Score = 94.0 bits (232), Expect = 3e-19
Identities = 75/261 (28%), Positives = 104/261 (39%), Gaps = 27/261 (10%)
Frame = +2
Query: 218 TQLGHRTYPRPHEYLSPSDLPRSWD----WRNVNGVNYASVTRNQHIPQYCGSCWAHGST 385
T LG P+ ++ LP S+D W N + R+Q CGSCWA G+
Sbjct: 142 TFLGGPKLPQRVQFAKNLILPESFDAREQWPNCPTIKEI---RDQGS---CGSCWAFGAV 195
Query: 386 SAMADRINIKRKGAWPSTLLSVQHVIDCGN--AGSCEGGDDLPVWAYAHRHGIPDET--- 550
A++DRI I+ G + + + CG+ C GG W + + G+
Sbjct: 196 EAISDRICIRTNGHVNVEVSAEDMLTCCGDQCGDGCNGGFPAEAWNFWTKQGLVSGGLYD 255
Query: 551 ----CNNYQ---------AKDQVCDKFNQCGTCTEFKECHVIQNYTLWKVGDYG----SV 679
C Y C C++ E +Y K YG SV
Sbjct: 256 SHVGCRPYSIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPSYKEDK--HYGCSSYSV 313
Query: 680 SGREK-MMAEIYANGPISCGIMATEKMSNYTGGIYAEYKDQAYINHIVSVAGWGVSGGTE 856
S EK +MAEIY NGP+ Y G+Y + H V + GWGV GT
Sbjct: 314 SDNEKEIMAEIYKNGPVEAAFTVYSDFLLYKSGVYQHVTGEMMGGHAVRILGWGVEDGTP 373
Query: 857 YWNVRNSWGEPWGERGWMRIV 919
YW V NSW WG+ G+ +I+
Sbjct: 374 YWLVGNSWNTDWGDNGFFKIL 394
>ref|NP_001002938.1| cathepsin S precursor [Canis lupus familiaris].
Length = 331
Score = 90.9 bits (224), Expect = 3e-18
Identities = 75/250 (30%), Positives = 111/250 (44%), Gaps = 10/250 (4%)
Frame = +2
Query: 263 SPSDLPRSWDWRNVNGVNYASVTRNQHIPQYCGSCWAHGSTSAMADRINIKRKGAWPSTL 442
S LP S DWR V + CG+CWA + A+ ++ +K G S
Sbjct: 111 SNQKLPDSVDWREKGCVTEVKYQGS------CGACWAFSAVGALEAQLKLKT-GKLVS-- 161
Query: 443 LSVQHVIDC-----GNAGSCEGGDDLPVWAYA-HRHGIPDETCNNYQAKDQVC--DKFNQ 598
LS Q+++DC GN G C GG + Y +GI E Y+A + C D +
Sbjct: 162 LSAQNLVDCSTEKYGNKG-CNGGFMTTAFQYIIDNNGIDSEASYPYKAMNGKCRYDSKKR 220
Query: 599 CGTCTEFKECHVIQNYTLWKVGDYGSVSGREKMMAEIYAN-GPISCGIMATE-KMSNYTG 772
TC+++ E G E + E AN GP+S I A+ Y
Sbjct: 221 AATCSKYTELPF----------------GSEDALKEAVANKGPVSVAIDASHYSFFLYRS 264
Query: 773 GIYAEYKDQAYINHIVSVAGWGVSGGTEYWNVRNSWGEPWGERGWMRIVTSTYKDGGGAH 952
G+Y E +NH V V G+G G +YW V+NSWG +G++G++R+ ++ G H
Sbjct: 265 GVYYEPSCTQNVNHGVLVVGYGNLNGKDYWLVKNSWGLNFGDQGYIRMARNS-----GNH 319
Query: 953 YNLASRKTAP 982
+AS + P
Sbjct: 320 CGIASYPSYP 329
>ref|XP_539782.2| PREDICTED: similar to Cathepsin O precursor [Canis familiaris].
Length = 518
Score = 86.3 bits (212), Expect = 6e-17
Identities = 68/218 (31%), Positives = 94/218 (43%), Gaps = 4/218 (1%)
Frame = +2
Query: 275 LPRSWDWRNVNGVNYASVTRNQHIPQYCGSCWAHGSTSAMADRINIKRKGAWPSTLLSVQ 454
LP +DWR+ V RNQ Q CG CWA A+ IK K P +SVQ
Sbjct: 303 LPLRFDWRDKRVVTQV---RNQ---QTCGGCWAFSVVGAVESAYAIKGK---PLADISVQ 353
Query: 455 HVIDCG-NAGSCEGGDDLPVWAYAHRHGIPDETCNNYQAKDQVCDKFNQCGTCTEFKECH 631
VIDC N C GG L + ++ + + Y K Q G C F +
Sbjct: 354 QVIDCSYNNYGCSGGSTLNALNWLNKTQVKLVRDSEYPFKAQN-------GLCHYFSD-- 404
Query: 632 VIQNYTLWKVGDYGSV--SGREKMMAEIYAN-GPISCGIMATEKMSNYTGGIYAEYKDQA 802
+Y+ + + Y + S +E MA++ GP+ + A +Y GGI +
Sbjct: 405 ---SYSGFSIRGYSAYDFSDQEDEMAKVLLTFGPLVVVVDAVS-WQDYLGGIIQHHCSSG 460
Query: 803 YINHIVSVAGWGVSGGTEYWNVRNSWGEPWGERGWMRI 916
NH V + G+ G T YW VRNSWG WG G+ +
Sbjct: 461 EANHAVLITGFDKIGSTPYWIVRNSWGSSWGVDGYAHV 498
>ref|NP_001029168.1| cathepsin K precursor [Canis lupus familiaris].
Length = 330
Score = 83.6 bits (205), Expect = 4e-16
Identities = 70/238 (29%), Positives = 108/238 (45%), Gaps = 5/238 (2%)
Frame = +2
Query: 269 SDLPRSWDWRNVNGVNYASVTRNQHIPQYCGSCWAHGSTSAMADRINIKRKGAWPSTLLS 448
S P S D+R Y + +NQ CGSCWA S A+ ++ K+ G + LS
Sbjct: 114 SRAPDSVDYRKKG---YVTPVKNQG---QCGSCWAFSSVGALEGQLK-KKTGKLLN--LS 164
Query: 449 VQHVIDCGNAGS-CEGGDDLPVWAYAHRH-GIPDETCNNYQAKDQVCDKFNQCGTCTEFK 622
Q+++DC + C GG + Y ++ GI E Y +D+ C +N G + +
Sbjct: 165 PQNLVDCVSENDGCGGGYMTNAFQYVQKNRGIDSEDAYPYVGQDESC-MYNPTGKAAKCR 223
Query: 623 ECHVIQNYTLWKVGDYGSVSGREKMMAEIYAN-GPISCGIMAT-EKMSNYTGGIYAEYK- 793
I G EK + A GPIS I A+ Y+ G+Y +
Sbjct: 224 GYREIPE-------------GNEKALKRAVARVGPISVAIDASLTSFQFYSKGVYYDENC 270
Query: 794 DQAYINHIVSVAGWGVSGGTEYWNVRNSWGEPWGERGWMRIVTSTYKDGGGAHYNLAS 967
+ +NH V G+G+ G ++W ++NSWGE WG +G+ I+ + K+ NLAS
Sbjct: 271 NSDNLNHAVLAVGYGIQKGNKHWIIKNSWGENWGNKGY--ILMARNKNNACGIANLAS 326
>ref|XP_535330.2| PREDICTED: similar to P3ECSL [Canis familiaris].
Length = 550
Score = 79.3 bits (194), Expect = 8e-15
Identities = 58/218 (26%), Positives = 88/218 (40%), Gaps = 30/218 (13%)
Frame = +2
Query: 356 CGSCWAHGSTSAMADRINIKRKGAWPSTLLSVQHVIDCG--NAGSCEGGDDLPVWAYAHR 529
C WA + + +DR++I G + +LS Q+++ C N C GG W + R
Sbjct: 309 CAGSWAFSTAAVASDRVSIHSLGHM-TPVLSPQNLLSCDTHNQQGCRGGRLDGAWWFLRR 367
Query: 530 HGIPDETCNNYQAKDQVCDKFNQCGTCTEFKEC---------------HVIQNYTLWKVG 664
G+ + C + ++Q D+ C HV N
Sbjct: 368 RGVVSDHCYPFVGREQ--DEAGPAPRCMMHSRAMGRGKRQATARCPSSHVHANDIYQVTP 425
Query: 665 DYGSVSGREKMMAEIYANGPISCGIMATEKMSNYTGGIYAEY--------KDQAYINHIV 820
Y + +++M E+ NGP+ + E Y GGIY+ + + + H V
Sbjct: 426 AYRLGTNEKEIMKELMENGPVQALMEVHEDFFLYQGGIYSHTPVSLGRPERYRRHGTHSV 485
Query: 821 SVAGWGVS----GGT-EYWNVRNSWGEPWGERGWMRIV 919
+ GWG G T +YW NSWG WGERG RIV
Sbjct: 486 KITGWGEETLPDGRTLKYWTAANSWGPAWGERGHFRIV 523
>ref|XP_536212.2| PREDICTED: similar to Cathepsin H precursor [Canis familiaris].
Length = 304
Score = 78.6 bits (192), Expect = 1e-14
Identities = 63/216 (29%), Positives = 90/216 (41%), Gaps = 10/216 (4%)
Frame = +2
Query: 290 DWRNVNGVNYASVTRNQHIPQYCGSCWAHGSTSAMADRINIKRKGAWPSTLLSV--QHVI 463
DWR + S +NQ CGSCW +T A+ I IK LLS+ Q ++
Sbjct: 89 DWRKKG--KFVSPVKNQGS---CGSCWTFSTTGALESAIAIKS-----GKLLSLAEQQLV 138
Query: 464 DCG---NAGSCEG-GDDLPVWAYA-HRHGIPDETCNNYQAKDQVCDKFNQCGTCTEFKEC 628
DC N C+G G L + Y + GI E Y+ +D C K+ K+
Sbjct: 139 DCAQNFNNHGCQGYGAPLQAFEYIRYNKGIMGEDSYPYKGQDGDC-KYQPSKAIAFVKD- 196
Query: 629 HVIQNYTLWKVGDYGSVSGREKMMAEIYANGPISCGIMATEKMSNYTGGIYAE---YKDQ 799
+ N T ++ + M+ + P+S T Y GIY+ +K
Sbjct: 197 --VANIT---------INDEQAMVEAVALYNPVSFAFEVTSDFMMYRKGIYSSTSCHKTP 245
Query: 800 AYINHIVSVAGWGVSGGTEYWNVRNSWGEPWGERGW 907
+NH V G+G G YW V+NSWG WG G+
Sbjct: 246 DKVNHAVLAVGYGEQNGIPYWIVKNSWGPQWGMNGY 281
>ref|XP_538969.2| PREDICTED: similar to tubulointerstitial nephritis antigen [Canis
familiaris].
Length = 476
Score = 78.2 bits (191), Expect = 2e-14
Identities = 59/213 (27%), Positives = 90/213 (42%), Gaps = 25/213 (11%)
Frame = +2
Query: 356 CGSCWAHGSTSAMADRINIKRKGAWPSTLLSVQHVIDC--GNAGSCEGGDDLPVWAYAHR 529
C + WA + S ADRI I+ G + + L S Q++I C N C G W + +
Sbjct: 240 CAASWAFSTASVAADRIAIQSNGRYTANL-SPQNLISCCAKNRHGCNSGSIDRAWWFLRK 298
Query: 530 HGIPDETC----NNYQAKDQVCDKFNQC---GTCTEFKEC--HVIQNYTLWKVGDYGSVS 682
G+ C + A + C ++ G K C ++ ++ +++ VS
Sbjct: 299 RGLVSHACYPLFKDQNATNYGCAMASRSDGRGKRHATKPCPNNIEKSNRIYQCSPPYRVS 358
Query: 683 GRE-KMMAEIYANGPISCGIMATEKMSNYTGGIYAEY--------KDQAYINHIVSVAGW 835
E ++M EI NGP+ + E +Y GIY K Q H V + GW
Sbjct: 359 SNETEIMKEIMQNGPVQAIMQVHEDFFHYKTGIYRHITRTNEESRKYQKLQTHAVKLTGW 418
Query: 836 GVSGGTE-----YWNVRNSWGEPWGERGWMRIV 919
G G + +W NSWG WGE G+ RI+
Sbjct: 419 GTLKGAQGQKEKFWIAANSWGISWGENGYFRIL 451
Database: RefSeq49_CP.fasta
Posted date: Oct 17, 2011 1:42 PM
Number of letters in database: 18,874,504
Number of sequences in database: 33,336
Lambda K H
0.318 0.134 0.401
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 33336
Number of Hits to DB: 48,988,208
Number of extensions: 1456022
Number of successful extensions: 7307
Number of sequences better than 1.0e-05: 13
Number of HSP's gapped: 7253
Number of HSP's successfully gapped: 13
Length of query: 457
Length of database: 18,874,504
Length adjustment: 106
Effective length of query: 351
Effective length of database: 15,340,888
Effective search space: 5384651688
Effective search space used: 5384651688
Neighboring words threshold: 12
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 36 (18.5 bits)
Search to RefSeqHP_Rel49
BLASTX 2.2.24 [Aug-08-2010]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= 20110601C-000959
(1371 letters)
Database: RefSeq49_HP.fasta
32,964 sequences; 18,297,164 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
Alignment gi|NP_001327.2| cathepsin Z preproprotein [Homo sapiens]. 513 e-145
Alignment gi|NP_001805.3| dipeptidyl peptidase 1 isoform a preproprotein ... 113 5e-25
Alignment gi|NP_004070.3| cathepsin S isoform 1 preproprotein [Homo sapie... 99 1e-20
Alignment gi|NP_001186668.1| cathepsin S isoform 2 preproprotein [Homo sa... 95 1e-19
Alignment gi|NP_680093.1| cathepsin B preproprotein [Homo sapiens]. 93 7e-19
Alignment gi|NP_680092.1| cathepsin B preproprotein [Homo sapiens]. 93 7e-19
Alignment gi|NP_680091.1| cathepsin B preproprotein [Homo sapiens]. 93 7e-19
Alignment gi|NP_680090.1| cathepsin B preproprotein [Homo sapiens]. 93 7e-19
Alignment gi|NP_001899.1| cathepsin B preproprotein [Homo sapiens]. 93 7e-19
Alignment gi|NP_004381.2| pro-cathepsin H preproprotein [Homo sapiens]. 90 6e-18
>ref|NP_001327.2| cathepsin Z preproprotein [Homo sapiens].
Length = 303
Score = 513 bits (1322), Expect = e-145
Identities = 227/266 (85%), Positives = 246/266 (92%)
Frame = +2
Query: 167 HFRPGCSCYRPLRGDQRTQLGHRTYPRPHEYLSPSDLPRSWDWRNVNGVNYASVTRNQHI 346
+FR G +CYRPLRGD LG TYPRPHEYLSP+DLP+SWDWRNV+GVNYAS+TRNQHI
Sbjct: 26 YFRRGQTCYRPLRGDGLAPLGRSTYPRPHEYLSPADLPKSWDWRNVDGVNYASITRNQHI 85
Query: 347 PQYCGSCWAHGSTSAMADRINIKRKGAWPSTLLSVQHVIDCGNAGSCEGGDDLPVWAYAH 526
PQYCGSCWAH STSAMADRINIKRKGAWPSTLLSVQ+VIDCGNAGSCEGG+DL VW YAH
Sbjct: 86 PQYCGSCWAHASTSAMADRINIKRKGAWPSTLLSVQNVIDCGNAGSCEGGNDLSVWDYAH 145
Query: 527 RHGIPDETCNNYQAKDQVCDKFNQCGTCTEFKECHVIQNYTLWKVGDYGSVSGREKMMAE 706
+HGIPDETCNNYQAKDQ CDKFNQCGTC EFKECH I+NYTLW+VGDYGS+SGREKMMAE
Sbjct: 146 QHGIPDETCNNYQAKDQECDKFNQCGTCNEFKECHAIRNYTLWRVGDYGSLSGREKMMAE 205
Query: 707 IYANGPISCGIMATEKMSNYTGGIYAEYKDQAYINHIVSVAGWGVSGGTEYWNVRNSWGE 886
IYANGPISCGIMATE+++NYTGGIYAEY+D YINH+VSVAGWG+S GTEYW VRNSWGE
Sbjct: 206 IYANGPISCGIMATERLANYTGGIYAEYQDTTYINHVVSVAGWGISDGTEYWIVRNSWGE 265
Query: 887 PWGERGWMRIVTSTYKDGGGAHYNLA 964
PWGERGW+RIVTSTYKDG GA YNLA
Sbjct: 266 PWGERGWLRIVTSTYKDGKGARYNLA 291
>ref|NP_001805.3| dipeptidyl peptidase 1 isoform a preproprotein [Homo sapiens].
Length = 463
Score = 113 bits (282), Expect = 5e-25
Identities = 75/228 (32%), Positives = 103/228 (45%), Gaps = 10/228 (4%)
Frame = +2
Query: 275 LPRSWDWRNVNGVNYASVTRNQHIPQYCGSCWAHGSTSAMADRINIKRKGAWPSTLLSVQ 454
LP SWDWRNV+G+N+ S RNQ CGSC++ S + RI I + + +LS Q
Sbjct: 231 LPTSWDWRNVHGINFVSPVRNQ---ASCGSCYSFASMGMLEARIRILTNNS-QTPILSPQ 286
Query: 455 HVIDCGN-AGSCEGG-DDLPVWAYAHRHGIPDETCNNYQAKDQVCDKFNQCGTCTEFKEC 628
V+ C A CEGG L YA G+ +E C Y D C C F+
Sbjct: 287 EVVSCSQYAQGCEGGFPYLIAGKYAQDFGLVEEACFPYTGTDSPCKMKEDC-----FRYY 341
Query: 629 HVIQNYTLWKVGDYGSVSGREKMMAEIYANGPISCGIMATEKMSNYTGGIYAE------Y 790
+Y VG + M E+ +GP++ + +Y GIY +
Sbjct: 342 SSEYHY----VGGFYGGCNEALMKLELVHHGPMAVAFEVYDDFLHYKKGIYHHTGLRDPF 397
Query: 791 KDQAYINHIVSVAGWGV--SGGTEYWNVRNSWGEPWGERGWMRIVTST 928
NH V + G+G + G +YW V+NSWG WGE G+ RI T
Sbjct: 398 NPFELTNHAVLLVGYGTDSASGMDYWIVKNSWGTGWGENGYFRIRRGT 445
>ref|NP_004070.3| cathepsin S isoform 1 preproprotein [Homo sapiens].
Length = 331
Score = 98.6 bits (244), Expect = 1e-20
Identities = 78/246 (31%), Positives = 110/246 (44%), Gaps = 10/246 (4%)
Frame = +2
Query: 275 LPRSWDWRNVNGVNYASVTRNQHIPQYCGSCWAHGSTSAMADRINIKRKGAWPSTLLSVQ 454
LP S DWR V + CG+CWA + A+ ++ +K G S LS Q
Sbjct: 115 LPDSVDWREKGCVTEVKYQGS------CGACWAFSAVGALEAQLKLKT-GKLVS--LSAQ 165
Query: 455 HVIDC-----GNAGSCEGGDDLPVWAYA-HRHGIPDETCNNYQAKDQVC--DKFNQCGTC 610
+++DC GN G C GG + Y GI + Y+A DQ C D + TC
Sbjct: 166 NLVDCSTEKYGNKG-CNGGFMTTAFQYIIDNKGIDSDASYPYKAMDQKCQYDSKYRAATC 224
Query: 611 TEFKECHVIQNYTLWKVGDYGSVSGREKMMAEIYAN-GPISCGIMATE-KMSNYTGGIYA 784
+++ E GRE ++ E AN GP+S G+ A Y G+Y
Sbjct: 225 SKYTELPY----------------GREDVLKEAVANKGPVSVGVDARHPSFFLYRSGVYY 268
Query: 785 EYKDQAYINHIVSVAGWGVSGGTEYWNVRNSWGEPWGERGWMRIVTSTYKDGGGAHYNLA 964
E +NH V V G+G G EYW V+NSWG +GE G++R+ + G H +A
Sbjct: 269 EPSCTQNVNHGVLVVGYGDLNGKEYWLVKNSWGHNFGEEGYIRMARNK-----GNHCGIA 323
Query: 965 SRKTAP 982
S + P
Sbjct: 324 SFPSYP 329
>ref|NP_001186668.1| cathepsin S isoform 2 preproprotein [Homo sapiens].
Length = 281
Score = 95.1 bits (235), Expect = 1e-19
Identities = 71/219 (32%), Positives = 102/219 (46%), Gaps = 10/219 (4%)
Frame = +2
Query: 356 CGSCWAHGSTSAMADRINIKRKGAWPSTLLSVQHVIDC-----GNAGSCEGGDDLPVWAY 520
CG+CWA + A+ ++ +K G S LS Q+++DC GN G C GG + Y
Sbjct: 86 CGACWAFSAVGALEAQLKLKT-GKLVS--LSAQNLVDCSTEKYGNKG-CNGGFMTTAFQY 141
Query: 521 A-HRHGIPDETCNNYQAKDQVC--DKFNQCGTCTEFKECHVIQNYTLWKVGDYGSVSGRE 691
GI + Y+A DQ C D + TC+++ E GRE
Sbjct: 142 IIDNKGIDSDASYPYKAMDQKCQYDSKYRAATCSKYTELPY----------------GRE 185
Query: 692 KMMAEIYAN-GPISCGIMATE-KMSNYTGGIYAEYKDQAYINHIVSVAGWGVSGGTEYWN 865
++ E AN GP+S G+ A Y G+Y E +NH V V G+G G EYW
Sbjct: 186 DVLKEAVANKGPVSVGVDARHPSFFLYRSGVYYEPSCTQNVNHGVLVVGYGDLNGKEYWL 245
Query: 866 VRNSWGEPWGERGWMRIVTSTYKDGGGAHYNLASRKTAP 982
V+NSWG +GE G++R+ + G H +AS + P
Sbjct: 246 VKNSWGHNFGEEGYIRMARNK-----GNHCGIASFPSYP 279
>ref|NP_680093.1| cathepsin B preproprotein [Homo sapiens].
Length = 339
Score = 92.8 bits (229), Expect = 7e-19
Identities = 63/210 (30%), Positives = 85/210 (40%), Gaps = 22/210 (10%)
Frame = +2
Query: 356 CGSCWAHGSTSAMADRINIKRKGAWPSTLLSVQHVIDCGNA---GSCEGGDDLPVWAYAH 526
CGSCWA G+ A++DRI I A S +S + ++ C + C GG W +
Sbjct: 105 CGSCWAFGAVEAISDRICI-HTNAHVSVEVSAEDLLTCCGSMCGDGCNGGYPAEAWNFWT 163
Query: 527 RHGI-------PDETCNNY---------QAKDQVCDKFNQCGTCTEFKECHVIQNYTLWK 658
R G+ C Y C C++ E Y K
Sbjct: 164 RKGLVSGGLYESHVGCRPYSIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYKQDK 223
Query: 659 VGDYG--SVSGREK-MMAEIYANGPISCGIMATEKMSNYTGGIYAEYKDQAYINHIVSVA 829
Y SVS EK +MAEIY NGP+ Y G+Y + H + +
Sbjct: 224 HYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRIL 283
Query: 830 GWGVSGGTEYWNVRNSWGEPWGERGWMRIV 919
GWGV GT YW V NSW WG+ G+ +I+
Sbjct: 284 GWGVENGTPYWLVANSWNTDWGDNGFFKIL 313
>ref|NP_680092.1| cathepsin B preproprotein [Homo sapiens].
Length = 339
Score = 92.8 bits (229), Expect = 7e-19
Identities = 63/210 (30%), Positives = 85/210 (40%), Gaps = 22/210 (10%)
Frame = +2
Query: 356 CGSCWAHGSTSAMADRINIKRKGAWPSTLLSVQHVIDCGNA---GSCEGGDDLPVWAYAH 526
CGSCWA G+ A++DRI I A S +S + ++ C + C GG W +
Sbjct: 105 CGSCWAFGAVEAISDRICI-HTNAHVSVEVSAEDLLTCCGSMCGDGCNGGYPAEAWNFWT 163
Query: 527 RHGI-------PDETCNNY---------QAKDQVCDKFNQCGTCTEFKECHVIQNYTLWK 658
R G+ C Y C C++ E Y K
Sbjct: 164 RKGLVSGGLYESHVGCRPYSIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYKQDK 223
Query: 659 VGDYG--SVSGREK-MMAEIYANGPISCGIMATEKMSNYTGGIYAEYKDQAYINHIVSVA 829
Y SVS EK +MAEIY NGP+ Y G+Y + H + +
Sbjct: 224 HYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRIL 283
Query: 830 GWGVSGGTEYWNVRNSWGEPWGERGWMRIV 919
GWGV GT YW V NSW WG+ G+ +I+
Sbjct: 284 GWGVENGTPYWLVANSWNTDWGDNGFFKIL 313
>ref|NP_680091.1| cathepsin B preproprotein [Homo sapiens].
Length = 339
Score = 92.8 bits (229), Expect = 7e-19
Identities = 63/210 (30%), Positives = 85/210 (40%), Gaps = 22/210 (10%)
Frame = +2
Query: 356 CGSCWAHGSTSAMADRINIKRKGAWPSTLLSVQHVIDCGNA---GSCEGGDDLPVWAYAH 526
CGSCWA G+ A++DRI I A S +S + ++ C + C GG W +
Sbjct: 105 CGSCWAFGAVEAISDRICI-HTNAHVSVEVSAEDLLTCCGSMCGDGCNGGYPAEAWNFWT 163
Query: 527 RHGI-------PDETCNNY---------QAKDQVCDKFNQCGTCTEFKECHVIQNYTLWK 658
R G+ C Y C C++ E Y K
Sbjct: 164 RKGLVSGGLYESHVGCRPYSIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYKQDK 223
Query: 659 VGDYG--SVSGREK-MMAEIYANGPISCGIMATEKMSNYTGGIYAEYKDQAYINHIVSVA 829
Y SVS EK +MAEIY NGP+ Y G+Y + H + +
Sbjct: 224 HYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRIL 283
Query: 830 GWGVSGGTEYWNVRNSWGEPWGERGWMRIV 919
GWGV GT YW V NSW WG+ G+ +I+
Sbjct: 284 GWGVENGTPYWLVANSWNTDWGDNGFFKIL 313
>ref|NP_680090.1| cathepsin B preproprotein [Homo sapiens].
Length = 339
Score = 92.8 bits (229), Expect = 7e-19
Identities = 63/210 (30%), Positives = 85/210 (40%), Gaps = 22/210 (10%)
Frame = +2
Query: 356 CGSCWAHGSTSAMADRINIKRKGAWPSTLLSVQHVIDCGNA---GSCEGGDDLPVWAYAH 526
CGSCWA G+ A++DRI I A S +S + ++ C + C GG W +
Sbjct: 105 CGSCWAFGAVEAISDRICI-HTNAHVSVEVSAEDLLTCCGSMCGDGCNGGYPAEAWNFWT 163
Query: 527 RHGI-------PDETCNNY---------QAKDQVCDKFNQCGTCTEFKECHVIQNYTLWK 658
R G+ C Y C C++ E Y K
Sbjct: 164 RKGLVSGGLYESHVGCRPYSIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYKQDK 223
Query: 659 VGDYG--SVSGREK-MMAEIYANGPISCGIMATEKMSNYTGGIYAEYKDQAYINHIVSVA 829
Y SVS EK +MAEIY NGP+ Y G+Y + H + +
Sbjct: 224 HYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRIL 283
Query: 830 GWGVSGGTEYWNVRNSWGEPWGERGWMRIV 919
GWGV GT YW V NSW WG+ G+ +I+
Sbjct: 284 GWGVENGTPYWLVANSWNTDWGDNGFFKIL 313
>ref|NP_001899.1| cathepsin B preproprotein [Homo sapiens].
Length = 339
Score = 92.8 bits (229), Expect = 7e-19
Identities = 63/210 (30%), Positives = 85/210 (40%), Gaps = 22/210 (10%)
Frame = +2
Query: 356 CGSCWAHGSTSAMADRINIKRKGAWPSTLLSVQHVIDCGNA---GSCEGGDDLPVWAYAH 526
CGSCWA G+ A++DRI I A S +S + ++ C + C GG W +
Sbjct: 105 CGSCWAFGAVEAISDRICI-HTNAHVSVEVSAEDLLTCCGSMCGDGCNGGYPAEAWNFWT 163
Query: 527 RHGI-------PDETCNNY---------QAKDQVCDKFNQCGTCTEFKECHVIQNYTLWK 658
R G+ C Y C C++ E Y K
Sbjct: 164 RKGLVSGGLYESHVGCRPYSIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYKQDK 223
Query: 659 VGDYG--SVSGREK-MMAEIYANGPISCGIMATEKMSNYTGGIYAEYKDQAYINHIVSVA 829
Y SVS EK +MAEIY NGP+ Y G+Y + H + +
Sbjct: 224 HYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRIL 283
Query: 830 GWGVSGGTEYWNVRNSWGEPWGERGWMRIV 919
GWGV GT YW V NSW WG+ G+ +I+
Sbjct: 284 GWGVENGTPYWLVANSWNTDWGDNGFFKIL 313
>ref|NP_004381.2| pro-cathepsin H preproprotein [Homo sapiens].
Length = 335
Score = 89.7 bits (221), Expect = 6e-18
Identities = 69/220 (31%), Positives = 93/220 (42%), Gaps = 7/220 (3%)
Frame = +2
Query: 278 PRSWDWRNVNGVNYASVTRNQHIPQYCGSCWAHGSTSAMADRINIKRKGAWPSTLLSVQH 457
P S DWR N+ S +NQ CGSCW +T A+ I I L+ Q
Sbjct: 117 PPSVDWRKKG--NFVSPVKNQGA---CGSCWTFSTTGALESAIAI---ATGKMLSLAEQQ 168
Query: 458 VIDCG---NAGSCEGGDDLPVWAYA-HRHGIPDETCNNYQAKDQVCDKFNQCGTCTEFKE 625
++DC N C+GG + Y + GI E YQ KD C KF Q G F +
Sbjct: 169 LVDCAQDFNNHGCQGGLPSQAFEYILYNKGIMGEDTYPYQGKDGYC-KF-QPGKAIGFVK 226
Query: 626 CHVIQNYTLWKVGDYGSVSGREKMMAEIYANGPISCGIMATEKMSNYTGGIYAE---YKD 796
+ N T++ E M+ + P+S T+ Y GIY+ +K
Sbjct: 227 D--VANITIYD---------EEAMVEAVALYNPVSFAFEVTQDFMMYRTGIYSSTSCHKT 275
Query: 797 QAYINHIVSVAGWGVSGGTEYWNVRNSWGEPWGERGWMRI 916
+NH V G+G G YW V+NSWG WG G+ I
Sbjct: 276 PDKVNHAVLAVGYGEKNGIPYWIVKNSWGPQWGMNGYFLI 315
Database: RefSeq49_HP.fasta
Posted date: Oct 17, 2011 1:42 PM
Number of letters in database: 18,297,164
Number of sequences in database: 32,964
Lambda K H
0.318 0.134 0.401
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 32964
Number of Hits to DB: 47,691,320
Number of extensions: 1399270
Number of successful extensions: 6716
Number of sequences better than 1.0e-05: 22
Number of HSP's gapped: 6641
Number of HSP's successfully gapped: 22
Length of query: 457
Length of database: 18,297,164
Length adjustment: 106
Effective length of query: 351
Effective length of database: 14,802,980
Effective search space: 5195845980
Effective search space used: 5195845980
Neighboring words threshold: 12
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 36 (18.5 bits)
Search to RefSeqMP_Rel49
BLASTX 2.2.24 [Aug-08-2010]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= 20110601C-000959
(1371 letters)
Database: RefSeq49_MP.fasta
30,036 sequences; 15,617,559 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
Alignment gi|NP_071720.1| cathepsin Z preproprotein [Mus musculus]. 512 e-145
Alignment gi|NP_034112.3| dipeptidyl peptidase 1 preproprotein [Mus muscu... 118 1e-26
Alignment gi|NP_064680.1| cathepsin R precursor [Mus musculus]. 99 6e-21
Alignment gi|NP_036137.1| cathepsin J [Mus musculus]. 94 3e-19
Alignment gi|NP_083912.2| cathepsin Q [Mus musculus]. 93 4e-19
Alignment gi|NP_067420.1| cathepsin 6 [Mus musculus]. 91 2e-18
Alignment gi|NP_031827.2| pro-cathepsin H preproprotein [Mus musculus]. 89 6e-18
Alignment gi|NP_067256.2| cathepsin S preproprotein [Mus musculus]. 86 5e-17
Alignment gi|NP_031828.2| cathepsin K precursor [Mus musculus]. 86 7e-17
Alignment gi|NP_808330.1| cathepsin O precursor [Mus musculus]. 86 9e-17
>ref|NP_071720.1| cathepsin Z preproprotein [Mus musculus].
Length = 306
Score = 512 bits (1319), Expect = e-145
Identities = 231/267 (86%), Positives = 246/267 (92%), Gaps = 1/267 (0%)
Frame = +2
Query: 167 HFRPGCSCYRPLRGDQRTQLGHRTYPRPHEYLSPSDLPRSWDWRNVNGVNYASVTRNQHI 346
+FR G +CY P+RGDQ LG RTYPRPHEYLSP+DLP++WDWRNVNGVNYASVTRNQHI
Sbjct: 28 YFRSGQTCYHPIRGDQLALLGRRTYPRPHEYLSPADLPKNWDWRNVNGVNYASVTRNQHI 87
Query: 347 PQYCGSCWAHGSTSAMADRINIKRKGAWPSTLLSVQHVIDCGNAGSCEGGDDLPVWAYAH 526
PQYCGSCWAHGSTSAMADRINIKRKGAWPS LLSVQ+VIDCGNAGSCEGG+DLPVW YAH
Sbjct: 88 PQYCGSCWAHGSTSAMADRINIKRKGAWPSILLSVQNVIDCGNAGSCEGGNDLPVWEYAH 147
Query: 527 RHGIPDETCNNYQAKDQVCDKFNQCGTCTEFKECHVIQNYTLWKVGDYGSVSGREKMMAE 706
+HGIPDETCNNYQAKDQ CDKFNQCGTCTEFKECH IQNYTLW+VGDYGS+SGREKMMAE
Sbjct: 148 KHGIPDETCNNYQAKDQDCDKFNQCGTCTEFKECHTIQNYTLWRVGDYGSLSGREKMMAE 207
Query: 707 IYANGPISCGIMATEKMSNYTGGIYAEYKDQAYINHIVSVAGWGVSG-GTEYWNVRNSWG 883
IYANGPISCGIMATE MSNYTGGIYAE++DQA INHI+SVAGWGVS G EYW VRNSWG
Sbjct: 208 IYANGPISCGIMATEMMSNYTGGIYAEHQDQAVINHIISVAGWGVSNDGIEYWIVRNSWG 267
Query: 884 EPWGERGWMRIVTSTYKDGGGAHYNLA 964
EPWGE+GWMRIVTSTYK G G YNLA
Sbjct: 268 EPWGEKGWMRIVTSTYKGGTGDSYNLA 294
>ref|NP_034112.3| dipeptidyl peptidase 1 preproprotein [Mus musculus].
Length = 462
Score = 118 bits (295), Expect = 1e-26
Identities = 77/231 (33%), Positives = 107/231 (46%), Gaps = 12/231 (5%)
Frame = +2
Query: 272 DLPRSWDWRNVNGVNYASVTRNQHIPQYCGSCWAHGSTSAMADRINIKRKGAWPSTLLSV 451
+LP SWDWRNV GVNY S RNQ + CGSC++ S + RI I + + +LS
Sbjct: 229 NLPESWDWRNVQGVNYVSPVRNQ---ESCGSCYSFASMGMLEARIRILTNNS-QTPILSP 284
Query: 452 QHVIDCG-NAGSCEGG-DDLPVWAYAHRHGIPDETCNNYQAKDQVCDKFNQCGTCTEFKE 625
Q V+ C A C+GG L YA G+ +E+C Y AKD C C
Sbjct: 285 QEVVSCSPYAQGCDGGFPYLIAGKYAQDFGVVEESCFPYTAKDSPCKPRENC-------- 336
Query: 626 CHVIQNYT--LWKVGDYGSVSGREKMMAEIYANGPISCGIMATEKMSNYTGGIYAE---- 787
++ Y+ + VG + M E+ +GP++ + +Y GIY
Sbjct: 337 ---LRYYSSDYYYVGGFYGGCNEALMKLELVKHGPMAVAFEVHDDFLHYHSGIYHHTGLS 393
Query: 788 --YKDQAYINHIVSVAGWGVS--GGTEYWNVRNSWGEPWGERGWMRIVTST 928
+ NH V + G+G G EYW ++NSWG WGE G+ RI T
Sbjct: 394 DPFNPFELTNHAVLLVGYGRDPVTGIEYWIIKNSWGSNWGESGYFRIRRGT 444
>ref|NP_064680.1| cathepsin R precursor [Mus musculus].
Length = 334
Score = 99.4 bits (246), Expect = 6e-21
Identities = 79/243 (32%), Positives = 108/243 (44%), Gaps = 14/243 (5%)
Frame = +2
Query: 269 SDLPRSWDWRNVNGVNYASVTRNQHIPQYCGSCWAHGSTSAMADRINIKRKGAWPS---T 439
S LP+ DWR Y + R Q C +CWA T A I+ + W + T
Sbjct: 113 SILPKFVDWRKKG---YVTPVRRQGD---CDACWAFAVTGA------IEAQAIWQTGKLT 160
Query: 440 LLSVQHVIDC----GNAGSCEGGDDLPVWAYA-HRHGIPDETCNNYQAKDQVCDKFNQCG 604
LSVQ+++DC GN G C GGD + Y H G+ E Y+ KD C ++N
Sbjct: 161 PLSVQNLVDCSKPQGNNG-CLGGDTYNAFQYVLHNGGLESEATYPYEGKDGPC-RYNPKN 218
Query: 605 TCTEFKECHVIQNYTLWKVGDYGSVSGREKMMAEIYANGPISCGIMAT-EKMSNYTGGIY 781
+ E G + +MA + GPI+ GI A+ E NY GGIY
Sbjct: 219 SKAEI-------------TGFVSLPQSEDILMAAVATIGPITAGIDASHESFKNYKGGIY 265
Query: 782 AEYKDQA-YINHIVSVAGWGVSG----GTEYWNVRNSWGEPWGERGWMRIVTSTYKDGGG 946
E + + H V V G+G G G YW ++NSWG+ WG RG+M++ G
Sbjct: 266 HEPNCSSDTVTHGVLVVGYGFKGIETDGNHYWLIKNSWGKRWGIRGYMKLAKDKNNHCGI 325
Query: 947 AHY 955
A Y
Sbjct: 326 ASY 328
>ref|NP_036137.1| cathepsin J [Mus musculus].
Length = 333
Score = 93.6 bits (231), Expect = 3e-19
Identities = 80/248 (32%), Positives = 113/248 (45%), Gaps = 12/248 (4%)
Frame = +2
Query: 275 LPRSWDWRNVNGVNYASVTRNQHIPQYCGSCWAHGSTSAMADRINIKRKGAWPSTLLSVQ 454
LP DWR Y + RNQ CGSCWA + A+ ++ K P LSVQ
Sbjct: 113 LPDYKDWREEG---YVTPVRNQG---KCGSCWAFAAAGAIEGQMFWKTGNLTP---LSVQ 163
Query: 455 HVIDC----GNAGSCEGGDDLPVWAYAHRH-GIPDETCNNYQAKDQVCDKFNQCGTCTEF 619
+++DC GN G C+ G + Y ++ G+ E Y+ KD C +
Sbjct: 164 NLLDCSKTVGNKG-CQSGTAHQAFEYVLKNKGLEAEATYPYEGKDGPC----------RY 212
Query: 620 KECHVIQNYTLWKVGDYGSVSGREKMMAEIYAN-GPISCGIMAT-EKMSNYTGGIYAEYK 793
+ + N T DY ++ E + A+ GP+S I A+ + Y GGIY E
Sbjct: 213 RSENASANIT-----DYVNLPPNELYLWVAVASIGPVSAAIDASHDSFRFYNGGIYYEPN 267
Query: 794 DQAY-INHIVSVAGWGVSG----GTEYWNVRNSWGEPWGERGWMRIVTSTYKDGGGAHYN 958
+Y +NH V V G+G G G YW ++NSWGE WG G+M+I KD H
Sbjct: 268 CSSYFVNHAVLVVGYGSEGDVKDGNNYWLIKNSWGEEWGMNGYMQIA----KDHNN-HCG 322
Query: 959 LASRKTAP 982
+AS + P
Sbjct: 323 IASLASYP 330
>ref|NP_083912.2| cathepsin Q [Mus musculus].
Length = 343
Score = 93.2 bits (230), Expect = 4e-19
Identities = 69/235 (29%), Positives = 105/235 (44%), Gaps = 10/235 (4%)
Frame = +2
Query: 242 PRPHEYLSPSDLPRSWDWRNVNGVNYASVTRNQHIPQYCGSCWAHGSTSAMADRINIKRK 421
P P + LP+ DWRN Y + RNQ + C SCWA T A+ ++ K
Sbjct: 114 PFPKSWYWKDALPKFVDWRNEG---YVTRVRNQ---RNCNSCWAFPVTGAIEGQMFKKTG 167
Query: 422 GAWPSTLLSVQHVIDC----GNAGSCEGGDDLPVWAYA-HRHGIPDETCNNYQAKDQVCD 586
P LSVQ+++DC GN G C G+ + Y H G+ + Y+ K+ +C
Sbjct: 168 KLIP---LSVQNLVDCSRPQGNRG-CRWGNTYNGFQYVLHNGGLEAQATYPYEGKEGLC- 222
Query: 587 KFNQCGTCTEFKECHVIQNYTLWKVGDYGSVSGREKMMAEIYANGPISCGI-MATEKMSN 763
++N + + V+ + +M + GPI+ GI + +
Sbjct: 223 RYNPKNSAAKITGFVVLPE-------------SEDVLMDAVATKGPIATGIHVVSSSFRF 269
Query: 764 YTGGIYAEYKDQAYINHIVSVAGWGVSG----GTEYWNVRNSWGEPWGERGWMRI 916
Y GG+Y E + +NH V + G+G G G YW ++NSWG WG G+M I
Sbjct: 270 YDGGVYYEPNCTSSVNHAVLIIGYGYVGNETDGNNYWLIKNSWGRRWGLSGYMMI 324
>ref|NP_067420.1| cathepsin 6 [Mus musculus].
Length = 334
Score = 91.3 bits (225), Expect = 2e-18
Identities = 74/239 (30%), Positives = 111/239 (46%), Gaps = 12/239 (5%)
Frame = +2
Query: 275 LPRSWDWRNVNGVNYASVTRNQHIPQYCGSCWAHGSTSAMADRINIKRKGAWPSTLLSVQ 454
LP+ DWR Y + R Q ++C SCWA A+ ++ K+ G T LSVQ
Sbjct: 115 LPKFVDWRKKG---YVTRVRRQ---KFCNSCWAFAVNGAIEGQM-FKKTGKL--TPLSVQ 165
Query: 455 HVIDC----GNAGSCEGGDDLPVWAYA-HRHGIPDETCNNYQAKDQVCDKFNQCGTCTEF 619
+++DC GN G C+ GD + Y + G+ E Y+ K+ C ++N + E
Sbjct: 166 NLVDCTKTQGNDG-CQWGDPYIAYEYVLNNGGLEAEATYPYEGKEGPC-RYNPKNSKAE- 222
Query: 620 KECHVIQNYTLWKVGDYGSVSGREKMMAEIYAN-GPISCGIMAT-EKMSNYTGGIYAEYK 793
+ + S+ E ++ E A GPIS + A+ + S Y GGIY +
Sbjct: 223 -------------ITGFVSLPESEDILMEAVATIGPISAAVDASFNRFSFYDGGIYHQPN 269
Query: 794 -DQAYINHIVSVAGWGVSG----GTEYWNVRNSWGEPWGERGWMRIVTSTYKDGGGAHY 955
+NH V V G+G G G +YW ++NSWG WG G+M+I+ G A Y
Sbjct: 270 CSNNTVNHAVLVVGYGTEGNETDGNKYWLIKNSWGRRWGIGGYMKIIRDQNNHCGIATY 328
>ref|NP_031827.2| pro-cathepsin H preproprotein [Mus musculus].
Length = 333
Score = 89.4 bits (220), Expect = 6e-18
Identities = 66/220 (30%), Positives = 92/220 (41%), Gaps = 7/220 (3%)
Frame = +2
Query: 278 PRSWDWRNVNGVNYASVTRNQHIPQYCGSCWAHGSTSAMADRINIKRKGAWPSTLLSVQH 457
P S DWR N S +NQ CGSCW +T A+ + I + L+ Q
Sbjct: 115 PSSMDWRKKG--NVVSPVKNQGA---CGSCWTFSTTGALESAVAI---ASGKMLSLAEQQ 166
Query: 458 VIDCGNAGS---CEGGDDLPVWAYA-HRHGIPDETCNNYQAKDQVCDKFNQCGTCTEFKE 625
++DC A + C+GG + Y + GI +E Y KD C +FN K
Sbjct: 167 LVDCAQAFNNHGCKGGLPSQAFEYILYNKGIMEEDSYPYIGKDSSC-RFNPQKAVAFVKN 225
Query: 626 CHVIQNYTLWKVGDYGSVSGREKMMAEIYANGPISCGIMATEKMSNYTGGIYAE---YKD 796
+ N TL + M+ + P+S TE Y G+Y+ +K
Sbjct: 226 ---VVNITL---------NDEAAMVEAVALYNPVSFAFEVTEDFLMYKSGVYSSKSCHKT 273
Query: 797 QAYINHIVSVAGWGVSGGTEYWNVRNSWGEPWGERGWMRI 916
+NH V G+G G YW V+NSWG WGE G+ I
Sbjct: 274 PDKVNHAVLAVGYGEQNGLLYWIVKNSWGSQWGENGYFLI 313
>ref|NP_067256.2| cathepsin S preproprotein [Mus musculus].
Length = 340
Score = 86.3 bits (212), Expect = 5e-17
Identities = 68/237 (28%), Positives = 104/237 (43%), Gaps = 10/237 (4%)
Frame = +2
Query: 275 LPRSWDWRNVNGVNYASVTRNQHIPQYCGSCWAHGSTSAMADRINIKRKGAWPSTLLSVQ 454
LP + DWR V + CG+CWA + A+ ++ +K G S LS Q
Sbjct: 123 LPDTVDWREKGCVTEVKYQGS------CGACWAFSAVGALEGQLKLKT-GKLIS--LSAQ 173
Query: 455 HVIDC------GNAGSCEGGDDLPVWAYA-HRHGIPDETCNNYQAKDQVC--DKFNQCGT 607
+++DC GN G C GG + Y GI + Y+A D+ C + N+ T
Sbjct: 174 NLVDCSNEEKYGNKG-CGGGYMTEAFQYIIDNGGIEADASYPYKATDEKCHYNSKNRAAT 232
Query: 608 CTEFKECHVIQNYTLWKVGDYGSVSGREKMMAEIYANGPISCGIMATEK-MSNYTGGIYA 784
C+ Y GD + + + GP+S GI A+ Y G+Y
Sbjct: 233 CSR---------YIQLPFGD------EDALKEAVATKGPVSVGIDASHSSFFFYKSGVYD 277
Query: 785 EYKDQAYINHIVSVAGWGVSGGTEYWNVRNSWGEPWGERGWMRIVTSTYKDGGGAHY 955
+ +NH V V G+G G +YW V+NSWG +G++G++R+ + G A Y
Sbjct: 278 DPSCTGNVNHGVLVVGYGTLDGKDYWLVKNSWGLNFGDQGYIRMARNNKNHCGIASY 334
>ref|NP_031828.2| cathepsin K precursor [Mus musculus].
Length = 329
Score = 85.9 bits (211), Expect = 7e-17
Identities = 64/218 (29%), Positives = 99/218 (45%), Gaps = 7/218 (3%)
Frame = +2
Query: 275 LPRSWDWRNVNGVNYASVTRNQHIPQYCGSCWAHGSTSAMADRINIKRKGAWPSTLLSVQ 454
+P S D+R Y + +NQ CGSCWA S A+ ++ +K LS Q
Sbjct: 115 VPDSIDYRKKG---YVTPVKNQG---QCGSCWAFSSAGALEGQL---KKKTGKLLALSPQ 165
Query: 455 HVIDCGNAG-SCEGGDDLPVWAYAHRHG-IPDETCNNYQAKDQVC--DKFNQCGTCTEFK 622
+++DC C GG + Y ++G I E Y +D+ C + + C ++
Sbjct: 166 NLVDCVTENYGCGGGYMTTAFQYVQQNGGIDSEDAYPYVGQDESCMYNATAKAAKCRGYR 225
Query: 623 ECHVIQNYTLWKVGDYGSVSGREKMMAEIYAN-GPISCGIMAT-EKMSNYTGGIYAEYK- 793
E V G EK + A GPIS I A+ Y+ G+Y +
Sbjct: 226 EIPV----------------GNEKALKRAVARVGPISVSIDASLASFQFYSRGVYYDENC 269
Query: 794 DQAYINHIVSVAGWGVSGGTEYWNVRNSWGEPWGERGW 907
D+ +NH V V G+G G+++W ++NSWGE WG +G+
Sbjct: 270 DRDNVNHAVLVVGYGTQKGSKHWIIKNSWGESWGNKGY 307
>ref|NP_808330.1| cathepsin O precursor [Mus musculus].
Length = 312
Score = 85.5 bits (210), Expect = 9e-17
Identities = 68/232 (29%), Positives = 99/232 (42%), Gaps = 6/232 (2%)
Frame = +2
Query: 239 YPRPHEYLSPS-DLPRSWDWRNVNGVNYASVTRNQHIPQYCGSCWAHGSTSAMADRINIK 415
YP + P+ LP +DWR+ + VN RNQ + CG CWA SA+ I+
Sbjct: 86 YPAEGQRPIPNVSLPLRFDWRDKHVVN---PVRNQEM---CGGCWAFSVVSAIESARAIQ 139
Query: 416 RKGAWPSTLLSVQHVIDCG-NAGSCEGGDDLPV--WAYAHRHGIPDETCNNYQAKDQVCD 586
K LSVQ VIDC N C GG L W + + ++ ++A + C
Sbjct: 140 GKSL---DYLSVQQVIDCSFNNSGCLGGSPLCALRWLNETQLKLVADSQYPFKAVNGQCR 196
Query: 587 KFNQCGTCTEFKECHVIQNYTLWKVGDYGSVSGREKMMAEIYANGPISCGIMATEKMS-- 760
F Q K+ Y ++M + + GP+ ++ + MS
Sbjct: 197 HFPQSQAGVSVKDFSA-----------YNFRGQEDEMARALLSFGPL---VVIVDAMSWQ 242
Query: 761 NYTGGIYAEYKDQAYINHIVSVAGWGVSGGTEYWNVRNSWGEPWGERGWMRI 916
+Y GGI + NH V + G+ +G T YW VRNSWG WG G+ +
Sbjct: 243 DYLGGIIQHHCSSGEANHAVLITGFDRTGNTPYWMVRNSWGSSWGVEGYAHV 294
Database: RefSeq49_MP.fasta
Posted date: Oct 17, 2011 1:42 PM
Number of letters in database: 15,617,559
Number of sequences in database: 30,036
Lambda K H
0.318 0.134 0.401
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 30036
Number of Hits to DB: 39,848,258
Number of extensions: 1143707
Number of successful extensions: 5436
Number of sequences better than 1.0e-05: 27
Number of HSP's gapped: 5359
Number of HSP's successfully gapped: 28
Length of query: 457
Length of database: 15,617,559
Length adjustment: 105
Effective length of query: 352
Effective length of database: 12,463,779
Effective search space: 4387250208
Effective search space used: 4387250208
Neighboring words threshold: 12
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 35 (18.1 bits)
Search to RefSeqSP_Rel49
BLASTX 2.2.24 [Aug-08-2010]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= 20110601C-000959
(1371 letters)
Database: RefSeq49_SP.fasta
24,897 sequences; 11,343,932 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
Alignment gi|NP_001116576.1| cathepsin Z [Sus scrofa]. 585 e-167
Alignment gi|XP_003129789.1| PREDICTED: dipeptidyl peptidase 1-like [Sus ... 108 9e-24
Alignment gi|NP_001090927.1| cathepsin B precursor [Sus scrofa]. 92 5e-19
Alignment gi|XP_001929706.1| PREDICTED: cathepsin S [Sus scrofa]. 92 9e-19
Alignment gi|XP_003355243.1| PREDICTED: cathepsin S-like [Sus scrofa]. 92 9e-19
Alignment gi|NP_999094.1| pro-cathepsin H precursor [Sus scrofa]. 86 6e-17
Alignment gi|XP_003129045.2| PREDICTED: tryptophan 2,3-dioxygenase [Sus s... 85 1e-16
Alignment gi|XP_003130681.1| PREDICTED: cathepsin L1-like [Sus scrofa]. 82 5e-16
Alignment gi|NP_999467.1| cathepsin K precursor [Sus scrofa]. 82 7e-16
Alignment gi|XP_001927698.3| PREDICTED: tubulointerstitial nephritis anti... 79 6e-15
>ref|NP_001116576.1| cathepsin Z [Sus scrofa].
Length = 304
Score = 585 bits (1507), Expect = e-167
Identities = 264/266 (99%), Positives = 264/266 (99%)
Frame = +2
Query: 167 HFRPGCSCYRPLRGDQRTQLGHRTYPRPHEYLSPSDLPRSWDWRNVNGVNYASVTRNQHI 346
HFRPGCSCYRPLRGDQRTQLGHRTYPRPHEYLSPSDLPRSWDWRNVNGVNYASVTRNQHI
Sbjct: 27 HFRPGCSCYRPLRGDQRTQLGHRTYPRPHEYLSPSDLPRSWDWRNVNGVNYASVTRNQHI 86
Query: 347 PQYCGSCWAHGSTSAMADRINIKRKGAWPSTLLSVQHVIDCGNAGSCEGGDDLPVWAYAH 526
PQYCGSCWAHGSTSAMADRINIKRKGAWPSTLLSVQHVIDCGNAGSCEGGDDLPVWAYAH
Sbjct: 87 PQYCGSCWAHGSTSAMADRINIKRKGAWPSTLLSVQHVIDCGNAGSCEGGDDLPVWAYAH 146
Query: 527 RHGIPDETCNNYQAKDQVCDKFNQCGTCTEFKECHVIQNYTLWKVGDYGSVSGREKMMAE 706
RHGIPDETCNNYQAKDQVCDKFNQCGTCTEFKECHVIQNYTLWKVGDYGSVSGREKMMAE
Sbjct: 147 RHGIPDETCNNYQAKDQVCDKFNQCGTCTEFKECHVIQNYTLWKVGDYGSVSGREKMMAE 206
Query: 707 IYANGPISCGIMATEKMSNYTGGIYAEYKDQAYINHIVSVAGWGVSGGTEYWNVRNSWGE 886
IYANGPISCGIMATEKMSNYTGGIYAEYKDQAYINHIVSVAGWGVSGGTEYW VRNSWGE
Sbjct: 207 IYANGPISCGIMATEKMSNYTGGIYAEYKDQAYINHIVSVAGWGVSGGTEYWIVRNSWGE 266
Query: 887 PWGERGWMRIVTSTYKDGGGAHYNLA 964
PWGERGWMRIVTSTYKDG GAHYNLA
Sbjct: 267 PWGERGWMRIVTSTYKDGRGAHYNLA 292
>ref|XP_003129789.1| PREDICTED: dipeptidyl peptidase 1-like [Sus scrofa].
Length = 463
Score = 108 bits (269), Expect = 9e-24
Identities = 73/228 (32%), Positives = 99/228 (43%), Gaps = 10/228 (4%)
Frame = +2
Query: 275 LPRSWDWRNVNGVNYASVTRNQHIPQYCGSCWAHGSTSAMADRINIKRKGAWPSTLLSVQ 454
LP SWDWRNV G N+ + RNQ CGSC++ S M RI I + +LS Q
Sbjct: 231 LPASWDWRNVRGTNFVTPVRNQ---ASCGSCYSFASMGMMEARIRILTNNT-QTPILSPQ 286
Query: 455 HVIDCGN-AGSCEGGDD-LPVWAYAHRHGIPDETCNNYQAKDQVCDKFNQCGTCTEFKEC 628
V+ C A C GG L YA G+ +E C Y D CT + C
Sbjct: 287 EVVSCSQYAQGCAGGFPYLIAGKYAQDFGLVEEACFPYTGTDS---------PCTVKEGC 337
Query: 629 HVIQNYTLWKVGDYGSVSGREKMMAEIYANGPISCGIMATEKMSNYTGGIYAE------Y 790
+ VG + M E+ +GP++ + +Y GIY +
Sbjct: 338 FRYYSSEYHYVGGFYGGCNEALMKLELVHHGPMAVAFEVYDDFLHYRKGIYHHTGLRDPF 397
Query: 791 KDQAYINHIVSVAGWG--VSGGTEYWNVRNSWGEPWGERGWMRIVTST 928
NH V + G+G ++ G +YW V+NSWG WGE G+ RI T
Sbjct: 398 NPFELTNHAVLLVGYGTDLASGMDYWIVKNSWGTSWGEDGYFRIRRGT 445
>ref|NP_001090927.1| cathepsin B precursor [Sus scrofa].
Length = 335
Score = 92.4 bits (228), Expect = 5e-19
Identities = 68/259 (26%), Positives = 100/259 (38%), Gaps = 25/259 (9%)
Frame = +2
Query: 218 TQLGHRTYPRPHEYLSPSDLPRSWD----WRNVNGVNYASVTRNQHIPQYCGSCWAHGST 385
T LG P+ + + LP+S+D W N + R+Q CGSCWA G+
Sbjct: 61 TFLGGPKLPQRAAFAADMILPKSFDAREQWPNCPTIKEI---RDQGS---CGSCWAFGAV 114
Query: 386 SAMADRINIKRKGAWPSTLLSVQHVIDCGN--AGSCEGGDDLPVWAYAHRHGIPDET--- 550
A++DRI I+ G + + + CG+ C GG W + + G+
Sbjct: 115 EAISDRICIRSNGRVNVEVSAEDMLTCCGDECGDGCNGGFPSGAWNFWTKKGLVSGGLYD 174
Query: 551 ----CNNYQ---------AKDQVCDKFNQCGTCTEFKECHVIQNYTLWK---VGDYGSVS 682
C Y C C++ E +Y K Y
Sbjct: 175 SHVGCRPYSIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYTPSYKEDKHFGCSSYSISR 234
Query: 683 GREKMMAEIYANGPISCGIMATEKMSNYTGGIYAEYKDQAYINHIVSVAGWGVSGGTEYW 862
+++MAEIY NGP+ Y G+Y H + + GWGV GT YW
Sbjct: 235 NEKEIMAEIYKNGPVEGAFTVYSDFLQYKSGVYQHVTGDLMGGHAIRILGWGVENGTPYW 294
Query: 863 NVRNSWGEPWGERGWMRIV 919
V NSW WG+ G+ +I+
Sbjct: 295 LVGNSWNTDWGDNGFFKIL 313
>ref|XP_001929706.1| PREDICTED: cathepsin S [Sus scrofa].
Length = 331
Score = 91.7 bits (226), Expect = 9e-19
Identities = 77/251 (30%), Positives = 109/251 (43%), Gaps = 12/251 (4%)
Frame = +2
Query: 239 YPRPHEYLSPSD--LPRSWDWRNVNGVNYASVTRNQHIPQYCGSCWAHGSTSAMADRINI 412
+PR Y S + LP S DWR V + CGSCWA + A+ ++ +
Sbjct: 101 WPRNVTYKSNPNQKLPDSMDWREKGCVTEVKYQGS------CGSCWAFSAVGALEAQVKM 154
Query: 413 KRKGAWPSTLLSVQHVIDCG-----NAGSCEGGDDLPVWAYA-HRHGIPDETCNNYQAKD 574
K G S LS Q+++DC N G C GG + Y +GI E Y+A D
Sbjct: 155 KT-GRLVS--LSAQNLVDCSTEKYRNKG-CNGGFMTEAFQYIIDNNGIDSEASYPYKAVD 210
Query: 575 QVC--DKFNQCGTCTEFKECHVIQNYTLWKVGDYGSVSGREKMMAEIYAN-GPISCGIMA 745
C D N+ TC+ + E Y L E AN GP+S I A
Sbjct: 211 GKCKYDSKNRAATCSRYTELPFADEYAL----------------KEAVANKGPVSVAIDA 254
Query: 746 TEK-MSNYTGGIYAEYKDQAYINHIVSVAGWGVSGGTEYWNVRNSWGEPWGERGWMRIVT 922
Y G+Y + +NH V V G+G G +YW V+NSWG +G+ G++R+
Sbjct: 255 KHSSFFFYRSGVYYDPSCTQNVNHGVLVVGYGNLNGKDYWLVKNSWGLNFGDGGYIRMAR 314
Query: 923 STYKDGGGAHY 955
++ G A+Y
Sbjct: 315 NSENHCGIANY 325
>ref|XP_003355243.1| PREDICTED: cathepsin S-like [Sus scrofa].
Length = 331
Score = 91.7 bits (226), Expect = 9e-19
Identities = 77/251 (30%), Positives = 109/251 (43%), Gaps = 12/251 (4%)
Frame = +2
Query: 239 YPRPHEYLSPSD--LPRSWDWRNVNGVNYASVTRNQHIPQYCGSCWAHGSTSAMADRINI 412
+PR Y S + LP S DWR V + CGSCWA + A+ ++ +
Sbjct: 101 WPRNVTYKSNPNQKLPDSMDWREKGCVTEVKYQGS------CGSCWAFSAVGALEAQVKM 154
Query: 413 KRKGAWPSTLLSVQHVIDCG-----NAGSCEGGDDLPVWAYA-HRHGIPDETCNNYQAKD 574
K G S LS Q+++DC N G C GG + Y +GI E Y+A D
Sbjct: 155 KT-GRLVS--LSAQNLVDCSTEKYRNKG-CNGGFMTEAFQYIIDNNGIDSEASYPYKAVD 210
Query: 575 QVC--DKFNQCGTCTEFKECHVIQNYTLWKVGDYGSVSGREKMMAEIYAN-GPISCGIMA 745
C D N+ TC+ + E Y L E AN GP+S I A
Sbjct: 211 GKCKYDSKNRAATCSRYTELPFADEYAL----------------KEAVANKGPVSVAIDA 254
Query: 746 TEK-MSNYTGGIYAEYKDQAYINHIVSVAGWGVSGGTEYWNVRNSWGEPWGERGWMRIVT 922
Y G+Y + +NH V V G+G G +YW V+NSWG +G+ G++R+
Sbjct: 255 KHSSFFFYRSGVYYDPSCTQNVNHGVLVVGYGNLNGKDYWLVKNSWGLNFGDGGYIRMAR 314
Query: 923 STYKDGGGAHY 955
++ G A+Y
Sbjct: 315 NSENHCGIANY 325
>ref|NP_999094.1| pro-cathepsin H precursor [Sus scrofa].
Length = 335
Score = 85.5 bits (210), Expect = 6e-17
Identities = 64/220 (29%), Positives = 90/220 (40%), Gaps = 7/220 (3%)
Frame = +2
Query: 278 PRSWDWRNVNGVNYASVTRNQHIPQYCGSCWAHGSTSAMADRINIKRKGAWPSTLLSVQH 457
P S DWR N+ S +NQ CGSCW +T A+ + I L+ Q
Sbjct: 117 PPSMDWRKKG--NFVSPVKNQGS---CGSCWTFSTTGALESAVAI---ATGKMLSLAEQQ 168
Query: 458 VIDCG---NAGSCEGGDDLPVWAYA-HRHGIPDETCNNYQAKDQVCDKFNQCGTCTEFKE 625
++DC N C+GG + Y + GI E Y+ +D C KF K+
Sbjct: 169 LVDCAQNFNNHGCQGGLPSQAFEYIRYNKGIMGEDTYPYKGQDDHC-KFQPDKAIAFVKD 227
Query: 626 CHVIQNYTLWKVGDYGSVSGREKMMAEIYANGPISCGIMATEKMSNYTGGIYAE---YKD 796
+ N T+ + E M+ + P+S T Y GIY+ +K
Sbjct: 228 ---VANITM---------NDEEAMVEAVALYNPVSFAFEVTNDFLMYRKGIYSSTSCHKT 275
Query: 797 QAYINHIVSVAGWGVSGGTEYWNVRNSWGEPWGERGWMRI 916
+NH V G+G G YW V+NSWG WG G+ I
Sbjct: 276 PDKVNHAVLAVGYGEENGIPYWIVKNSWGPQWGMNGYFLI 315
>ref|XP_003129045.2| PREDICTED: tryptophan 2,3-dioxygenase [Sus scrofa].
Length = 411
Score = 84.7 bits (208), Expect = 1e-16
Identities = 63/198 (31%), Positives = 88/198 (44%), Gaps = 2/198 (1%)
Frame = +2
Query: 320 ASVTRNQHIPQYCGSCWAHGSTSAMADRINIKRKGAWPSTLLSVQHVIDCG-NAGSCEGG 496
A + R + +CG CWA SA+ IK + P +LSVQ VIDC N C GG
Sbjct: 207 AWLERTPGLEPHCGGCWAFSVVSAVESAYAIKGQ---PLEVLSVQQVIDCSYNNYGCNGG 263
Query: 497 DDLPVWAYAHRHGIPDETCNNYQAKDQVCDKFNQCGTCTEFKECHVIQNYTLWKVGDYGS 676
L + ++ + + + Y K Q G C F H + + D+
Sbjct: 264 STLNALYWLNKTQVKVVSDSEYPFKAQN-------GLCHYFSCSHSGVSIKDYSAYDF-- 314
Query: 677 VSGREKMMAE-IYANGPISCGIMATEKMSNYTGGIYAEYKDQAYINHIVSVAGWGVSGGT 853
SG+E MA+ + GP+ I+ +Y GGI + NH V V G+ +G T
Sbjct: 315 -SGQEDEMAKTLLTLGPLIV-IVDAVSWQDYLGGIIQHHCSSGEANHAVLVTGFDKTGST 372
Query: 854 EYWNVRNSWGEPWGERGW 907
YW VRNSWG WG G+
Sbjct: 373 PYWIVRNSWGSAWGIDGY 390
>ref|XP_003130681.1| PREDICTED: cathepsin L1-like [Sus scrofa].
Length = 332
Score = 82.4 bits (202), Expect = 5e-16
Identities = 72/224 (32%), Positives = 104/224 (46%), Gaps = 11/224 (4%)
Frame = +2
Query: 278 PRSWDWRNVNGVNYASVTRNQHIPQYCGSCWAHGSTSAMADRINIKRKGAWPSTLLSVQH 457
P S DWR Y + +NQ +CGSCWA +T A+ ++ K LS Q+
Sbjct: 115 PHSVDWREKG---YVTAVKNQG---HCGSCWAFSATGALEGQMFRKTSKL---ISLSEQN 165
Query: 458 VIDC----GNAGSCEGGDDLPVWAYAHRHG-IPDETCNNYQAKDQVCDKFNQCGTCTEFK 622
++DC GN G C GG + Y +G + E Y KD C ++K
Sbjct: 166 LVDCSWPEGNEG-CNGGLMDNAFQYIKDNGGLDSEESYPYFGKDGSC----------KYK 214
Query: 623 ECHVIQNYTLWKVGDYGSVSGREKMMAEIYAN-GPISCGIMAT-EKMSNYTGGIYAEYKD 796
N T Y + +EK + + A GPIS GI A+ E Y+ GIY E +
Sbjct: 215 PQSSAANDT-----GYVDIPKQEKALMKAVATVGPISVGIDASHESFQFYSTGIYFEPQC 269
Query: 797 QAY-INHIVSVAGWGVSGG---TEYWNVRNSWGEPWGERGWMRI 916
+ ++H V V G+GV G +YW V+NSWG WG G++++
Sbjct: 270 SSEDLDHGVLVVGYGVEGAHSNNKYWLVKNSWGNTWGMDGYIKM 313
>ref|NP_999467.1| cathepsin K precursor [Sus scrofa].
Length = 330
Score = 82.0 bits (201), Expect = 7e-16
Identities = 67/235 (28%), Positives = 109/235 (46%), Gaps = 8/235 (3%)
Frame = +2
Query: 287 WDWRNVNGVNYAS---VTRNQHIPQYCGSCWAHGSTSAMADRINIKRKGAWPSTLLSVQH 457
W+ R + ++Y VT ++ Q CGSCWA S A+ ++ K+ G + LS Q+
Sbjct: 112 WEGRTPDSIDYRKKGYVTPVKNQGQ-CGSCWAFSSVGALEGQLK-KKTGKLLN--LSPQN 167
Query: 458 VIDCGNAGS-CEGGDDLPVWAYAHRH-GIPDETCNNYQAKDQVCDKFNQCGTCTEFKECH 631
++DC + C GG + Y ++ GI E Y +D+ C +N G + +
Sbjct: 168 LVDCVSENDGCGGGYMTNAFQYVQKNRGIDSEDAYPYVGQDENC-MYNPTGKAAKCRGYR 226
Query: 632 VIQNYTLWKVGDYGSVSGREKMMAEIYAN-GPISCGIMAT-EKMSNYTGGIYAEYK-DQA 802
I G EK + A GP+S I A+ Y+ G+Y + +
Sbjct: 227 EIPE-------------GNEKALKRAVARVGPVSVAIDASLTSFQFYSKGVYYDENCNSD 273
Query: 803 YINHIVSVAGWGVSGGTEYWNVRNSWGEPWGERGWMRIVTSTYKDGGGAHYNLAS 967
+NH V G+G+ G ++W ++NSWGE WG +G+ I+ + K+ NLAS
Sbjct: 274 NLNHAVLAVGYGIQKGKKHWIIKNSWGENWGNKGY--ILMARNKNNACGIANLAS 326
>ref|XP_001927698.3| PREDICTED: tubulointerstitial nephritis antigen [Sus scrofa].
Length = 476
Score = 79.0 bits (193), Expect = 6e-15
Identities = 59/213 (27%), Positives = 91/213 (42%), Gaps = 25/213 (11%)
Frame = +2
Query: 356 CGSCWAHGSTSAMADRINIKRKGAWPSTLLSVQHVIDC--GNAGSCEGGDDLPVWAYAHR 529
C + WA + S ADRI I+ +G + + L S Q++I C N C G W Y +
Sbjct: 240 CAASWAFSTASVAADRIAIQSEGRYTANL-SPQNLISCCAKNRHGCNSGSIDRAWWYLRK 298
Query: 530 HGIPDETC----NNYQAKDQVCDKFNQC---GTCTEFKEC--HVIQNYTLWKVGDYGSVS 682
G+ C + A + C ++ G K C + ++ +++ VS
Sbjct: 299 RGLVSHACYPLFKDQNATNNGCAMASRSDGRGKRHATKPCPNNFEKSNRIYQCSPPYRVS 358
Query: 683 GRE-KMMAEIYANGPISCGIMATEKMSNYTGGIYAEY--------KDQAYINHIVSVAGW 835
E ++M EI NGP+ + E +Y GIY K + H V + GW
Sbjct: 359 SNETEIMREIMQNGPVQAIMQVHEDFFHYKTGIYRHVTSTNEESDKYRKLRTHAVKLTGW 418
Query: 836 GVSGGTE-----YWNVRNSWGEPWGERGWMRIV 919
G G + +W NSWG+ WGE G+ RI+
Sbjct: 419 GTLKGAQGRKEKFWIAANSWGKSWGENGYFRIL 451
Database: RefSeq49_SP.fasta
Posted date: Oct 17, 2011 1:42 PM
Number of letters in database: 11,343,932
Number of sequences in database: 24,897
Lambda K H
0.318 0.134 0.401
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 24897
Number of Hits to DB: 31,949,446
Number of extensions: 1174906
Number of successful extensions: 4764
Number of sequences better than 1.0e-05: 14
Number of HSP's gapped: 4707
Number of HSP's successfully gapped: 14
Length of query: 457
Length of database: 11,343,932
Length adjustment: 102
Effective length of query: 355
Effective length of database: 8,804,438
Effective search space: 3125575490
Effective search space used: 3125575490
Neighboring words threshold: 12
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 35 (18.1 bits)
Search to Sscrofa10_2
BLASTN 2.2.24 [Aug-08-2010]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= 20110601C-000959
(1371 letters)
Database: Sscrofa_10.2.fasta
4582 sequences; 2,808,509,378 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
Sscrofa_Chr17 359 3e-96
>Sscrofa_Chr17
|| Length = 69701581
Score = 359 bits (181), Expect = 3e-96
Identities = 181/181 (100%)
Strand = Plus / Minus
Query: 398 gacagaatcaacatcaagaggaagggggcctggccctccacgctgctgtctgtgcagcat 457
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Sbjct: 66515890 gacagaatcaacatcaagaggaagggggcctggccctccacgctgctgtctgtgcagcat 66515831
Query: 458 gtcatcgactgcggcaacgccggctcctgcgaggggggcgacgacctgcccgtgtgggcc 517
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Sbjct: 66515830 gtcatcgactgcggcaacgccggctcctgcgaggggggcgacgacctgcccgtgtgggcc 66515771
Query: 518 tacgcccaccggcacggcatcccggatgagacctgcaacaactaccaggccaaggaccaa 577
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Sbjct: 66515770 tacgcccaccggcacggcatcccggatgagacctgcaacaactaccaggccaaggaccaa 66515711
Query: 578 g 578
|
Sbjct: 66515710 g 66515710
Score = 331 bits (167), Expect = 6e-88
Identities = 167/167 (100%)
Strand = Plus / Minus
Query: 232 caggacgtacccccggcctcatgagtacctgtccccatcggatctgcccaggagctggga 291
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Sbjct: 66519103 caggacgtacccccggcctcatgagtacctgtccccatcggatctgcccaggagctggga 66519044
Query: 292 ctggcgcaacgtgaatggggtcaactatgccagtgtcaccaggaaccagcacatccccca 351
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Sbjct: 66519043 ctggcgcaacgtgaatggggtcaactatgccagtgtcaccaggaaccagcacatccccca 66518984
Query: 352 gtactgcggctcctgctgggcccacggcagcaccagcgccatggctg 398
|||||||||||||||||||||||||||||||||||||||||||||||
Sbjct: 66518983 gtactgcggctcctgctgggcccacggcagcaccagcgccatggctg 66518937
Score = 323 bits (163), Expect = 1e-85
Identities = 166/167 (99%)
Strand = Plus / Minus
Query: 727 cagctgcggcatcatggccacggagaagatgtcgaactacaccggcggcatctacgccga 786
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Sbjct: 66513548 cagctgcggcatcatggccacggagaagatgtcgaactacaccggcggcatctacgccga 66513489
Query: 787 gtacaaggaccaggcctacatcaaccacatcgtgtcggtggccgggtggggcgtcagcgg 846
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Sbjct: 66513488 gtacaaggaccaggcctacatcaaccacatcgtgtcggtggccgggtggggcgtcagcgg 66513429
Query: 847 cgggacggagtactggaatgtccggaattcgtggggcgagccctggg 893
||||||||||||||||| |||||||||||||||||||||||||||||
Sbjct: 66513428 cgggacggagtactggattgtccggaattcgtggggcgagccctggg 66513382
Score = 303 bits (153), Expect = 1e-79
Identities = 153/153 (100%)
Strand = Plus / Minus
Query: 577 agtgtgtgacaagtttaaccagtgtggaacgtgcactgagttcaaagaatgccacgtcat 636
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Sbjct: 66514265 agtgtgtgacaagtttaaccagtgtggaacgtgcactgagttcaaagaatgccacgtcat 66514206
Query: 637 ccagaactacaccctctggaaggttggcgactacggctccgtctccgggcgggagaagat 696
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Sbjct: 66514205 ccagaactacaccctctggaaggttggcgactacggctccgtctccgggcgggagaagat 66514146
Query: 697 gatggctgaaatctacgccaacgggcccatcag 729
|||||||||||||||||||||||||||||||||
Sbjct: 66514145 gatggctgaaatctacgccaacgggcccatcag 66514113
Score = 167 bits (84), Expect = 2e-38
Identities = 120/128 (93%), Gaps = 3/128 (2%)
Strand = Plus / Minus
Query: 892 gggtgaacgaggctggatgaggattgtgaccagcacctacaaagacggcgggggcgccca 951
|||||| |||||||||||||||||||||||||||||||||||||||||| ||||||||||
Sbjct: 66512392 gggtgagcgaggctggatgaggattgtgaccagcacctacaaagacggcaggggcgccca 66512333
Query: 952 ctacaaccttgcc-tcaaggaaaactgcacc-tccgggaccccatcgtttgaggccgac- 1008
||||||||||||| || |||| ||||||||| || ||||||||||||||||||||||||
Sbjct: 66512332 ctacaaccttgccatcgaggagaactgcaccttcggggaccccatcgtttgaggccgacg 66512273
Query: 1009 tctccgga 1016
||||||||
Sbjct: 66512272 tctccgga 66512265
Score = 141 bits (71), Expect = 1e-30
Identities = 71/71 (100%)
Strand = Plus / Minus
Query: 165 tccacttccgcccgggctgcagctgctaccggcccctgcgcggggaccagcggacccagt 224
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Sbjct: 66519585 tccacttccgcccgggctgcagctgctaccggcccctgcgcggggaccagcggacccagt 66519526
Query: 225 tggggcacagg 235
|||||||||||
Sbjct: 66519525 tggggcacagg 66519515
Database: Sscrofa_10.2.fasta
Posted date: Nov 16, 2011 10:34 AM
Number of letters in database: 2,808,509,378
Number of sequences in database: 4582
Lambda K H
1.37 0.711 1.31
Gapped
Lambda K H
1.37 0.711 1.31
Matrix: blastn matrix:1 -3
Gap Penalties: Existence: 5, Extension: 2
Number of Sequences: 4582
Number of Hits to DB: 26,048,558
Number of extensions: 130
Number of successful extensions: 130
Number of sequences better than 1.0e-05: 1
Number of HSP's gapped: 129
Number of HSP's successfully gapped: 6
Length of query: 1371
Length of database: 2,808,509,378
Length adjustment: 21
Effective length of query: 1350
Effective length of database: 2,808,413,156
Effective search space: 3791357760600
Effective search space used: 3791357760600
X1: 11 (21.8 bits)
X2: 15 (29.7 bits)
X3: 50 (99.1 bits)
S1: 18 (36.2 bits)
S2: 30 (60.0 bits)