Search to RefSeqBP_Rel49
BLASTX 2.2.24 [Aug-08-2010]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= 20110601C-002438
(1320 letters)
Database: RefSeq49_BP.fasta
33,088 sequences; 17,681,374 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
Alignment gi|NP_001103540.1| cathepsin W [Bos taurus]. 360 2e-99
Alignment gi|NP_001068884.1| cathepsin F [Bos taurus]. 228 1e-59
Alignment gi|NP_001029557.1| pro-cathepsin H precursor [Bos taurus]. 137 2e-32
Alignment gi|NP_001028787.1| cathepsin S precursor [Bos taurus]. 135 8e-32
Alignment gi|NP_776457.1| cathepsin L2 precursor [Bos taurus]. 128 1e-29
Alignment gi|NP_001077155.1| cathepsin L1 [Bos taurus]. 127 2e-29
Alignment gi|NP_001029607.1| cathepsin K [Bos taurus]. 125 1e-28
Alignment gi|XP_002694471.1| PREDICTED: cathepsin O preproprotein-like [B... 121 2e-27
Alignment gi|XP_874012.3| PREDICTED: cathepsin O [Bos taurus]. 121 2e-27
Alignment gi|NP_001028789.1| dipeptidyl peptidase 1 [Bos taurus]. 98 2e-20
>ref|NP_001103540.1| cathepsin W [Bos taurus].
Length = 272
Score = 360 bits (924), Expect = 2e-99
Identities = 178/249 (71%), Positives = 206/249 (82%)
Frame = +2
Query: 35 MALTAHLSCLLVLVVAGPAQGLKDALRSQDPGPQPMGLKEVFTLFQIQYNRSYSNPAEHA 214
MA T HLSCLL L+VAG AQG+KD+LR QDPGPQP+ LKEVF LFQ+QYNRSY NPAE+A
Sbjct: 1 MAPTVHLSCLLALLVAGLAQGIKDSLRGQDPGPQPLELKEVFRLFQMQYNRSYPNPAEYA 60
Query: 215 RRLDIFAQNLAKAQRLQEEDLGTAEFGVTPFSDLTEEEFGQLHGHHWGAGKAPSMGIKVG 394
RRLDIFAQNLAKAQRLQEEDLGTAEFGVT FSDLTEEEF QL+G AG+A + KVG
Sbjct: 61 RRLDIFAQNLAKAQRLQEEDLGTAEFGVTQFSDLTEEEFVQLYGSQ-VAGEALGVSRKVG 119
Query: 395 SEESGETVPQSCDWRKKPGVISAIKHQKDCNCCWAMAAVDNVEAQWAIKYHQAVQLSVQQ 574
SEE GE+ PQ+CDWR K G IS ++ Q++CNCCWAMAA N+EA WAIK+ V++SVQQ
Sbjct: 120 SEEWGESEPQTCDWR-KVGTISPVRDQRNCNCCWAMAAAGNIEALWAIKFRHFVEVSVQQ 178
Query: 575 VLDCDRXXXXXXXXFVWDAFLTVLNTSGLASEQDYPYKGTVKTHRCLAKQHRKVAWIQDF 754
+LDCDR FVWDAFLTVLN SGLASE+DYP+ G+ KTHRCLAK+++KVAWIQDF
Sbjct: 179 LLDCDRCGNGCRGGFVWDAFLTVLNNSGLASEKDYPFNGSGKTHRCLAKKYKKVAWIQDF 238
Query: 755 LMLQFCEQS 781
++LQ CEQS
Sbjct: 239 IILQACEQS 247
>ref|NP_001068884.1| cathepsin F [Bos taurus].
Length = 460
Score = 228 bits (580), Expect = 1e-59
Identities = 122/322 (37%), Positives = 178/322 (55%), Gaps = 2/322 (0%)
Frame = +2
Query: 110 LRSQDPGPQPMGLK--EVFTLFQIQYNRSYSNPAEHARRLDIFAQNLAKAQRLQEEDLGT 283
L ++DP PQ +K +F F YNR+Y + E + R+ +FA N+ +AQ++Q D GT
Sbjct: 145 LLNKDPLPQDFSVKMASIFKDFVTTYNRTYDSQEEASWRMSVFANNMVRAQKIQALDRGT 204
Query: 284 AEFGVTPFSDLTEEEFGQLHGHHWGAGKAPSMGIKVGSEESGETVPQSCDWRKKPGVISA 463
A +GVT FSDLTEEEF ++ + AP ++ + PQ DWR K G ++
Sbjct: 205 ARYGVTKFSDLTEEEFRTIYLNPL-LKDAPGRNMRPAQPVTDVPPPQ-WDWRNK-GAVTN 261
Query: 464 IKHQKDCNCCWAMAAVDNVEAQWAIKYHQAVQLSVQQVLDCDRXXXXXXXXFVWDAFLTV 643
+K Q C CWA + NVE QW +K + LS Q++LDCD+ +A+ +
Sbjct: 262 VKDQGMCGSCWAFSVTGNVEGQWFLKRGTLLSLSEQELLDCDKTDKACLGGLPSNAYSAI 321
Query: 644 LNTSGLASEQDYPYKGTVKTHRCLAKQHRKVAWIQDFLMLQFCEQSIARYLATEGPITVT 823
GL +E DY Y+G ++T C + +I D + L EQ +A +LA GP+++
Sbjct: 322 RTLGGLETEDDYSYRGRLQT--CSFSAEKAKVYINDSVELSKNEQKLAAWLAKNGPVSIA 379
Query: 824 INAGLLQQYKRGVIRATPATCDPHLVNHSVLLVGFGKSKSVEGKRPRPGHSIPYWILKNS 1003
INA +Q Y+ G+ C P L++H+VLLVG+G +IP+W +KNS
Sbjct: 380 INAFGMQFYRHGISHPLRPLCSPWLIDHAVLLVGYGNRS-----------AIPFWAIKNS 428
Query: 1004 WGPDWGEEGYFRLHRGSNTCGI 1069
WG DWGEEGY+ LHRGS CG+
Sbjct: 429 WGTDWGEEGYYYLHRGSGACGV 450
>ref|NP_001029557.1| pro-cathepsin H precursor [Bos taurus].
Length = 335
Score = 137 bits (346), Expect = 2e-32
Identities = 87/308 (28%), Positives = 143/308 (46%), Gaps = 4/308 (1%)
Frame = +2
Query: 158 FTLFQIQYNRSYSNPAEHARRLDIFAQNLAKAQRLQEEDLGTAEFGVTPFSDLTEEEFGQ 337
F + +Q+ + YS+ E+ RL FA NL + + T + G+ FSD++ F +
Sbjct: 35 FQSWMVQHQKKYSSE-EYYHRLQAFASNLREINAHNARN-HTFKMGLNQFSDMS---FDE 89
Query: 338 LHGHHWGAGKAPSMGIKVGSEESGETVPQSCDWRKKPGVISAIKHQKDCNCCWAMAAVDN 517
L + + K P S DWRKK ++ +K+Q C CW +
Sbjct: 90 LKRKYLWSEPQNCSATKSNYLRGTGPYPPSMDWRKKGNFVTPVKNQGSCGSCWTFSTTGA 149
Query: 518 VEAQWAIKYHQAVQLSVQQVLDCDRXXXXXXXX--FVWDAFLTVLNTSGLASEQDYPYKG 691
+E+ AI + L+ QQ++DC + AF + G+ E YPY+G
Sbjct: 150 LESAVAIATGKLPFLAEQQLVDCAQNFNNHGCQGGLPSQAFEYIRYNKGIMGEDTYPYRG 209
Query: 692 TVKTHRCLAKQHRKVAWIQDFLMLQFC-EQSIARYLATEGPITVTINAGL-LQQYKRGVI 865
+ C + + +A+++D + E+++ +A P++ Y++G+
Sbjct: 210 --QDGDCKYQPSKAIAFVKDVANITLNDEEAMVEAVALHNPVSFAFEVTADFMMYRKGIY 267
Query: 866 RATPATCDPHLVNHSVLLVGFGKSKSVEGKRPRPGHSIPYWILKNSWGPDWGEEGYFRLH 1045
+T P VNH+VL VG+G+ K IPYWI+KNSWGP+WG +GYF +
Sbjct: 268 SSTSCHKTPDKVNHAVLAVGYGEEKG-----------IPYWIVKNSWGPNWGMKGYFLIE 316
Query: 1046 RGSNTCGI 1069
RG N CG+
Sbjct: 317 RGKNMCGL 324
>ref|NP_001028787.1| cathepsin S precursor [Bos taurus].
Length = 331
Score = 135 bits (340), Expect = 8e-32
Identities = 102/319 (31%), Positives = 160/319 (50%), Gaps = 13/319 (4%)
Frame = +2
Query: 164 LFQIQYNRSYSNPAEHARRLDIFAQNLAKAQRLQ--EEDLG--TAEFGVTPFSDLTEEEF 331
L++ Y + Y E R I+ +NL K L E +G + E G+ D+T EE
Sbjct: 30 LWKKTYGKQYKEKNEEVARRLIWEKNL-KTVTLHNLEHSMGMHSYELGMNHLGDMTSEEV 88
Query: 332 GQLHGHHWGAGKAPSMGIKVGSEES--GETVPQSCDWRKKPGVISAIKHQKDCNCCWAMA 505
L + + PS + + +S + +P S DWR+K G ++ +K+Q C CWA +
Sbjct: 89 ISL----MSSLRVPSQWPRNVTYKSDPNQKLPDSMDWREK-GCVTEVKYQGACGSCWAFS 143
Query: 506 AVDNVEAQWAIKYHQAVQLSVQQVLDCDR---XXXXXXXXFVWDAFLTVLNTSGLASEQD 676
AV +EAQ +K + V LS Q ++DC F+ +AF +++ +G+ SE
Sbjct: 144 AVGALEAQVKLKTGKLVSLSAQNLVDCSTAKYGNKGCNGGFMTEAFQYIIDNNGIDSEAS 203
Query: 677 YPYKGTVKTHRCLAKQHRKVAWIQDFLMLQF-CEQSIARYLATEGPITVTINA--GLLQQ 847
YPYK +C + A ++ L F E+++ +A +GP++V I+A
Sbjct: 204 YPYK--AMDGKCQYDVKNRAATCSRYIELPFGSEEALKEAVANKGPVSVGIDASHSSFFL 261
Query: 848 YKRGVIRATPATCDPHLVNHSVLLVGFGKSKSVEGKRPRPGHSIPYWILKNSWGPDWGEE 1027
YK GV T + VNH VL+VG+G +++GK YW++KNSWG +G++
Sbjct: 262 YKTGVYYDPSCTQN---VNHGVLVVGYG---NLDGK--------DYWLVKNSWGLHFGDQ 307
Query: 1028 GYFRLHRGS-NTCGITKYP 1081
GY R+ R S N CGI YP
Sbjct: 308 GYIRMARNSGNHCGIANYP 326
>ref|NP_776457.1| cathepsin L2 precursor [Bos taurus].
Length = 334
Score = 128 bits (322), Expect = 1e-29
Identities = 89/260 (34%), Positives = 131/260 (50%), Gaps = 5/260 (1%)
Frame = +2
Query: 305 FSDLTEEEFGQLHGHHWGAGKAPSMGIKVGSEESGETVPQSCDWRKKPGVISAIKHQKDC 484
F D+T EEF Q+ G K+ E VP+S DW KK G ++ +K+Q C
Sbjct: 80 FGDMTNEEFRQVMN---GFQNQKHKKGKLFHEPLLVDVPKSVDWTKK-GYVTPVKNQGQC 135
Query: 485 NCCWAMAAVDNVEAQWAIKYHQAVQLSVQQVLDCDR--XXXXXXXXFVWDAFLTVLNTSG 658
CWA +A +E Q K + V LS Q ++DC R + +AF + + G
Sbjct: 136 GSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRAQGNQGCNGGLMDNAFQYIKDNGG 195
Query: 659 LASEQDYPYKGTVKTHRCLAKQHRKVAWIQDFLMLQFCEQSIARYLATEGPITVTINAG- 835
L SE+ YPY T T+ C K A F+ + E+++ + +AT GPI+V I+AG
Sbjct: 196 LDSEESYPYLAT-DTNSCNYKPECSAANDTGFVDIPQREKALMKAVATVGPISVAIDAGH 254
Query: 836 -LLQQYKRGVIRATPATCDPHLVNHSVLLVGFGKSKSVEGKRPRPGHSIPYWILKNSWGP 1012
Q YK G+ +C ++H VL+VG+G EG ++ +WI+KNSWGP
Sbjct: 255 TSFQFYKSGIYYDPDCSCKD--LDHGVLVVGYG----FEG---TDSNNNKFWIVKNSWGP 305
Query: 1013 DWGEEGYFRLHRGSNT-CGI 1069
+WG GY ++ + N CGI
Sbjct: 306 EWGWNGYVKMAKDQNNHCGI 325
>ref|NP_001077155.1| cathepsin L1 [Bos taurus].
Length = 333
Score = 127 bits (320), Expect = 2e-29
Identities = 92/276 (33%), Positives = 134/276 (48%), Gaps = 7/276 (2%)
Frame = +2
Query: 263 QEEDLGTAEFGVT--PFSDLTEEEFGQLHGHHWGAGKAPSMGIKVGSEESGETVPQSCDW 436
QE G F + F D+T EEF G + + K E ++P S DW
Sbjct: 64 QEYSQGKHSFSMAMNAFGDMTNEEFRHTMN---GFQRQKNKKGKEFHETIFASIPPSVDW 120
Query: 437 RKKPGVISAIKHQKDCNCCWAMAAVDNVEAQWAIKYHQAVQLSVQQVLDCDR--XXXXXX 610
R+K G ++ +K+Q C CWA +A +E Q K + V LS Q ++DC +
Sbjct: 121 REK-GYVTPVKNQGKCGSCWAFSATGALEGQMFQKTGKLVSLSEQNLVDCSQPEGNRGCH 179
Query: 611 XXFVWDAFLTVLNTSGLASEQDYPYKGTVKTHRCLAKQHRKVAWIQDFLMLQFCEQSIAR 790
F+ +AF VL+ GL SE+ YPY G V T CL + A F+ L E+++ +
Sbjct: 180 GGFIDNAFQYVLDVGGLDSEESYPYTGLVGT--CLYNPNNSAANETGFVDLPKQEKALMK 237
Query: 791 YLATEGPITVTINA--GLLQQYKRGVIRATPATCDPHLVNHSVLLVGFGKSKSVEGKRPR 964
+A GPI+V ++A Q YK G+ C V+H+VL+VG+G EG
Sbjct: 238 AVANLGPISVAVDAHNPSFQFYKSGIY--YEPNCSSESVDHAVLVVGYG----FEG---A 288
Query: 965 PGHSIPYWILKNSWGPDWGEEGYFRLHRG-SNTCGI 1069
YW++KNSWG WG GY ++ + +N CGI
Sbjct: 289 DSDDNKYWLVKNSWGEHWGMNGYIKMAKDRNNHCGI 324
>ref|NP_001029607.1| cathepsin K [Bos taurus].
Length = 334
Score = 125 bits (313), Expect = 1e-28
Identities = 94/318 (29%), Positives = 149/318 (46%), Gaps = 16/318 (5%)
Frame = +2
Query: 164 LFQIQYNRSYSNPAEHARRLDIFAQNLAKAQ-RLQEEDLG--TAEFGVTPFSDLTEEEFG 334
L++ Y + Y++ + R I+ +NL E LG T E + D+T EE
Sbjct: 33 LWKKTYRKQYNSKGDEISRRLIWEKNLKHISIHNLEASLGVHTYELAMNHLGDMTSEEVV 92
Query: 335 QLHGHHWGAGKAPSMGIKVGSEESGETV---------PQSCDWRKKPGVISAIKHQKDCN 487
Q K + + S +T+ P S D+RKK G ++ +K+Q C
Sbjct: 93 Q---------KMTGLKVPASRSRSNDTLYIPDWEGRAPDSVDYRKK-GYVTPVKNQGQCG 142
Query: 488 CCWAMAAVDNVEAQWAIKYHQAVQLSVQQVLDCDRXXXXXXXXFVWDAFLTVLNTSGLAS 667
CWA ++V +E Q K + + LS Q ++DC ++ +AF V G+ S
Sbjct: 143 SCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSENDGCGGGYMTNAFQYVQKNRGIDS 202
Query: 668 EQDYPYKGTVKTHRCLAKQHRKVAWIQDFLML-QFCEQSIARYLATEGPITVTINAGL-- 838
E YPY G + C+ K A + + + + E+++ R +A GPI+V I+A L
Sbjct: 203 EDAYPYVG--QDENCMYNPTGKAAKCRGYREIPEGNEKALKRAVARVGPISVAIDASLTS 260
Query: 839 LQQYKRGVIRATPATCDPHLVNHSVLLVGFGKSKSVEGKRPRPGHSIPYWILKNSWGPDW 1018
Q Y++GV C+ +NH+VL VG+G K + +WI+KNSWG +W
Sbjct: 261 FQFYRKGVY--YDENCNSDNLNHAVLAVGYGIQKGNK-----------HWIIKNSWGENW 307
Query: 1019 GEEGYFRLHRG-SNTCGI 1069
G +GY + R +N CGI
Sbjct: 308 GNKGYILMARNKNNACGI 325
>ref|XP_002694471.1| PREDICTED: cathepsin O preproprotein-like [Bos taurus].
Length = 375
Score = 121 bits (303), Expect = 2e-27
Identities = 89/271 (32%), Positives = 128/271 (47%), Gaps = 8/271 (2%)
Frame = +2
Query: 281 TAEFGVTPFSDLTEEEFGQLHGHHWGAGKAPSMGIKVGSEE----SGETVPQSCDWRKKP 448
TA +G+ FS L EEF ++ +PS + +EE S ++P DWR K
Sbjct: 118 TAVYGINQFSYLFPEEFKAIY-----LRSSPSRFPRFPAEEYTSISNLSLPLRFDWRDKH 172
Query: 449 GVISAIKHQKDCNCCWAMAAVDNVEAQWAIKYHQAVQLSVQQVLDCDRXXXXXXXXFVWD 628
V++ +++QK C CWA + V VE+ AIK LSVQQV+DC
Sbjct: 173 -VVTQVRNQKTCGGCWAFSVVGAVESVCAIKGQPLEVLSVQQVIDCSYSNYGCNGGSPLS 231
Query: 629 A--FLTVLNTSGLASEQDYPYKGTVKTHRCLAKQHRKVAWIQDFLMLQFC--EQSIARYL 796
A +L L L + +YP++ R + H + I+ + F E +A L
Sbjct: 232 ALYWLNKLQVK-LVRDSEYPFQAQNGLCRYFSDSHSGSS-IKGYSAYDFSGQEDKMAEAL 289
Query: 797 ATEGPITVTINAGLLQQYKRGVIRATPATCDPHLVNHSVLLVGFGKSKSVEGKRPRPGHS 976
GP+ V ++A Q Y G+I+ C NH+VL+ GF K+ S
Sbjct: 290 LALGPLIVVVDAMSWQDYLGGIIQHH---CSSGEANHAVLVTGFDKTGS----------- 335
Query: 977 IPYWILKNSWGPDWGEEGYFRLHRGSNTCGI 1069
IPYWI++NSWG WG +GY R+ G N CGI
Sbjct: 336 IPYWIVRNSWGTSWGIDGYVRVKMGGNVCGI 366
>ref|XP_874012.3| PREDICTED: cathepsin O [Bos taurus].
Length = 384
Score = 121 bits (303), Expect = 2e-27
Identities = 89/271 (32%), Positives = 128/271 (47%), Gaps = 8/271 (2%)
Frame = +2
Query: 281 TAEFGVTPFSDLTEEEFGQLHGHHWGAGKAPSMGIKVGSEE----SGETVPQSCDWRKKP 448
TA +G+ FS L EEF ++ +PS + +EE S ++P DWR K
Sbjct: 127 TAVYGINQFSYLFPEEFKAIY-----LRSSPSRFPRFPAEEYTSISNLSLPLRFDWRDKH 181
Query: 449 GVISAIKHQKDCNCCWAMAAVDNVEAQWAIKYHQAVQLSVQQVLDCDRXXXXXXXXFVWD 628
V++ +++QK C CWA + V VE+ AIK LSVQQV+DC
Sbjct: 182 -VVTQVRNQKTCGGCWAFSVVGAVESVCAIKGQPLEVLSVQQVIDCSYSNYGCNGGSPLS 240
Query: 629 A--FLTVLNTSGLASEQDYPYKGTVKTHRCLAKQHRKVAWIQDFLMLQFC--EQSIARYL 796
A +L L L + +YP++ R + H + I+ + F E +A L
Sbjct: 241 ALYWLNKLQVK-LVRDSEYPFQAQNGLCRYFSDSHSGSS-IKGYSAYDFSGQEDKMAEAL 298
Query: 797 ATEGPITVTINAGLLQQYKRGVIRATPATCDPHLVNHSVLLVGFGKSKSVEGKRPRPGHS 976
GP+ V ++A Q Y G+I+ C NH+VL+ GF K+ S
Sbjct: 299 LALGPLIVVVDAMSWQDYLGGIIQHH---CSSGEANHAVLVTGFDKTGS----------- 344
Query: 977 IPYWILKNSWGPDWGEEGYFRLHRGSNTCGI 1069
IPYWI++NSWG WG +GY R+ G N CGI
Sbjct: 345 IPYWIVRNSWGTSWGIDGYVRVKMGGNVCGI 375
>ref|NP_001028789.1| dipeptidyl peptidase 1 [Bos taurus].
Length = 463
Score = 97.8 bits (242), Expect = 2e-20
Identities = 70/239 (29%), Positives = 109/239 (45%), Gaps = 14/239 (5%)
Frame = +2
Query: 416 VPQSCDWRKKPGV--ISAIKHQKDCNCCWAMAAVDNVEAQWAIKYH--QAVQLSVQQVLD 583
+P S DWR G+ ++ +++Q C C++ A++ +EA+ I + Q LS Q+V+
Sbjct: 231 LPTSWDWRNVHGINFVTPVRNQGSCGSCYSFASMGMMEARIRILTNNTQTPILSPQEVVS 290
Query: 584 CDRXXXXXXXXFVWDAFLTVLNTSGLASEQDYPYKGTVKTHR----CLAKQHRKVAWIQD 751
C + F + GL E +PY GT R C + ++
Sbjct: 291 CSQYAQGCEGGFPYLIAGKYAQDFGLVEEDCFPYTGTDSPCRLKEGCFRYYSSEYHYVGG 350
Query: 752 FLMLQFCEQSIARY-LATEGPITVTINA-GLLQQYKRGVIRATPATCDP----HLVNHSV 913
F C +++ + L +GP+ V Y++GV T DP L NH+V
Sbjct: 351 FY--GGCNEALMKLELVHQGPMAVAFEVYDDFLHYRKGVYHHTGLR-DPFNPFELTNHAV 407
Query: 914 LLVGFGKSKSVEGKRPRPGHSIPYWILKNSWGPDWGEEGYFRLHRGSNTCGITKYPVTA 1090
LLVG+G + + YWI+KNSWG WGE GYFR+ RG++ C I + A
Sbjct: 408 LLVGYGTDAA---------SGLDYWIVKNSWGTSWGENGYFRIRRGTDECAIESIALAA 457
Database: RefSeq49_BP.fasta
Posted date: Oct 17, 2011 1:42 PM
Number of letters in database: 17,681,374
Number of sequences in database: 33,088
Lambda K H
0.318 0.134 0.401
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 33088
Number of Hits to DB: 57,740,396
Number of extensions: 1807372
Number of successful extensions: 7855
Number of sequences better than 1.0e-05: 14
Number of HSP's gapped: 7766
Number of HSP's successfully gapped: 14
Length of query: 440
Length of database: 17,681,374
Length adjustment: 105
Effective length of query: 335
Effective length of database: 14,207,134
Effective search space: 4759389890
Effective search space used: 4759389890
Neighboring words threshold: 12
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 35 (18.1 bits)
Search to RefSeqCP_Rel49
BLASTX 2.2.24 [Aug-08-2010]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= 20110601C-002438
(1320 letters)
Database: RefSeq49_CP.fasta
33,336 sequences; 18,874,504 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
Alignment gi|XP_540846.2| PREDICTED: similar to cathepsin W preproprotein... 530 e-151
Alignment gi|XP_533219.2| PREDICTED: similar to Cathepsin F precursor (CA... 229 4e-60
Alignment gi|NP_001002938.1| cathepsin S precursor [Canis lupus familiari... 137 2e-32
Alignment gi|XP_536212.2| PREDICTED: similar to Cathepsin H precursor [Ca... 133 4e-31
Alignment gi|NP_001003115.1| cathepsin L1 precursor [Canis lupus familiar... 129 5e-30
Alignment gi|NP_001029168.1| cathepsin K precursor [Canis lupus familiari... 123 4e-28
Alignment gi|XP_539782.2| PREDICTED: similar to Cathepsin O precursor [Ca... 118 1e-26
Alignment gi|XP_541257.2| PREDICTED: similar to Cathepsin L precursor (Ma... 117 3e-26
Alignment gi|NP_001182763.1| dipeptidyl peptidase 1 [Canis lupus familiar... 97 3e-20
Alignment gi|XP_535784.2| PREDICTED: similar to Dipeptidyl-peptidase I pr... 91 2e-18
>ref|XP_540846.2| PREDICTED: similar to cathepsin W preproprotein [Canis familiaris].
Length = 374
Score = 530 bits (1366), Expect = e-151
Identities = 256/362 (70%), Positives = 295/362 (81%), Gaps = 7/362 (1%)
Frame = +2
Query: 35 MALTAHLSCLLVLVVAGPAQGLKDALRSQDPGPQPMGLKEVFTLFQIQYNRSYSNPAEHA 214
MALT +LSCLL L VA A G+K +L++QDPGPQP+ LK+VF LFQIQYNRSYSNP E+A
Sbjct: 1 MALTIYLSCLLALSVASLAHGIKRSLKNQDPGPQPLELKQVFALFQIQYNRSYSNPEEYA 60
Query: 215 RRLDIFAQNLAKAQRLQEEDLGTAEFGVTPFSDLTEEEFGQLHGHHWGAGKAPSMGIKVG 394
RRLDIFA NLA+AQ+L++EDLGTAEFGVTPFSDLTEEEFGQ +GH AG+APS+G KV
Sbjct: 61 RRLDIFAHNLAQAQQLEDEDLGTAEFGVTPFSDLTEEEFGQFYGHQRMAGEAPSVGRKVE 120
Query: 395 SEESGETVPQSCDWRKKPGVISAIKHQKDCNCCWAMAAVDNVEAQWAIKYHQAVQLSVQQ 574
SEE GE VP +CDWRK PG+IS IK Q +C CCWAMAA N+EA W I+YHQ V++SVQ+
Sbjct: 121 SEEWGEPVPPTCDWRKLPGIISPIKQQGNCRCCWAMAAAGNIEALWGIRYHQPVEVSVQE 180
Query: 575 VLDCDRXXXXXXXXFVWDAFLTVLNTSGLASEQDYPYKGTVKTHRCLAKQHRKVAWIQDF 754
+LDC R F WDAF+TVLN SGLAS +DYP+ G K HRCLAK+++KVAWIQDF
Sbjct: 181 LLDCGRCGDGCKGGFTWDAFITVLNNSGLASAKDYPFLGNTKPHRCLAKKYKKVAWIQDF 240
Query: 755 LMLQFCEQSIARYLATEGPITVTINAGLLQQYKRGVIRATPATCDPHLVNHSVLLVGFGK 934
+MLQ EQ+IA YLAT+GPITVTIN LLQ Y++GVI+AT TCDP V+HSVLLVGFGK
Sbjct: 241 IMLQGNEQAIAWYLATKGPITVTINMKLLQHYQKGVIQATHTTCDPQRVDHSVLLVGFGK 300
Query: 935 SKSVEGK-------RPRPGHSIPYWILKNSWGPDWGEEGYFRLHRGSNTCGITKYPVTAR 1093
SKSV GK RPRP H IPYWILKNSWG +WGEEGYFRLHRG+NTCGITKYPVTAR
Sbjct: 301 SKSVAGKQAEGGSSRPRPHHPIPYWILKNSWGAEWGEEGYFRLHRGNNTCGITKYPVTAR 360
Query: 1094 VD 1099
VD
Sbjct: 361 VD 362
>ref|XP_533219.2| PREDICTED: similar to Cathepsin F precursor (CATSF) [Canis
familiaris].
Length = 442
Score = 229 bits (584), Expect = 4e-60
Identities = 126/335 (37%), Positives = 183/335 (54%), Gaps = 5/335 (1%)
Frame = +2
Query: 110 LRSQDPGPQPMGLK--EVFTLFQIQYNRSYSNPAEHARRLDIFAQNLAKAQRLQEEDLGT 283
L ++DP PQ +K VF F YNR+Y E R+ +F+ N+ +AQ++Q D GT
Sbjct: 126 LLNKDPLPQDFSVKMASVFKEFVTTYNRTYETKEEAEWRMSVFSNNMVRAQKIQALDRGT 185
Query: 284 AEFGVTPFSDLTEEEFGQLHGH---HWGAGKAPSMGIKVGSEESGETVPQSCDWRKKPGV 454
A++G+T FSDLTEEEF ++ + GK +++ S P DWR K G
Sbjct: 186 AQYGITKFSDLTEEEFRTIYLNPLLRENRGKK----MRLAKSISDHAPPPEWDWRSK-GA 240
Query: 455 ISAIKHQKDCNCCWAMAAVDNVEAQWAIKYHQAVQLSVQQVLDCDRXXXXXXXXFVWDAF 634
++ +K Q C CWA + NVE QW +K + LS Q++LDCD+ +A+
Sbjct: 241 VTKVKDQGMCGSCWAFSVTGNVEGQWFLKEGTLLSLSEQELLDCDKVDKACLGGLPSNAY 300
Query: 635 LTVLNTSGLASEQDYPYKGTVKTHRCLAKQHRKVAWIQDFLMLQFCEQSIARYLATEGPI 814
++ GL +E DY Y+G ++ AK+ R +I D + L EQ +A +LA +GPI
Sbjct: 301 SAIMTLGGLETEDDYSYQGHLQACSFSAKKAR--VYINDSMELSQNEQKLAAWLAKKGPI 358
Query: 815 TVTINAGLLQQYKRGVIRATPATCDPHLVNHSVLLVGFGKSKSVEGKRPRPGHSIPYWIL 994
+V INA +Q Y+ G+ C P L++H+VLLVG+G IP+W +
Sbjct: 359 SVAINAFGMQFYRHGISHPLRPLCSPWLIDHAVLLVGYGNRS-----------GIPFWAI 407
Query: 995 KNSWGPDWGEEGYFRLHRGSNTCGITKYPVTARVD 1099
KNSWG DWGEEGY+ LHRGS CG+ +A V+
Sbjct: 408 KNSWGTDWGEEGYYYLHRGSGACGVNTMASSAVVN 442
>ref|NP_001002938.1| cathepsin S precursor [Canis lupus familiaris].
Length = 331
Score = 137 bits (346), Expect = 2e-32
Identities = 99/320 (30%), Positives = 157/320 (49%), Gaps = 12/320 (3%)
Frame = +2
Query: 158 FTLFQIQYNRSYSNPAEHARRLDIFAQNLAKAQRLQ-EEDLG--TAEFGVTPFSDLTEEE 328
+ L++ Y++ Y E R I+ +NL E +G + + G+ D+T EE
Sbjct: 28 WNLWKKTYSKQYKEENEEVARRLIWEKNLKFVMLHNLEHSMGMHSYDLGMNHLGDMTGEE 87
Query: 329 FGQLHGHHWGAGKAPSMGIK--VGSEESGETVPQSCDWRKKPGVISAIKHQKDCNCCWAM 502
L G+ + PS + S + +P S DWR+K G ++ +K+Q C CWA
Sbjct: 88 VISL----MGSLRVPSQWQRNVTYRSNSNQKLPDSVDWREK-GCVTEVKYQGSCGACWAF 142
Query: 503 AAVDNVEAQWAIKYHQAVQLSVQQVLDCDR---XXXXXXXXFVWDAFLTVLNTSGLASEQ 673
+AV +EAQ +K + V LS Q ++DC F+ AF +++ +G+ SE
Sbjct: 143 SAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTTAFQYIIDNNGIDSEA 202
Query: 674 DYPYKGTVKTHRCLAKQHRKVAWIQDFLMLQF-CEQSIARYLATEGPITVTINAGLLQ-- 844
YPYK +C ++ A + L F E ++ +A +GP++V I+A
Sbjct: 203 SYPYK--AMNGKCRYDSKKRAATCSKYTELPFGSEDALKEAVANKGPVSVAIDASHYSFF 260
Query: 845 QYKRGVIRATPATCDPHLVNHSVLLVGFGKSKSVEGKRPRPGHSIPYWILKNSWGPDWGE 1024
Y+ GV T + VNH VL+VG+G ++ GK YW++KNSWG ++G+
Sbjct: 261 LYRSGVYYEPSCTQN---VNHGVLVVGYG---NLNGK--------DYWLVKNSWGLNFGD 306
Query: 1025 EGYFRLHRGS-NTCGITKYP 1081
+GY R+ R S N CGI YP
Sbjct: 307 QGYIRMARNSGNHCGIASYP 326
>ref|XP_536212.2| PREDICTED: similar to Cathepsin H precursor [Canis familiaris].
Length = 304
Score = 133 bits (334), Expect = 4e-31
Identities = 91/312 (29%), Positives = 142/312 (45%), Gaps = 10/312 (3%)
Frame = +2
Query: 179 YNRSYSNPAEHARRLDIFAQNLAKAQRLQEEDLG--TAEFGVTPFSDLTEEEFGQLHGHH 352
+ + YS+ E+ +RL F N K + + G T + G+ FSD+ E H +
Sbjct: 10 HQKKYSSE-EYLQRLQTFVGNWRK---INAHNAGNHTFKMGLNQFSDMNFAEIK--HKYL 63
Query: 353 WGAGKAPSMGIKVGSEESGETVPQSCDWRKKPGVISAIKHQKDCNCCWAMAAVDNVEAQW 532
W + S K P DWRKK +S +K+Q C CW + +E+
Sbjct: 64 WSEPQNCS-ATKGNYLRGTGPYPPFVDWRKKGKFVSPVKNQGSCGSCWTFSTTGALESAI 122
Query: 533 AIKYHQAVQLSVQQVLDCDRXXXXXXXXFVW---DAFLTVLNTSGLASEQDYPYKGTVKT 703
AIK + + L+ QQ++DC + AF + G+ E YPYKG +
Sbjct: 123 AIKSGKLLSLAEQQLVDCAQNFNNHGCQGYGAPLQAFEYIRYNKGIMGEDSYPYKG--QD 180
Query: 704 HRCLAKQHRKVAWIQDFLMLQFC-EQSIARYLATEGPITVTINAGL-LQQYKRGVIRATP 877
C + + +A+++D + EQ++ +A P++ Y++G+ +T
Sbjct: 181 GDCKYQPSKAIAFVKDVANITINDEQAMVEAVALYNPVSFAFEVTSDFMMYRKGIYSSTS 240
Query: 878 ATCDPHLVNHSVLLVGFGKSKSVEGKRPRPGHSIPYWILKNSWGPDWGEEGYFRLHRGSN 1057
P VNH+VL VG+G+ + IPYWI+KNSWGP WG GYF + RG N
Sbjct: 241 CHKTPDKVNHAVLAVGYGEQ-----------NGIPYWIVKNSWGPQWGMNGYFLMERGKN 289
Query: 1058 TCGI---TKYPV 1084
CG+ YP+
Sbjct: 290 MCGLAACASYPI 301
>ref|NP_001003115.1| cathepsin L1 precursor [Canis lupus familiaris].
Length = 333
Score = 129 bits (325), Expect = 5e-30
Identities = 98/306 (32%), Positives = 145/306 (47%), Gaps = 9/306 (2%)
Frame = +2
Query: 179 YNRSYSNPAEHARRLDIFAQNLAKAQRLQEEDLGTAEFGVT----PFSDLTEEEFGQLHG 346
+ R Y E RR ++ +N+ K L + + G T F D+T EEF Q+
Sbjct: 36 HRRLYGMNEEGWRRA-VWEKNM-KMIELHNREYSQGKHGFTMAMNAFGDMTNEEFRQVMN 93
Query: 347 HHWGAGKAPSMGIKVGSEESGETVPQSCDWRKKPGVISAIKHQKDCNCCWAMAAVDNVEA 526
G K+ E +P+S DWR+K G ++ +K+Q C CWA +A +E
Sbjct: 94 ---GFQNQKHKKGKMFQEPLFAEIPKSVDWREK-GYVTPVKNQGQCGSCWAFSATGALEG 149
Query: 527 QWAIKYHQAVQLSVQQVLDCDR--XXXXXXXXFVWDAFLTVLNTSGLASEQDYPYKGTVK 700
Q K + V LS Q ++DC R + +AF V + GL SE+ YPY G
Sbjct: 150 QMFRKTGKLVSLSEQNLVDCSRAQGNEGCNGGLMDNAFRYVKDNGGLDSEESYPYLGR-D 208
Query: 701 THRCLAKQHRKVAWIQDFLMLQFCEQSIARYLATEGPITVTINAG--LLQQYKRGVIRAT 874
T C K A F+ L E+++ + +AT GPI+V I+AG Q YK G+
Sbjct: 209 TETCNYKPECSAANDTGFVDLPQREKALMKAVATLGPISVAIDAGHQSFQFYKSGIY--F 266
Query: 875 PATCDPHLVNHSVLLVGFGKSKSVEGKRPRPGHSIPYWILKNSWGPDWGEEGYFRLHRGS 1054
C ++H VL+VG+G + + +WI+KNSWGP+WG GY ++ +
Sbjct: 267 DPDCSSKDLDHGVLVVGYGFEGTDSNNK--------FWIVKNSWGPEWGWNGYVKMAKDQ 318
Query: 1055 NT-CGI 1069
N CGI
Sbjct: 319 NNHCGI 324
>ref|NP_001029168.1| cathepsin K precursor [Canis lupus familiaris].
Length = 330
Score = 123 bits (308), Expect = 4e-28
Identities = 94/318 (29%), Positives = 148/318 (46%), Gaps = 16/318 (5%)
Frame = +2
Query: 164 LFQIQYNRSYSNPAEHARRLDIFAQNLAKAQ-RLQEEDLG--TAEFGVTPFSDLTEEEFG 334
L++ Y + Y++ + R I+ +NL E LG T E + D+T EE
Sbjct: 29 LWKKTYRKQYNSKVDELSRRLIWEKNLKHISIHNLEASLGVHTYELAMNHLGDMTSEEVV 88
Query: 335 QLHGHHWGAGKAPSMGIKVGSEESGETV---------PQSCDWRKKPGVISAIKHQKDCN 487
Q K + + S +T+ P S D+RKK G ++ +K+Q C
Sbjct: 89 Q---------KMTGLKVPPSHSRSNDTLYIPDWESRAPDSVDYRKK-GYVTPVKNQGQCG 138
Query: 488 CCWAMAAVDNVEAQWAIKYHQAVQLSVQQVLDCDRXXXXXXXXFVWDAFLTVLNTSGLAS 667
CWA ++V +E Q K + + LS Q ++DC ++ +AF V G+ S
Sbjct: 139 SCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSENDGCGGGYMTNAFQYVQKNRGIDS 198
Query: 668 EQDYPYKGTVKTHRCLAKQHRKVAWIQDFLML-QFCEQSIARYLATEGPITVTINAGL-- 838
E YPY G + C+ K A + + + + E+++ R +A GPI+V I+A L
Sbjct: 199 EDAYPYVG--QDESCMYNPTGKAAKCRGYREIPEGNEKALKRAVARVGPISVAIDASLTS 256
Query: 839 LQQYKRGVIRATPATCDPHLVNHSVLLVGFGKSKSVEGKRPRPGHSIPYWILKNSWGPDW 1018
Q Y +GV C+ +NH+VL VG+G K + +WI+KNSWG +W
Sbjct: 257 FQFYSKGVY--YDENCNSDNLNHAVLAVGYGIQKGNK-----------HWIIKNSWGENW 303
Query: 1019 GEEGYFRLHRG-SNTCGI 1069
G +GY + R +N CGI
Sbjct: 304 GNKGYILMARNKNNACGI 321
>ref|XP_539782.2| PREDICTED: similar to Cathepsin O precursor [Canis familiaris].
Length = 518
Score = 118 bits (296), Expect = 1e-26
Identities = 79/268 (29%), Positives = 127/268 (47%), Gaps = 3/268 (1%)
Frame = +2
Query: 281 TAEFGVTPFSDLTEEEFGQLHGHHWGAGKAPSMGIKVGSEESGETVPQSCDWRKKPGVIS 460
+A +G+ FS L+ EEF ++ ++P +V + ++P DWR K V++
Sbjct: 259 SAVYGINQFSYLSPEEFKAIYLRS-KPSRSPRYPAEVRTSIRNVSLPLRFDWRDKR-VVT 316
Query: 461 AIKHQKDCNCCWAMAAVDNVEAQWAIKYHQAVQLSVQQVLDCDRXXXXXXXXFVWDAFLT 640
+++Q+ C CWA + V VE+ +AIK +SVQQV+DC +A
Sbjct: 317 QVRNQQTCGGCWAFSVVGAVESAYAIKGKPLADISVQQVIDCSYNNYGCSGGSTLNALNW 376
Query: 641 VLNTS-GLASEQDYPYKGTVKTHRCLAKQHRKVAWIQDFLMLQFCEQS--IARYLATEGP 811
+ T L + +YP+K + + + I+ + F +Q +A+ L T GP
Sbjct: 377 LNKTQVKLVRDSEYPFKAQNGLCHYFSDSYSGFS-IRGYSAYDFSDQEDEMAKVLLTFGP 435
Query: 812 ITVTINAGLLQQYKRGVIRATPATCDPHLVNHSVLLVGFGKSKSVEGKRPRPGHSIPYWI 991
+ V ++A Q Y G+I+ C NH+VL+ GF K S PYWI
Sbjct: 436 LVVVVDAVSWQDYLGGIIQHH---CSSGEANHAVLITGFDKIGST-----------PYWI 481
Query: 992 LKNSWGPDWGEEGYFRLHRGSNTCGITK 1075
++NSWG WG +GY + G N C +K
Sbjct: 482 VRNSWGSSWGVDGYAHVKMGGNICDSSK 509
>ref|XP_541257.2| PREDICTED: similar to Cathepsin L precursor (Major excreted protein)
(MEP) [Canis familiaris].
Length = 333
Score = 117 bits (292), Expect = 3e-26
Identities = 102/357 (28%), Positives = 156/357 (43%), Gaps = 19/357 (5%)
Frame = +2
Query: 71 LVVAGPAQGLKDALRSQDPGPQPMGLKEVFTLFQIQYNRSYSNPAEHARRLDIFAQNLAK 250
L +A G+ A QD L ++ ++ + + Y E RR ++ +N+
Sbjct: 5 LFLAALCLGIASAAPQQDHS-----LDAHWSQWKEAHGKLYDKDEEGWRRT-VWERNMEM 58
Query: 251 A-QRLQEEDLGTAEF--GVTPFSDLTEEEFGQLHGHHWGAGKAPSMGIKVGSEESGET-- 415
Q QE G F + F D+T EEF Q+ K+ + G+
Sbjct: 59 IEQHNQEYSQGEHSFTLAMNAFGDMTNEEFKQVLND-----------FKIQKHKKGKVFP 107
Query: 416 ------VPQSCDWRKKPGVISAIKHQKDCNCCWAMAAVDNVEAQWAIKYHQAVQLSVQQV 577
VP S DWR++ G ++ +K Q C CWA +A +E Q K + V LS Q +
Sbjct: 108 APLFAEVPSSVDWREQ-GYVTPVKDQGQCLGCWAFSATGALEGQMFRKTGKLVSLSEQNL 166
Query: 578 LDC--DRXXXXXXXXFVWDAFLTVLNTSGLASEQDYPYKGTVKTHRCLAKQHRKVAWIQD 751
+DC + + AF V + GL SE+ YPY + C + + A +
Sbjct: 167 VDCSWSQGNRGCNGGLMEYAFQYVKDNGGLDSEESYPY--LARNEPCKYRPEKSAANVTA 224
Query: 752 FLMLQFCEQSIARYLATEGPITVTINAG--LLQQYKRGVIRATPATCDPHLVNHSVLLVG 925
F + E + +AT GP++ +++ Q YK+G+ C L+NH VL+VG
Sbjct: 225 FWPILNEEDGLMTTVATVGPVSAAVDSSPQSFQFYKKGIY--YDPKCSNKLLNHGVLVVG 282
Query: 926 FGKSKSVEGKRPRPGHSIPYWILKNSWGPDWGEEGYFRLHRG-SNTCGI---TKYPV 1084
+G EG + YWI+KNSWG +WG +GY L + N CGI YPV
Sbjct: 283 YG----FEGAE---SDNKKYWIVKNSWGTNWGMQGYMLLAKDRDNHCGIATRASYPV 332
>ref|NP_001182763.1| dipeptidyl peptidase 1 [Canis lupus familiaris].
Length = 459
Score = 97.4 bits (241), Expect = 3e-20
Identities = 70/238 (29%), Positives = 108/238 (45%), Gaps = 13/238 (5%)
Frame = +2
Query: 416 VPQSCDWRKKPGV--ISAIKHQKDCNCCWAMAAVDNVEAQWAIKYH--QAVQLSVQQVLD 583
+P S DWR G +S +++Q C C+A A+ +EA+ I + Q LS Q+++
Sbjct: 228 LPTSWDWRNVRGTNFVSPVRNQASCGSCYAFASTAMLEARIRILTNNTQTPILSPQEIVS 287
Query: 584 CDRXXXXXXXXFVWDAFLTVLNTSGLASEQDYPYKGT---VKTHRCLAKQHRKVAWIQDF 754
C + F + GL E +PY G+ K + C + ++ F
Sbjct: 288 CSQYAQGCEGGFPYLIAGKYAQDFGLVEEACFPYAGSDSPCKPNDCFRYYSSEYYYVGGF 347
Query: 755 LMLQFCEQSIARY-LATEGPITVTINA-GLLQQYKRGVIRATPATCDP----HLVNHSVL 916
C +++ + L GP+ V Y++G+ T DP L NH+VL
Sbjct: 348 YGA--CNEALMKLELVRHGPMAVAFEVYDDFFHYQKGIYYHTGLR-DPFNPFELTNHAVL 404
Query: 917 LVGFGKSKSVEGKRPRPGHSIPYWILKNSWGPDWGEEGYFRLHRGSNTCGITKYPVTA 1090
LVG+G + + YWI+KNSWG WGE+GYFR+ RG++ C I V A
Sbjct: 405 LVGYGTDSA---------SGMDYWIVKNSWGSRWGEDGYFRIRRGTDECAIESIAVAA 453
>ref|XP_535784.2| PREDICTED: similar to Dipeptidyl-peptidase I precursor (DPP-I) (DPPI)
(Cathepsin C) (Cathepsin J) (Dipeptidyl transferase),
partial [Canis familiaris].
Length = 481
Score = 90.9 bits (224), Expect = 2e-18
Identities = 68/238 (28%), Positives = 107/238 (44%), Gaps = 13/238 (5%)
Frame = +2
Query: 416 VPQSCDWRKKPGV--ISAIKHQKDCNCCWAMAAVDNVEAQWAIKYH--QAVQLSVQQVLD 583
+P S DWR G +S +++Q C C+A A+ +EA+ I + Q LS Q+++
Sbjct: 250 LPTSWDWRNVRGTNFVSPVRNQASCGSCYAFASTVMLEARIRILTNNTQTPILSPQEIVS 309
Query: 584 CDRXXXXXXXXFVWDAFLTVLNTSGLASEQDYPYKGT---VKTHRCLAKQHRKVAWIQDF 754
C + F + GL E + Y G+ K + C + ++ F
Sbjct: 310 CSQYAQGCEGGFPYLIAGKYAQDFGLVDEACFSYAGSDSPCKPNDCFHYYSSEYHYVGGF 369
Query: 755 LMLQFCEQSIARY-LATEGPITVTINA-GLLQQYKRGVIRATPATCDP----HLVNHSVL 916
C +++ + L GP+ V Y++G+ T DP L NH+VL
Sbjct: 370 YGA--CNEALMKLELVRHGPMAVAFEVYDDFFHYQKGIYYHTGLR-DPINPFELTNHAVL 426
Query: 917 LVGFGKSKSVEGKRPRPGHSIPYWILKNSWGPDWGEEGYFRLHRGSNTCGITKYPVTA 1090
LVG+G + + YWI+KNSWG WGE+GYF++ RG++ C I V A
Sbjct: 427 LVGYGTDSA---------SGMDYWIVKNSWGSRWGEDGYFQICRGTDECAIESIAVAA 475
Database: RefSeq49_CP.fasta
Posted date: Oct 17, 2011 1:42 PM
Number of letters in database: 18,874,504
Number of sequences in database: 33,336
Lambda K H
0.318 0.134 0.401
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 33336
Number of Hits to DB: 60,170,835
Number of extensions: 1852056
Number of successful extensions: 8099
Number of sequences better than 1.0e-05: 14
Number of HSP's gapped: 8007
Number of HSP's successfully gapped: 14
Length of query: 440
Length of database: 18,874,504
Length adjustment: 106
Effective length of query: 334
Effective length of database: 15,340,888
Effective search space: 5123856592
Effective search space used: 5123856592
Neighboring words threshold: 12
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 35 (18.1 bits)
Search to RefSeqHP_Rel49
BLASTX 2.2.24 [Aug-08-2010]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= 20110601C-002438
(1320 letters)
Database: RefSeq49_HP.fasta
32,964 sequences; 18,297,164 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
Alignment gi|NP_001326.2| cathepsin W preproprotein [Homo sapiens]. 506 e-143
Alignment gi|NP_003784.2| cathepsin F precursor [Homo sapiens]. 233 2e-61
Alignment gi|NP_004381.2| pro-cathepsin H preproprotein [Homo sapiens]. 135 8e-32
Alignment gi|NP_001188504.1| cathepsin L2 preproprotein [Homo sapiens]. 129 5e-30
Alignment gi|NP_001324.2| cathepsin L2 preproprotein [Homo sapiens]. 129 5e-30
Alignment gi|NP_004070.3| cathepsin S isoform 1 preproprotein [Homo sapie... 127 2e-29
Alignment gi|NP_001325.1| cathepsin O preproprotein [Homo sapiens]. 126 5e-29
Alignment gi|NP_666023.1| cathepsin L1 preproprotein [Homo sapiens]. 125 9e-29
Alignment gi|NP_001903.1| cathepsin L1 preproprotein [Homo sapiens]. 125 9e-29
Alignment gi|NP_000387.1| cathepsin K preproprotein [Homo sapiens]. 124 3e-28
>ref|NP_001326.2| cathepsin W preproprotein [Homo sapiens].
Length = 376
Score = 506 bits (1303), Expect = e-143
Identities = 248/373 (66%), Positives = 288/373 (77%), Gaps = 9/373 (2%)
Frame = +2
Query: 35 MALTAHLSCLLVLVVAGPAQGLKDALRSQDPGPQPMGLKEVFTLFQIQYNRSYSNPAEHA 214
MALTAH SCLL L+VAG AQG++ LR+QD GPQP+ LKE F LFQIQ+NRSY +P EHA
Sbjct: 1 MALTAHPSCLLALLVAGLAQGIRGPLRAQDLGPQPLELKEAFKLFQIQFNRSYLSPEEHA 60
Query: 215 RRLDIFAQNLAKAQRLQEEDLGTAEFGVTPFSDLTEEEFGQLHGHHWGAGKAPSMGIKVG 394
RLDIFA NLA+AQRLQEEDLGTAEFGVTPFSDLTEEEFGQL+G+ AG PSMG ++
Sbjct: 61 HRLDIFAHNLAQAQRLQEEDLGTAEFGVTPFSDLTEEEFGQLYGYRRAAGGVPSMGREIR 120
Query: 395 SEESGETVPQSCDWRKKPGVISAIKHQKDCNCCWAMAAVDNVEAQWAIKYHQAVQLSVQQ 574
SEE E+VP SCDWRK G IS IK QK+CNCCWAMAA N+E W I + V +SVQ+
Sbjct: 121 SEEPEESVPFSCDWRKVAGAISPIKDQKNCNCCWAMAAAGNIETLWRISFWDFVDVSVQE 180
Query: 575 VLDCDRXXXXXXXXFVWDAFLTVLNTSGLASEQDYPYKGTVKTHRCLAKQHRKVAWIQDF 754
+LDC R FVWDAF+TVLN SGLASE+DYP++G V+ HRC K+++KVAWIQDF
Sbjct: 181 LLDCGRCGDGCHGGFVWDAFITVLNNSGLASEKDYPFQGKVRAHRCHPKKYQKVAWIQDF 240
Query: 755 LMLQFCEQSIARYLATEGPITVTINAGLLQQYKRGVIRATPATCDPHLVNHSVLLVGFGK 934
+MLQ E IA+YLAT GPITVTIN LQ Y++GVI+ATP TCDP LV+HSVLLVGFG
Sbjct: 241 IMLQNNEHRIAQYLATYGPITVTINMKPLQLYRKGVIKATPTTCDPQLVDHSVLLVGFGS 300
Query: 935 SKSVEG---------KRPRPGHSIPYWILKNSWGPDWGEEGYFRLHRGSNTCGITKYPVT 1087
KS EG +P+P H PYWILKNSWG WGE+GYFRLHRGSNTCGITK+P+T
Sbjct: 301 VKSEEGIWAETVSSQSQPQPPHPTPYWILKNSWGAQWGEKGYFRLHRGSNTCGITKFPLT 360
Query: 1088 ARVDKPG*KHQIS 1126
ARV KP K ++S
Sbjct: 361 ARVQKPDMKPRVS 373
>ref|NP_003784.2| cathepsin F precursor [Homo sapiens].
Length = 484
Score = 233 bits (595), Expect = 2e-61
Identities = 127/333 (38%), Positives = 185/333 (55%), Gaps = 2/333 (0%)
Frame = +2
Query: 107 ALRSQDPGPQ--PMGLKEVFTLFQIQYNRSYSNPAEHARRLDIFAQNLAKAQRLQEEDLG 280
+L ++DP Q P+ + +F F I YNR+Y + E RL +F N+ +AQ++Q D G
Sbjct: 168 SLLNEDPLSQDLPVKMASIFKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQALDRG 227
Query: 281 TAEFGVTPFSDLTEEEFGQLHGHHWGAGKAPSMGIKVGSEESGETVPQSCDWRKKPGVIS 460
TA++GVT FSDLTEEEF ++ + K P +K ++ G+ P DWR K G ++
Sbjct: 228 TAQYGVTKFSDLTEEEFRTIYLNTL-LRKEPGNKMK-QAKSVGDLAPPEWDWRSK-GAVT 284
Query: 461 AIKHQKDCNCCWAMAAVDNVEAQWAIKYHQAVQLSVQQVLDCDRXXXXXXXXFVWDAFLT 640
+K Q C CWA + NVE QW + + LS Q++LDCD+ +A+
Sbjct: 285 KVKDQGMCGSCWAFSVTGNVEGQWFLNQGTLLSLSEQELLDCDKMDKACMGGLPSNAYSA 344
Query: 641 VLNTSGLASEQDYPYKGTVKTHRCLAKQHRKVAWIQDFLMLQFCEQSIARYLATEGPITV 820
+ N GL +E DY Y+G +++ C + +I D + L EQ +A +LA GPI+V
Sbjct: 345 IKNLGGLETEDDYSYQGHMQS--CNFSAEKAKVYINDSVELSQNEQKLAAWLAKRGPISV 402
Query: 821 TINAGLLQQYKRGVIRATPATCDPHLVNHSVLLVGFGKSKSVEGKRPRPGHSIPYWILKN 1000
INA +Q Y+ G+ R C P L++H+VLLVG+G V P+W +KN
Sbjct: 403 AINAFGMQFYRHGISRPLRPLCSPWLIDHAVLLVGYGNRSDV-----------PFWAIKN 451
Query: 1001 SWGPDWGEEGYFRLHRGSNTCGITKYPVTARVD 1099
SWG DWGE+GY+ LHRGS CG+ +A VD
Sbjct: 452 SWGTDWGEKGYYYLHRGSGACGVNTMASSAVVD 484
>ref|NP_004381.2| pro-cathepsin H preproprotein [Homo sapiens].
Length = 335
Score = 135 bits (340), Expect = 8e-32
Identities = 91/316 (28%), Positives = 142/316 (44%), Gaps = 7/316 (2%)
Frame = +2
Query: 158 FTLFQIQYNRSYSNPAEHARRLDIFAQNLAKAQRLQEEDLGTAEFGVTPFSDLTEEEFGQ 337
F + ++ ++YS H RL FA N K + T + + FSD++ E
Sbjct: 35 FKSWMSKHRKTYSTEEYH-HRLQTFASNWRKINAHNNGN-HTFKMALNQFSDMSFAEIK- 91
Query: 338 LHGHHWGAGKAPSMGIKVGSEESGETVPQSCDWRKKPGVISAIKHQKDCNCCWAMAAVDN 517
H + W + S K P S DWRKK +S +K+Q C CW +
Sbjct: 92 -HKYLWSEPQNCS-ATKSNYLRGTGPYPPSVDWRKKGNFVSPVKNQGACGSCWTFSTTGA 149
Query: 518 VEAQWAIKYHQAVQLSVQQVLDC--DRXXXXXXXXFVWDAFLTVLNTSGLASEQDYPYKG 691
+E+ AI + + L+ QQ++DC D AF +L G+ E YPY+G
Sbjct: 150 LESAIAIATGKMLSLAEQQLVDCAQDFNNHGCQGGLPSQAFEYILYNKGIMGEDTYPYQG 209
Query: 692 TVKTHRCLAKQHRKVAWIQDFLMLQ-FCEQSIARYLATEGPITVTINAGL-LQQYKRGVI 865
K C + + + +++D + + E+++ +A P++ Y+ G+
Sbjct: 210 --KDGYCKFQPGKAIGFVKDVANITIYDEEAMVEAVALYNPVSFAFEVTQDFMMYRTGIY 267
Query: 866 RATPATCDPHLVNHSVLLVGFGKSKSVEGKRPRPGHSIPYWILKNSWGPDWGEEGYFRLH 1045
+T P VNH+VL VG+G+ + IPYWI+KNSWGP WG GYF +
Sbjct: 268 SSTSCHKTPDKVNHAVLAVGYGEK-----------NGIPYWIVKNSWGPQWGMNGYFLIE 316
Query: 1046 RGSNTCGI---TKYPV 1084
RG N CG+ YP+
Sbjct: 317 RGKNMCGLAACASYPI 332
>ref|NP_001188504.1| cathepsin L2 preproprotein [Homo sapiens].
Length = 334
Score = 129 bits (325), Expect = 5e-30
Identities = 108/345 (31%), Positives = 161/345 (46%), Gaps = 10/345 (2%)
Frame = +2
Query: 65 LVLVVAGPAQGLKDALRSQDPGPQPMGLKEVFTLFQIQYNRSYSNPAEHARRLDIFAQNL 244
L LV+A G+ A+ D L + ++ + R Y E RR ++ +N+
Sbjct: 3 LSLVLAAFCLGIASAVPKFD-----QNLDTKWYQWKATHRRLYGANEEGWRRA-VWEKNM 56
Query: 245 AKAQRLQEEDLGTAEFGVT----PFSDLTEEEFGQLHGHHWGAGKAPSMGIKVGSEESGE 412
K L + + G T F D+T EEF Q+ G KV E
Sbjct: 57 -KMIELHNGEYSQGKHGFTMAMNAFGDMTNEEFRQMMGCFRNQKFRKG---KVFREPLFL 112
Query: 413 TVPQSCDWRKKPGVISAIKHQKDCNCCWAMAAVDNVEAQWAIKYHQAVQLSVQQVLDCDR 592
+P+S DWRKK G ++ +K+QK C CWA +A +E Q K + V LS Q ++DC R
Sbjct: 113 DLPKSVDWRKK-GYVTPVKNQKQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSR 171
Query: 593 --XXXXXXXXFVWDAFLTVLNTSGLASEQDYPYKGTVKTHRCLAKQHRKVAWIQDF-LML 763
F+ AF V GL SE+ YPY + C + VA F ++
Sbjct: 172 PQGNQGCNGGFMARAFQYVKENGGLDSEESYPYVAVDEI--CKYRPENSVANDTGFTVVA 229
Query: 764 QFCEQSIARYLATEGPITVTINAG--LLQQYKRGVIRATPATCDPHLVNHSVLLVGFGKS 937
E+++ + +AT GPI+V ++AG Q YK G+ C ++H VL+VG+G
Sbjct: 230 PGKEKALMKAVATVGPISVAMDAGHSSFQFYKSGIY--FEPDCSSKNLDHGVLVVGYG-- 285
Query: 938 KSVEGKRPRPGHSIPYWILKNSWGPDWGEEGYFRLHRGSNT-CGI 1069
EG ++ YW++KNSWGP+WG GY ++ + N CGI
Sbjct: 286 --FEGAN---SNNSKYWLVKNSWGPEWGSNGYVKIAKDKNNHCGI 325
>ref|NP_001324.2| cathepsin L2 preproprotein [Homo sapiens].
Length = 334
Score = 129 bits (325), Expect = 5e-30
Identities = 108/345 (31%), Positives = 161/345 (46%), Gaps = 10/345 (2%)
Frame = +2
Query: 65 LVLVVAGPAQGLKDALRSQDPGPQPMGLKEVFTLFQIQYNRSYSNPAEHARRLDIFAQNL 244
L LV+A G+ A+ D L + ++ + R Y E RR ++ +N+
Sbjct: 3 LSLVLAAFCLGIASAVPKFD-----QNLDTKWYQWKATHRRLYGANEEGWRRA-VWEKNM 56
Query: 245 AKAQRLQEEDLGTAEFGVT----PFSDLTEEEFGQLHGHHWGAGKAPSMGIKVGSEESGE 412
K L + + G T F D+T EEF Q+ G KV E
Sbjct: 57 -KMIELHNGEYSQGKHGFTMAMNAFGDMTNEEFRQMMGCFRNQKFRKG---KVFREPLFL 112
Query: 413 TVPQSCDWRKKPGVISAIKHQKDCNCCWAMAAVDNVEAQWAIKYHQAVQLSVQQVLDCDR 592
+P+S DWRKK G ++ +K+QK C CWA +A +E Q K + V LS Q ++DC R
Sbjct: 113 DLPKSVDWRKK-GYVTPVKNQKQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSR 171
Query: 593 --XXXXXXXXFVWDAFLTVLNTSGLASEQDYPYKGTVKTHRCLAKQHRKVAWIQDF-LML 763
F+ AF V GL SE+ YPY + C + VA F ++
Sbjct: 172 PQGNQGCNGGFMARAFQYVKENGGLDSEESYPYVAVDEI--CKYRPENSVANDTGFTVVA 229
Query: 764 QFCEQSIARYLATEGPITVTINAG--LLQQYKRGVIRATPATCDPHLVNHSVLLVGFGKS 937
E+++ + +AT GPI+V ++AG Q YK G+ C ++H VL+VG+G
Sbjct: 230 PGKEKALMKAVATVGPISVAMDAGHSSFQFYKSGIY--FEPDCSSKNLDHGVLVVGYG-- 285
Query: 938 KSVEGKRPRPGHSIPYWILKNSWGPDWGEEGYFRLHRGSNT-CGI 1069
EG ++ YW++KNSWGP+WG GY ++ + N CGI
Sbjct: 286 --FEGAN---SNNSKYWLVKNSWGPEWGSNGYVKIAKDKNNHCGI 325
>ref|NP_004070.3| cathepsin S isoform 1 preproprotein [Homo sapiens].
Length = 331
Score = 127 bits (320), Expect = 2e-29
Identities = 103/355 (29%), Positives = 165/355 (46%), Gaps = 12/355 (3%)
Frame = +2
Query: 53 LSCLLVLVVAGPAQGLKDALRSQDPGPQPMGLKEVFTLFQIQYNRSYSNPAEHARRLDIF 232
L C+L++ + AQ KD L + L++ Y + Y E A R I+
Sbjct: 4 LVCVLLVCSSAVAQLHKDPT-----------LDHHWHLWKKTYGKQYKEKNEEAVRRLIW 52
Query: 233 AQNLAKAQRLQ-EEDLG--TAEFGVTPFSDLTEEEFGQLHGHHWGAGKAPSMGIKVGSEE 403
+NL E +G + + G+ D+T EE L + + PS + + +
Sbjct: 53 EKNLKFVMLHNLEHSMGMHSYDLGMNHLGDMTSEEVMSL----MSSLRVPSQWQRNITYK 108
Query: 404 SGET--VPQSCDWRKKPGVISAIKHQKDCNCCWAMAAVDNVEAQWAIKYHQAVQLSVQQV 577
S +P S DWR+K G ++ +K+Q C CWA +AV +EAQ +K + V LS Q +
Sbjct: 109 SNPNRILPDSVDWREK-GCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNL 167
Query: 578 LDCDR---XXXXXXXXFVWDAFLTVLNTSGLASEQDYPYKGTVKTHRCLAKQHRKVAWIQ 748
+DC F+ AF +++ G+ S+ YPYK +C + A
Sbjct: 168 VDCSTEKYGNKGCNGGFMTTAFQYIIDNKGIDSDASYPYK--AMDQKCQYDSKYRAATCS 225
Query: 749 DFLMLQFCEQSIAR-YLATEGPITVTINA--GLLQQYKRGVIRATPATCDPHLVNHSVLL 919
+ L + + + + +A +GP++V ++A Y+ GV T + VNH VL+
Sbjct: 226 KYTELPYGREDVLKEAVANKGPVSVGVDARHPSFFLYRSGVYYEPSCTQN---VNHGVLV 282
Query: 920 VGFGKSKSVEGKRPRPGHSIPYWILKNSWGPDWGEEGYFRLHRG-SNTCGITKYP 1081
VG+G + GK YW++KNSWG ++GEEGY R+ R N CGI +P
Sbjct: 283 VGYG---DLNGKE--------YWLVKNSWGHNFGEEGYIRMARNKGNHCGIASFP 326
>ref|NP_001325.1| cathepsin O preproprotein [Homo sapiens].
Length = 321
Score = 126 bits (316), Expect = 5e-29
Identities = 92/297 (30%), Positives = 137/297 (46%), Gaps = 7/297 (2%)
Frame = +2
Query: 200 PAEHARRLDIFAQNLAKAQRLQE---EDLGTAEFGVTPFSDLTEEEFGQLHGHHWGAGKA 370
P R F ++L + + L + TA +G+ FS L EEF ++ K
Sbjct: 34 PRSREREAAAFRESLNRHRYLNSLFPSENSTAFYGINQFSYLFPEEFKAIYLRS-KPSKF 92
Query: 371 PSMGIKVGSEESGETVPQSCDWRKKPGVISAIKHQKDCNCCWAMAAVDNVEAQWAIKYHQ 550
P +V ++P DWR K V++ +++Q+ C CWA + V VE+ +AIK
Sbjct: 93 PRYSAEVHMSIPNVSLPLRFDWRDKQ-VVTQVRNQQMCGGCWAFSVVGAVESAYAIKGKP 151
Query: 551 AVQLSVQQVLDCDRXXXXXXXXFVWDAFLTVLNTSG--LASEQDYPYKGTVKTHRCLAKQ 724
LSVQQV+DC +A L LN L + +YP+K +
Sbjct: 152 LEDLSVQQVIDCSYNNYGCNGGSTLNA-LNWLNKMQVKLVKDSEYPFKAQNGLCHYFSGS 210
Query: 725 HRKVAWIQDFLMLQFCEQS--IARYLATEGPITVTINAGLLQQYKRGVIRATPATCDPHL 898
H + I+ + F +Q +A+ L T GP+ V ++A Q Y G+I+ C
Sbjct: 211 HSGFS-IKGYSAYDFSDQEDEMAKALLTFGPLVVIVDAVSWQDYLGGIIQHH---CSSGE 266
Query: 899 VNHSVLLVGFGKSKSVEGKRPRPGHSIPYWILKNSWGPDWGEEGYFRLHRGSNTCGI 1069
NH+VL+ GF K+ S PYWI++NSWG WG +GY + GSN CGI
Sbjct: 267 ANHAVLITGFDKTGST-----------PYWIVRNSWGSSWGVDGYAHVKMGSNVCGI 312
>ref|NP_666023.1| cathepsin L1 preproprotein [Homo sapiens].
Length = 333
Score = 125 bits (314), Expect = 9e-29
Identities = 99/316 (31%), Positives = 154/316 (48%), Gaps = 8/316 (2%)
Frame = +2
Query: 146 LKEVFTLFQIQYNRSYSNPAEHARRLDIFAQNLAKAQ-RLQEEDLGTAEF--GVTPFSDL 316
L+ +T ++ +NR Y E RR ++ +N+ + QE G F + F D+
Sbjct: 25 LEAQWTKWKAMHNRLYGMNEEGWRRA-VWEKNMKMIELHNQEYREGKHSFTMAMNAFGDM 83
Query: 317 TEEEFGQLHGHHWGAGKAPSMGIKVGSEESGETVPQSCDWRKKPGVISAIKHQKDCNCCW 496
T EEF Q+ + P G KV E P+S DWR+K G ++ +K+Q C CW
Sbjct: 84 TSEEFRQVMNGF--QNRKPRKG-KVFQEPLFYEAPRSVDWREK-GYVTPVKNQGQCGSCW 139
Query: 497 AMAAVDNVEAQWAIKYHQAVQLSVQQVLDCD--RXXXXXXXXFVWDAFLTVLNTSGLASE 670
A +A +E Q K + + LS Q ++DC + + AF V + GL SE
Sbjct: 140 AFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSE 199
Query: 671 QDYPYKGTVKTHRCLAKQHRKVAWIQDFLMLQFCEQSIARYLATEGPITVTINAG--LLQ 844
+ YPY+ T ++ C VA F+ + E+++ + +AT GPI+V I+AG
Sbjct: 200 ESYPYEATEES--CKYNPKYSVANDTGFVDIPKQEKALMKAVATVGPISVAIDAGHESFL 257
Query: 845 QYKRGVIRATPATCDPHLVNHSVLLVGFGKSKSVEGKRPRPGHSIPYWILKNSWGPDWGE 1024
YK G+ C ++H VL+VG+G +S E + YW++KNSWG +WG
Sbjct: 258 FYKEGIY--FEPDCSSEDMDHGVLVVGYG-FESTESDNNK------YWLVKNSWGEEWGM 308
Query: 1025 EGYFRLHRG-SNTCGI 1069
GY ++ + N CGI
Sbjct: 309 GGYVKMAKDRRNHCGI 324
>ref|NP_001903.1| cathepsin L1 preproprotein [Homo sapiens].
Length = 333
Score = 125 bits (314), Expect = 9e-29
Identities = 99/316 (31%), Positives = 154/316 (48%), Gaps = 8/316 (2%)
Frame = +2
Query: 146 LKEVFTLFQIQYNRSYSNPAEHARRLDIFAQNLAKAQ-RLQEEDLGTAEF--GVTPFSDL 316
L+ +T ++ +NR Y E RR ++ +N+ + QE G F + F D+
Sbjct: 25 LEAQWTKWKAMHNRLYGMNEEGWRRA-VWEKNMKMIELHNQEYREGKHSFTMAMNAFGDM 83
Query: 317 TEEEFGQLHGHHWGAGKAPSMGIKVGSEESGETVPQSCDWRKKPGVISAIKHQKDCNCCW 496
T EEF Q+ + P G KV E P+S DWR+K G ++ +K+Q C CW
Sbjct: 84 TSEEFRQVMNGF--QNRKPRKG-KVFQEPLFYEAPRSVDWREK-GYVTPVKNQGQCGSCW 139
Query: 497 AMAAVDNVEAQWAIKYHQAVQLSVQQVLDCD--RXXXXXXXXFVWDAFLTVLNTSGLASE 670
A +A +E Q K + + LS Q ++DC + + AF V + GL SE
Sbjct: 140 AFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSE 199
Query: 671 QDYPYKGTVKTHRCLAKQHRKVAWIQDFLMLQFCEQSIARYLATEGPITVTINAG--LLQ 844
+ YPY+ T ++ C VA F+ + E+++ + +AT GPI+V I+AG
Sbjct: 200 ESYPYEATEES--CKYNPKYSVANDTGFVDIPKQEKALMKAVATVGPISVAIDAGHESFL 257
Query: 845 QYKRGVIRATPATCDPHLVNHSVLLVGFGKSKSVEGKRPRPGHSIPYWILKNSWGPDWGE 1024
YK G+ C ++H VL+VG+G +S E + YW++KNSWG +WG
Sbjct: 258 FYKEGIY--FEPDCSSEDMDHGVLVVGYG-FESTESDNNK------YWLVKNSWGEEWGM 308
Query: 1025 EGYFRLHRG-SNTCGI 1069
GY ++ + N CGI
Sbjct: 309 GGYVKMAKDRRNHCGI 324
>ref|NP_000387.1| cathepsin K preproprotein [Homo sapiens].
Length = 329
Score = 124 bits (310), Expect = 3e-28
Identities = 93/318 (29%), Positives = 150/318 (47%), Gaps = 16/318 (5%)
Frame = +2
Query: 164 LFQIQYNRSYSNPAEHARRLDIFAQNLAKAQ-RLQEEDLG--TAEFGVTPFSDLTEEEFG 334
L++ + + Y+N + R I+ +NL E LG T E + D+T EE
Sbjct: 28 LWKKTHRKQYNNKVDEISRRLIWEKNLKYISIHNLEASLGVHTYELAMNHLGDMTSEEVV 87
Query: 335 QLHGHHWGAGKAPSMGIKVGSEESGETV---------PQSCDWRKKPGVISAIKHQKDCN 487
Q K + + + S +T+ P S D+RKK G ++ +K+Q C
Sbjct: 88 Q---------KMTGLKVPLSHSRSNDTLYIPEWEGRAPDSVDYRKK-GYVTPVKNQGQCG 137
Query: 488 CCWAMAAVDNVEAQWAIKYHQAVQLSVQQVLDCDRXXXXXXXXFVWDAFLTVLNTSGLAS 667
CWA ++V +E Q K + + LS Q ++DC ++ +AF V G+ S
Sbjct: 138 SCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSENDGCGGGYMTNAFQYVQKNRGIDS 197
Query: 668 EQDYPYKGTVKTHRCLAKQHRKVAWIQDFLML-QFCEQSIARYLATEGPITVTINAGL-- 838
E YPY G + C+ K A + + + + E+++ R +A GP++V I+A L
Sbjct: 198 EDAYPYVG--QEESCMYNPTGKAAKCRGYREIPEGNEKALKRAVARVGPVSVAIDASLTS 255
Query: 839 LQQYKRGVIRATPATCDPHLVNHSVLLVGFGKSKSVEGKRPRPGHSIPYWILKNSWGPDW 1018
Q Y +GV +C+ +NH+VL VG+G K + +WI+KNSWG +W
Sbjct: 256 FQFYSKGVY--YDESCNSDNLNHAVLAVGYGIQKGNK-----------HWIIKNSWGENW 302
Query: 1019 GEEGYFRLHRG-SNTCGI 1069
G +GY + R +N CGI
Sbjct: 303 GNKGYILMARNKNNACGI 320
Database: RefSeq49_HP.fasta
Posted date: Oct 17, 2011 1:42 PM
Number of letters in database: 18,297,164
Number of sequences in database: 32,964
Lambda K H
0.318 0.134 0.401
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 32964
Number of Hits to DB: 58,810,525
Number of extensions: 1793001
Number of successful extensions: 7796
Number of sequences better than 1.0e-05: 22
Number of HSP's gapped: 7687
Number of HSP's successfully gapped: 22
Length of query: 440
Length of database: 18,297,164
Length adjustment: 106
Effective length of query: 334
Effective length of database: 14,802,980
Effective search space: 4944195320
Effective search space used: 4944195320
Neighboring words threshold: 12
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 35 (18.1 bits)
Search to RefSeqMP_Rel49
BLASTX 2.2.24 [Aug-08-2010]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= 20110601C-002438
(1320 letters)
Database: RefSeq49_MP.fasta
30,036 sequences; 15,617,559 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
Alignment gi|NP_034115.2| cathepsin W preproprotein [Mus musculus]. 473 e-133
Alignment gi|NP_063914.1| cathepsin F precursor [Mus musculus]. 229 4e-60
Alignment gi|NP_062414.3| cathepsin 8 [Mus musculus]. 142 8e-34
Alignment gi|NP_064680.1| cathepsin R precursor [Mus musculus]. 141 1e-33
Alignment gi|NP_067420.1| cathepsin 6 [Mus musculus]. 134 1e-31
Alignment gi|NP_031827.2| pro-cathepsin H preproprotein [Mus musculus]. 132 5e-31
Alignment gi|NP_067256.2| cathepsin S preproprotein [Mus musculus]. 130 2e-30
Alignment gi|XP_922074.1| PREDICTED: cathepsin M-like [Mus musculus]. 130 2e-30
Alignment gi|NP_031828.2| cathepsin K precursor [Mus musculus]. 130 3e-30
Alignment gi|NP_081620.2| cathepsin L-like 3 [Mus musculus]. 128 9e-30
>ref|NP_034115.2| cathepsin W preproprotein [Mus musculus].
Length = 371
Score = 473 bits (1217), Expect = e-133
Identities = 230/363 (63%), Positives = 280/363 (77%), Gaps = 6/363 (1%)
Frame = +2
Query: 35 MALTAHLSCLLVLVVAGPAQGLKDALRSQDPGPQPMGLKEVFTLFQIQYNRSYSNPAEHA 214
M LTAHLS LVL++AG QGL D+L ++D GP+P+ LKEVF LFQI++NRSY NPAE+
Sbjct: 1 MTLTAHLSYFLVLLLAG--QGLSDSLLTKDAGPRPLELKEVFKLFQIRFNRSYWNPAEYT 58
Query: 215 RRLDIFAQNLAKAQRLQEEDLGTAEFGVTPFSDLTEEEFGQLHGHHWGAGKAPSMGIKVG 394
RRL IFA NLA+AQRLQ+EDLGTAEFG TPFSDLTEEEFGQL+G + P+M KV
Sbjct: 59 RRLSIFAHNLAQAQRLQQEDLGTAEFGETPFSDLTEEEFGQLYGQERSPERTPNMTKKVE 118
Query: 395 SEESGETVPQSCDWRKKPGVISAIKHQKDCNCCWAMAAVDNVEAQWAIKYHQAVQLSVQQ 574
S GE+VP++CDWRK +IS++K+Q C CCWAMAA DN++A W IK+ Q V +SVQ+
Sbjct: 119 SNTWGESVPRTCDWRKAKNIISSVKNQGSCKCCWAMAAADNIQALWRIKHQQFVDVSVQE 178
Query: 575 VLDCDRXXXXXXXXFVWDAFLTVLNTSGLASEQDYPYKGTVKTHRCLAKQHRKVAWIQDF 754
+LDC+R FVWDA+LTVLN SGLASE+DYP++G K HRCLAK+++KVAWIQDF
Sbjct: 179 LLDCERCGNGCNGGFVWDAYLTVLNNSGLASEKDYPFQGDRKPHRCLAKKYKKVAWIQDF 238
Query: 755 LMLQFCEQSIARYLATEGPITVTINAGLLQQYKRGVIRATPATCDPHLVNHSVLLVGFGK 934
ML EQ+IA YLA GPITVTIN LLQ Y++GVI+ATP++CDP V+HSVLLVGFGK
Sbjct: 239 TMLSNNEQAIAHYLAVHGPITVTINMKLLQHYQKGVIKATPSSCDPRQVDHSVLLVGFGK 298
Query: 935 SK------SVEGKRPRPGHSIPYWILKNSWGPDWGEEGYFRLHRGSNTCGITKYPVTARV 1096
K +V + HS PYWILKNSWG WGE+GYFRL+RG+NTCG+TKYP TA+V
Sbjct: 299 EKEGMQTGTVLSHSRKRRHSSPYWILKNSWGAHWGEKGYFRLYRGNNTCGVTKYPFTAQV 358
Query: 1097 DKP 1105
D P
Sbjct: 359 DSP 361
>ref|NP_063914.1| cathepsin F precursor [Mus musculus].
Length = 462
Score = 229 bits (584), Expect = 4e-60
Identities = 128/333 (38%), Positives = 181/333 (54%), Gaps = 3/333 (0%)
Frame = +2
Query: 110 LRSQDPGPQPMGLK--EVFTLFQIQYNRSYSNPAEHARRLDIFAQNLAKAQRLQEEDLGT 283
L +DP PQ +K +F F YNR+Y + E RL +FA+N+ +AQ++Q D GT
Sbjct: 147 LLDKDPLPQDFSVKMAPLFKDFMTTYNRTYESREEAQWRLTVFARNMIRAQKIQALDRGT 206
Query: 284 AEFGVTPFSDLTEEEFGQLHGHHWGAGKAPSMGIKVGSEES-GETVPQSCDWRKKPGVIS 460
A++G+T FSDLTEEEF H + G K+ +S + P DWRKK G ++
Sbjct: 207 AQYGITKFSDLTEEEF---HTIYLNPLLQKESGRKMSPAKSINDLAPPEWDWRKK-GAVT 262
Query: 461 AIKHQKDCNCCWAMAAVDNVEAQWAIKYHQAVQLSVQQVLDCDRXXXXXXXXFVWDAFLT 640
+K+Q C CWA + NVE QW + + LS Q++LDCD+ +A+
Sbjct: 263 EVKNQGMCGSCWAFSVTGNVEGQWFLNRGTLLSLSEQELLDCDKVDKACLGGLPSNAYAA 322
Query: 641 VLNTSGLASEQDYPYKGTVKTHRCLAKQHRKVAWIQDFLMLQFCEQSIARYLATEGPITV 820
+ N GL +E DY Y+G V+T C +I D + L E IA +LA +GPI+V
Sbjct: 323 IKNLGGLETEDDYGYQGHVQT--CNFSAQMAKVYINDSVELSRNENKIAAWLAQKGPISV 380
Query: 821 TINAGLLQQYKRGVIRATPATCDPHLVNHSVLLVGFGKSKSVEGKRPRPGHSIPYWILKN 1000
INA +Q Y+ G+ C P ++H+VLLVG+G +IPYW +KN
Sbjct: 381 AINAFGMQFYRHGIAHPFRPLCSPWFIDHAVLLVGYGNRS-----------NIPYWAIKN 429
Query: 1001 SWGPDWGEEGYFRLHRGSNTCGITKYPVTARVD 1099
SWG DWGEEGY+ L+RGS CG+ +A V+
Sbjct: 430 SWGSDWGEEGYYYLYRGSGACGVNTMASSAVVN 462
>ref|NP_062414.3| cathepsin 8 [Mus musculus].
Length = 333
Score = 142 bits (357), Expect = 8e-34
Identities = 109/345 (31%), Positives = 166/345 (48%), Gaps = 11/345 (3%)
Frame = +2
Query: 95 GLKDALRSQDPGPQPMGLKEVFTLFQIQYNRSYSNPAEHARRLDIFAQNLAKAQRLQ-EE 271
G+ + +S DP L + ++ ++N++YS E +R ++ +N+ ++ E
Sbjct: 13 GVAEVTQSSDPS-----LDSEWQEWKRKFNKNYSMEEEGQKRA-VWEENMKLVKQHNIEY 66
Query: 272 DLGTAEF--GVTPFSDLTEEEFGQLHGHHWGAGKAPSMGIKVGSEES-GETVPQSCDWRK 442
D G F V F D+T EE+ ++ P+ K + +P+ DWRK
Sbjct: 67 DQGKKNFTMDVNAFGDMTGEEYRKMLTDI----PVPNFRKKKSIHQPIAGYLPKFVDWRK 122
Query: 443 KPGVISAIKHQKDCNCCWAMAAVDNVEAQWAIKYHQAVQLSVQQVLDCDRXXXXXXXXFV 622
+ G ++ +K+Q CN CWA +A +E Q K + V LS Q ++DC R F
Sbjct: 123 R-GCVTPVKNQGTCNSCWAFSAAGAIEGQMFRKTGKLVPLSTQNLVDCSR-LEGNFGCFK 180
Query: 623 WDAFLT---VLNTSGLASEQDYPYKGTVKTHRCLAKQHRKVAWIQDFLMLQFCEQSIARY 793
FL V GL +E YPYKGT C R A I F + E+ + R
Sbjct: 181 GSTFLALKYVWKNRGLEAESTYPYKGT--DGHCRYHPERSAARITSFSFVSNSEKDLMRA 238
Query: 794 LATEGPITVTINA--GLLQQYKRGVIRATPATCDPHLVNHSVLLVGFG-KSKSVEGKRPR 964
+AT GPI+V I+A + Y+ G+ C +++NHSVL+VG+G + K +G +
Sbjct: 239 VATIGPISVGIDARHKSFRLYREGIY--YEPKCSSNIINHSVLVVGYGYEGKESDGNK-- 294
Query: 965 PGHSIPYWILKNSWGPDWGEEGYFRLHRG-SNTCGITKYPVTARV 1096
YW++KNS G WG GY +L RG +N CGI Y V RV
Sbjct: 295 ------YWLIKNSHGEQWGMNGYMKLARGRNNHCGIASYAVYPRV 333
>ref|NP_064680.1| cathepsin R precursor [Mus musculus].
Length = 334
Score = 141 bits (356), Expect = 1e-33
Identities = 103/315 (32%), Positives = 151/315 (47%), Gaps = 11/315 (3%)
Frame = +2
Query: 167 FQIQYNRSYSNPAEHARRLDIFAQNLAKAQ-RLQEEDLGTAEFGV--TPFSDLTEEEFGQ 337
++I+YN+SYS E +R+ ++ + L + +E LG F + F D T+EEF +
Sbjct: 32 WKIKYNKSYSLKEEKLKRV-VWEEKLKMIKLHNRENSLGKNGFTMKMNEFGDQTDEEFRK 90
Query: 338 L--HGHHWGAGKAPSMGIKVGSEESGETVPQSCDWRKKPGVISAIKHQKDCNCCWAMAAV 511
+ W + S + E+G +P+ DWRKK G ++ ++ Q DC+ CWA A
Sbjct: 91 MMIEISVWTHREGKS----IMKREAGSILPKFVDWRKK-GYVTPVRRQGDCDACWAFAVT 145
Query: 512 DNVEAQWAIKYHQAVQLSVQQVLDCDR--XXXXXXXXFVWDAFLTVLNTSGLASEQDYPY 685
+EAQ + + LSVQ ++DC + ++AF VL+ GL SE YPY
Sbjct: 146 GAIEAQAIWQTGKLTPLSVQNLVDCSKPQGNNGCLGGDTYNAFQYVLHNGGLESEATYPY 205
Query: 686 KGTVKTHRCLAKQHRKVAWIQDFLMLQFCEQSIARYLATEGPITVTINAG--LLQQYKRG 859
+G K C A I F+ L E + +AT GPIT I+A + YK G
Sbjct: 206 EG--KDGPCRYNPKNSKAEITGFVSLPQSEDILMAAVATIGPITAGIDASHESFKNYKGG 263
Query: 860 VIRATPATCDPHLVNHSVLLVGFG-KSKSVEGKRPRPGHSIPYWILKNSWGPDWGEEGYF 1036
+ C V H VL+VG+G K +G YW++KNSWG WG GY
Sbjct: 264 IYH--EPNCSSDTVTHGVLVVGYGFKGIETDGNH--------YWLIKNSWGKRWGIRGYM 313
Query: 1037 RLHRGSNT-CGITKY 1078
+L + N CGI Y
Sbjct: 314 KLAKDKNNHCGIASY 328
>ref|NP_067420.1| cathepsin 6 [Mus musculus].
Length = 334
Score = 134 bits (338), Expect = 1e-31
Identities = 100/311 (32%), Positives = 147/311 (47%), Gaps = 10/311 (3%)
Frame = +2
Query: 176 QYNRSYSNPAEHARRLDIFAQNLAKAQRLQ-EEDLGTAEFGV--TPFSDLTEEEFGQLHG 346
QY +SY+ E RR I+ +N+ + E LG F + F DLT EE ++
Sbjct: 35 QYEKSYTMEEEGLRRA-IWEENMRMIKLHNWENSLGKNNFTLKMNEFGDLTPEELRKMMN 93
Query: 347 HH--WGAGKAPSMGIKVGSEESGETVPQSCDWRKKPGVISAIKHQKDCNCCWAMAAVDNV 520
+ W K + G+ +P+ DWRKK G ++ ++ QK CN CWA A +
Sbjct: 94 NFPIWSHKKRKI----IRKRAVGDVLPKFVDWRKK-GYVTRVRRQKFCNSCWAFAVNGAI 148
Query: 521 EAQWAIKYHQAVQLSVQQVLDCDRXXXXXXXXF--VWDAFLTVLNTSGLASEQDYPYKGT 694
E Q K + LSVQ ++DC + + + A+ VLN GL +E YPY+G
Sbjct: 149 EGQMFKKTGKLTPLSVQNLVDCTKTQGNDGCQWGDPYIAYEYVLNNGGLEAEATYPYEG- 207
Query: 695 VKTHRCLAKQHRKVAWIQDFLMLQFCEQSIARYLATEGPITVTINAGLLQ-QYKRGVIRA 871
K C A I F+ L E + +AT GPI+ ++A + + G I
Sbjct: 208 -KEGPCRYNPKNSKAEITGFVSLPESEDILMEAVATIGPISAAVDASFNRFSFYDGGIYH 266
Query: 872 TPATCDPHLVNHSVLLVGFG-KSKSVEGKRPRPGHSIPYWILKNSWGPDWGEEGYFRLHR 1048
P C + VNH+VL+VG+G + +G + YW++KNSWG WG GY ++ R
Sbjct: 267 QP-NCSNNTVNHAVLVVGYGTEGNETDGNK--------YWLIKNSWGRRWGIGGYMKIIR 317
Query: 1049 GSNT-CGITKY 1078
N CGI Y
Sbjct: 318 DQNNHCGIATY 328
>ref|NP_031827.2| pro-cathepsin H preproprotein [Mus musculus].
Length = 333
Score = 132 bits (333), Expect = 5e-31
Identities = 92/316 (29%), Positives = 144/316 (45%), Gaps = 7/316 (2%)
Frame = +2
Query: 158 FTLFQIQYNRSYSNPAEHARRLDIFAQNLAKAQRLQEEDLGTAEFGVTPFSDLTEEEFGQ 337
F + Q+ ++YS+ E+ RL +FA N K Q + + T + + FSD++ E
Sbjct: 33 FKSWMKQHQKTYSS-VEYNHRLQMFANNWRKIQAHNQRN-HTFKMALNQFSDMSFAEIK- 89
Query: 338 LHGHHWGAGKAPSMGIKVGSEESGETVPQSCDWRKKPGVISAIKHQKDCNCCWAMAAVDN 517
H W + S K P S DWRKK V+S +K+Q C CW +
Sbjct: 90 -HKFLWSEPQNCS-ATKSNYLRGTGPYPSSMDWRKKGNVVSPVKNQGACGSCWTFSTTGA 147
Query: 518 VEAQWAIKYHQAVQLSVQQVLDCDRXXXXXXXX--FVWDAFLTVLNTSGLASEQDYPYKG 691
+E+ AI + + L+ QQ++DC + AF +L G+ E YPY G
Sbjct: 148 LESAVAIASGKMLSLAEQQLVDCAQAFNNHGCKGGLPSQAFEYILYNKGIMEEDSYPYIG 207
Query: 692 TVKTHRCLAKQHRKVAWIQDFLMLQFCEQS-IARYLATEGPITVTINAGL-LQQYKRGVI 865
K C + VA++++ + + +++ + +A P++ YK GV
Sbjct: 208 --KDSSCRFNPQKAVAFVKNVVNITLNDEAAMVEAVALYNPVSFAFEVTEDFLMYKSGVY 265
Query: 866 RATPATCDPHLVNHSVLLVGFGKSKSVEGKRPRPGHSIPYWILKNSWGPDWGEEGYFRLH 1045
+ P VNH+VL VG+G+ + + YWI+KNSWG WGE GYF +
Sbjct: 266 SSKSCHKTPDKVNHAVLAVGYGEQ-----------NGLLYWIVKNSWGSQWGENGYFLIE 314
Query: 1046 RGSNTCGI---TKYPV 1084
RG N CG+ YP+
Sbjct: 315 RGKNMCGLAACASYPI 330
>ref|NP_067256.2| cathepsin S preproprotein [Mus musculus].
Length = 340
Score = 130 bits (328), Expect = 2e-30
Identities = 94/318 (29%), Positives = 155/318 (48%), Gaps = 13/318 (4%)
Frame = +2
Query: 164 LFQIQYNRSYSNPAEHARRLDIFAQNLAKAQRLQEE---DLGTAEFGVTPFSDLTEEEFG 334
L++ + + Y + E R I+ +NL E + T + G+ D+T EE
Sbjct: 38 LWKKTHEKEYKDKNEEEVRRLIWEKNLKFIMIHNLEYSMGMHTYQVGMNDMGDMTNEEIL 97
Query: 335 QLHGHHWGAGKAPSMGIKVGS--EESGETVPQSCDWRKKPGVISAIKHQKDCNCCWAMAA 508
GA + P K + S T+P + DWR+K G ++ +K+Q C CWA +A
Sbjct: 98 C----RMGALRIPRQSPKTVTFRSYSNRTLPDTVDWREK-GCVTEVKYQGSCGACWAFSA 152
Query: 509 VDNVEAQWAIKYHQAVQLSVQQVLDCDR----XXXXXXXXFVWDAFLTVLNTSGLASEQD 676
V +E Q +K + + LS Q ++DC ++ +AF +++ G+ ++
Sbjct: 153 VGALEGQLKLKTGKLISLSAQNLVDCSNEEKYGNKGCGGGYMTEAFQYIIDNGGIEADAS 212
Query: 677 YPYKGTVKTHRCLAKQHRKVAWIQDFLMLQF-CEQSIARYLATEGPITVTINA--GLLQQ 847
YPYK T +C + A ++ L F E ++ +AT+GP++V I+A
Sbjct: 213 YPYKAT--DEKCHYNSKNRAATCSRYIQLPFGDEDALKEAVATKGPVSVGIDASHSSFFF 270
Query: 848 YKRGVIRATPATCDPHLVNHSVLLVGFGKSKSVEGKRPRPGHSIPYWILKNSWGPDWGEE 1027
YK GV T + VNH VL+VG+G +++GK YW++KNSWG ++G++
Sbjct: 271 YKSGVYDDPSCTGN---VNHGVLVVGYG---TLDGK--------DYWLVKNSWGLNFGDQ 316
Query: 1028 GYFRLHRGS-NTCGITKY 1078
GY R+ R + N CGI Y
Sbjct: 317 GYIRMARNNKNHCGIASY 334
>ref|XP_922074.1| PREDICTED: cathepsin M-like [Mus musculus].
Length = 333
Score = 130 bits (327), Expect = 2e-30
Identities = 102/342 (29%), Positives = 160/342 (46%), Gaps = 13/342 (3%)
Frame = +2
Query: 110 LRSQDPGPQPMGLKEVFTLFQIQYNRSYSNPAEHARRLDIFAQNLAKAQRLQEEDLGTAE 289
+ S P P P+ L + ++I++ + YS E +R ++ +N+ K +L + G +
Sbjct: 14 MASSSPSPDPI-LDAEWQKWKIKHGKPYSLEEEEQKRA-VWEENMKKI-KLHNGENGLGK 70
Query: 290 FGVT----PFSDLTEEEFGQLHGHHWGAGKAPSMGIKVGS---EESGETVPQSCDWRKKP 448
G T F D+T EEF ++ + P +K G + +P+ +W+K+
Sbjct: 71 HGFTMEMNAFGDMTLEEFRKV------MIEIPVPTVKKGKSVQKHLSVNLPKFINWKKR- 123
Query: 449 GVISAIKHQKDCNCCWAMAAVDNVEAQWAIKYHQAVQLSVQQVLDCDRXXXXXXXXF--V 622
G ++ ++ Q CN CWA++ +E Q K Q + LSVQ ++DC R
Sbjct: 124 GYVTPVRTQGRCNSCWAISVTGAIEGQMFQKTGQLIPLSVQNLVDCSRPQGNRGCYVGNT 183
Query: 623 WDAFLTVLNTSGLASEQDYPYKGTVKTHRCLAKQHRKVAWIQDFLMLQFCEQSIARYLAT 802
+ A V+ GL SE YPY+ K C A I F + E ++ +AT
Sbjct: 184 YRALKYVVENGGLESEATYPYE--EKEGSCRYNPENSTASITGFDFVPENEDALMNAVAT 241
Query: 803 EGPITVTINA--GLLQQYKRGVIRATPATCDPHLVNHSVLLVGFG-KSKSVEGKRPRPGH 973
GPI+V I+A YKRG+ C +V H++LLVG+G EG++
Sbjct: 242 IGPISVAIDARHESFLFYKRGIYH--EPNCSSSVVTHAMLLVGYGFVGNESEGRK----- 294
Query: 974 SIPYWILKNSWGPDWGEEGYFRLHRG-SNTCGITKYPVTARV 1096
YWI+KNS G WG +GY ++ R N CGI Y + RV
Sbjct: 295 ---YWIVKNSMGTKWGSKGYMKIARDQGNHCGIATYALYPRV 333
>ref|NP_031828.2| cathepsin K precursor [Mus musculus].
Length = 329
Score = 130 bits (326), Expect = 3e-30
Identities = 102/322 (31%), Positives = 154/322 (47%), Gaps = 8/322 (2%)
Frame = +2
Query: 131 PQPMGLKEVFTLFQIQYNRSYSNPAEHARRLDIFAQNLAKAQRLQ-EEDLG--TAEFGVT 301
P+ M L + L++ + + Y++ + R I+ +NL + E LG T E +
Sbjct: 18 PEEM-LDTQWELWKKTHQKQYNSKVDEISRRLIWEKNLKQISAHNLEASLGVHTYELAMN 76
Query: 302 PFSDLTEEEFGQ-LHGHHWGAGKAPSMGIKVGSEESGETVPQSCDWRKKPGVISAIKHQK 478
D+T EE Q + G ++ S E G VP S D+RKK G ++ +K+Q
Sbjct: 77 HLGDMTSEEVVQKMTGLRIPPSRSYSNDTLYTPEWEGR-VPDSIDYRKK-GYVTPVKNQG 134
Query: 479 DCNCCWAMAAVDNVEAQWAIKYHQAVQLSVQQVLDCDRXXXXXXXXFVWDAFLTVLNTSG 658
C CWA ++ +E Q K + + LS Q ++DC ++ AF V G
Sbjct: 135 QCGSCWAFSSAGALEGQLKKKTGKLLALSPQNLVDCVTENYGCGGGYMTTAFQYVQQNGG 194
Query: 659 LASEQDYPYKGTVKTHRCLAKQHRKVAWIQDFLMLQF-CEQSIARYLATEGPITVTINAG 835
+ SE YPY G + C+ K A + + + E+++ R +A GPI+V+I+A
Sbjct: 195 IDSEDAYPYVG--QDESCMYNATAKAAKCRGYREIPVGNEKALKRAVARVGPISVSIDAS 252
Query: 836 L--LQQYKRGVIRATPATCDPHLVNHSVLLVGFGKSKSVEGKRPRPGHSIPYWILKNSWG 1009
L Q Y RGV CD VNH+VL+VG+G K + +WI+KNSWG
Sbjct: 253 LASFQFYSRGVY--YDENCDRDNVNHAVLVVGYGTQKGSK-----------HWIIKNSWG 299
Query: 1010 PDWGEEGYFRLHRG-SNTCGIT 1072
WG +GY L R +N CGIT
Sbjct: 300 ESWGNKGYALLARNKNNACGIT 321
>ref|NP_081620.2| cathepsin L-like 3 [Mus musculus].
Length = 331
Score = 128 bits (322), Expect = 9e-30
Identities = 105/351 (29%), Positives = 165/351 (47%), Gaps = 12/351 (3%)
Frame = +2
Query: 68 VLVVAGPAQGLKDALRSQDPGPQPMGLKEVFTLFQIQYNRSYSNPAEHARRLDIFAQNLA 247
V ++A G+ A + +P L V+ ++ ++ ++Y+ E +R +N
Sbjct: 4 VFLLATLCLGVVSAAPAHNPS-----LDAVWEEWKTKHKKTYNMNDEGQKRA--VWENNK 56
Query: 248 KAQRLQEEDLGTAEFG----VTPFSDLTEEEFGQLHGHHWGAGKAPSMGIKVGSEESGET 415
K L ED + G + F DLT EF +L G+ M +KV E
Sbjct: 57 KMIDLHNEDYLKGKHGFSLEMNAFGDLTNTEFRELMTGF--QGQKTKMMMKVFQEPLLGD 114
Query: 416 VPQSCDWRKKPGVISAIKHQKDCNCCWAMAAVDNVEAQWAIKYHQAVQLSVQQVLDC--D 589
VP+S DWR G ++ +K Q C CWA +AV ++E Q K + V LSVQ ++DC
Sbjct: 115 VPKSVDWRDH-GYVTPVKDQGSCGSCWAFSAVGSLEGQMFRKTGKLVPLSVQNLVDCSWS 173
Query: 590 RXXXXXXXXFVWDAFLTVLNTSGLASEQDYPYKGTVKTHRCLAKQHRKVAWIQDFLMLQF 769
+ AF V + GL + YPY+ T C A + F+ +Q
Sbjct: 174 QGNQGCDGGLPDLAFQYVKDNGGLDTSVSYPYEALNGT--CRYNPKNSAATVTGFVNVQS 231
Query: 770 CEQSIARYLATEGPITVTINA--GLLQQYKRGVIRATPATCDPHLVNHSVLLVGFGKSKS 943
E ++ + +AT GPI+V I+ Q YK G+ C +++H+VL+VG+G+
Sbjct: 232 SEDALMKAVATVGPISVGIDTKHKSFQFYKEGMY--YEPDCSSTVLDHAVLVVGYGEES- 288
Query: 944 VEGKRPRPGHSIPYWILKNSWGPDWGEEGYFRLHRG-SNTCGI---TKYPV 1084
+G++ YW++KNSWG DWG GY ++ + +N CGI YPV
Sbjct: 289 -DGRK--------YWLVKNSWGRDWGMNGYIKMAKDRNNNCGIASDASYPV 330
Database: RefSeq49_MP.fasta
Posted date: Oct 17, 2011 1:42 PM
Number of letters in database: 15,617,559
Number of sequences in database: 30,036
Lambda K H
0.318 0.134 0.401
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 30036
Number of Hits to DB: 49,310,996
Number of extensions: 1476856
Number of successful extensions: 6277
Number of sequences better than 1.0e-05: 27
Number of HSP's gapped: 6146
Number of HSP's successfully gapped: 27
Length of query: 440
Length of database: 15,617,559
Length adjustment: 104
Effective length of query: 336
Effective length of database: 12,493,815
Effective search space: 4197921840
Effective search space used: 4197921840
Neighboring words threshold: 12
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 35 (18.1 bits)
Search to RefSeqSP_Rel49
BLASTX 2.2.24 [Aug-08-2010]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= 20110601C-002438
(1320 letters)
Database: RefSeq49_SP.fasta
24,897 sequences; 11,343,932 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
Alignment gi|XP_003122571.1| PREDICTED: cathepsin W-like [Sus scrofa]. 732 0.0
Alignment gi|XP_003122543.2| PREDICTED: cathepsin F [Sus scrofa]. 227 1e-59
Alignment gi|NP_999094.1| pro-cathepsin H precursor [Sus scrofa]. 139 3e-33
Alignment gi|NP_999057.1| cathepsin L1 precursor [Sus scrofa]. 134 9e-32
Alignment gi|XP_001929706.1| PREDICTED: cathepsin S [Sus scrofa]. 131 7e-31
Alignment gi|XP_003355243.1| PREDICTED: cathepsin S-like [Sus scrofa]. 131 7e-31
Alignment gi|XP_003130681.1| PREDICTED: cathepsin L1-like [Sus scrofa]. 124 9e-29
Alignment gi|NP_999467.1| cathepsin K precursor [Sus scrofa]. 124 9e-29
Alignment gi|XP_003129045.2| PREDICTED: tryptophan 2,3-dioxygenase [Sus s... 104 1e-22
Alignment gi|XP_003129789.1| PREDICTED: dipeptidyl peptidase 1-like [Sus ... 96 3e-20
>ref|XP_003122571.1| PREDICTED: cathepsin W-like [Sus scrofa].
Length = 367
Score = 732 bits (1890), Expect = 0.0
Identities = 353/364 (96%), Positives = 354/364 (97%)
Frame = +2
Query: 35 MALTAHLSCLLVLVVAGPAQGLKDALRSQDPGPQPMGLKEVFTLFQIQYNRSYSNPAEHA 214
MALTAHLSCLLVLVVAGPAQGLKDALRSQDPGPQPMGLKEVFTLFQIQYNRSYSNPAEHA
Sbjct: 1 MALTAHLSCLLVLVVAGPAQGLKDALRSQDPGPQPMGLKEVFTLFQIQYNRSYSNPAEHA 60
Query: 215 RRLDIFAQNLAKAQRLQEEDLGTAEFGVTPFSDLTEEEFGQLHGHHWGAGKAPSMGIKVG 394
RRLDIFAQNLAKAQRLQEEDLGTAEFGVTPFSDLTEEEFGQLHGHHWGAGKAPSMGIKVG
Sbjct: 61 RRLDIFAQNLAKAQRLQEEDLGTAEFGVTPFSDLTEEEFGQLHGHHWGAGKAPSMGIKVG 120
Query: 395 SEESGETVPQSCDWRKKPGVISAIKHQKDCNCCWAMAAVDNVEAQWAIKYHQAVQLSVQQ 574
SEESGETVPQSCDWRKKPGVISAIKHQKDCNCCWAMAAVDNVEAQWAIKYHQAVQLSVQQ
Sbjct: 121 SEESGETVPQSCDWRKKPGVISAIKHQKDCNCCWAMAAVDNVEAQWAIKYHQAVQLSVQQ 180
Query: 575 VLDCDRXXXXXXXXFVWDAFLTVLNTSGLASEQDYPYKGTVKTHRCLAKQHRKVAWIQDF 754
VLDCDR FVWDAFLTVLNTSGLASEQDYPYKGTVKTHRCLAKQHRKVAWIQDF
Sbjct: 181 VLDCDRCGNGCNGGFVWDAFLTVLNTSGLASEQDYPYKGTVKTHRCLAKQHRKVAWIQDF 240
Query: 755 LMLQFCEQSIARYLATEGPITVTINAGLLQQYKRGVIRATPATCDPHLVNHSVLLVGFGK 934
LMLQFCEQSIARYLATEGPITVTINAGLLQQYKRGVIRATPATCDPHLVNHSVLLVGFGK
Sbjct: 241 LMLQFCEQSIARYLATEGPITVTINAGLLQQYKRGVIRATPATCDPHLVNHSVLLVGFGK 300
Query: 935 SKSVEGKRPRPGHSIPYWILKNSWGPDWGEEGYFRLHRGSNTCGITKYPVTARVDKPG*K 1114
SKSVEG+RPRPGHSIPYWILKNSWGPDWGEEGYFRLHRGSNTCGITKYPVTARVDKP K
Sbjct: 301 SKSVEGRRPRPGHSIPYWILKNSWGPDWGEEGYFRLHRGSNTCGITKYPVTARVDKPVKK 360
Query: 1115 HQIS 1126
HQIS
Sbjct: 361 HQIS 364
>ref|XP_003122543.2| PREDICTED: cathepsin F [Sus scrofa].
Length = 490
Score = 227 bits (579), Expect = 1e-59
Identities = 122/322 (37%), Positives = 179/322 (55%), Gaps = 2/322 (0%)
Frame = +2
Query: 110 LRSQDPGPQPMGLK--EVFTLFQIQYNRSYSNPAEHARRLDIFAQNLAKAQRLQEEDLGT 283
L ++DP PQ +K +F F YNR+Y E R+ +FA N+ +AQ++Q D GT
Sbjct: 175 LLNKDPLPQDFSVKMASIFKEFVTTYNRTYDTKEEARWRMSVFANNMVRAQKIQALDTGT 234
Query: 284 AEFGVTPFSDLTEEEFGQLHGHHWGAGKAPSMGIKVGSEESGETVPQSCDWRKKPGVISA 463
A +GVT FSDLTEEEF ++ + + P +++ S P+ DWRKK G ++
Sbjct: 235 ARYGVTKFSDLTEEEFRTIYLNPL-LQEEPGRKMRLAKSVSSLPPPE-WDWRKK-GAVTK 291
Query: 464 IKHQKDCNCCWAMAAVDNVEAQWAIKYHQAVQLSVQQVLDCDRXXXXXXXXFVWDAFLTV 643
+K Q C CWA + NVE QW +K + LS Q++LDCD+ +A+ +
Sbjct: 292 VKDQGMCGSCWAFSVTGNVEGQWFLKQGTLLSLSEQELLDCDKVDKGCMGGLPSNAYSAI 351
Query: 644 LNTSGLASEQDYPYKGTVKTHRCLAKQHRKVAWIQDFLMLQFCEQSIARYLATEGPITVT 823
GL +E+DY Y+G ++T C + +I D + L EQ +A +LA +GPI+V
Sbjct: 352 KTLGGLETEEDYSYRGHLQT--CSFNAEKAKVYINDSVELSQNEQKLAAWLAEKGPISVA 409
Query: 824 INAGLLQQYKRGVIRATPATCDPHLVNHSVLLVGFGKSKSVEGKRPRPGHSIPYWILKNS 1003
INA +Q Y+ G+ C P L++H+VLLVG+G + P+W +KNS
Sbjct: 410 INAFGMQFYRHGISHPLRPLCSPWLIDHAVLLVGYGNRSAT-----------PFWAIKNS 458
Query: 1004 WGPDWGEEGYFRLHRGSNTCGI 1069
WG DWGEEGY+ L+RGS CG+
Sbjct: 459 WGTDWGEEGYYYLYRGSGACGV 480
>ref|NP_999094.1| pro-cathepsin H precursor [Sus scrofa].
Length = 335
Score = 139 bits (351), Expect = 3e-33
Identities = 92/318 (28%), Positives = 146/318 (45%), Gaps = 9/318 (2%)
Frame = +2
Query: 158 FTLFQIQYNRSYSNPAEHARRLDIFAQNLAKAQRLQEEDLG--TAEFGVTPFSDLTEEEF 331
F + +Q+ + YS H RL +F N K + + G T + G+ FSD++ +E
Sbjct: 35 FKSWMVQHQKKYSLEEYH-HRLQVFVSNWRK---INAHNAGNHTFKLGLNQFSDMSFDEI 90
Query: 332 GQLHGHHWGAGKAPSMGIKVGSEESGETVPQSCDWRKKPGVISAIKHQKDCNCCWAMAAV 511
H + W + S K P S DWRKK +S +K+Q C CW +
Sbjct: 91 R--HKYLWSEPQNCS-ATKGNYLRGTGPYPPSMDWRKKGNFVSPVKNQGSCGSCWTFSTT 147
Query: 512 DNVEAQWAIKYHQAVQLSVQQVLDCDRXXXXXXXX--FVWDAFLTVLNTSGLASEQDYPY 685
+E+ AI + + L+ QQ++DC + AF + G+ E YPY
Sbjct: 148 GALESAVAIATGKMLSLAEQQLVDCAQNFNNHGCQGGLPSQAFEYIRYNKGIMGEDTYPY 207
Query: 686 KGTVKTHRCLAKQHRKVAWIQDFLMLQFC-EQSIARYLATEGPITVTINA-GLLQQYKRG 859
KG + C + + +A+++D + E+++ +A P++ Y++G
Sbjct: 208 KG--QDDHCKFQPDKAIAFVKDVANITMNDEEAMVEAVALYNPVSFAFEVTNDFLMYRKG 265
Query: 860 VIRATPATCDPHLVNHSVLLVGFGKSKSVEGKRPRPGHSIPYWILKNSWGPDWGEEGYFR 1039
+ +T P VNH+VL VG+G+ + IPYWI+KNSWGP WG GYF
Sbjct: 266 IYSSTSCHKTPDKVNHAVLAVGYGEE-----------NGIPYWIVKNSWGPQWGMNGYFL 314
Query: 1040 LHRGSNTCGI---TKYPV 1084
+ RG N CG+ YP+
Sbjct: 315 IERGKNMCGLAACASYPI 332
>ref|NP_999057.1| cathepsin L1 precursor [Sus scrofa].
Length = 334
Score = 134 bits (338), Expect = 9e-32
Identities = 101/307 (32%), Positives = 152/307 (49%), Gaps = 9/307 (2%)
Frame = +2
Query: 179 YNRSYSNPAEHARRLDIFAQNLAKAQRLQEEDLGTAEFG----VTPFSDLTEEEFGQLHG 346
+ R Y E RR ++ +N+ K L ++ + G + F D+T EEF Q+
Sbjct: 36 HGRLYGMNEEGWRRA-VWEKNM-KMIELHNQEYSQGKHGFSMAMNAFGDMTNEEFRQVMN 93
Query: 347 HHWGAGKAPSMGIKVGSEESGETVPQSCDWRKKPGVISAIKHQKDCNCCWAMAAVDNVEA 526
G KV E VP+S DWR+K G ++A+K+Q C CWA +A +E
Sbjct: 94 ---GFQNQKHKKGKVFHESLVLEVPKSVDWREK-GYVTAVKNQGQCGSCWAFSATGALEG 149
Query: 527 QWAIKYHQAVQLSVQQVLDCDR--XXXXXXXXFVWDAFLTVLNTSGLASEQDYPYKGTVK 700
Q K + V LS Q ++DC R + +AF V + GL +E+ YPY G +
Sbjct: 150 QMFRKTGKLVSLSEQNLVDCSRPQGNQGCNGGLMDNAFQYVKDNGGLDTEESYPYLGR-E 208
Query: 701 THRCLAKQHRKVAWIQDFLMLQFCEQSIARYLATEGPITVTINAG--LLQQYKRGVIRAT 874
T+ C K A F+ + E+++ + +AT GPI+V I+AG Q YK G+
Sbjct: 209 TNSCTYKPECSAANDTGFVDIPQREKALMKAVATVGPISVAIDAGHSSFQFYKSGIY--Y 266
Query: 875 PATCDPHLVNHSVLLVGFGKSKSVEGKRPRPGHSIPYWILKNSWGPDWGEEGYFRLHRGS 1054
C ++H VL+VG+G EG +S +WI+KNSWGP+WG GY ++ +
Sbjct: 267 DPDCSSKDLDHGVLVVGYG----FEG---TDSNSSKFWIVKNSWGPEWGWNGYVKMAKDQ 319
Query: 1055 NT-CGIT 1072
N CGI+
Sbjct: 320 NNHCGIS 326
>ref|XP_001929706.1| PREDICTED: cathepsin S [Sus scrofa].
Length = 331
Score = 131 bits (330), Expect = 7e-31
Identities = 98/318 (30%), Positives = 154/318 (48%), Gaps = 12/318 (3%)
Frame = +2
Query: 164 LFQIQYNRSYSNPAEHARRLDIFAQNLAKAQRLQ-EEDLG--TAEFGVTPFSDLTEEEFG 334
L++ Y + Y E R I+ +NL E +G + + G+ D+T EE
Sbjct: 30 LWKKTYGKQYKEKNEEVARRLIWEKNLKTVMLHNLEHSMGMHSYDLGMNHLGDMTSEEVI 89
Query: 335 QLHGHHWGAGKAPSMGIKVGSEESG--ETVPQSCDWRKKPGVISAIKHQKDCNCCWAMAA 508
L + PS + + +S + +P S DWR+K G ++ +K+Q C CWA +A
Sbjct: 90 SL----MSCVRVPSQWPRNVTYKSNPNQKLPDSMDWREK-GCVTEVKYQGSCGSCWAFSA 144
Query: 509 VDNVEAQWAIKYHQAVQLSVQQVLDCDR---XXXXXXXXFVWDAFLTVLNTSGLASEQDY 679
V +EAQ +K + V LS Q ++DC F+ +AF +++ +G+ SE Y
Sbjct: 145 VGALEAQVKMKTGRLVSLSAQNLVDCSTEKYRNKGCNGGFMTEAFQYIIDNNGIDSEASY 204
Query: 680 PYKGTVKTHRCLAKQHRKVAWIQDFLMLQFC-EQSIARYLATEGPITVTINA--GLLQQY 850
PYK +C + A + L F E ++ +A +GP++V I+A Y
Sbjct: 205 PYKAV--DGKCKYDSKNRAATCSRYTELPFADEYALKEAVANKGPVSVAIDAKHSSFFFY 262
Query: 851 KRGVIRATPATCDPHLVNHSVLLVGFGKSKSVEGKRPRPGHSIPYWILKNSWGPDWGEEG 1030
+ GV T + VNH VL+VG+G ++ GK YW++KNSWG ++G+ G
Sbjct: 263 RSGVYYDPSCTQN---VNHGVLVVGYG---NLNGK--------DYWLVKNSWGLNFGDGG 308
Query: 1031 YFRLHRGS-NTCGITKYP 1081
Y R+ R S N CGI YP
Sbjct: 309 YIRMARNSENHCGIANYP 326
>ref|XP_003355243.1| PREDICTED: cathepsin S-like [Sus scrofa].
Length = 331
Score = 131 bits (330), Expect = 7e-31
Identities = 98/318 (30%), Positives = 154/318 (48%), Gaps = 12/318 (3%)
Frame = +2
Query: 164 LFQIQYNRSYSNPAEHARRLDIFAQNLAKAQRLQ-EEDLG--TAEFGVTPFSDLTEEEFG 334
L++ Y + Y E R I+ +NL E +G + + G+ D+T EE
Sbjct: 30 LWKKTYGKQYKEKNEEVARRLIWEKNLKTVMLHNLEHSMGMHSYDLGMNHLGDMTSEEVI 89
Query: 335 QLHGHHWGAGKAPSMGIKVGSEESG--ETVPQSCDWRKKPGVISAIKHQKDCNCCWAMAA 508
L + PS + + +S + +P S DWR+K G ++ +K+Q C CWA +A
Sbjct: 90 SL----MSCVRVPSQWPRNVTYKSNPNQKLPDSMDWREK-GCVTEVKYQGSCGSCWAFSA 144
Query: 509 VDNVEAQWAIKYHQAVQLSVQQVLDCDR---XXXXXXXXFVWDAFLTVLNTSGLASEQDY 679
V +EAQ +K + V LS Q ++DC F+ +AF +++ +G+ SE Y
Sbjct: 145 VGALEAQVKMKTGRLVSLSAQNLVDCSTEKYRNKGCNGGFMTEAFQYIIDNNGIDSEASY 204
Query: 680 PYKGTVKTHRCLAKQHRKVAWIQDFLMLQFC-EQSIARYLATEGPITVTINA--GLLQQY 850
PYK +C + A + L F E ++ +A +GP++V I+A Y
Sbjct: 205 PYKAV--DGKCKYDSKNRAATCSRYTELPFADEYALKEAVANKGPVSVAIDAKHSSFFFY 262
Query: 851 KRGVIRATPATCDPHLVNHSVLLVGFGKSKSVEGKRPRPGHSIPYWILKNSWGPDWGEEG 1030
+ GV T + VNH VL+VG+G ++ GK YW++KNSWG ++G+ G
Sbjct: 263 RSGVYYDPSCTQN---VNHGVLVVGYG---NLNGK--------DYWLVKNSWGLNFGDGG 308
Query: 1031 YFRLHRGS-NTCGITKYP 1081
Y R+ R S N CGI YP
Sbjct: 309 YIRMARNSENHCGIANYP 326
>ref|XP_003130681.1| PREDICTED: cathepsin L1-like [Sus scrofa].
Length = 332
Score = 124 bits (312), Expect = 9e-29
Identities = 96/305 (31%), Positives = 145/305 (47%), Gaps = 12/305 (3%)
Frame = +2
Query: 206 EHARRLDIFAQNLAKAQRLQ-EEDLGTAEF--GVTPFSDLTEEEFGQ-LHGHHWGAGKAP 373
E RR I+ +N+ +R E G F + F D+T EEF + ++G K
Sbjct: 44 EEGRRRAIWEKNMKMIERHNWEHRQGKHSFTMAMNAFGDMTNEEFRKTMNGFQNQKHKKG 103
Query: 374 SMGIKVGSEESGETVPQSCDWRKKPGVISAIKHQKDCNCCWAMAAVDNVEAQWAIKYHQA 553
+ + GS P S DWR+K G ++A+K+Q C CWA +A +E Q K +
Sbjct: 104 KVFLDAGSA----LTPHSVDWREK-GYVTAVKNQGHCGSCWAFSATGALEGQMFRKTSKL 158
Query: 554 VQLSVQQVLDCD--RXXXXXXXXFVWDAFLTVLNTSGLASEQDYPYKGTVKTHRCLAKQH 727
+ LS Q ++DC + +AF + + GL SE+ YPY G K C K
Sbjct: 159 ISLSEQNLVDCSWPEGNEGCNGGLMDNAFQYIKDNGGLDSEESYPYFG--KDGSCKYKPQ 216
Query: 728 RKVAWIQDFLMLQFCEQSIARYLATEGPITVTINAG--LLQQYKRGVIRATPATCDPHLV 901
A ++ + E+++ + +AT GPI+V I+A Q Y G+ C +
Sbjct: 217 SSAANDTGYVDIPKQEKALMKAVATVGPISVGIDASHESFQFYSTGIY--FEPQCSSEDL 274
Query: 902 NHSVLLVGFGKSKSVEGKRPRPGHSIPYWILKNSWGPDWGEEGYFRLHRGSNT-CGI--- 1069
+H VL+VG+G VEG + YW++KNSWG WG +GY ++ + N CGI
Sbjct: 275 DHGVLVVGYG----VEGAH----SNNKYWLVKNSWGNTWGMDGYIKMTKDQNNHCGIATM 326
Query: 1070 TKYPV 1084
YPV
Sbjct: 327 ASYPV 331
>ref|NP_999467.1| cathepsin K precursor [Sus scrofa].
Length = 330
Score = 124 bits (312), Expect = 9e-29
Identities = 95/318 (29%), Positives = 150/318 (47%), Gaps = 16/318 (5%)
Frame = +2
Query: 164 LFQIQYNRSYSNPAEHARRLDIFAQNLAKAQ-RLQEEDLG--TAEFGVTPFSDLTEEEFG 334
L++ Y + Y++ + R I+ +NL E LG T E + D+T EE
Sbjct: 29 LWKKTYRKQYNSKVDEISRRLIWEKNLKHISIHNLEASLGVHTYELAMNHLGDMTSEEVV 88
Query: 335 QLHGHHWGAGKAPSMGIKVGSEESGETV---------PQSCDWRKKPGVISAIKHQKDCN 487
Q K + + S +T+ P S D+RKK G ++ +K+Q C
Sbjct: 89 Q---------KMTGLKVPPSHSRSNDTLYIPDWEGRTPDSIDYRKK-GYVTPVKNQGQCG 138
Query: 488 CCWAMAAVDNVEAQWAIKYHQAVQLSVQQVLDCDRXXXXXXXXFVWDAFLTVLNTSGLAS 667
CWA ++V +E Q K + + LS Q ++DC ++ +AF V G+ S
Sbjct: 139 SCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSENDGCGGGYMTNAFQYVQKNRGIDS 198
Query: 668 EQDYPYKGTVKTHRCLAKQHRKVAWIQDFLML-QFCEQSIARYLATEGPITVTINAGL-- 838
E YPY G + C+ K A + + + + E+++ R +A GP++V I+A L
Sbjct: 199 EDAYPYVG--QDENCMYNPTGKAAKCRGYREIPEGNEKALKRAVARVGPVSVAIDASLTS 256
Query: 839 LQQYKRGVIRATPATCDPHLVNHSVLLVGFGKSKSVEGKRPRPGHSIPYWILKNSWGPDW 1018
Q Y +GV C+ +NH+VL VG+G K GK+ +WI+KNSWG +W
Sbjct: 257 FQFYSKGVY--YDENCNSDNLNHAVLAVGYGIQK---GKK--------HWIIKNSWGENW 303
Query: 1019 GEEGYFRLHRG-SNTCGI 1069
G +GY + R +N CGI
Sbjct: 304 GNKGYILMARNKNNACGI 321
>ref|XP_003129045.2| PREDICTED: tryptophan 2,3-dioxygenase [Sus scrofa].
Length = 411
Score = 104 bits (259), Expect = 1e-22
Identities = 87/316 (27%), Positives = 131/316 (41%), Gaps = 4/316 (1%)
Frame = +2
Query: 134 QPMGLKEVFTLFQIQYNRSYSNPAEHARRLDIFAQNLAKAQRLQEEDLGTAEFGVTPFSD 313
Q + E T R Y +PA + L RL E +G + P++
Sbjct: 127 QQFSVLETMTALDFNDFREYLSPASGFQSLQF---------RLLENKIGVLQSLRVPYN- 176
Query: 314 LTEEEFGQLHGHHWGAGKAPSMGIKVGSEESGETVPQSCDWRKK-PGVISAIKHQKDCNC 490
H+ K + + SE+ + W ++ PG+ + C
Sbjct: 177 ---------RRHYRDTFKGKDNELLLKSEQERTLLQLVEAWLERTPGL------EPHCGG 221
Query: 491 CWAMAAVDNVEAQWAIKYHQAVQLSVQQVLDCDRXXXXXXXXFVWDAFLTVLNTS-GLAS 667
CWA + V VE+ +AIK LSVQQV+DC +A + T + S
Sbjct: 222 CWAFSVVSAVESAYAIKGQPLEVLSVQQVIDCSYNNYGCNGGSTLNALYWLNKTQVKVVS 281
Query: 668 EQDYPYKGTVKTHRCLAKQHRKVAWIQDFLMLQFC--EQSIARYLATEGPITVTINAGLL 841
+ +YP+K + H V+ I+D+ F E +A+ L T GP+ V ++A
Sbjct: 282 DSEYPFKAQNGLCHYFSCSHSGVS-IKDYSAYDFSGQEDEMAKTLLTLGPLIVIVDAVSW 340
Query: 842 QQYKRGVIRATPATCDPHLVNHSVLLVGFGKSKSVEGKRPRPGHSIPYWILKNSWGPDWG 1021
Q Y G+I+ C NH+VL+ GF K+ S PYWI++NSWG WG
Sbjct: 341 QDYLGGIIQHH---CSSGEANHAVLVTGFDKTGST-----------PYWIVRNSWGSAWG 386
Query: 1022 EEGYFRLHRGSNTCGI 1069
+GY + G N CGI
Sbjct: 387 IDGYALVKMGGNICGI 402
>ref|XP_003129789.1| PREDICTED: dipeptidyl peptidase 1-like [Sus scrofa].
Length = 463
Score = 96.3 bits (238), Expect = 3e-20
Identities = 70/245 (28%), Positives = 109/245 (44%), Gaps = 14/245 (5%)
Frame = +2
Query: 398 EESGETVPQSCDWRKKPGV--ISAIKHQKDCNCCWAMAAVDNVEAQWAIKYH--QAVQLS 565
+E +P S DWR G ++ +++Q C C++ A++ +EA+ I + Q LS
Sbjct: 225 QEKSLHLPASWDWRNVRGTNFVTPVRNQASCGSCYSFASMGMMEARIRILTNNTQTPILS 284
Query: 566 VQQVLDCDRXXXXXXXXFVWDAFLTVLNTSGLASEQDYPYKGT----VKTHRCLAKQHRK 733
Q+V+ C + F + GL E +PY GT C +
Sbjct: 285 PQEVVSCSQYAQGCAGGFPYLIAGKYAQDFGLVEEACFPYTGTDSPCTVKEGCFRYYSSE 344
Query: 734 VAWIQDFLMLQFCEQSIARY-LATEGPITVTINA-GLLQQYKRGVIRATPATCDP----H 895
++ F C +++ + L GP+ V Y++G+ T DP
Sbjct: 345 YHYVGGFY--GGCNEALMKLELVHHGPMAVAFEVYDDFLHYRKGIYHHTGLR-DPFNPFE 401
Query: 896 LVNHSVLLVGFGKSKSVEGKRPRPGHSIPYWILKNSWGPDWGEEGYFRLHRGSNTCGITK 1075
L NH+VLLVG+G + + YWI+KNSWG WGE+GYFR+ RG++ C I
Sbjct: 402 LTNHAVLLVGYGTDLA---------SGMDYWIVKNSWGTSWGEDGYFRIRRGTDECAIES 452
Query: 1076 YPVTA 1090
V A
Sbjct: 453 IAVAA 457
Database: RefSeq49_SP.fasta
Posted date: Oct 17, 2011 1:42 PM
Number of letters in database: 11,343,932
Number of sequences in database: 24,897
Lambda K H
0.318 0.134 0.401
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 24897
Number of Hits to DB: 37,613,384
Number of extensions: 1209522
Number of successful extensions: 5252
Number of sequences better than 1.0e-05: 14
Number of HSP's gapped: 5186
Number of HSP's successfully gapped: 14
Length of query: 440
Length of database: 11,343,932
Length adjustment: 102
Effective length of query: 338
Effective length of database: 8,804,438
Effective search space: 2975900044
Effective search space used: 2975900044
Neighboring words threshold: 12
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 35 (18.1 bits)
Search to Sscrofa10_2
BLASTN 2.2.24 [Aug-08-2010]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= 20110601C-002438
(1320 letters)
Database: Sscrofa_10.2.fasta
4582 sequences; 2,808,509,378 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
Sscrofa_Chr02 363 2e-97
>Sscrofa_Chr02
|| Length = 162569375
Score = 363 bits (183), Expect = 2e-97
Identities = 186/187 (99%)
Strand = Plus / Minus
Query: 842 cagcaatacaagaggggagtgatcagggccactcctgccacctgtgacccccaccttgtg 901
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Sbjct: 5550645 cagcaatacaagaggggagtgatcagggccactcctgccacctgtgacccccaccttgtg 5550586
Query: 902 aatcactctgtcctgctggtgggctttggtaaaagcaagtctgtggaggggaagcggccg 961
|||||||||||||||||||||||||||||||||||||||||||||||||||| |||||||
Sbjct: 5550585 aatcactctgtcctgctggtgggctttggtaaaagcaagtctgtggaggggaggcggccg 5550526
Query: 962 cgtcctggccactccatcccatactggatcctgaagaactcctgggggcccgactggggt 1021
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Sbjct: 5550525 cgtcctggccactccatcccatactggatcctgaagaactcctgggggcccgactggggt 5550466
Query: 1022 gaagagg 1028
|||||||
Sbjct: 5550465 gaagagg 5550459
Score = 313 bits (158), Expect = 1e-82
Identities = 158/158 (100%)
Strand = Plus / Minus
Query: 318 cagaggaggagtttggccagctccatgggcatcactggggggctggcaaggcccccagta 377
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Sbjct: 5551991 cagaggaggagtttggccagctccatgggcatcactggggggctggcaaggcccccagta 5551932
Query: 378 tgggcataaaggtagggtctgaagagtcgggggagacagtgccccagagctgtgactggc 437
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Sbjct: 5551931 tgggcataaaggtagggtctgaagagtcgggggagacagtgccccagagctgtgactggc 5551872
Query: 438 ggaagaagcctggtgtcatctcggccatcaagcatcag 475
||||||||||||||||||||||||||||||||||||||
Sbjct: 5551871 ggaagaagcctggtgtcatctcggccatcaagcatcag 5551834
Score = 252 bits (127), Expect = 4e-64
Identities = 127/127 (100%)
Strand = Plus / Minus
Query: 654 gcggcctggccagcgaacaggactacccgtacaaggggaccgtcaaaacccacaggtgcc 713
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Sbjct: 5551365 gcggcctggccagcgaacaggactacccgtacaaggggaccgtcaaaacccacaggtgcc 5551306
Query: 714 tggccaagcagcacaggaaggtggcctggatccaggatttcctcatgctgcagttctgcg 773
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Sbjct: 5551305 tggccaagcagcacaggaaggtggcctggatccaggatttcctcatgctgcagttctgcg 5551246
Query: 774 agcagag 780
|||||||
Sbjct: 5551245 agcagag 5551239
Score = 234 bits (118), Expect = 1e-58
Identities = 121/122 (99%)
Strand = Plus / Minus
Query: 1 gcttcctgtccccgacactctggttggcggcatcatggcactaactgcccacctctcctg 60
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Sbjct: 5554210 gcttcctgtccccgacactctggttggcggcatcatggcactaactgcccacctctcctg 5554151
Query: 61 tctcctggtcctggtggtcgcaggcccggcccaaggcctcaaggacgccctcagaagcca 120
|||||||||||||||||| |||||||||||||||||||||||||||||||||||||||||
Sbjct: 5554150 tctcctggtcctggtggtggcaggcccggcccaaggcctcaaggacgccctcagaagcca 5554091
Query: 121 gg 122
||
Sbjct: 5554090 gg 5554089
Score = 232 bits (117), Expect = 4e-58
Identities = 117/117 (100%)
Strand = Plus / Minus
Query: 204 cagagcacgctcgccgcctggacatctttgcccaaaacctggccaaggctcagcggctgc 263
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Sbjct: 5552489 cagagcacgctcgccgcctggacatctttgcccaaaacctggccaaggctcagcggctgc 5552430
Query: 264 aggaggaggacttgggcacagccgagtttggagtgactccattcagtgatctcacag 320
|||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Sbjct: 5552429 aggaggaggacttgggcacagccgagtttggagtgactccattcagtgatctcacag 5552373
Score = 206 bits (104), Expect = 2e-50
Identities = 120/124 (96%), Gaps = 1/124 (0%)
Strand = Plus / Minus
Query: 1026 agggctatttccggctgcaccgagggagtaacacctgcggcatcaccaagtacccggtca 1085
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Sbjct: 5550372 agggctatttccggctgcaccgagggagtaacacctgcggcatcaccaagtacccggtca 5550313
Query: 1086 ctgccagagtggacaaacccggttagaagcaccagatctcctggccggccctgagcccac 1145
||||||||||||||||||||| | |||||||||||||||||| ||| |||||||||||||
Sbjct: 5550312 ctgccagagtggacaaacccgttaagaagcaccagatctcct-gcccgccctgagcccac 5550254
Query: 1146 ccgg 1149
||||
Sbjct: 5550253 ccgg 5550250
Score = 196 bits (99), Expect = 2e-47
Identities = 99/99 (100%)
Strand = Plus / Minus
Query: 474 agaaggactgcaactgttgctgggccatggcggcagtggacaacgtggaggctcaatggg 533
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Sbjct: 5551717 agaaggactgcaactgttgctgggccatggcggcagtggacaacgtggaggctcaatggg 5551658
Query: 534 ccatcaagtaccaccaggctgtgcaactctctgtgcagc 572
|||||||||||||||||||||||||||||||||||||||
Sbjct: 5551657 ccatcaagtaccaccaggctgtgcaactctctgtgcagc 5551619
Score = 176 bits (89), Expect = 2e-41
Identities = 89/89 (100%)
Strand = Plus / Minus
Query: 118 ccaggacccaggtcctcagccaatggggctgaaagaggtcttcaccttgttccagatcca 177
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Sbjct: 5553864 ccaggacccaggtcctcagccaatggggctgaaagaggtcttcaccttgttccagatcca 5553805
Query: 178 atacaaccggagttactcgaacccagcag 206
|||||||||||||||||||||||||||||
Sbjct: 5553804 atacaaccggagttactcgaacccagcag 5553776
Score = 163 bits (82), Expect = 3e-37
Identities = 82/82 (100%)
Strand = Plus / Minus
Query: 573 aggtgcttgactgtgaccgctgtgggaatggctgcaacggcggcttcgtctgggacgcgt 632
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Sbjct: 5551536 aggtgcttgactgtgaccgctgtgggaatggctgcaacggcggcttcgtctgggacgcgt 5551477
Query: 633 tcctgactgtcctcaacaccag 654
||||||||||||||||||||||
Sbjct: 5551476 tcctgactgtcctcaacaccag 5551455
Score = 129 bits (65), Expect = 4e-27
Identities = 65/65 (100%)
Strand = Plus / Minus
Query: 780 gcatcgccaggtacttggccaccgaaggccccatcaccgtgaccatcaacgcgggcctac 839
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Sbjct: 5550780 gcatcgccaggtacttggccaccgaaggccccatcaccgtgaccatcaacgcgggcctac 5550721
Query: 840 tgcag 844
|||||
Sbjct: 5550720 tgcag 5550716
Database: Sscrofa_10.2.fasta
Posted date: Nov 16, 2011 10:34 AM
Number of letters in database: 2,808,509,378
Number of sequences in database: 4582
Lambda K H
1.37 0.711 1.31
Gapped
Lambda K H
1.37 0.711 1.31
Matrix: blastn matrix:1 -3
Gap Penalties: Existence: 5, Extension: 2
Number of Sequences: 4582
Number of Hits to DB: 32,933,525
Number of extensions: 191
Number of successful extensions: 191
Number of sequences better than 1.0e-05: 1
Number of HSP's gapped: 191
Number of HSP's successfully gapped: 10
Length of query: 1320
Length of database: 2,808,509,378
Length adjustment: 21
Effective length of query: 1299
Effective length of database: 2,808,413,156
Effective search space: 3648128689644
Effective search space used: 3648128689644
X1: 11 (21.8 bits)
X2: 15 (29.7 bits)
X3: 50 (99.1 bits)
S1: 18 (36.2 bits)
S2: 30 (60.0 bits)