Search to RefSeqBP_Rel49
BLASTX 2.2.24 [Aug-08-2010]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= 20110601C-012224
(1166 letters)
Database: RefSeq49_BP.fasta
33,088 sequences; 17,681,374 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
Alignment gi|NP_001028787.1| cathepsin S precursor [Bos taurus]. 586 e-167
Alignment gi|NP_001029607.1| cathepsin K [Bos taurus]. 341 6e-94
Alignment gi|NP_001077155.1| cathepsin L1 [Bos taurus]. 288 5e-78
Alignment gi|NP_776457.1| cathepsin L2 precursor [Bos taurus]. 279 3e-75
Alignment gi|NP_001029557.1| pro-cathepsin H precursor [Bos taurus]. 184 1e-46
Alignment gi|NP_001068884.1| cathepsin F [Bos taurus]. 128 8e-30
Alignment gi|XP_002694471.1| PREDICTED: cathepsin O preproprotein-like [B... 119 4e-27
Alignment gi|XP_874012.3| PREDICTED: cathepsin O [Bos taurus]. 119 4e-27
Alignment gi|NP_001028789.1| dipeptidyl peptidase 1 [Bos taurus]. 105 6e-23
Alignment gi|NP_001103540.1| cathepsin W [Bos taurus]. 88 2e-17
>ref|NP_001028787.1| cathepsin S precursor [Bos taurus].
Length = 331
Score = 586 bits (1510), Expect = e-167
Identities = 279/331 (84%), Positives = 292/331 (88%)
Frame = +3
Query: 144 MKCLVWVLLLCSSAMAQLHRDPTLDRHWDLWKKTYGKQYKEKNEEVARRLIWEKNLKTVM 323
M LVW LLLCSSAMA +HRDPTLD HWDLWKKTYGKQYKEKNEEVARRLIWEKNLKTV
Sbjct: 1 MNWLVWALLLCSSAMAHVHRDPTLDHHWDLWKKTYGKQYKEKNEEVARRLIWEKNLKTVT 60
Query: 324 LHNLEHSMGMHSYDLGMNHLGDMTSEEVISLMSCVRVPSQWPRNVTYKSNPNQKLPDSMD 503
LHNLEHSMGMHSY+LGMNHLGDMTSEEVISLMS +RVPSQWPRNVTYKS+PNQKLPDSMD
Sbjct: 61 LHNLEHSMGMHSYELGMNHLGDMTSEEVISLMSSLRVPSQWPRNVTYKSDPNQKLPDSMD 120
Query: 504 WREKGCVTEVKYQGSCGSCWAFSAVGALEAQVKMKTGRLVSLSAQNLVDCSTEKYRNKGC 683
WREKGCVTEVKYQG+CGSCWAFSAVGALEAQVK+KTG+LVSLSAQNLVDCST KY NKGC
Sbjct: 121 WREKGCVTEVKYQGACGSCWAFSAVGALEAQVKLKTGKLVSLSAQNLVDCSTAKYGNKGC 180
Query: 684 NGGFMTEAFQYIIDNNGIDSEASYPYKAVDGKCKYDSKNRAATCSRYTELPFADEYALKE 863
NGGFMTEAFQYIIDNNGIDSEASYPYKA+DGKC+YD KNRAATCSRY ELPF E ALKE
Sbjct: 181 NGGFMTEAFQYIIDNNGIDSEASYPYKAMDGKCQYDVKNRAATCSRYIELPFGSEEALKE 240
Query: 864 AVANKGPVSVAIDAKHSSFFFYRSGVYYDPSCTQXXXXXXXXXXXXXXXXKDYWLVKNSW 1043
AVANKGPVSV IDA HSSFF Y++GVYYDPSCTQ KDYWLVKNSW
Sbjct: 241 AVANKGPVSVGIDASHSSFFLYKTGVYYDPSCTQNVNHGVLVVGYGNLDGKDYWLVKNSW 300
Query: 1044 GLNFGDGGYIRMARNSENPWGNANYPPYPKI 1136
GL+FGD GYIRMARNS N G ANYP YP+I
Sbjct: 301 GLHFGDQGYIRMARNSGNHCGIANYPSYPEI 331
>ref|NP_001029607.1| cathepsin K [Bos taurus].
Length = 334
Score = 341 bits (875), Expect = 6e-94
Identities = 173/328 (52%), Positives = 221/328 (67%), Gaps = 3/328 (0%)
Frame = +3
Query: 162 VLLLCSSAMAQLHRDPTLDRHWDLWKKTYGKQYKEKNEEVARRLIWEKNLKTVMLHNLEH 341
VLLL + A L+ + LD W+LWKKTY KQY K +E++RRLIWEKNLK + +HNLE
Sbjct: 11 VLLLPVVSFA-LYPEEILDTQWELWKKTYRKQYNSKGDEISRRLIWEKNLKHISIHNLEA 69
Query: 342 SMGMHSYDLGMNHLGDMTSEEVISLMSCVRVPSQWPRN--VTYKSNPNQKLPDSMDWREK 515
S+G+H+Y+L MNHLGDMTSEEV+ M+ ++VP+ R+ Y + + PDS+D+R+K
Sbjct: 70 SLGVHTYELAMNHLGDMTSEEVVQKMTGLKVPASRSRSNDTLYIPDWEGRAPDSVDYRKK 129
Query: 516 GCVTEVKYQGSCGSCWAFSAVGALEAQVKMKTGRLVSLSAQNLVDCSTEKYRNKGCNGGF 695
G VT VK QG CGSCWAFS+VGALE Q+K KTG+L++LS QNLVDC +E N GC GG+
Sbjct: 130 GYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSE---NDGCGGGY 186
Query: 696 MTEAFQYIIDNNGIDSEASYPYKAVDGKCKYDSKNRAATCSRYTELPFADEYALKEAVAN 875
MT AFQY+ N GIDSE +YPY D C Y+ +AA C Y E+P +E ALK AVA
Sbjct: 187 MTNAFQYVQKNRGIDSEDAYPYVGQDENCMYNPTGKAAKCRGYREIPEGNEKALKRAVAR 246
Query: 876 KGPVSVAIDAKHSSFFFYRSGVYYDPSC-TQXXXXXXXXXXXXXXXXKDYWLVKNSWGLN 1052
GP+SVAIDA +SF FYR GVYYD +C + +W++KNSWG N
Sbjct: 247 VGPISVAIDASLTSFQFYRKGVYYDENCNSDNLNHAVLAVGYGIQKGNKHWIIKNSWGEN 306
Query: 1053 FGDGGYIRMARNSENPWGNANYPPYPKI 1136
+G+ GYI MARN N G AN +PK+
Sbjct: 307 WGNKGYILMARNKNNACGIANLASFPKM 334
>ref|NP_001077155.1| cathepsin L1 [Bos taurus].
Length = 333
Score = 288 bits (738), Expect = 5e-78
Identities = 153/333 (45%), Positives = 201/333 (60%), Gaps = 5/333 (1%)
Frame = +3
Query: 153 LVWVLLLCSSAMAQLHRDPTLDRHWDLWKKTYGKQYKEKNEEVARRLIWEKNLKTVMLHN 332
L+ L A A D +LD W LWK + K Y + NEE R+ +W+KN+K + LHN
Sbjct: 5 LLLTALCLGIASAAPKFDHSLDTQWKLWKAAHRKPY-DLNEEGWRKAVWKKNMKMIELHN 63
Query: 333 LEHSMGMHSYDLGMNHLGDMTSEEVISLMSCVRVPSQWPRNVTYKSNPNQKLPDSMDWRE 512
E+S G HS+ + MN GDMT+EE M+ + + + +P S+DWRE
Sbjct: 64 QEYSQGKHSFSMAMNAFGDMTNEEFRHTMNGFQRQKN-KKGKEFHETIFASIPPSVDWRE 122
Query: 513 KGCVTEVKYQGSCGSCWAFSAVGALEAQVKMKTGRLVSLSAQNLVDCSTEKYRNKGCNGG 692
KG VT VK QG CGSCWAFSA GALE Q+ KTG+LVSLS QNLVDCS + N+GC+GG
Sbjct: 123 KGYVTPVKNQGKCGSCWAFSATGALEGQMFQKTGKLVSLSEQNLVDCS-QPEGNRGCHGG 181
Query: 693 FMTEAFQYIIDNNGIDSEASYPYKAVDGKCKYDSKNRAATCSRYTELPFADEYALKEAVA 872
F+ AFQY++D G+DSE SYPY + G C Y+ N AA + + +LP E AL +AVA
Sbjct: 182 FIDNAFQYVLDVGGLDSEESYPYTGLVGTCLYNPNNSAANETGFVDLP-KQEKALMKAVA 240
Query: 873 NKGPVSVAIDAKHSSFFFYRSGVYYDPSCTQXXXXXXXXXXXXXXXXKD-----YWLVKN 1037
N GP+SVA+DA + SF FY+SG+YY+P+C+ D YWLVKN
Sbjct: 241 NLGPISVAVDAHNPSFQFYKSGIYYEPNCSSESVDHAVLVVGYGFEGADSDDNKYWLVKN 300
Query: 1038 SWGLNFGDGGYIRMARNSENPWGNANYPPYPKI 1136
SWG ++G GYI+MA++ N G A YP +
Sbjct: 301 SWGEHWGMNGYIKMAKDRNNHCGIATMASYPTV 333
>ref|NP_776457.1| cathepsin L2 precursor [Bos taurus].
Length = 334
Score = 279 bits (714), Expect = 3e-75
Identities = 153/330 (46%), Positives = 196/330 (59%), Gaps = 6/330 (1%)
Frame = +3
Query: 165 LLLCSSAMAQLHRDPTLDRHWDLWKKTYGKQYKEKNEEVARRLIWEKNLKTVMLHNLEHS 344
+L A A DP LD HW WK T+ + Y NEE RR +WEKN K + LHN E+S
Sbjct: 9 VLCLGVASAAPKLDPNLDAHWHQWKATHRRLYG-MNEEEWRRAVWEKNKKIIDLHNQEYS 67
Query: 345 MGMHSYDLGMNHLGDMTSEEVISLMSCVRVPSQWPRNVTYKSNPNQKLPDSMDWREKGCV 524
G H++ + MN GDMT+EE +M+ + + + + +P S+DW +KG V
Sbjct: 68 EGKHAFRMAMNAFGDMTNEEFRQVMNGFQ-NQKHKKGKLFHEPLLVDVPKSVDWTKKGYV 126
Query: 525 TEVKYQGSCGSCWAFSAVGALEAQVKMKTGRLVSLSAQNLVDCSTEKYRNKGCNGGFMTE 704
T VK QG CGSCWAFSA GALE Q+ KTG+LVSLS QNLVDCS + N+GCNGG M
Sbjct: 127 TPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRAQ-GNQGCNGGLMDN 185
Query: 705 AFQYIIDNNGIDSEASYPYKAVD-GKCKYDSKNRAATCSRYTELPFADEYALKEAVANKG 881
AFQYI DN G+DSE SYPY A D C Y + AA + + ++P E AL +AVA G
Sbjct: 186 AFQYIKDNGGLDSEESYPYLATDTNSCNYKPECSAANDTGFVDIP-QREKALMKAVATVG 244
Query: 882 PVSVAIDAKHSSFFFYRSGVYYDPSCTQXXXXXXXXXXXXXXXXKD-----YWLVKNSWG 1046
P+SVAIDA H+SF FY+SG+YYDP C+ D +W+VKNSWG
Sbjct: 245 PISVAIDAGHTSFQFYKSGIYYDPDCSCKDLDHGVLVVGYGFEGTDSNNNKFWIVKNSWG 304
Query: 1047 LNFGDGGYIRMARNSENPWGNANYPPYPKI 1136
+G GY++MA++ N G A YP +
Sbjct: 305 PEWGWNGYVKMAKDQNNHCGIATAASYPTV 334
>ref|NP_001029557.1| pro-cathepsin H precursor [Bos taurus].
Length = 335
Score = 184 bits (467), Expect = 1e-46
Identities = 124/348 (35%), Positives = 171/348 (49%), Gaps = 7/348 (2%)
Frame = +3
Query: 108 ILCIGAPAGSIIMKCLVWVLLLCSSAMAQLHRDPTLDRHWDLWKKTYGKQYKEKNEEVAR 287
+LC GA W+L + A+L + H+ W + K+Y +EE
Sbjct: 7 LLCAGA-----------WLLGAPACGAAELAANSLEKFHFQSWMVQHQKKYS--SEEYYH 53
Query: 288 RL-IWEKNLKTVMLHNLEHSMGMHSYDLGMNHLGDMTSEEVIS--LMSCVRVPSQWPRNV 458
RL + NL+ + HN + H++ +G+N DM+ +E+ L S + S N
Sbjct: 54 RLQAFASNLREINAHNARN----HTFKMGLNQFSDMSFDELKRKYLWSEPQNCSATKSNY 109
Query: 459 TYKSNPNQKLPDSMDWREKG-CVTEVKYQGSCGSCWAFSAVGALEAQVKMKTGRLVSLSA 635
+ P P SMDWR+KG VT VK QGSCGSCW FS GALE+ V + TG+L L+
Sbjct: 110 LRGTGP---YPPSMDWRKKGNFVTPVKNQGSCGSCWTFSTTGALESAVAIATGKLPFLAE 166
Query: 636 QNLVDCSTEKYRNKGCNGGFMTEAFQYIIDNNGIDSEASYPYKAVDGKCKYDSKNRAATC 815
Q LVDC+ + + N GC GG ++AF+YI N GI E +YPY+ DG CKY A
Sbjct: 167 QQLVDCA-QNFNNHGCQGGLPSQAFEYIRYNKGIMGEDTYPYRGQDGDCKYQPSKAIAFV 225
Query: 816 SRYTELPFADEYALKEAVANKGPVSVAIDAKHSSFFFYRSGVYYDPSCTQXXXXXXXXXX 995
+ DE A+ EAVA PVS A + + F YR G+Y SC +
Sbjct: 226 KDVANITLNDEEAMVEAVALHNPVSFAFEVT-ADFMMYRKGIYSSTSCHKTPDKVNHAVL 284
Query: 996 XXXXXXK---DYWLVKNSWGLNFGDGGYIRMARNSENPWGNANYPPYP 1130
+ YW+VKNSWG N+G GY + R +N G A +P
Sbjct: 285 AVGYGEEKGIPYWIVKNSWGPNWGMKGYFLIER-GKNMCGLAACASFP 331
>ref|NP_001068884.1| cathepsin F [Bos taurus].
Length = 460
Score = 128 bits (322), Expect = 8e-30
Identities = 87/287 (30%), Positives = 130/287 (45%), Gaps = 4/287 (1%)
Frame = +3
Query: 243 TYGKQYKEKNEEVARRLIWEKNL-KTVMLHNLEHSMGMHSYDLGMNHLGDMTSEEVISLM 419
TY + Y + E R ++ N+ + + L+ + G+ D+T EE ++
Sbjct: 169 TYNRTYDSQEEASWRMSVFANNMVRAQKIQALDRGTARY----GVTKFSDLTEEEFRTIY 224
Query: 420 SCVRVPSQWPRNVTYKSNPNQKLPDSMDWREKGCVTEVKYQGSCGSCWAFSAVGALEAQV 599
+ RN+ P DWR KG VT VK QG CGSCWAFS G +E Q
Sbjct: 225 LNPLLKDAPGRNMRPAQPVTDVPPPQWDWRNKGAVTNVKDQGMCGSCWAFSVTGNVEGQW 284
Query: 600 KMKTGRLVSLSAQNLVDCSTEKYRNKGCNGGFMTEAFQYIIDNNGIDSEASYPYKAVDGK 779
+K G L+SLS Q L+DC +K C GG + A+ I G+++E Y Y+
Sbjct: 285 FLKRGTLLSLSEQELLDCDK---TDKACLGGLPSNAYSAIRTLGGLETEDDYSYRGRLQT 341
Query: 780 CKYDSKNRAATCSRYTELPFADEYALKEAVANKGPVSVAIDAKHSSFFFYRSGVYYD--P 953
C + ++ + EL +E L +A GPVS+AI+A FYR G+ + P
Sbjct: 342 CSFSAEKAKVYINDSVELS-KNEQKLAAWLAKNGPVSIAINA--FGMQFYRHGISHPLRP 398
Query: 954 SCTQXXXXXXXXXXXXXXXXK-DYWLVKNSWGLNFGDGGYIRMARNS 1091
C+ +W +KNSWG ++G+ GY + R S
Sbjct: 399 LCSPWLIDHAVLLVGYGNRSAIPFWAIKNSWGTDWGEEGYYYLHRGS 445
>ref|XP_002694471.1| PREDICTED: cathepsin O preproprotein-like [Bos taurus].
Length = 375
Score = 119 bits (299), Expect = 4e-27
Identities = 75/220 (34%), Positives = 114/220 (51%), Gaps = 5/220 (2%)
Frame = +3
Query: 435 PSQWPRNVT--YKSNPNQKLPDSMDWREKGCVTEVKYQGSCGSCWAFSAVGALEAQVKMK 608
PS++PR Y S N LP DWR+K VT+V+ Q +CG CWAFS VGA+E+ +K
Sbjct: 143 PSRFPRFPAEEYTSISNLSLPLRFDWRDKHVVTQVRNQKTCGGCWAFSVVGAVESVCAIK 202
Query: 609 TGRLVSLSAQNLVDCSTEKYRNKGCNGGFMTEAFQYIID-NNGIDSEASYPYKAVDGKCK 785
L LS Q ++DCS Y N GCNGG A ++ + ++ YP++A +G C+
Sbjct: 203 GQPLEVLSVQQVIDCS---YSNYGCNGGSPLSALYWLNKLQVKLVRDSEYPFQAQNGLCR 259
Query: 786 YDSKNRAATCSR-YTELPFA-DEYALKEAVANKGPVSVAIDAKHSSFFFYRSGVYYDPSC 959
Y S + + + + Y+ F+ E + EA+ GP+ V +DA S+ Y G+
Sbjct: 260 YFSDSHSGSSIKGYSAYDFSGQEDKMAEALLALGPLIVVVDA--MSWQDYLGGIIQHHCS 317
Query: 960 TQXXXXXXXXXXXXXXXXKDYWLVKNSWGLNFGDGGYIRM 1079
+ YW+V+NSWG ++G GY+R+
Sbjct: 318 SGEANHAVLVTGFDKTGSIPYWIVRNSWGTSWGIDGYVRV 357
>ref|XP_874012.3| PREDICTED: cathepsin O [Bos taurus].
Length = 384
Score = 119 bits (299), Expect = 4e-27
Identities = 75/220 (34%), Positives = 114/220 (51%), Gaps = 5/220 (2%)
Frame = +3
Query: 435 PSQWPRNVT--YKSNPNQKLPDSMDWREKGCVTEVKYQGSCGSCWAFSAVGALEAQVKMK 608
PS++PR Y S N LP DWR+K VT+V+ Q +CG CWAFS VGA+E+ +K
Sbjct: 152 PSRFPRFPAEEYTSISNLSLPLRFDWRDKHVVTQVRNQKTCGGCWAFSVVGAVESVCAIK 211
Query: 609 TGRLVSLSAQNLVDCSTEKYRNKGCNGGFMTEAFQYIID-NNGIDSEASYPYKAVDGKCK 785
L LS Q ++DCS Y N GCNGG A ++ + ++ YP++A +G C+
Sbjct: 212 GQPLEVLSVQQVIDCS---YSNYGCNGGSPLSALYWLNKLQVKLVRDSEYPFQAQNGLCR 268
Query: 786 YDSKNRAATCSR-YTELPFA-DEYALKEAVANKGPVSVAIDAKHSSFFFYRSGVYYDPSC 959
Y S + + + + Y+ F+ E + EA+ GP+ V +DA S+ Y G+
Sbjct: 269 YFSDSHSGSSIKGYSAYDFSGQEDKMAEALLALGPLIVVVDA--MSWQDYLGGIIQHHCS 326
Query: 960 TQXXXXXXXXXXXXXXXXKDYWLVKNSWGLNFGDGGYIRM 1079
+ YW+V+NSWG ++G GY+R+
Sbjct: 327 SGEANHAVLVTGFDKTGSIPYWIVRNSWGTSWGIDGYVRV 366
>ref|NP_001028789.1| dipeptidyl peptidase 1 [Bos taurus].
Length = 463
Score = 105 bits (263), Expect = 6e-23
Identities = 72/245 (29%), Positives = 112/245 (45%), Gaps = 23/245 (9%)
Frame = +3
Query: 429 RVPSQWPRNVTYKSNPN-QKLPDSMDWREK---GCVTEVKYQGSCGSCWAFSAVGALEAQ 596
R+P P +T + LP S DWR VT V+ QGSCGSC++F+++G +EA+
Sbjct: 211 RIPRPKPAPITAEIQKKILHLPTSWDWRNVHGINFVTPVRNQGSCGSCYSFASMGMMEAR 270
Query: 597 VKMKTGRLVS--LSAQNLVDCSTEKYRNKGCNGGFMTEAFQYIIDNNGIDSEASYPYKAV 770
+++ T + LS Q +V CS +GC GGF + G+ E +PY
Sbjct: 271 IRILTNNTQTPILSPQEVVSCSQYA---QGCEGGFPYLIAGKYAQDFGLVEEDCFPYTGT 327
Query: 771 DGKCKYDSKNRAATCSRYTELPF---------ADEYALKEAVANKGPVSVAIDAKHSSFF 923
D C+ C RY + +E +K + ++GP++VA + + F
Sbjct: 328 DSPCRLKEG-----CFRYYSSEYHYVGGFYGGCNEALMKLELVHQGPMAVAFEV-YDDFL 381
Query: 924 FYRSGVY--------YDPSCTQXXXXXXXXXXXXXXXXKDYWLVKNSWGLNFGDGGYIRM 1079
YR GVY ++P DYW+VKNSWG ++G+ GY R+
Sbjct: 382 HYRKGVYHHTGLRDPFNPFELTNHAVLLVGYGTDAASGLDYWIVKNSWGTSWGENGYFRI 441
Query: 1080 ARNSE 1094
R ++
Sbjct: 442 RRGTD 446
>ref|NP_001103540.1| cathepsin W [Bos taurus].
Length = 272
Score = 87.8 bits (216), Expect = 2e-17
Identities = 56/213 (26%), Positives = 96/213 (45%), Gaps = 1/213 (0%)
Frame = +3
Query: 231 LWKKTYGKQYKEKNEEVARRLIWEKNLKTVMLHNLEHSMGMHSYDLGMNHLGDMTSEEVI 410
L++ Y + Y E R I+ +NL E +G + + G+ D+T EE +
Sbjct: 44 LFQMQYNRSYPNPAEYARRLDIFAQNLAKAQRLQ-EEDLG--TAEFGVTQFSDLTEEEFV 100
Query: 411 SLM-SCVRVPSQWPRNVTYKSNPNQKLPDSMDWREKGCVTEVKYQGSCGSCWAFSAVGAL 587
L S V + + P + DWR+ G ++ V+ Q +C CWA +A G +
Sbjct: 101 QLYGSQVAGEALGVSRKVGSEEWGESEPQTCDWRKVGTISPVRDQRNCNCCWAMAAAGNI 160
Query: 588 EAQVKMKTGRLVSLSAQNLVDCSTEKYRNKGCNGGFMTEAFQYIIDNNGIDSEASYPYKA 767
EA +K V +S Q L+DC GC GGF+ +AF +++N+G+ SE YP+
Sbjct: 161 EALWAIKFRHFVEVSVQQLLDCDR---CGNGCRGGFVWDAFLTVLNNSGLASEKDYPFNG 217
Query: 768 VDGKCKYDSKNRAATCSRYTELPFADEYALKEA 866
K +Y ++ + ++ + +A
Sbjct: 218 -------SGKTHRCLAKKYKKVAWIQDFIILQA 243
Database: RefSeq49_BP.fasta
Posted date: Oct 17, 2011 1:42 PM
Number of letters in database: 17,681,374
Number of sequences in database: 33,088
Lambda K H
0.318 0.134 0.401
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 33088
Number of Hits to DB: 43,625,092
Number of extensions: 1102599
Number of successful extensions: 3079
Number of sequences better than 1.0e-05: 15
Number of HSP's gapped: 3042
Number of HSP's successfully gapped: 20
Length of query: 388
Length of database: 17,681,374
Length adjustment: 104
Effective length of query: 284
Effective length of database: 14,240,222
Effective search space: 4044223048
Effective search space used: 4044223048
Neighboring words threshold: 12
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 35 (18.1 bits)
Animal-Genome cDNA 20110601C-012224
Search to RefSeqCP_Rel49
BLASTX 2.2.24 [Aug-08-2010]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= 20110601C-012224
(1166 letters)
Database: RefSeq49_CP.fasta
33,336 sequences; 18,874,504 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
Alignment gi|NP_001002938.1| cathepsin S precursor [Canis lupus familiari... 561 e-160
Alignment gi|NP_001029168.1| cathepsin K precursor [Canis lupus familiari... 343 2e-94
Alignment gi|NP_001003115.1| cathepsin L1 precursor [Canis lupus familiar... 274 1e-73
Alignment gi|XP_541257.2| PREDICTED: similar to Cathepsin L precursor (Ma... 271 7e-73
Alignment gi|XP_855060.1| PREDICTED: similar to Cathepsin L2 precursor (C... 174 1e-43
Alignment gi|XP_536212.2| PREDICTED: similar to Cathepsin H precursor [Ca... 173 2e-43
Alignment gi|XP_533219.2| PREDICTED: similar to Cathepsin F precursor (CA... 134 2e-31
Alignment gi|XP_539782.2| PREDICTED: similar to Cathepsin O precursor [Ca... 114 2e-25
Alignment gi|XP_540846.2| PREDICTED: similar to cathepsin W preproprotein... 111 2e-24
Alignment gi|NP_001182763.1| dipeptidyl peptidase 1 [Canis lupus familiar... 107 2e-23
>ref|NP_001002938.1| cathepsin S precursor [Canis lupus familiaris].
Length = 331
Score = 561 bits (1445), Expect = e-160
Identities = 268/331 (80%), Positives = 288/331 (87%)
Frame = +3
Query: 144 MKCLVWVLLLCSSAMAQLHRDPTLDRHWDLWKKTYGKQYKEKNEEVARRLIWEKNLKTVM 323
MK LV +L LCS A+AQ+H+DPTLD HW+LWKKTY KQYKE+NEEVARRLIWEKNLK VM
Sbjct: 1 MKWLVGLLPLCSYAVAQVHKDPTLDHHWNLWKKTYSKQYKEENEEVARRLIWEKNLKFVM 60
Query: 324 LHNLEHSMGMHSYDLGMNHLGDMTSEEVISLMSCVRVPSQWPRNVTYKSNPNQKLPDSMD 503
LHNLEHSMGMHSYDLGMNHLGDMT EEVISLM +RVPSQW RNVTY+SN NQKLPDS+D
Sbjct: 61 LHNLEHSMGMHSYDLGMNHLGDMTGEEVISLMGSLRVPSQWQRNVTYRSNSNQKLPDSVD 120
Query: 504 WREKGCVTEVKYQGSCGSCWAFSAVGALEAQVKMKTGRLVSLSAQNLVDCSTEKYRNKGC 683
WREKGCVTEVKYQGSCG+CWAFSAVGALEAQ+K+KTG+LVSLSAQNLVDCSTEKY NKGC
Sbjct: 121 WREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGC 180
Query: 684 NGGFMTEAFQYIIDNNGIDSEASYPYKAVDGKCKYDSKNRAATCSRYTELPFADEYALKE 863
NGGFMT AFQYIIDNNGIDSEASYPYKA++GKC+YDSK RAATCS+YTELPF E ALKE
Sbjct: 181 NGGFMTTAFQYIIDNNGIDSEASYPYKAMNGKCRYDSKKRAATCSKYTELPFGSEDALKE 240
Query: 864 AVANKGPVSVAIDAKHSSFFFYRSGVYYDPSCTQXXXXXXXXXXXXXXXXKDYWLVKNSW 1043
AVANKGPVSVAIDA H SFF YRSGVYY+PSCTQ KDYWLVKNSW
Sbjct: 241 AVANKGPVSVAIDASHYSFFLYRSGVYYEPSCTQNVNHGVLVVGYGNLNGKDYWLVKNSW 300
Query: 1044 GLNFGDGGYIRMARNSENPWGNANYPPYPKI 1136
GLNFGD GYIRMARNS N G A+YP YP+I
Sbjct: 301 GLNFGDQGYIRMARNSGNHCGIASYPSYPEI 331
>ref|NP_001029168.1| cathepsin K precursor [Canis lupus familiaris].
Length = 330
Score = 343 bits (880), Expect = 2e-94
Identities = 173/328 (52%), Positives = 218/328 (66%), Gaps = 3/328 (0%)
Frame = +3
Query: 162 VLLLCSSAMAQLHRDPTLDRHWDLWKKTYGKQYKEKNEEVARRLIWEKNLKTVMLHNLEH 341
VLLL A L+ + LD WDLWKKTY KQY K +E++RRLIWEKNLK + +HNLE
Sbjct: 6 VLLLLPMASFALYPEEILDTQWDLWKKTYRKQYNSKVDELSRRLIWEKNLKHISIHNLEA 65
Query: 342 SMGMHSYDLGMNHLGDMTSEEVISLMSCVRVPSQWPR--NVTYKSNPNQKLPDSMDWREK 515
S+G+H+Y+L MNHLGDMTSEEV+ M+ ++VP R + Y + + PDS+D+R+K
Sbjct: 66 SLGVHTYELAMNHLGDMTSEEVVQKMTGLKVPPSHSRSNDTLYIPDWESRAPDSVDYRKK 125
Query: 516 GCVTEVKYQGSCGSCWAFSAVGALEAQVKMKTGRLVSLSAQNLVDCSTEKYRNKGCNGGF 695
G VT VK QG CGSCWAFS+VGALE Q+K KTG+L++LS QNLVDC +E N GC GG+
Sbjct: 126 GYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSE---NDGCGGGY 182
Query: 696 MTEAFQYIIDNNGIDSEASYPYKAVDGKCKYDSKNRAATCSRYTELPFADEYALKEAVAN 875
MT AFQY+ N GIDSE +YPY D C Y+ +AA C Y E+P +E ALK AVA
Sbjct: 183 MTNAFQYVQKNRGIDSEDAYPYVGQDESCMYNPTGKAAKCRGYREIPEGNEKALKRAVAR 242
Query: 876 KGPVSVAIDAKHSSFFFYRSGVYYDPSC-TQXXXXXXXXXXXXXXXXKDYWLVKNSWGLN 1052
GP+SVAIDA +SF FY GVYYD +C + +W++KNSWG N
Sbjct: 243 VGPISVAIDASLTSFQFYSKGVYYDENCNSDNLNHAVLAVGYGIQKGNKHWIIKNSWGEN 302
Query: 1053 FGDGGYIRMARNSENPWGNANYPPYPKI 1136
+G+ GYI MARN N G AN +PK+
Sbjct: 303 WGNKGYILMARNKNNACGIANLASFPKM 330
>ref|NP_001003115.1| cathepsin L1 precursor [Canis lupus familiaris].
Length = 333
Score = 274 bits (701), Expect = 1e-73
Identities = 146/316 (46%), Positives = 193/316 (61%), Gaps = 5/316 (1%)
Frame = +3
Query: 204 DPTLDRHWDLWKKTYGKQYKEKNEEVARRLIWEKNLKTVMLHNLEHSMGMHSYDLGMNHL 383
D +L+ W WK T+ + Y NEE RR +WEKN+K + LHN E+S G H + + MN
Sbjct: 22 DQSLNAQWYQWKATHRRLYG-MNEEGWRRAVWEKNMKMIELHNREYSQGKHGFTMAMNAF 80
Query: 384 GDMTSEEVISLMSCVRVPSQWPRNVTYKSNPNQKLPDSMDWREKGCVTEVKYQGSCGSCW 563
GDMT+EE +M+ + + + ++ ++P S+DWREKG VT VK QG CGSCW
Sbjct: 81 GDMTNEEFRQVMNGFQ-NQKHKKGKMFQEPLFAEIPKSVDWREKGYVTPVKNQGQCGSCW 139
Query: 564 AFSAVGALEAQVKMKTGRLVSLSAQNLVDCSTEKYRNKGCNGGFMTEAFQYIIDNNGIDS 743
AFSA GALE Q+ KTG+LVSLS QNLVDCS + N+GCNGG M AF+Y+ DN G+DS
Sbjct: 140 AFSATGALEGQMFRKTGKLVSLSEQNLVDCSRAQ-GNEGCNGGLMDNAFRYVKDNGGLDS 198
Query: 744 EASYPYKAVDGK-CKYDSKNRAATCSRYTELPFADEYALKEAVANKGPVSVAIDAKHSSF 920
E SYPY D + C Y + AA + + +LP E AL +AVA GP+SVAIDA H SF
Sbjct: 199 EESYPYLGRDTETCNYKPECSAANDTGFVDLP-QREKALMKAVATLGPISVAIDAGHQSF 257
Query: 921 FFYRSGVYYDPSCTQXXXXXXXXXXXXXXXXKD----YWLVKNSWGLNFGDGGYIRMARN 1088
FY+SG+Y+DP C+ D +W+VKNSWG +G GY++MA++
Sbjct: 258 QFYKSGIYFDPDCSSKDLDHGVLVVGYGFEGTDSNNKFWIVKNSWGPEWGWNGYVKMAKD 317
Query: 1089 SENPWGNANYPPYPKI 1136
N G A YP +
Sbjct: 318 QNNHCGIATAASYPTV 333
>ref|XP_541257.2| PREDICTED: similar to Cathepsin L precursor (Major excreted protein)
(MEP) [Canis familiaris].
Length = 333
Score = 271 bits (694), Expect = 7e-73
Identities = 142/323 (43%), Positives = 193/323 (59%), Gaps = 5/323 (1%)
Frame = +3
Query: 183 AMAQLHRDPTLDRHWDLWKKTYGKQYKEKNEEVARRLIWEKNLKTVMLHNLEHSMGMHSY 362
A A +D +LD HW WK+ +GK Y +K+EE RR +WE+N++ + HN E+S G HS+
Sbjct: 15 ASAAPQQDHSLDAHWSQWKEAHGKLY-DKDEEGWRRTVWERNMEMIEQHNQEYSQGEHSF 73
Query: 363 DLGMNHLGDMTSEEVISLMSCVRVPSQWPRNVTYKSNPNQKLPDSMDWREKGCVTEVKYQ 542
L MN GDMT+EE +++ ++ + + + ++P S+DWRE+G VT VK Q
Sbjct: 74 TLAMNAFGDMTNEEFKQVLNDFKIQKH-KKGKVFPAPLFAEVPSSVDWREQGYVTPVKDQ 132
Query: 543 GSCGSCWAFSAVGALEAQVKMKTGRLVSLSAQNLVDCSTEKYRNKGCNGGFMTEAFQYII 722
G C CWAFSA GALE Q+ KTG+LVSLS QNLVDCS + N+GCNGG M AFQY+
Sbjct: 133 GQCLGCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSWSQ-GNRGCNGGLMEYAFQYVK 191
Query: 723 DNNGIDSEASYPYKAVDGKCKYDSKNRAATCSRYTELPFADEYALKEAVANKGPVSVAID 902
DN G+DSE SYPY A + CKY + AA + + + +E L VA GPVS A+D
Sbjct: 192 DNGGLDSEESYPYLARNEPCKYRPEKSAANVTAFWPI-LNEEDGLMTTVATVGPVSAAVD 250
Query: 903 AKHSSFFFYRSGVYYDPSCT-----QXXXXXXXXXXXXXXXXKDYWLVKNSWGLNFGDGG 1067
+ SF FY+ G+YYDP C+ K YW+VKNSWG N+G G
Sbjct: 251 SSPQSFQFYKKGIYYDPKCSNKLLNHGVLVVGYGFEGAESDNKKYWIVKNSWGTNWGMQG 310
Query: 1068 YIRMARNSENPWGNANYPPYPKI 1136
Y+ +A++ +N G A YP +
Sbjct: 311 YMLLAKDRDNHCGIATRASYPVV 333
>ref|XP_855060.1| PREDICTED: similar to Cathepsin L2 precursor (Cathepsin V) (Cathepsin
U) [Canis familiaris].
Length = 289
Score = 174 bits (441), Expect = 1e-43
Identities = 111/314 (35%), Positives = 151/314 (48%), Gaps = 3/314 (0%)
Frame = +3
Query: 204 DPTLDRHWDLWKKTYGKQYKEKNEEVARRLIWEKNLKTVMLHNLEHSMGMHSYDLGMNHL 383
D +L+ W WK + + NEE RR +WEKN+K + LHN E+S G H + + MN
Sbjct: 19 DQSLNAQWYQWKAMH--RLYAMNEEGWRRAVWEKNMKMIELHNREYSQGKHGFTMAMNAF 76
Query: 384 GDMTSEEVISLMSCVRVPSQWPRNVTYKSNPNQKLPDSMDWREKGCVTEVKYQGSCGSCW 563
G MT+EE +M+ + + + ++ ++P S+DWREKG VT VK QG CGSCW
Sbjct: 77 GYMTNEEFRQVMNGFQ-NQKHKKGKVFQEPLFAEIPKSVDWREKGYVTPVKNQGHCGSCW 135
Query: 564 AFSAVGALEAQVKMKTGRLVSLSAQNLVDCSTEKYRNKGCNGGFMTEAFQYIIDNNGIDS 743
AFSA G L+ Q+ KTG+L NLVDC + NKGCNGG M AFQY+ DN G+DS
Sbjct: 136 AFSATGDLKGQMFQKTGKL------NLVDC-YQAQGNKGCNGGLMDNAFQYVKDNGGLDS 188
Query: 744 EASYPYKAVDGKCKYDSK---NRAATCSRYTELPFADEYALKEAVANKGPVSVAIDAKHS 914
E Y Y D Y+ K + A T + +T L + G V + S
Sbjct: 189 EECYLYLGRDTD-TYNYKPECSAAMTLASWTSLNGRRLQQQRSRSWCYGVVGYGFEGTDS 247
Query: 915 SFFFYRSGVYYDPSCTQXXXXXXXXXXXXXXXXKDYWLVKNSWGLNFGDGGYIRMARNSE 1094
+ W+VKNSWG +G GY++ A+
Sbjct: 248 N--------------------------------NKLWIVKNSWGTEWGWNGYVKTAKGQN 275
Query: 1095 NPWGNANYPPYPKI 1136
N G A YP +
Sbjct: 276 NHCGIATAASYPTV 289
>ref|XP_536212.2| PREDICTED: similar to Cathepsin H precursor [Canis familiaris].
Length = 304
Score = 173 bits (439), Expect = 2e-43
Identities = 115/300 (38%), Positives = 152/300 (50%), Gaps = 8/300 (2%)
Frame = +3
Query: 255 QYKEKNEEVARRL-IWEKNLKTVMLHNLEHSMGMHSYDLGMNHLGDMTSEEVIS--LMSC 425
Q K +EE +RL + N + + HN G H++ +G+N DM E+ L S
Sbjct: 11 QKKYSSEEYLQRLQTFVGNWRKINAHNA----GNHTFKMGLNQFSDMNFAEIKHKYLWSE 66
Query: 426 VRVPSQWPRNVTYKSNPNQKLPDSMDWREKG-CVTEVKYQGSCGSCWAFSAVGALEAQVK 602
+ S N + P P +DWR+KG V+ VK QGSCGSCW FS GALE+ +
Sbjct: 67 PQNCSATKGNYLRGTGP---YPPFVDWRKKGKFVSPVKNQGSCGSCWTFSTTGALESAIA 123
Query: 603 MKTGRLVSLSAQNLVDCSTEKYRNKGCNG-GFMTEAFQYIIDNNGIDSEASYPYKAVDGK 779
+K+G+L+SL+ Q LVDC+ + + N GC G G +AF+YI N GI E SYPYK DG
Sbjct: 124 IKSGKLLSLAEQQLVDCA-QNFNNHGCQGYGAPLQAFEYIRYNKGIMGEDSYPYKGQDGD 182
Query: 780 CKYDSKNRAATCSRYTELPFADEYALKEAVANKGPVSVAIDAKHSSFFFYRSGVYYDPSC 959
CKY A + DE A+ EAVA PVS A + S F YR G+Y SC
Sbjct: 183 CKYQPSKAIAFVKDVANITINDEQAMVEAVALYNPVSFAFEVT-SDFMMYRKGIYSSTSC 241
Query: 960 TQXXXXXXXXXXXXXXXXKD---YWLVKNSWGLNFGDGGYIRMARNSENPWGNANYPPYP 1130
+ ++ YW+VKNSWG +G GY M R +N G A YP
Sbjct: 242 HKTPDKVNHAVLAVGYGEQNGIPYWIVKNSWGPQWGMNGYFLMER-GKNMCGLAACASYP 300
>ref|XP_533219.2| PREDICTED: similar to Cathepsin F precursor (CATSF) [Canis
familiaris].
Length = 442
Score = 134 bits (336), Expect = 2e-31
Identities = 90/288 (31%), Positives = 139/288 (48%), Gaps = 5/288 (1%)
Frame = +3
Query: 243 TYGKQYKEKNEEVARRLIWEKNL-KTVMLHNLEHSMGMHSYDLGMNHLGDMTSEEVISL- 416
TY + Y+ K E R ++ N+ + + L+ + G+ D+T EE ++
Sbjct: 150 TYNRTYETKEEAEWRMSVFSNNMVRAQKIQALDRGTAQY----GITKFSDLTEEEFRTIY 205
Query: 417 MSCVRVPSQWPRNVTYKSNPNQKLPDSMDWREKGCVTEVKYQGSCGSCWAFSAVGALEAQ 596
++ + ++ + KS + P DWR KG VT+VK QG CGSCWAFS G +E Q
Sbjct: 206 LNPLLRENRGKKMRLAKSISDHAPPPEWDWRSKGAVTKVKDQGMCGSCWAFSVTGNVEGQ 265
Query: 597 VKMKTGRLVSLSAQNLVDCSTEKYRNKGCNGGFMTEAFQYIIDNNGIDSEASYPYKAVDG 776
+K G L+SLS Q L+DC +K C GG + A+ I+ G+++E Y Y+
Sbjct: 266 WFLKEGTLLSLSEQELLDCDKV---DKACLGGLPSNAYSAIMTLGGLETEDDYSYQGHLQ 322
Query: 777 KCKYDSKNRAATCSRYTELPFADEYALKEAVANKGPVSVAIDAKHSSFFFYRSGVYYD-- 950
C + +K + EL +E L +A KGP+SVAI+A FYR G+ +
Sbjct: 323 ACSFSAKKARVYINDSMELS-QNEQKLAAWLAKKGPISVAINA--FGMQFYRHGISHPLR 379
Query: 951 PSCTQXXXXXXXXXXXXXXXXK-DYWLVKNSWGLNFGDGGYIRMARNS 1091
P C+ +W +KNSWG ++G+ GY + R S
Sbjct: 380 PLCSPWLIDHAVLLVGYGNRSGIPFWAIKNSWGTDWGEEGYYYLHRGS 427
>ref|XP_539782.2| PREDICTED: similar to Cathepsin O precursor [Canis familiaris].
Length = 518
Score = 114 bits (285), Expect = 2e-25
Identities = 79/247 (31%), Positives = 120/247 (48%), Gaps = 7/247 (2%)
Frame = +3
Query: 369 GMNHLGDMTSEE--VISLMSCVRVPSQWPRNVTYKSNPNQKLPDSMDWREKGCVTEVKYQ 542
G+N ++ EE I L S ++P V S N LP DWR+K VT+V+ Q
Sbjct: 263 GINQFSYLSPEEFKAIYLRSKPSRSPRYPAEVR-TSIRNVSLPLRFDWRDKRVVTQVRNQ 321
Query: 543 GSCGSCWAFSAVGALEAQVKMKTGRLVSLSAQNLVDCSTEKYRNKGCNGGFMTEAFQYII 722
+CG CWAFS VGA+E+ +K L +S Q ++DCS Y N GC+GG A ++
Sbjct: 322 QTCGGCWAFSVVGAVESAYAIKGKPLADISVQQVIDCS---YNNYGCSGGSTLNALNWLN 378
Query: 723 DNN-GIDSEASYPYKAVDGKCKYDSKNRAATCSR-YTELPFAD-EYALKEAVANKGPVSV 893
+ ++ YP+KA +G C Y S + + R Y+ F+D E + + + GP+ V
Sbjct: 379 KTQVKLVRDSEYPFKAQNGLCHYFSDSYSGFSIRGYSAYDFSDQEDEMAKVLLTFGPLVV 438
Query: 894 AIDAKHSSFFFYRSGVYYDPSCTQXXXXXXXXXXXXXXXXKDYWLVKNSWGLNFGDGGY- 1070
+DA S+ Y G+ + YW+V+NSWG ++G GY
Sbjct: 439 VVDA--VSWQDYLGGIIQHHCSSGEANHAVLITGFDKIGSTPYWIVRNSWGSSWGVDGYA 496
Query: 1071 -IRMARN 1088
++M N
Sbjct: 497 HVKMGGN 503
>ref|XP_540846.2| PREDICTED: similar to cathepsin W preproprotein [Canis familiaris].
Length = 374
Score = 111 bits (277), Expect = 2e-24
Identities = 90/328 (27%), Positives = 137/328 (41%), Gaps = 31/328 (9%)
Frame = +3
Query: 231 LWKKTYGKQYKEKNEEVARRLIWEKNLKTVMLHNLEHSMGMHSYDLGMNHLG-----DMT 395
L++ Y + Y EE ARRL + HNL + + DLG G D+T
Sbjct: 44 LFQIQYNRSYSNP-EEYARRL-------DIFAHNLAQAQQLEDEDLGTAEFGVTPFSDLT 95
Query: 396 SEEVISLMSCVRVPSQWPR--NVTYKSNPNQKLPDSMDWRE-KGCVTEVKYQGSCGSCWA 566
EE R+ + P + +P + DWR+ G ++ +K QG+C CWA
Sbjct: 96 EEEFGQFYGHQRMAGEAPSVGRKVESEEWGEPVPPTCDWRKLPGIISPIKQQGNCRCCWA 155
Query: 567 FSAVGALEAQVKMKTGRLVSLSAQNLVDCSTEKYRNKGCNGGFMTEAFQYIIDNNGIDSE 746
+A G +EA ++ + V +S Q L+DC GC GGF +AF +++N+G+ S
Sbjct: 156 MAAAGNIEALWGIRYHQPVEVSVQELLDCGR---CGDGCKGGFTWDAFITVLNNSGLASA 212
Query: 747 ASYPY--KAVDGKCKYDSKNRAATCSRYTELPFADEYALKEAVANKGPVSVAIDAKHSSF 920
YP+ +C + A + L +E A+ +A KGP++V I+ K
Sbjct: 213 KDYPFLGNTKPHRCLAKKYKKVAWIQDFIMLQ-GNEQAIAWYLATKGPITVTINMK--LL 269
Query: 921 FFYRSGVYY------DPSCTQXXXXXXXXXXXXXXXXKD---------------YWLVKN 1037
Y+ GV DP K YW++KN
Sbjct: 270 QHYQKGVIQATHTTCDPQRVDHSVLLVGFGKSKSVAGKQAEGGSSRPRPHHPIPYWILKN 329
Query: 1038 SWGLNFGDGGYIRMARNSENPWGNANYP 1121
SWG +G+ GY R+ R + N G YP
Sbjct: 330 SWGAEWGEEGYFRLHRGN-NTCGITKYP 356
>ref|NP_001182763.1| dipeptidyl peptidase 1 [Canis lupus familiaris].
Length = 459
Score = 107 bits (267), Expect = 2e-23
Identities = 73/252 (28%), Positives = 118/252 (46%), Gaps = 17/252 (6%)
Frame = +3
Query: 390 MTSEEVISLMSCVRVPSQWPRNVTYKSNPN-QKLPDSMDWRE---KGCVTEVKYQGSCGS 557
+T ++++ + ++P P +T + + +LP S DWR V+ V+ Q SCGS
Sbjct: 195 LTLRDMMTRVGGRKIPRPKPTPLTAEIHEEISRLPTSWDWRNVRGTNFVSPVRNQASCGS 254
Query: 558 CWAFSAVGALEAQVKMKTGRLVS--LSAQNLVDCSTEKYRNKGCNGGFMTEAFQYIIDNN 731
C+AF++ LEA++++ T + LS Q +V CS +GC GGF +
Sbjct: 255 CYAFASTAMLEARIRILTNNTQTPILSPQEIVSCSQYA---QGCEGGFPYLIAGKYAQDF 311
Query: 732 GIDSEASYPYKAVDGKCKYDSKNRAATCSRYTELPF---ADEYALKEAVANKGPVSVAID 902
G+ EA +PY D CK + R + Y F +E +K + GP++VA +
Sbjct: 312 GLVEEACFPYAGSDSPCKPNDCFRYYSSEYYYVGGFYGACNEALMKLELVRHGPMAVAFE 371
Query: 903 AKHSSFFFYRSGVYY--------DPSCTQXXXXXXXXXXXXXXXXKDYWLVKNSWGLNFG 1058
+ FF Y+ G+YY +P DYW+VKNSWG +G
Sbjct: 372 V-YDDFFHYQKGIYYHTGLRDPFNPFELTNHAVLLVGYGTDSASGMDYWIVKNSWGSRWG 430
Query: 1059 DGGYIRMARNSE 1094
+ GY R+ R ++
Sbjct: 431 EDGYFRIRRGTD 442
Database: RefSeq49_CP.fasta
Posted date: Oct 17, 2011 1:42 PM
Number of letters in database: 18,874,504
Number of sequences in database: 33,336
Lambda K H
0.318 0.134 0.401
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 33336
Number of Hits to DB: 45,457,616
Number of extensions: 1126924
Number of successful extensions: 3536
Number of sequences better than 1.0e-05: 14
Number of HSP's gapped: 3498
Number of HSP's successfully gapped: 14
Length of query: 388
Length of database: 18,874,504
Length adjustment: 105
Effective length of query: 283
Effective length of database: 15,374,224
Effective search space: 4350905392
Effective search space used: 4350905392
Neighboring words threshold: 12
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 35 (18.1 bits)
Search to RefSeqSP_Rel49
BLASTX 2.2.24 [Aug-08-2010]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= 20110601C-012224
(1166 letters)
Database: RefSeq49_SP.fasta
24,897 sequences; 11,343,932 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
Alignment gi|XP_001929706.1| PREDICTED: cathepsin S [Sus scrofa]. 648 0.0
Alignment gi|XP_003355243.1| PREDICTED: cathepsin S-like [Sus scrofa]. 648 0.0
Alignment gi|NP_999467.1| cathepsin K precursor [Sus scrofa]. 342 3e-94
Alignment gi|XP_003130681.1| PREDICTED: cathepsin L1-like [Sus scrofa]. 281 5e-76
Alignment gi|NP_999057.1| cathepsin L1 precursor [Sus scrofa]. 281 6e-76
Alignment gi|NP_999094.1| pro-cathepsin H precursor [Sus scrofa]. 178 4e-45
Alignment gi|XP_003122543.2| PREDICTED: cathepsin F [Sus scrofa]. 135 4e-32
Alignment gi|XP_003122571.1| PREDICTED: cathepsin W-like [Sus scrofa]. 120 1e-27
Alignment gi|XP_003129789.1| PREDICTED: dipeptidyl peptidase 1-like [Sus ... 103 2e-22
Alignment gi|XP_003129045.2| PREDICTED: tryptophan 2,3-dioxygenase [Sus s... 90 3e-18
>ref|XP_001929706.1| PREDICTED: cathepsin S [Sus scrofa].
Length = 331
Score = 648 bits (1672), Expect = 0.0
Identities = 310/331 (93%), Positives = 311/331 (93%)
Frame = +3
Query: 144 MKCLVWVLLLCSSAMAQLHRDPTLDRHWDLWKKTYGKQYKEKNEEVARRLIWEKNLKTVM 323
MKCLVWVLLLCSSAMAQLHRDPTLDRHWDLWKKTYGKQYKEKNEEVARRLIWEKNLKTVM
Sbjct: 1 MKCLVWVLLLCSSAMAQLHRDPTLDRHWDLWKKTYGKQYKEKNEEVARRLIWEKNLKTVM 60
Query: 324 LHNLEHSMGMHSYDLGMNHLGDMTSEEVISLMSCVRVPSQWPRNVTYKSNPNQKLPDSMD 503
LHNLEHSMGMHSYDLGMNHLGDMTSEEVISLMSCVRVPSQWPRNVTYKSNPNQKLPDSMD
Sbjct: 61 LHNLEHSMGMHSYDLGMNHLGDMTSEEVISLMSCVRVPSQWPRNVTYKSNPNQKLPDSMD 120
Query: 504 WREKGCVTEVKYQGSCGSCWAFSAVGALEAQVKMKTGRLVSLSAQNLVDCSTEKYRNKGC 683
WREKGCVTEVKYQGSCGSCWAFSAVGALEAQVKMKTGRLVSLSAQNLVDCSTEKYRNKGC
Sbjct: 121 WREKGCVTEVKYQGSCGSCWAFSAVGALEAQVKMKTGRLVSLSAQNLVDCSTEKYRNKGC 180
Query: 684 NGGFMTEAFQYIIDNNGIDSEASYPYKAVDGKCKYDSKNRAATCSRYTELPFADEYALKE 863
NGGFMTEAFQYIIDNNGIDSEASYPYKAVDGKCKYDSKNRAATCSRYTELPFADEYALKE
Sbjct: 181 NGGFMTEAFQYIIDNNGIDSEASYPYKAVDGKCKYDSKNRAATCSRYTELPFADEYALKE 240
Query: 864 AVANKGPVSVAIDAKHSSFFFYRSGVYYDPSCTQXXXXXXXXXXXXXXXXKDYWLVKNSW 1043
AVANKGPVSVAIDAKHSSFFFYRSGVYYDPSCTQ KDYWLVKNSW
Sbjct: 241 AVANKGPVSVAIDAKHSSFFFYRSGVYYDPSCTQNVNHGVLVVGYGNLNGKDYWLVKNSW 300
Query: 1044 GLNFGDGGYIRMARNSENPWGNANYPPYPKI 1136
GLNFGDGGYIRMARNSEN G ANYP YP+I
Sbjct: 301 GLNFGDGGYIRMARNSENHCGIANYPSYPEI 331
>ref|XP_003355243.1| PREDICTED: cathepsin S-like [Sus scrofa].
Length = 331
Score = 648 bits (1672), Expect = 0.0
Identities = 310/331 (93%), Positives = 311/331 (93%)
Frame = +3
Query: 144 MKCLVWVLLLCSSAMAQLHRDPTLDRHWDLWKKTYGKQYKEKNEEVARRLIWEKNLKTVM 323
MKCLVWVLLLCSSAMAQLHRDPTLDRHWDLWKKTYGKQYKEKNEEVARRLIWEKNLKTVM
Sbjct: 1 MKCLVWVLLLCSSAMAQLHRDPTLDRHWDLWKKTYGKQYKEKNEEVARRLIWEKNLKTVM 60
Query: 324 LHNLEHSMGMHSYDLGMNHLGDMTSEEVISLMSCVRVPSQWPRNVTYKSNPNQKLPDSMD 503
LHNLEHSMGMHSYDLGMNHLGDMTSEEVISLMSCVRVPSQWPRNVTYKSNPNQKLPDSMD
Sbjct: 61 LHNLEHSMGMHSYDLGMNHLGDMTSEEVISLMSCVRVPSQWPRNVTYKSNPNQKLPDSMD 120
Query: 504 WREKGCVTEVKYQGSCGSCWAFSAVGALEAQVKMKTGRLVSLSAQNLVDCSTEKYRNKGC 683
WREKGCVTEVKYQGSCGSCWAFSAVGALEAQVKMKTGRLVSLSAQNLVDCSTEKYRNKGC
Sbjct: 121 WREKGCVTEVKYQGSCGSCWAFSAVGALEAQVKMKTGRLVSLSAQNLVDCSTEKYRNKGC 180
Query: 684 NGGFMTEAFQYIIDNNGIDSEASYPYKAVDGKCKYDSKNRAATCSRYTELPFADEYALKE 863
NGGFMTEAFQYIIDNNGIDSEASYPYKAVDGKCKYDSKNRAATCSRYTELPFADEYALKE
Sbjct: 181 NGGFMTEAFQYIIDNNGIDSEASYPYKAVDGKCKYDSKNRAATCSRYTELPFADEYALKE 240
Query: 864 AVANKGPVSVAIDAKHSSFFFYRSGVYYDPSCTQXXXXXXXXXXXXXXXXKDYWLVKNSW 1043
AVANKGPVSVAIDAKHSSFFFYRSGVYYDPSCTQ KDYWLVKNSW
Sbjct: 241 AVANKGPVSVAIDAKHSSFFFYRSGVYYDPSCTQNVNHGVLVVGYGNLNGKDYWLVKNSW 300
Query: 1044 GLNFGDGGYIRMARNSENPWGNANYPPYPKI 1136
GLNFGDGGYIRMARNSEN G ANYP YP+I
Sbjct: 301 GLNFGDGGYIRMARNSENHCGIANYPSYPEI 331
>ref|NP_999467.1| cathepsin K precursor [Sus scrofa].
Length = 330
Score = 342 bits (876), Expect = 3e-94
Identities = 172/328 (52%), Positives = 219/328 (66%), Gaps = 3/328 (0%)
Frame = +3
Query: 162 VLLLCSSAMAQLHRDPTLDRHWDLWKKTYGKQYKEKNEEVARRLIWEKNLKTVMLHNLEH 341
V+LL + L+ + LD W+LWKKTY KQY K +E++RRLIWEKNLK + +HNLE
Sbjct: 6 VVLLLPVMSSALYPEEILDTQWELWKKTYRKQYNSKVDEISRRLIWEKNLKHISIHNLEA 65
Query: 342 SMGMHSYDLGMNHLGDMTSEEVISLMSCVRVPSQWPR--NVTYKSNPNQKLPDSMDWREK 515
S+G+H+Y+L MNHLGDMTSEEV+ M+ ++VP R + Y + + PDS+D+R+K
Sbjct: 66 SLGVHTYELAMNHLGDMTSEEVVQKMTGLKVPPSHSRSNDTLYIPDWEGRTPDSIDYRKK 125
Query: 516 GCVTEVKYQGSCGSCWAFSAVGALEAQVKMKTGRLVSLSAQNLVDCSTEKYRNKGCNGGF 695
G VT VK QG CGSCWAFS+VGALE Q+K KTG+L++LS QNLVDC +E N GC GG+
Sbjct: 126 GYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSE---NDGCGGGY 182
Query: 696 MTEAFQYIIDNNGIDSEASYPYKAVDGKCKYDSKNRAATCSRYTELPFADEYALKEAVAN 875
MT AFQY+ N GIDSE +YPY D C Y+ +AA C Y E+P +E ALK AVA
Sbjct: 183 MTNAFQYVQKNRGIDSEDAYPYVGQDENCMYNPTGKAAKCRGYREIPEGNEKALKRAVAR 242
Query: 876 KGPVSVAIDAKHSSFFFYRSGVYYDPSC-TQXXXXXXXXXXXXXXXXKDYWLVKNSWGLN 1052
GPVSVAIDA +SF FY GVYYD +C + K +W++KNSWG N
Sbjct: 243 VGPVSVAIDASLTSFQFYSKGVYYDENCNSDNLNHAVLAVGYGIQKGKKHWIIKNSWGEN 302
Query: 1053 FGDGGYIRMARNSENPWGNANYPPYPKI 1136
+G+ GYI MARN N G AN +PK+
Sbjct: 303 WGNKGYILMARNKNNACGIANLASFPKM 330
>ref|XP_003130681.1| PREDICTED: cathepsin L1-like [Sus scrofa].
Length = 332
Score = 281 bits (719), Expect = 5e-76
Identities = 151/322 (46%), Positives = 188/322 (58%), Gaps = 4/322 (1%)
Frame = +3
Query: 183 AMAQLHRDPTLDRHWDLWKKTYGKQYKEKNEEVARRLIWEKNLKTVMLHNLEHSMGMHSY 362
A A D +LD W WK T+ K Y NEE RR IWEKN+K + HN EH G HS+
Sbjct: 15 ASAAPRHDHSLDADWYKWKATHRKLYG-LNEEGRRRAIWEKNMKMIERHNWEHRQGKHSF 73
Query: 363 DLGMNHLGDMTSEEVISLMSCVRVPSQWPRNVTYKSNPNQKLPDSMDWREKGCVTEVKYQ 542
+ MN GDMT+EE M+ + + + + + P S+DWREKG VT VK Q
Sbjct: 74 TMAMNAFGDMTNEEFRKTMNGFQ-NQKHKKGKVFLDAGSALTPHSVDWREKGYVTAVKNQ 132
Query: 543 GSCGSCWAFSAVGALEAQVKMKTGRLVSLSAQNLVDCSTEKYRNKGCNGGFMTEAFQYII 722
G CGSCWAFSA GALE Q+ KT +L+SLS QNLVDCS + N+GCNGG M AFQYI
Sbjct: 133 GHCGSCWAFSATGALEGQMFRKTSKLISLSEQNLVDCSWPE-GNEGCNGGLMDNAFQYIK 191
Query: 723 DNNGIDSEASYPYKAVDGKCKYDSKNRAATCSRYTELPFADEYALKEAVANKGPVSVAID 902
DN G+DSE SYPY DG CKY ++ AA + Y ++P E AL +AVA GP+SV ID
Sbjct: 192 DNGGLDSEESYPYFGKDGSCKYKPQSSAANDTGYVDIP-KQEKALMKAVATVGPISVGID 250
Query: 903 AKHSSFFFYRSGVYYDPSCT----QXXXXXXXXXXXXXXXXKDYWLVKNSWGLNFGDGGY 1070
A H SF FY +G+Y++P C+ YWLVKNSWG +G GY
Sbjct: 251 ASHESFQFYSTGIYFEPQCSSEDLDHGVLVVGYGVEGAHSNNKYWLVKNSWGNTWGMDGY 310
Query: 1071 IRMARNSENPWGNANYPPYPKI 1136
I+M ++ N G A YP +
Sbjct: 311 IKMTKDQNNHCGIATMASYPVV 332
>ref|NP_999057.1| cathepsin L1 precursor [Sus scrofa].
Length = 334
Score = 281 bits (718), Expect = 6e-76
Identities = 155/338 (45%), Positives = 202/338 (59%), Gaps = 7/338 (2%)
Frame = +3
Query: 144 MKCLVWVLLLCSS-AMAQLHRDPTLDRHWDLWKKTYGKQYKEKNEEVARRLIWEKNLKTV 320
MK +++ LC A A D LD W WK T+G+ Y NEE RR +WEKN+K +
Sbjct: 1 MKPSLFLTALCLGIASAAPKLDQNLDADWYKWKATHGRLYG-MNEEGWRRAVWEKNMKMI 59
Query: 321 MLHNLEHSMGMHSYDLGMNHLGDMTSEEVISLMSCVRVPSQWPRNVTYKSNPNQKLPDSM 500
LHN E+S G H + + MN GDMT+EE +M+ + V ++S + +P S+
Sbjct: 60 ELHNQEYSQGKHGFSMAMNAFGDMTNEEFRQVMNGFQNQKHKKGKVFHESLVLE-VPKSV 118
Query: 501 DWREKGCVTEVKYQGSCGSCWAFSAVGALEAQVKMKTGRLVSLSAQNLVDCSTEKYRNKG 680
DWREKG VT VK QG CGSCWAFSA GALE Q+ KTG+LVSLS QNLVDCS + N+G
Sbjct: 119 DWREKGYVTAVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRPQ-GNQG 177
Query: 681 CNGGFMTEAFQYIIDNNGIDSEASYPYKAVD-GKCKYDSKNRAATCSRYTELPFADEYAL 857
CNGG M AFQY+ DN G+D+E SYPY + C Y + AA + + ++P E AL
Sbjct: 178 CNGGLMDNAFQYVKDNGGLDTEESYPYLGRETNSCTYKPECSAANDTGFVDIP-QREKAL 236
Query: 858 KEAVANKGPVSVAIDAKHSSFFFYRSGVYYDPSCTQXXXXXXXXXXXXXXXXKD-----Y 1022
+AVA GP+SVAIDA HSSF FY+SG+YYDP C+ D +
Sbjct: 237 MKAVATVGPISVAIDAGHSSFQFYKSGIYYDPDCSSKDLDHGVLVVGYGFEGTDSNSSKF 296
Query: 1023 WLVKNSWGLNFGDGGYIRMARNSENPWGNANYPPYPKI 1136
W+VKNSWG +G GY++MA++ N G + YP +
Sbjct: 297 WIVKNSWGPEWGWNGYVKMAKDQNNHCGISTAASYPTV 334
>ref|NP_999094.1| pro-cathepsin H precursor [Sus scrofa].
Length = 335
Score = 178 bits (452), Expect = 4e-45
Identities = 115/310 (37%), Positives = 158/310 (50%), Gaps = 7/310 (2%)
Frame = +3
Query: 222 HWDLWKKTYGKQYKEKNEEVARRL-IWEKNLKTVMLHNLEHSMGMHSYDLGMNHLGDMTS 398
H+ W + K+Y EE RL ++ N + + HN G H++ LG+N DM+
Sbjct: 34 HFKSWMVQHQKKYSL--EEYHHRLQVFVSNWRKINAHNA----GNHTFKLGLNQFSDMSF 87
Query: 399 EEVIS--LMSCVRVPSQWPRNVTYKSNPNQKLPDSMDWREKG-CVTEVKYQGSCGSCWAF 569
+E+ L S + S N + P P SMDWR+KG V+ VK QGSCGSCW F
Sbjct: 88 DEIRHKYLWSEPQNCSATKGNYLRGTGP---YPPSMDWRKKGNFVSPVKNQGSCGSCWTF 144
Query: 570 SAVGALEAQVKMKTGRLVSLSAQNLVDCSTEKYRNKGCNGGFMTEAFQYIIDNNGIDSEA 749
S GALE+ V + TG+++SL+ Q LVDC+ + + N GC GG ++AF+YI N GI E
Sbjct: 145 STTGALESAVAIATGKMLSLAEQQLVDCA-QNFNNHGCQGGLPSQAFEYIRYNKGIMGED 203
Query: 750 SYPYKAVDGKCKYDSKNRAATCSRYTELPFADEYALKEAVANKGPVSVAIDAKHSSFFFY 929
+YPYK D CK+ A + DE A+ EAVA PVS A + + F Y
Sbjct: 204 TYPYKGQDDHCKFQPDKAIAFVKDVANITMNDEEAMVEAVALYNPVSFAFEVT-NDFLMY 262
Query: 930 RSGVYYDPSCTQXXXXXXXXXXXXXXXXKD---YWLVKNSWGLNFGDGGYIRMARNSENP 1100
R G+Y SC + ++ YW+VKNSWG +G GY + R +N
Sbjct: 263 RKGIYSSTSCHKTPDKVNHAVLAVGYGEENGIPYWIVKNSWGPQWGMNGYFLIER-GKNM 321
Query: 1101 WGNANYPPYP 1130
G A YP
Sbjct: 322 CGLAACASYP 331
>ref|XP_003122543.2| PREDICTED: cathepsin F [Sus scrofa].
Length = 490
Score = 135 bits (340), Expect = 4e-32
Identities = 89/286 (31%), Positives = 138/286 (48%), Gaps = 3/286 (1%)
Frame = +3
Query: 243 TYGKQYKEKNEEVARRLIWEKNLKTVMLHNLEHSMGMHSYDLGMNHLGDMTSEEVISLMS 422
TY + Y K E R ++ N+ V ++ ++ + G+ D+T EE ++
Sbjct: 199 TYNRTYDTKEEARWRMSVFANNM--VRAQKIQ-ALDTGTARYGVTKFSDLTEEEFRTIYL 255
Query: 423 CVRVPSQWPRNVTYKSNPNQKLPDSMDWREKGCVTEVKYQGSCGSCWAFSAVGALEAQVK 602
+ + R + + + P DWR+KG VT+VK QG CGSCWAFS G +E Q
Sbjct: 256 NPLLQEEPGRKMRLAKSVSSLPPPEWDWRKKGAVTKVKDQGMCGSCWAFSVTGNVEGQWF 315
Query: 603 MKTGRLVSLSAQNLVDCSTEKYRNKGCNGGFMTEAFQYIIDNNGIDSEASYPYKAVDGKC 782
+K G L+SLS Q L+DC +KGC GG + A+ I G+++E Y Y+ C
Sbjct: 316 LKQGTLLSLSEQELLDCDKV---DKGCMGGLPSNAYSAIKTLGGLETEEDYSYRGHLQTC 372
Query: 783 KYDSKNRAATCSRYTELPFADEYALKEAVANKGPVSVAIDAKHSSFFFYRSGVYYD--PS 956
++++ + EL +E L +A KGP+SVAI+A FYR G+ + P
Sbjct: 373 SFNAEKAKVYINDSVELS-QNEQKLAAWLAEKGPISVAINA--FGMQFYRHGISHPLRPL 429
Query: 957 CTQ-XXXXXXXXXXXXXXXXKDYWLVKNSWGLNFGDGGYIRMARNS 1091
C+ +W +KNSWG ++G+ GY + R S
Sbjct: 430 CSPWLIDHAVLLVGYGNRSATPFWAIKNSWGTDWGEEGYYYLYRGS 475
>ref|XP_003122571.1| PREDICTED: cathepsin W-like [Sus scrofa].
Length = 367
Score = 120 bits (301), Expect = 1e-27
Identities = 92/318 (28%), Positives = 144/318 (45%), Gaps = 21/318 (6%)
Frame = +3
Query: 231 LWKKTYGKQYKEKNEEVARRLIWEKNLKTVMLHNLEHSMGMHSYDLGMNHLGDMTSEEVI 410
L++ Y + Y E R I+ +NL E +G + + G+ D+T EE
Sbjct: 44 LFQIQYNRSYSNPAEHARRLDIFAQNLAKAQRLQ-EEDLG--TAEFGVTPFSDLTEEEFG 100
Query: 411 SL----MSCVRVPSQWPRNVTYKSNPNQKLPDSMDWREK-GCVTEVKYQGSCGSCWAFSA 575
L + PS + + +S + +P S DWR+K G ++ +K+Q C CWA +A
Sbjct: 101 QLHGHHWGAGKAPSMGIKVGSEESG--ETVPQSCDWRKKPGVISAIKHQKDCNCCWAMAA 158
Query: 576 VGALEAQVKMKTGRLVSLSAQNLVDCSTEKYRNKGCNGGFMTEAFQYIIDNNGIDSEASY 755
V +EAQ +K + V LS Q ++DC GCNGGF+ +AF +++ +G+ SE Y
Sbjct: 159 VDNVEAQWAIKYHQAVQLSVQQVLDCDR---CGNGCNGGFVWDAFLTVLNTSGLASEQDY 215
Query: 756 PYKAV--DGKCKYDSKNRAATCSRYTELPFADEYALKEAVANKGPVSVAIDAKHSSFFFY 929
PYK +C + A + L F E ++ +A +GP++V I+A Y
Sbjct: 216 PYKGTVKTHRCLAKQHRKVAWIQDFLMLQFC-EQSIARYLATEGPITVTINA--GLLQQY 272
Query: 930 RSGVY------YDPSCTQXXXXXXXXXXXXXXXXK--------DYWLVKNSWGLNFGDGG 1067
+ GV DP + YW++KNSWG ++G+ G
Sbjct: 273 KRGVIRATPATCDPHLVNHSVLLVGFGKSKSVEGRRPRPGHSIPYWILKNSWGPDWGEEG 332
Query: 1068 YIRMARNSENPWGNANYP 1121
Y R+ R S N G YP
Sbjct: 333 YFRLHRGS-NTCGITKYP 349
>ref|XP_003129789.1| PREDICTED: dipeptidyl peptidase 1-like [Sus scrofa].
Length = 463
Score = 103 bits (256), Expect = 2e-22
Identities = 71/245 (28%), Positives = 111/245 (45%), Gaps = 23/245 (9%)
Frame = +3
Query: 429 RVPSQWPRNVTYK-SNPNQKLPDSMDWRE---KGCVTEVKYQGSCGSCWAFSAVGALEAQ 596
R+P P +T + + LP S DWR VT V+ Q SCGSC++F+++G +EA+
Sbjct: 211 RLPRPKPAPITAEIQEKSLHLPASWDWRNVRGTNFVTPVRNQASCGSCYSFASMGMMEAR 270
Query: 597 VKMKTGRLVS--LSAQNLVDCSTEKYRNKGCNGGFMTEAFQYIIDNNGIDSEASYPYKAV 770
+++ T + LS Q +V CS +GC GGF + G+ EA +PY
Sbjct: 271 IRILTNNTQTPILSPQEVVSCSQYA---QGCAGGFPYLIAGKYAQDFGLVEEACFPYTGT 327
Query: 771 DGKCKYDSKNRAATCSRYTELPF---------ADEYALKEAVANKGPVSVAIDAKHSSFF 923
D C C RY + +E +K + + GP++VA + + F
Sbjct: 328 DSPCTVKEG-----CFRYYSSEYHYVGGFYGGCNEALMKLELVHHGPMAVAFEV-YDDFL 381
Query: 924 FYRSGVY--------YDPSCTQXXXXXXXXXXXXXXXXKDYWLVKNSWGLNFGDGGYIRM 1079
YR G+Y ++P DYW+VKNSWG ++G+ GY R+
Sbjct: 382 HYRKGIYHHTGLRDPFNPFELTNHAVLLVGYGTDLASGMDYWIVKNSWGTSWGEDGYFRI 441
Query: 1080 ARNSE 1094
R ++
Sbjct: 442 RRGTD 446
>ref|XP_003129045.2| PREDICTED: tryptophan 2,3-dioxygenase [Sus scrofa].
Length = 411
Score = 89.7 bits (221), Expect = 3e-18
Identities = 70/242 (28%), Positives = 109/242 (45%), Gaps = 12/242 (4%)
Frame = +3
Query: 399 EEVISLMSCVRVP-SQWPRNVTYKSNPNQKLPDSMDWREKGCVTEVKYQGS------CGS 557
E I ++ +RVP ++ T+K N+ L S R + E + + CG
Sbjct: 162 ENKIGVLQSLRVPYNRRHYRDTFKGKDNELLLKSEQERTLLQLVEAWLERTPGLEPHCGG 221
Query: 558 CWAFSAVGALEAQVKMKTGRLVSLSAQNLVDCSTEKYRNKGCNGGFMTEAFQYIIDNN-G 734
CWAFS V A+E+ +K L LS Q ++DCS Y N GCNGG A ++
Sbjct: 222 CWAFSVVSAVESAYAIKGQPLEVLSVQQVIDCS---YNNYGCNGGSTLNALYWLNKTQVK 278
Query: 735 IDSEASYPYKAVDGKCKYDS-KNRAATCSRYTELPFA-DEYALKEAVANKGPVSVAIDAK 908
+ S++ YP+KA +G C Y S + + Y+ F+ E + + + GP+ V +DA
Sbjct: 279 VVSDSEYPFKAQNGLCHYFSCSHSGVSIKDYSAYDFSGQEDEMAKTLLTLGPLIVIVDA- 337
Query: 909 HSSFFFYRSGVYYDPSCTQXXXXXXXXXXXXXXXXKDYWLVKNSWGLNFGDGGY--IRMA 1082
S+ Y G+ + YW+V+NSWG +G GY ++M
Sbjct: 338 -VSWQDYLGGIIQHHCSSGEANHAVLVTGFDKTGSTPYWIVRNSWGSAWGIDGYALVKMG 396
Query: 1083 RN 1088
N
Sbjct: 397 GN 398
Database: RefSeq49_SP.fasta
Posted date: Oct 17, 2011 1:42 PM
Number of letters in database: 11,343,932
Number of sequences in database: 24,897
Lambda K H
0.318 0.134 0.401
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 24897
Number of Hits to DB: 28,001,433
Number of extensions: 702212
Number of successful extensions: 2054
Number of sequences better than 1.0e-05: 13
Number of HSP's gapped: 2023
Number of HSP's successfully gapped: 13
Length of query: 388
Length of database: 11,343,932
Length adjustment: 101
Effective length of query: 287
Effective length of database: 8,829,335
Effective search space: 2534019145
Effective search space used: 2534019145
Neighboring words threshold: 12
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 34 (17.7 bits)
Search to RefSeqMP_Rel49
BLASTX 2.2.24 [Aug-08-2010]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= 20110601C-012224
(1166 letters)
Database: RefSeq49_MP.fasta
30,036 sequences; 15,617,559 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
Alignment gi|NP_067256.2| cathepsin S preproprotein [Mus musculus]. 492 e-139
Alignment gi|NP_031828.2| cathepsin K precursor [Mus musculus]. 329 3e-90
Alignment gi|NP_034114.1| cathepsin L1 preproprotein [Mus musculus]. 289 3e-78
Alignment gi|NP_954599.2| hypothetical protein LOC218275 [Mus musculus]. 277 1e-74
Alignment gi|NP_036137.1| cathepsin J [Mus musculus]. 271 7e-73
Alignment gi|NP_081620.2| cathepsin L-like 3 [Mus musculus]. 270 2e-72
Alignment gi|XP_922074.1| PREDICTED: cathepsin M-like [Mus musculus]. 260 2e-69
Alignment gi|NP_071721.2| cathepsin M precursor [Mus musculus]. 259 4e-69
Alignment gi|NP_835199.1| testin-2 precursor [Mus musculus]. 256 2e-68
Alignment gi|NP_067420.1| cathepsin 6 [Mus musculus]. 256 2e-68
>ref|NP_067256.2| cathepsin S preproprotein [Mus musculus].
Length = 340
Score = 492 bits (1266), Expect = e-139
Identities = 231/336 (68%), Positives = 271/336 (80%), Gaps = 1/336 (0%)
Frame = +3
Query: 132 GSIIMKCLVWVLLLCSSAMAQLHRDPTLDRHWDLWKKTYGKQYKEKNEEVARRLIWEKNL 311
G ++ L W+ L+CS AM QL RDPTLD HWDLWKKT+ K+YK+KNEE RRLIWEKNL
Sbjct: 5 GHAAIRWLFWMPLVCSVAMEQLQRDPTLDYHWDLWKKTHEKEYKDKNEEEVRRLIWEKNL 64
Query: 312 KTVMLHNLEHSMGMHSYDLGMNHLGDMTSEEVISLMSCVRVPSQWPRNVTYKSNPNQKLP 491
K +M+HNLE+SMGMH+Y +GMN +GDMT+EE++ M +R+P Q P+ VT++S N+ LP
Sbjct: 65 KFIMIHNLEYSMGMHTYQVGMNDMGDMTNEEILCRMGALRIPRQSPKTVTFRSYSNRTLP 124
Query: 492 DSMDWREKGCVTEVKYQGSCGSCWAFSAVGALEAQVKMKTGRLVSLSAQNLVDCST-EKY 668
D++DWREKGCVTEVKYQGSCG+CWAFSAVGALE Q+K+KTG+L+SLSAQNLVDCS EKY
Sbjct: 125 DTVDWREKGCVTEVKYQGSCGACWAFSAVGALEGQLKLKTGKLISLSAQNLVDCSNEEKY 184
Query: 669 RNKGCNGGFMTEAFQYIIDNNGIDSEASYPYKAVDGKCKYDSKNRAATCSRYTELPFADE 848
NKGC GG+MTEAFQYIIDN GI+++ASYPYKA D KC Y+SKNRAATCSRY +LPF DE
Sbjct: 185 GNKGCGGGYMTEAFQYIIDNGGIEADASYPYKATDEKCHYNSKNRAATCSRYIQLPFGDE 244
Query: 849 YALKEAVANKGPVSVAIDAKHSSFFFYRSGVYYDPSCTQXXXXXXXXXXXXXXXXKDYWL 1028
ALKEAVA KGPVSV IDA HSSFFFY+SGVY DPSCT KDYWL
Sbjct: 245 DALKEAVATKGPVSVGIDASHSSFFFYKSGVYDDPSCTGNVNHGVLVVGYGTLDGKDYWL 304
Query: 1029 VKNSWGLNFGDGGYIRMARNSENPWGNANYPPYPKI 1136
VKNSWGLNFGD GYIRMARN++N G A+Y YP+I
Sbjct: 305 VKNSWGLNFGDQGYIRMARNNKNHCGIASYCSYPEI 340
>ref|NP_031828.2| cathepsin K precursor [Mus musculus].
Length = 329
Score = 329 bits (843), Expect = 3e-90
Identities = 163/330 (49%), Positives = 213/330 (64%), Gaps = 3/330 (0%)
Frame = +3
Query: 156 VWVLLLCSSAMAQLHRDPTLDRHWDLWKKTYGKQYKEKNEEVARRLIWEKNLKTVMLHNL 335
V+ LL L + LD W+LWKKT+ KQY K +E++RRLIWEKNLK + HNL
Sbjct: 3 VFKFLLLPMVSFALSPEEMLDTQWELWKKTHQKQYNSKVDEISRRLIWEKNLKQISAHNL 62
Query: 336 EHSMGMHSYDLGMNHLGDMTSEEVISLMSCVRVPSQ--WPRNVTYKSNPNQKLPDSMDWR 509
E S+G+H+Y+L MNHLGDMTSEEV+ M+ +R+P + + Y ++PDS+D+R
Sbjct: 63 EASLGVHTYELAMNHLGDMTSEEVVQKMTGLRIPPSRSYSNDTLYTPEWEGRVPDSIDYR 122
Query: 510 EKGCVTEVKYQGSCGSCWAFSAVGALEAQVKMKTGRLVSLSAQNLVDCSTEKYRNKGCNG 689
+KG VT VK QG CGSCWAFS+ GALE Q+K KTG+L++LS QNLVDC TE Y GC G
Sbjct: 123 KKGYVTPVKNQGQCGSCWAFSSAGALEGQLKKKTGKLLALSPQNLVDCVTENY---GCGG 179
Query: 690 GFMTEAFQYIIDNNGIDSEASYPYKAVDGKCKYDSKNRAATCSRYTELPFADEYALKEAV 869
G+MT AFQY+ N GIDSE +YPY D C Y++ +AA C Y E+P +E ALK AV
Sbjct: 180 GYMTTAFQYVQQNGGIDSEDAYPYVGQDESCMYNATAKAAKCRGYREIPVGNEKALKRAV 239
Query: 870 ANKGPVSVAIDAKHSSFFFYRSGVYYDPSCTQ-XXXXXXXXXXXXXXXXKDYWLVKNSWG 1046
A GP+SV+IDA +SF FY GVYYD +C + +W++KNSWG
Sbjct: 240 ARVGPISVSIDASLASFQFYSRGVYYDENCDRDNVNHAVLVVGYGTQKGSKHWIIKNSWG 299
Query: 1047 LNFGDGGYIRMARNSENPWGNANYPPYPKI 1136
++G+ GY +ARN N G N +PK+
Sbjct: 300 ESWGNKGYALLARNKNNACGITNMASFPKM 329
>ref|NP_034114.1| cathepsin L1 preproprotein [Mus musculus].
Length = 334
Score = 289 bits (740), Expect = 3e-78
Identities = 154/337 (45%), Positives = 205/337 (60%), Gaps = 6/337 (1%)
Frame = +3
Query: 144 MKCLVWVLLLC-SSAMAQLHRDPTLDRHWDLWKKTYGKQYKEKNEEVARRLIWEKNLKTV 320
M L+ + +LC +A+A D T W WK T+ + Y NEE RR IWEKN++ +
Sbjct: 1 MNLLLLLAVLCLGTALATPKFDQTFSAEWHQWKSTHRRLYGT-NEEEWRRAIWEKNMRMI 59
Query: 321 MLHNLEHSMGMHSYDLGMNHLGDMTSEEVISLMSCVRVPSQWPRNVTYKSNPNQKLPDSM 500
LHN E+S G H + + MN GDMT+EE +++ R + + ++ K+P S+
Sbjct: 60 QLHNGEYSNGQHGFSMEMNAFGDMTNEEFRQVVNGYR-HQKHKKGRLFQEPLMLKIPKSV 118
Query: 501 DWREKGCVTEVKYQGSCGSCWAFSAVGALEAQVKMKTGRLVSLSAQNLVDCSTEKYRNKG 680
DWREKGCVT VK QG CGSCWAFSA G LE Q+ +KTG+L+SLS QNLVDCS N+G
Sbjct: 119 DWREKGCVTPVKNQGQCGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCS-HAQGNQG 177
Query: 681 CNGGFMTEAFQYIIDNNGIDSEASYPYKAVDGKCKYDSKNRAATCSRYTELPFADEYALK 860
CNGG M AFQYI +N G+DSE SYPY+A DG CKY ++ A + + ++P E AL
Sbjct: 178 CNGGLMDFAFQYIKENGGLDSEESYPYEAKDGSCKYRAEFAVANDTGFVDIP-QQEKALM 236
Query: 861 EAVANKGPVSVAIDAKHSSFFFYRSGVYYDPSCTQXXXXXXXXXXXXXXXXKD-----YW 1025
+AVA GP+SVA+DA H S FY SG+YY+P+C+ D YW
Sbjct: 237 KAVATVGPISVAMDASHPSLQFYSSGIYYEPNCSSKNLDHGVLLVGYGYEGTDSNKNKYW 296
Query: 1026 LVKNSWGLNFGDGGYIRMARNSENPWGNANYPPYPKI 1136
LVKNSWG +G GYI++A++ +N G A YP +
Sbjct: 297 LVKNSWGSEWGMEGYIKIAKDRDNHCGLATAASYPVV 333
>ref|NP_954599.2| hypothetical protein LOC218275 [Mus musculus].
Length = 330
Score = 277 bits (709), Expect = 1e-74
Identities = 145/314 (46%), Positives = 192/314 (61%), Gaps = 3/314 (0%)
Frame = +3
Query: 204 DPTLDRHWDLWKKTYGKQYKEKNEEVARRLIWEKNLKTVMLHNLEHSMGMHSYDLGMNHL 383
DP+LD W+ WK + K Y NEE +R +WE N+K + LHN ++ G H ++L MN
Sbjct: 22 DPSLDAVWEEWKTKHRKTYN-MNEEAQKRAVWENNMKMIGLHNEDYLKGKHGFNLEMNAF 80
Query: 384 GDMTSEEVISLMSCVRVPSQWPRNVTYKSNPNQ-KLPDSMDWREKGCVTEVKYQGSCGSC 560
GD+T+ E LM+ + S + +T P +P S+DWR+ G VT VK QG CGSC
Sbjct: 81 GDLTNTEFRELMTGFQ--SMGHKEMTIFQEPLLGDVPKSVDWRDHGYVTPVKDQGHCGSC 138
Query: 561 WAFSAVGALEAQVKMKTGRLVSLSAQNLVDCSTEKYRNKGCNGGFMTEAFQYIIDNNGID 740
WAFSAVG+LE Q+ KTG+LV LS QNL+DCS Y N GCNGG M AFQY+ +N G+D
Sbjct: 139 WAFSAVGSLEGQIFRKTGKLVPLSEQNLMDCSW-SYGNVGCNGGLMELAFQYVKENRGLD 197
Query: 741 SEASYPYKAVDGKCKYDSKNRAATCSRYTELPFADEYALKEAVANKGPVSVAIDAKHSSF 920
+ SY Y+A DG C+YD K A + + ++P +++ AL AVA+ GPVSV ID H SF
Sbjct: 198 TRESYAYEAWDGPCRYDPKYSAVNITGFVKVPLSED-ALMNAVASVGPVSVGIDTHHHSF 256
Query: 921 FFYRSGVYYDPSC--TQXXXXXXXXXXXXXXXXKDYWLVKNSWGLNFGDGGYIRMARNSE 1094
FYR G YY+P C T + YWLVKNSWG ++G GYI+MA++ +
Sbjct: 257 RFYRGGTYYEPDCSSTNLDHAVLVVGYGEESDGRKYWLVKNSWGEDWGMDGYIKMAKDRD 316
Query: 1095 NPWGNANYPPYPKI 1136
N G A Y YP +
Sbjct: 317 NNCGIATYAIYPTV 330
>ref|NP_036137.1| cathepsin J [Mus musculus].
Length = 333
Score = 271 bits (693), Expect = 7e-73
Identities = 142/334 (42%), Positives = 199/334 (59%), Gaps = 7/334 (2%)
Frame = +3
Query: 156 VWVLLLCSSAM--AQLHRDPTLDRHWDLWKKTYGKQYKEKNEEVARRLIWEKNLKTVMLH 329
V +L+LC AQ H DP LD W WK Y K Y EE RR +WE+N++ + LH
Sbjct: 5 VLLLILCFGVASGAQAH-DPKLDAEWKDWKTKYAKSYSP--EEALRRAVWEENMRMIKLH 61
Query: 330 NLEHSMGMHSYDLGMNHLGDMTSEEVISLMSCVRVPSQWPRNVTYKSNPNQKLPDSMDWR 509
N E+S+G +++ + MN GD TSEE + + +P+ + +++ + LPD DWR
Sbjct: 62 NKENSLGKNNFTMKMNKFGDQTSEEFRKSIDNIPIPAAMT-DPHAQNHVSIGLPDYKDWR 120
Query: 510 EKGCVTEVKYQGSCGSCWAFSAVGALEAQVKMKTGRLVSLSAQNLVDCSTEKYRNKGCNG 689
E+G VT V+ QG CGSCWAF+A GA+E Q+ KTG L LS QNL+DCS + NKGC
Sbjct: 121 EEGYVTPVRNQGKCGSCWAFAAAGAIEGQMFWKTGNLTPLSVQNLLDCS-KTVGNKGCQS 179
Query: 690 GFMTEAFQYIIDNNGIDSEASYPYKAVDGKCKYDSKNRAATCSRYTELPFADEYALKEAV 869
G +AF+Y++ N G+++EA+YPY+ DG C+Y S+N +A + Y LP +E L AV
Sbjct: 180 GTAHQAFEYVLKNKGLEAEATYPYEGKDGPCRYRSENASANITDYVNLP-PNELYLWVAV 238
Query: 870 ANKGPVSVAIDAKHSSFFFYRSGVYYDPSCT-----QXXXXXXXXXXXXXXXXKDYWLVK 1034
A+ GPVS AIDA H SF FY G+YY+P+C+ +YWL+K
Sbjct: 239 ASIGPVSAAIDASHDSFRFYNGGIYYEPNCSSYFVNHAVLVVGYGSEGDVKDGNNYWLIK 298
Query: 1035 NSWGLNFGDGGYIRMARNSENPWGNANYPPYPKI 1136
NSWG +G GY+++A++ N G A+ YP I
Sbjct: 299 NSWGEEWGMNGYMQIAKDHNNHCGIASLASYPNI 332
>ref|NP_081620.2| cathepsin L-like 3 [Mus musculus].
Length = 331
Score = 270 bits (690), Expect = 2e-72
Identities = 140/313 (44%), Positives = 191/313 (61%), Gaps = 2/313 (0%)
Frame = +3
Query: 204 DPTLDRHWDLWKKTYGKQYKEKNEEVARRLIWEKNLKTVMLHNLEHSMGMHSYDLGMNHL 383
+P+LD W+ WK + K Y N+E +R +WE N K + LHN ++ G H + L MN
Sbjct: 22 NPSLDAVWEEWKTKHKKTYN-MNDEGQKRAVWENNKKMIDLHNEDYLKGKHGFSLEMNAF 80
Query: 384 GDMTSEEVISLMSCVRVPSQWPRNVTYKSNPNQKLPDSMDWREKGCVTEVKYQGSCGSCW 563
GD+T+ E LM+ + ++ +P S+DWR+ G VT VK QGSCGSCW
Sbjct: 81 GDLTNTEFRELMTGFQGQKTKMMMKVFQEPLLGDVPKSVDWRDHGYVTPVKDQGSCGSCW 140
Query: 564 AFSAVGALEAQVKMKTGRLVSLSAQNLVDCSTEKYRNKGCNGGFMTEAFQYIIDNNGIDS 743
AFSAVG+LE Q+ KTG+LV LS QNLVDCS + N+GC+GG AFQY+ DN G+D+
Sbjct: 141 AFSAVGSLEGQMFRKTGKLVPLSVQNLVDCSWSQ-GNQGCDGGLPDLAFQYVKDNGGLDT 199
Query: 744 EASYPYKAVDGKCKYDSKNRAATCSRYTELPFADEYALKEAVANKGPVSVAIDAKHSSFF 923
SYPY+A++G C+Y+ KN AAT + + + + E AL +AVA GP+SV ID KH SF
Sbjct: 200 SVSYPYEALNGTCRYNPKNSAATVTGFVNVQ-SSEDALMKAVATVGPISVGIDTKHKSFQ 258
Query: 924 FYRSGVYYDPSC--TQXXXXXXXXXXXXXXXXKDYWLVKNSWGLNFGDGGYIRMARNSEN 1097
FY+ G+YY+P C T + YWLVKNSWG ++G GYI+MA++ N
Sbjct: 259 FYKEGMYYEPDCSSTVLDHAVLVVGYGEESDGRKYWLVKNSWGRDWGMNGYIKMAKDRNN 318
Query: 1098 PWGNANYPPYPKI 1136
G A+ YP +
Sbjct: 319 NCGIASDASYPVV 331
>ref|XP_922074.1| PREDICTED: cathepsin M-like [Mus musculus].
Length = 333
Score = 260 bits (664), Expect = 2e-69
Identities = 131/337 (38%), Positives = 201/337 (59%), Gaps = 5/337 (1%)
Frame = +3
Query: 141 IMKCLVWVLLLCSSAMAQLHRDPTLDRHWDLWKKTYGKQYKEKNEEVARRLIWEKNLKTV 320
+ ++W+++ SS DP LD W WK +GK Y + EE +R +WE+N+K +
Sbjct: 5 VFLAILWLVMASSSPSP----DPILDAEWQKWKIKHGKPYSLEEEE-QKRAVWEENMKKI 59
Query: 321 MLHNLEHSMGMHSYDLGMNHLGDMTSEEVISLMSCVRVPSQWPRNVTYKSNPNQKLPDSM 500
LHN E+ +G H + + MN GDMT EE +M + VP+ + + + + + LP +
Sbjct: 60 KLHNGENGLGKHGFTMEMNAFGDMTLEEFRKVMIEIPVPTV-KKGKSVQKHLSVNLPKFI 118
Query: 501 DWREKGCVTEVKYQGSCGSCWAFSAVGALEAQVKMKTGRLVSLSAQNLVDCSTEKYRNKG 680
+W+++G VT V+ QG C SCWA S GA+E Q+ KTG+L+ LS QNLVDCS + N+G
Sbjct: 119 NWKKRGYVTPVRTQGRCNSCWAISVTGAIEGQMFQKTGQLIPLSVQNLVDCSRPQ-GNRG 177
Query: 681 CNGGFMTEAFQYIIDNNGIDSEASYPYKAVDGKCKYDSKNRAATCSRYTELPFADEYALK 860
C G A +Y+++N G++SEA+YPY+ +G C+Y+ +N A+ + + +P +E AL
Sbjct: 178 CYVGNTYRALKYVVENGGLESEATYPYEEKEGSCRYNPENSTASITGFDFVP-ENEDALM 236
Query: 861 EAVANKGPVSVAIDAKHSSFFFYRSGVYYDPSC-----TQXXXXXXXXXXXXXXXXKDYW 1025
AVA GP+SVAIDA+H SF FY+ G+Y++P+C T + YW
Sbjct: 237 NAVATIGPISVAIDARHESFLFYKRGIYHEPNCSSSVVTHAMLLVGYGFVGNESEGRKYW 296
Query: 1026 LVKNSWGLNFGDGGYIRMARNSENPWGNANYPPYPKI 1136
+VKNS G +G GY+++AR+ N G A Y YP++
Sbjct: 297 IVKNSMGTKWGSKGYMKIARDQGNHCGIATYALYPRV 333
>ref|NP_071721.2| cathepsin M precursor [Mus musculus].
Length = 333
Score = 259 bits (661), Expect = 4e-69
Identities = 136/337 (40%), Positives = 199/337 (59%), Gaps = 6/337 (1%)
Frame = +3
Query: 144 MKCLVWVLLLC-SSAMAQLHRDPTLDRHWDLWKKTYGKQYKEKNEEVARRLIWEKNLKTV 320
M +++ +LC A+ DP LD W WK YGK Y + EE +R +WE N+K +
Sbjct: 1 MTSAIFLAMLCLGMALPSPAPDPILDVEWQKWKIKYGKAYSLE-EEGQKRAVWEDNMKKI 59
Query: 321 MLHNLEHSMGMHSYDLGMNHLGDMTSEEVISLMSCVRVPSQWPRNVTYKSNPNQKLPDSM 500
LHN E+ +G H + + MN GDMT EE +M + VP+ + + + + LP +
Sbjct: 60 KLHNGENGLGKHGFTMEMNAFGDMTLEEFRKVMIEIPVPTV-KKGKSVQKRLSVNLPKFI 118
Query: 501 DWREKGCVTEVKYQGSCGSCWAFSAVGALEAQVKMKTGRLVSLSAQNLVDCSTEKYRNKG 680
+W+++G VT V+ QG C SCWAFS GA+E Q+ KTG+L+ LS QNLVDCS + N G
Sbjct: 119 NWKKRGYVTPVQTQGRCNSCWAFSVTGAIEGQMFRKTGQLIPLSVQNLVDCSRPQ-GNWG 177
Query: 681 CNGGFMTEAFQYIIDNNGIDSEASYPYKAVDGKCKYDSKNRAATCSRYTELPFADEYALK 860
C G A Y+++N G++SEA+YPY+ DG C+Y +N A + + +P +E AL
Sbjct: 178 CYLGNTYLALHYVMENGGLESEATYPYEEKDGSCRYSPENSTANITGFEFVP-KNEDALM 236
Query: 861 EAVANKGPVSVAIDAKHSSFFFYRSGVYYDPSC-----TQXXXXXXXXXXXXXXXXKDYW 1025
AVA+ GP+SVAIDA+H+SF FY+ G+YY+P+C T + YW
Sbjct: 237 NAVASIGPISVAIDARHASFLFYKRGIYYEPNCSSSVVTHSMLLVGYGFTGRESDGRKYW 296
Query: 1026 LVKNSWGLNFGDGGYIRMARNSENPWGNANYPPYPKI 1136
LVKNS G +G+ GY++++R+ N G A Y YP++
Sbjct: 297 LVKNSMGTQWGNKGYMKISRDKGNHCGIATYALYPRV 333
>ref|NP_835199.1| testin-2 precursor [Mus musculus].
Length = 333
Score = 256 bits (654), Expect = 2e-68
Identities = 136/314 (43%), Positives = 184/314 (58%), Gaps = 5/314 (1%)
Frame = +3
Query: 204 DPTLDRHWDLWKKTYGKQYKEKNEEVARRLIWEKNLKTVMLHNLEHSMGMHSYDLGMNHL 383
DP+LD W+ W+ +GK Y NEE RR +WEKN K + LHN E+ G H + + MN
Sbjct: 22 DPSLDVQWNEWRTKHGKAYNV-NEERLRRAVWEKNFKMIELHNWEYLEGKHDFTMTMNAF 80
Query: 384 GDMTSEEVISLMSCVRVPSQWPRNVTYKSNPNQKLPDSMDWREKGCVTEVKYQGSCGSCW 563
GD+T+ E + +M+ R + R ++ + +P +DWR G VT VK QG C S W
Sbjct: 81 GDLTNTEFVKMMTGFR-RQKIKRMHVFQDHQFLYVPKYVDWRMLGYVTPVKNQGYCASSW 139
Query: 564 AFSAVGALEAQVKMKTGRLVSLSAQNLVDCSTEKYRNKGCNGGFMTEAFQYIIDNNGIDS 743
AFSA G+LE Q+ KTGRLV LS QNL+DC + C+GGFM AFQY+ DN G+ +
Sbjct: 140 AFSATGSLEGQMFKKTGRLVPLSEQNLLDCMGSNVTH-DCSGGFMQNAFQYVKDNGGLAT 198
Query: 744 EASYPYKAVDGKCKYDSKNRAATCSRYTELPFADEYALKEAVANKGPVSVAIDAKHSSFF 923
E SYPY KC+Y ++N AA + ++P +E AL +AVA GP+SVA+DA H SF
Sbjct: 199 EESYPYIGPGRKCRYHAENSAANVRDFVQIPGREE-ALMKAVAKVGPISVAVDASHDSFQ 257
Query: 924 FYRSGVYYDPSCTQXXXXXXXXXXXXXXXXKD-----YWLVKNSWGLNFGDGGYIRMARN 1088
FY SG+YY+P C + ++ YWLVKNSWG +G GYI++A++
Sbjct: 258 FYDSGIYYEPQCKRVHLNHAVLVVGYGFEGEESDGNSYWLVKNSWGEEWGMKGYIKIAKD 317
Query: 1089 SENPWGNANYPPYP 1130
N G A YP
Sbjct: 318 WNNHCGIATLATYP 331
>ref|NP_067420.1| cathepsin 6 [Mus musculus].
Length = 334
Score = 256 bits (654), Expect = 2e-68
Identities = 133/346 (38%), Positives = 195/346 (56%), Gaps = 5/346 (1%)
Frame = +3
Query: 108 ILCIGAPAGSIIMKCLVWVLLLCSSAMAQLHRDPTLDRHWDLWKKTYGKQYKEKNEEVAR 287
ILC+G +G++ + DP L+ W WKK Y K Y + EE R
Sbjct: 9 ILCLGVGSGALAL-------------------DPNLNAEWHDWKKQYEKSYTME-EEGLR 48
Query: 288 RLIWEKNLKTVMLHNLEHSMGMHSYDLGMNHLGDMTSEEVISLMSCVRVPSQWPRNVTYK 467
R IWE+N++ + LHN E+S+G +++ L MN GD+T EE+ +M+ + S R + K
Sbjct: 49 RAIWEENMRMIKLHNWENSLGKNNFTLKMNEFGDLTPEELRKMMNNFPIWSHKKRKIIRK 108
Query: 468 SNPNQKLPDSMDWREKGCVTEVKYQGSCGSCWAFSAVGALEAQVKMKTGRLVSLSAQNLV 647
LP +DWR+KG VT V+ Q C SCWAF+ GA+E Q+ KTG+L LS QNLV
Sbjct: 109 RAVGDVLPKFVDWRKKGYVTRVRRQKFCNSCWAFAVNGAIEGQMFKKTGKLTPLSVQNLV 168
Query: 648 DCSTEKYRNKGCNGGFMTEAFQYIIDNNGIDSEASYPYKAVDGKCKYDSKNRAATCSRYT 827
DC T+ N GC G A++Y+++N G+++EA+YPY+ +G C+Y+ KN A + +
Sbjct: 169 DC-TKTQGNDGCQWGDPYIAYEYVLNNGGLEAEATYPYEGKEGPCRYNPKNSKAEITGFV 227
Query: 828 ELPFADEYALKEAVANKGPVSVAIDAKHSSFFFYRSGVYYDPSCTQXXXXXXXXXXXXXX 1007
LP +++ L EAVA GP+S A+DA + F FY G+Y+ P+C+
Sbjct: 228 SLPESED-ILMEAVATIGPISAAVDASFNRFSFYDGGIYHQPNCSNNTVNHAVLVVGYGT 286
Query: 1008 XXKD-----YWLVKNSWGLNFGDGGYIRMARNSENPWGNANYPPYP 1130
+ YWL+KNSWG +G GGY+++ R+ N G A Y YP
Sbjct: 287 EGNETDGNKYWLIKNSWGRRWGIGGYMKIIRDQNNHCGIATYAHYP 332
Database: RefSeq49_MP.fasta
Posted date: Oct 17, 2011 1:42 PM
Number of letters in database: 15,617,559
Number of sequences in database: 30,036
Lambda K H
0.318 0.134 0.401
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 30036
Number of Hits to DB: 38,324,955
Number of extensions: 966512
Number of successful extensions: 2943
Number of sequences better than 1.0e-05: 29
Number of HSP's gapped: 2847
Number of HSP's successfully gapped: 30
Length of query: 388
Length of database: 15,617,559
Length adjustment: 103
Effective length of query: 285
Effective length of database: 12,523,851
Effective search space: 3569297535
Effective search space used: 3569297535
Neighboring words threshold: 12
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 35 (18.1 bits)
Search to RefSeqHP_Rel49
BLASTX 2.2.24 [Aug-08-2010]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= 20110601C-012224
(1166 letters)
Database: RefSeq49_HP.fasta
32,964 sequences; 18,297,164 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
Alignment gi|NP_004070.3| cathepsin S isoform 1 preproprotein [Homo sapie... 549 e-156
Alignment gi|NP_001186668.1| cathepsin S isoform 2 preproprotein [Homo sa... 436 e-122
Alignment gi|NP_000387.1| cathepsin K preproprotein [Homo sapiens]. 338 7e-93
Alignment gi|NP_001188504.1| cathepsin L2 preproprotein [Homo sapiens]. 290 2e-78
Alignment gi|NP_001324.2| cathepsin L2 preproprotein [Homo sapiens]. 290 2e-78
Alignment gi|NP_666023.1| cathepsin L1 preproprotein [Homo sapiens]. 287 1e-77
Alignment gi|NP_001903.1| cathepsin L1 preproprotein [Homo sapiens]. 287 1e-77
Alignment gi|NP_004381.2| pro-cathepsin H preproprotein [Homo sapiens]. 174 1e-43
Alignment gi|NP_003784.2| cathepsin F precursor [Homo sapiens]. 129 5e-30
Alignment gi|NP_001325.1| cathepsin O preproprotein [Homo sapiens]. 120 2e-27
>ref|NP_004070.3| cathepsin S isoform 1 preproprotein [Homo sapiens].
Length = 331
Score = 549 bits (1414), Expect = e-156
Identities = 260/331 (78%), Positives = 285/331 (86%)
Frame = +3
Query: 144 MKCLVWVLLLCSSAMAQLHRDPTLDRHWDLWKKTYGKQYKEKNEEVARRLIWEKNLKTVM 323
MK LV VLL+CSSA+AQLH+DPTLD HW LWKKTYGKQYKEKNEE RRLIWEKNLK VM
Sbjct: 1 MKRLVCVLLVCSSAVAQLHKDPTLDHHWHLWKKTYGKQYKEKNEEAVRRLIWEKNLKFVM 60
Query: 324 LHNLEHSMGMHSYDLGMNHLGDMTSEEVISLMSCVRVPSQWPRNVTYKSNPNQKLPDSMD 503
LHNLEHSMGMHSYDLGMNHLGDMTSEEV+SLMS +RVPSQW RN+TYKSNPN+ LPDS+D
Sbjct: 61 LHNLEHSMGMHSYDLGMNHLGDMTSEEVMSLMSSLRVPSQWQRNITYKSNPNRILPDSVD 120
Query: 504 WREKGCVTEVKYQGSCGSCWAFSAVGALEAQVKMKTGRLVSLSAQNLVDCSTEKYRNKGC 683
WREKGCVTEVKYQGSCG+CWAFSAVGALEAQ+K+KTG+LVSLSAQNLVDCSTEKY NKGC
Sbjct: 121 WREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGC 180
Query: 684 NGGFMTEAFQYIIDNNGIDSEASYPYKAVDGKCKYDSKNRAATCSRYTELPFADEYALKE 863
NGGFMT AFQYIIDN GIDS+ASYPYKA+D KC+YDSK RAATCS+YTELP+ E LKE
Sbjct: 181 NGGFMTTAFQYIIDNKGIDSDASYPYKAMDQKCQYDSKYRAATCSKYTELPYGREDVLKE 240
Query: 864 AVANKGPVSVAIDAKHSSFFFYRSGVYYDPSCTQXXXXXXXXXXXXXXXXKDYWLVKNSW 1043
AVANKGPVSV +DA+H SFF YRSGVYY+PSCTQ K+YWLVKNSW
Sbjct: 241 AVANKGPVSVGVDARHPSFFLYRSGVYYEPSCTQNVNHGVLVVGYGDLNGKEYWLVKNSW 300
Query: 1044 GLNFGDGGYIRMARNSENPWGNANYPPYPKI 1136
G NFG+ GYIRMARN N G A++P YP+I
Sbjct: 301 GHNFGEEGYIRMARNKGNHCGIASFPSYPEI 331
>ref|NP_001186668.1| cathepsin S isoform 2 preproprotein [Homo sapiens].
Length = 281
Score = 436 bits (1121), Expect = e-122
Identities = 218/331 (65%), Positives = 238/331 (71%)
Frame = +3
Query: 144 MKCLVWVLLLCSSAMAQLHRDPTLDRHWDLWKKTYGKQYKEKNEEVARRLIWEKNLKTVM 323
MK LV VLL+CSSA+AQLH+DPTLD HW LWKKTYGKQYKEKNEE RRLIWEKNLK VM
Sbjct: 1 MKRLVCVLLVCSSAVAQLHKDPTLDHHWHLWKKTYGKQYKEKNEEAVRRLIWEKNLKFVM 60
Query: 324 LHNLEHSMGMHSYDLGMNHLGDMTSEEVISLMSCVRVPSQWPRNVTYKSNPNQKLPDSMD 503
LHNLEHSMGMHSYDLGMNHLGDM
Sbjct: 61 LHNLEHSMGMHSYDLGMNHLGDM------------------------------------- 83
Query: 504 WREKGCVTEVKYQGSCGSCWAFSAVGALEAQVKMKTGRLVSLSAQNLVDCSTEKYRNKGC 683
GSCG+CWAFSAVGALEAQ+K+KTG+LVSLSAQNLVDCSTEKY NKGC
Sbjct: 84 -------------GSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGC 130
Query: 684 NGGFMTEAFQYIIDNNGIDSEASYPYKAVDGKCKYDSKNRAATCSRYTELPFADEYALKE 863
NGGFMT AFQYIIDN GIDS+ASYPYKA+D KC+YDSK RAATCS+YTELP+ E LKE
Sbjct: 131 NGGFMTTAFQYIIDNKGIDSDASYPYKAMDQKCQYDSKYRAATCSKYTELPYGREDVLKE 190
Query: 864 AVANKGPVSVAIDAKHSSFFFYRSGVYYDPSCTQXXXXXXXXXXXXXXXXKDYWLVKNSW 1043
AVANKGPVSV +DA+H SFF YRSGVYY+PSCTQ K+YWLVKNSW
Sbjct: 191 AVANKGPVSVGVDARHPSFFLYRSGVYYEPSCTQNVNHGVLVVGYGDLNGKEYWLVKNSW 250
Query: 1044 GLNFGDGGYIRMARNSENPWGNANYPPYPKI 1136
G NFG+ GYIRMARN N G A++P YP+I
Sbjct: 251 GHNFGEEGYIRMARNKGNHCGIASFPSYPEI 281
>ref|NP_000387.1| cathepsin K preproprotein [Homo sapiens].
Length = 329
Score = 338 bits (866), Expect = 7e-93
Identities = 173/328 (52%), Positives = 219/328 (66%), Gaps = 3/328 (0%)
Frame = +3
Query: 162 VLLLCSSAMAQLHRDPTLDRHWDLWKKTYGKQYKEKNEEVARRLIWEKNLKTVMLHNLEH 341
VLLL + A L+ + LD HW+LWKKT+ KQY K +E++RRLIWEKNLK + +HNLE
Sbjct: 6 VLLLPVVSFA-LYPEEILDTHWELWKKTHRKQYNNKVDEISRRLIWEKNLKYISIHNLEA 64
Query: 342 SMGMHSYDLGMNHLGDMTSEEVISLMSCVRVPSQWPR--NVTYKSNPNQKLPDSMDWREK 515
S+G+H+Y+L MNHLGDMTSEEV+ M+ ++VP R + Y + PDS+D+R+K
Sbjct: 65 SLGVHTYELAMNHLGDMTSEEVVQKMTGLKVPLSHSRSNDTLYIPEWEGRAPDSVDYRKK 124
Query: 516 GCVTEVKYQGSCGSCWAFSAVGALEAQVKMKTGRLVSLSAQNLVDCSTEKYRNKGCNGGF 695
G VT VK QG CGSCWAFS+VGALE Q+K KTG+L++LS QNLVDC +E N GC GG+
Sbjct: 125 GYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSE---NDGCGGGY 181
Query: 696 MTEAFQYIIDNNGIDSEASYPYKAVDGKCKYDSKNRAATCSRYTELPFADEYALKEAVAN 875
MT AFQY+ N GIDSE +YPY + C Y+ +AA C Y E+P +E ALK AVA
Sbjct: 182 MTNAFQYVQKNRGIDSEDAYPYVGQEESCMYNPTGKAAKCRGYREIPEGNEKALKRAVAR 241
Query: 876 KGPVSVAIDAKHSSFFFYRSGVYYDPSC-TQXXXXXXXXXXXXXXXXKDYWLVKNSWGLN 1052
GPVSVAIDA +SF FY GVYYD SC + +W++KNSWG N
Sbjct: 242 VGPVSVAIDASLTSFQFYSKGVYYDESCNSDNLNHAVLAVGYGIQKGNKHWIIKNSWGEN 301
Query: 1053 FGDGGYIRMARNSENPWGNANYPPYPKI 1136
+G+ GYI MARN N G AN +PK+
Sbjct: 302 WGNKGYILMARNKNNACGIANLASFPKM 329
>ref|NP_001188504.1| cathepsin L2 preproprotein [Homo sapiens].
Length = 334
Score = 290 bits (742), Expect = 2e-78
Identities = 150/316 (47%), Positives = 194/316 (61%), Gaps = 5/316 (1%)
Frame = +3
Query: 204 DPTLDRHWDLWKKTYGKQYKEKNEEVARRLIWEKNLKTVMLHNLEHSMGMHSYDLGMNHL 383
D LD W WK T+ + Y NEE RR +WEKN+K + LHN E+S G H + + MN
Sbjct: 22 DQNLDTKWYQWKATHRRLYGA-NEEGWRRAVWEKNMKMIELHNGEYSQGKHGFTMAMNAF 80
Query: 384 GDMTSEEVISLMSCVRVPSQWPRNVTYKSNPNQKLPDSMDWREKGCVTEVKYQGSCGSCW 563
GDMT+EE +M C R ++ + ++ LP S+DWR+KG VT VK Q CGSCW
Sbjct: 81 GDMTNEEFRQMMGCFR-NQKFRKGKVFREPLFLDLPKSVDWRKKGYVTPVKNQKQCGSCW 139
Query: 564 AFSAVGALEAQVKMKTGRLVSLSAQNLVDCSTEKYRNKGCNGGFMTEAFQYIIDNNGIDS 743
AFSA GALE Q+ KTG+LVSLS QNLVDCS + N+GCNGGFM AFQY+ +N G+DS
Sbjct: 140 AFSATGALEGQMFRKTGKLVSLSEQNLVDCSRPQ-GNQGCNGGFMARAFQYVKENGGLDS 198
Query: 744 EASYPYKAVDGKCKYDSKNRAATCSRYTELPFADEYALKEAVANKGPVSVAIDAKHSSFF 923
E SYPY AVD CKY +N A + +T + E AL +AVA GP+SVA+DA HSSF
Sbjct: 199 EESYPYVAVDEICKYRPENSVANDTGFTVVAPGKEKALMKAVATVGPISVAMDAGHSSFQ 258
Query: 924 FYRSGVYYDPSCTQXXXXXXXXXXXXXXXXKD-----YWLVKNSWGLNFGDGGYIRMARN 1088
FY+SG+Y++P C+ + YWLVKNSWG +G GY+++A++
Sbjct: 259 FYKSGIYFEPDCSSKNLDHGVLVVGYGFEGANSNNSKYWLVKNSWGPEWGSNGYVKIAKD 318
Query: 1089 SENPWGNANYPPYPKI 1136
N G A YP +
Sbjct: 319 KNNHCGIATAASYPNV 334
>ref|NP_001324.2| cathepsin L2 preproprotein [Homo sapiens].
Length = 334
Score = 290 bits (742), Expect = 2e-78
Identities = 150/316 (47%), Positives = 194/316 (61%), Gaps = 5/316 (1%)
Frame = +3
Query: 204 DPTLDRHWDLWKKTYGKQYKEKNEEVARRLIWEKNLKTVMLHNLEHSMGMHSYDLGMNHL 383
D LD W WK T+ + Y NEE RR +WEKN+K + LHN E+S G H + + MN
Sbjct: 22 DQNLDTKWYQWKATHRRLYGA-NEEGWRRAVWEKNMKMIELHNGEYSQGKHGFTMAMNAF 80
Query: 384 GDMTSEEVISLMSCVRVPSQWPRNVTYKSNPNQKLPDSMDWREKGCVTEVKYQGSCGSCW 563
GDMT+EE +M C R ++ + ++ LP S+DWR+KG VT VK Q CGSCW
Sbjct: 81 GDMTNEEFRQMMGCFR-NQKFRKGKVFREPLFLDLPKSVDWRKKGYVTPVKNQKQCGSCW 139
Query: 564 AFSAVGALEAQVKMKTGRLVSLSAQNLVDCSTEKYRNKGCNGGFMTEAFQYIIDNNGIDS 743
AFSA GALE Q+ KTG+LVSLS QNLVDCS + N+GCNGGFM AFQY+ +N G+DS
Sbjct: 140 AFSATGALEGQMFRKTGKLVSLSEQNLVDCSRPQ-GNQGCNGGFMARAFQYVKENGGLDS 198
Query: 744 EASYPYKAVDGKCKYDSKNRAATCSRYTELPFADEYALKEAVANKGPVSVAIDAKHSSFF 923
E SYPY AVD CKY +N A + +T + E AL +AVA GP+SVA+DA HSSF
Sbjct: 199 EESYPYVAVDEICKYRPENSVANDTGFTVVAPGKEKALMKAVATVGPISVAMDAGHSSFQ 258
Query: 924 FYRSGVYYDPSCTQXXXXXXXXXXXXXXXXKD-----YWLVKNSWGLNFGDGGYIRMARN 1088
FY+SG+Y++P C+ + YWLVKNSWG +G GY+++A++
Sbjct: 259 FYKSGIYFEPDCSSKNLDHGVLVVGYGFEGANSNNSKYWLVKNSWGPEWGSNGYVKIAKD 318
Query: 1089 SENPWGNANYPPYPKI 1136
N G A YP +
Sbjct: 319 KNNHCGIATAASYPNV 334
>ref|NP_666023.1| cathepsin L1 preproprotein [Homo sapiens].
Length = 333
Score = 287 bits (734), Expect = 1e-77
Identities = 153/324 (47%), Positives = 198/324 (61%), Gaps = 6/324 (1%)
Frame = +3
Query: 183 AMAQLHRDPTLDRHWDLWKKTYGKQYKEKNEEVARRLIWEKNLKTVMLHNLEHSMGMHSY 362
A A L D +L+ W WK + + Y NEE RR +WEKN+K + LHN E+ G HS+
Sbjct: 15 ASATLTFDHSLEAQWTKWKAMHNRLYG-MNEEGWRRAVWEKNMKMIELHNQEYREGKHSF 73
Query: 363 DLGMNHLGDMTSEEVISLMSCVRVPSQWPRNVTYKSNPN-QKLPDSMDWREKGCVTEVKY 539
+ MN GDMTSEE +M+ + ++ PR P + P S+DWREKG VT VK
Sbjct: 74 TMAMNAFGDMTSEEFRQVMNGFQ--NRKPRKGKVFQEPLFYEAPRSVDWREKGYVTPVKN 131
Query: 540 QGSCGSCWAFSAVGALEAQVKMKTGRLVSLSAQNLVDCSTEKYRNKGCNGGFMTEAFQYI 719
QG CGSCWAFSA GALE Q+ KTGRL+SLS QNLVDCS + N+GCNGG M AFQY+
Sbjct: 132 QGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQ-GNEGCNGGLMDYAFQYV 190
Query: 720 IDNNGIDSEASYPYKAVDGKCKYDSKNRAATCSRYTELPFADEYALKEAVANKGPVSVAI 899
DN G+DSE SYPY+A + CKY+ K A + + ++P E AL +AVA GP+SVAI
Sbjct: 191 QDNGGLDSEESYPYEATEESCKYNPKYSVANDTGFVDIP-KQEKALMKAVATVGPISVAI 249
Query: 900 DAKHSSFFFYRSGVYYDPSCTQXXXXXXXXXXXXXXXXKD-----YWLVKNSWGLNFGDG 1064
DA H SF FY+ G+Y++P C+ + YWLVKNSWG +G G
Sbjct: 250 DAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNNKYWLVKNSWGEEWGMG 309
Query: 1065 GYIRMARNSENPWGNANYPPYPKI 1136
GY++MA++ N G A+ YP +
Sbjct: 310 GYVKMAKDRRNHCGIASAASYPTV 333
>ref|NP_001903.1| cathepsin L1 preproprotein [Homo sapiens].
Length = 333
Score = 287 bits (734), Expect = 1e-77
Identities = 153/324 (47%), Positives = 198/324 (61%), Gaps = 6/324 (1%)
Frame = +3
Query: 183 AMAQLHRDPTLDRHWDLWKKTYGKQYKEKNEEVARRLIWEKNLKTVMLHNLEHSMGMHSY 362
A A L D +L+ W WK + + Y NEE RR +WEKN+K + LHN E+ G HS+
Sbjct: 15 ASATLTFDHSLEAQWTKWKAMHNRLYG-MNEEGWRRAVWEKNMKMIELHNQEYREGKHSF 73
Query: 363 DLGMNHLGDMTSEEVISLMSCVRVPSQWPRNVTYKSNPN-QKLPDSMDWREKGCVTEVKY 539
+ MN GDMTSEE +M+ + ++ PR P + P S+DWREKG VT VK
Sbjct: 74 TMAMNAFGDMTSEEFRQVMNGFQ--NRKPRKGKVFQEPLFYEAPRSVDWREKGYVTPVKN 131
Query: 540 QGSCGSCWAFSAVGALEAQVKMKTGRLVSLSAQNLVDCSTEKYRNKGCNGGFMTEAFQYI 719
QG CGSCWAFSA GALE Q+ KTGRL+SLS QNLVDCS + N+GCNGG M AFQY+
Sbjct: 132 QGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQ-GNEGCNGGLMDYAFQYV 190
Query: 720 IDNNGIDSEASYPYKAVDGKCKYDSKNRAATCSRYTELPFADEYALKEAVANKGPVSVAI 899
DN G+DSE SYPY+A + CKY+ K A + + ++P E AL +AVA GP+SVAI
Sbjct: 191 QDNGGLDSEESYPYEATEESCKYNPKYSVANDTGFVDIP-KQEKALMKAVATVGPISVAI 249
Query: 900 DAKHSSFFFYRSGVYYDPSCTQXXXXXXXXXXXXXXXXKD-----YWLVKNSWGLNFGDG 1064
DA H SF FY+ G+Y++P C+ + YWLVKNSWG +G G
Sbjct: 250 DAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNNKYWLVKNSWGEEWGMG 309
Query: 1065 GYIRMARNSENPWGNANYPPYPKI 1136
GY++MA++ N G A+ YP +
Sbjct: 310 GYVKMAKDRRNHCGIASAASYPTV 333
>ref|NP_004381.2| pro-cathepsin H preproprotein [Homo sapiens].
Length = 335
Score = 174 bits (442), Expect = 1e-43
Identities = 118/348 (33%), Positives = 168/348 (48%), Gaps = 7/348 (2%)
Frame = +3
Query: 108 ILCIGAPAGSIIMKCLVWVLLLCSSAMAQLHRDPTLDRHWDLWKKTYGKQYKEKNEEVAR 287
+LC GA W+L + A+L + H+ W + K Y EE
Sbjct: 7 LLCAGA-----------WLLGVPVCGAAELCVNSLEKFHFKSWMSKHRKTYS--TEEYHH 53
Query: 288 RL-IWEKNLKTVMLHNLEHSMGMHSYDLGMNHLGDMTSEEVIS--LMSCVRVPSQWPRNV 458
RL + N + + HN G H++ + +N DM+ E+ L S + S N
Sbjct: 54 RLQTFASNWRKINAHN----NGNHTFKMALNQFSDMSFAEIKHKYLWSEPQNCSATKSNY 109
Query: 459 TYKSNPNQKLPDSMDWREKG-CVTEVKYQGSCGSCWAFSAVGALEAQVKMKTGRLVSLSA 635
+ P P S+DWR+KG V+ VK QG+CGSCW FS GALE+ + + TG+++SL+
Sbjct: 110 LRGTGP---YPPSVDWRKKGNFVSPVKNQGACGSCWTFSTTGALESAIAIATGKMLSLAE 166
Query: 636 QNLVDCSTEKYRNKGCNGGFMTEAFQYIIDNNGIDSEASYPYKAVDGKCKYDSKNRAATC 815
Q LVDC+ + + N GC GG ++AF+YI+ N GI E +YPY+ DG CK+
Sbjct: 167 QQLVDCAQD-FNNHGCQGGLPSQAFEYILYNKGIMGEDTYPYQGKDGYCKFQPGKAIGFV 225
Query: 816 SRYTELPFADEYALKEAVANKGPVSVAIDAKHSSFFFYRSGVYYDPSCTQXXXXXXXXXX 995
+ DE A+ EAVA PVS A + F YR+G+Y SC +
Sbjct: 226 KDVANITIYDEEAMVEAVALYNPVSFAFEVT-QDFMMYRTGIYSSTSCHKTPDKVNHAVL 284
Query: 996 XXXXXXKD---YWLVKNSWGLNFGDGGYIRMARNSENPWGNANYPPYP 1130
K+ YW+VKNSWG +G GY + R +N G A YP
Sbjct: 285 AVGYGEKNGIPYWIVKNSWGPQWGMNGYFLIER-GKNMCGLAACASYP 331
>ref|NP_003784.2| cathepsin F precursor [Homo sapiens].
Length = 484
Score = 129 bits (324), Expect = 5e-30
Identities = 85/287 (29%), Positives = 132/287 (45%), Gaps = 4/287 (1%)
Frame = +3
Query: 243 TYGKQYKEKNEEVARRLIWEKNL-KTVMLHNLEHSMGMHSYDLGMNHLGDMTSEEVISLM 419
TY + Y+ K E R ++ N+ + + L+ + G+ D+T EE ++
Sbjct: 193 TYNRTYESKEEARWRLSVFVNNMVRAQKIQALDRGTAQY----GVTKFSDLTEEEFRTIY 248
Query: 420 SCVRVPSQWPRNVTYKSNPNQKLPDSMDWREKGCVTEVKYQGSCGSCWAFSAVGALEAQV 599
+ + + + P DWR KG VT+VK QG CGSCWAFS G +E Q
Sbjct: 249 LNTLLRKEPGNKMKQAKSVGDLAPPEWDWRSKGAVTKVKDQGMCGSCWAFSVTGNVEGQW 308
Query: 600 KMKTGRLVSLSAQNLVDCSTEKYRNKGCNGGFMTEAFQYIIDNNGIDSEASYPYKAVDGK 779
+ G L+SLS Q L+DC +K C GG + A+ I + G+++E Y Y+
Sbjct: 309 FLNQGTLLSLSEQELLDCDK---MDKACMGGLPSNAYSAIKNLGGLETEDDYSYQGHMQS 365
Query: 780 CKYDSKNRAATCSRYTELPFADEYALKEAVANKGPVSVAIDAKHSSFFFYRSGVY--YDP 953
C + ++ + EL +E L +A +GP+SVAI+A FYR G+ P
Sbjct: 366 CNFSAEKAKVYINDSVELS-QNEQKLAAWLAKRGPISVAINA--FGMQFYRHGISRPLRP 422
Query: 954 SCTQXXXXXXXXXXXXXXXXK-DYWLVKNSWGLNFGDGGYIRMARNS 1091
C+ +W +KNSWG ++G+ GY + R S
Sbjct: 423 LCSPWLIDHAVLLVGYGNRSDVPFWAIKNSWGTDWGEKGYYYLHRGS 469
>ref|NP_001325.1| cathepsin O preproprotein [Homo sapiens].
Length = 321
Score = 120 bits (302), Expect = 2e-27
Identities = 77/225 (34%), Positives = 114/225 (50%), Gaps = 7/225 (3%)
Frame = +3
Query: 435 PSQWPRNVT--YKSNPNQKLPDSMDWREKGCVTEVKYQGSCGSCWAFSAVGALEAQVKMK 608
PS++PR + S PN LP DWR+K VT+V+ Q CG CWAFS VGA+E+ +K
Sbjct: 89 PSKFPRYSAEVHMSIPNVSLPLRFDWRDKQVVTQVRNQQMCGGCWAFSVVGAVESAYAIK 148
Query: 609 TGRLVSLSAQNLVDCSTEKYRNKGCNGGFMTEAFQYIIDNN-GIDSEASYPYKAVDGKCK 785
L LS Q ++DCS Y N GCNGG A ++ + ++ YP+KA +G C
Sbjct: 149 GKPLEDLSVQQVIDCS---YNNYGCNGGSTLNALNWLNKMQVKLVKDSEYPFKAQNGLCH 205
Query: 786 YDSKNRAA-TCSRYTELPFAD-EYALKEAVANKGPVSVAIDAKHSSFFFYRSGVYYDPSC 959
Y S + + + Y+ F+D E + +A+ GP+ V +DA S+ Y G+
Sbjct: 206 YFSGSHSGFSIKGYSAYDFSDQEDEMAKALLTFGPLVVIVDA--VSWQDYLGGIIQHHCS 263
Query: 960 TQXXXXXXXXXXXXXXXXKDYWLVKNSWGLNFGDGGY--IRMARN 1088
+ YW+V+NSWG ++G GY ++M N
Sbjct: 264 SGEANHAVLITGFDKTGSTPYWIVRNSWGSSWGVDGYAHVKMGSN 308
Database: RefSeq49_HP.fasta
Posted date: Oct 17, 2011 1:42 PM
Number of letters in database: 18,297,164
Number of sequences in database: 32,964
Lambda K H
0.318 0.134 0.401
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 32964
Number of Hits to DB: 44,889,166
Number of extensions: 1124993
Number of successful extensions: 3389
Number of sequences better than 1.0e-05: 21
Number of HSP's gapped: 3345
Number of HSP's successfully gapped: 23
Length of query: 388
Length of database: 18,297,164
Length adjustment: 104
Effective length of query: 284
Effective length of database: 14,868,908
Effective search space: 4222769872
Effective search space used: 4222769872
Neighboring words threshold: 12
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 35 (18.1 bits)
Search to Sscrofa10_2
BLASTN 2.2.24 [Aug-08-2010]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= 20110601C-012224
(1166 letters)
Database: Sscrofa_10.2.fasta
4582 sequences; 2,808,509,378 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
Sscrofa_Chr04 458 e-126
>Sscrofa_Chr04
|| Length = 143465943
Score = 458 bits (231), Expect = e-126
Identities = 231/231 (100%)
Strand = Plus / Plus
Query: 541 agggttcttgtggttcttgttgggctttcagcgctgtgggagctctggaagcacaagtga 600
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Sbjct: 107674018 agggttcttgtggttcttgttgggctttcagcgctgtgggagctctggaagcacaagtga 107674077
Query: 601 agatgaaaacaggaaggctggtgtctctgagtgcacagaacctggtggattgctcaactg 660
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Sbjct: 107674078 agatgaaaacaggaaggctggtgtctctgagtgcacagaacctggtggattgctcaactg 107674137
Query: 661 aaaaatacaggaataaaggctgcaatggcggcttcatgacagaggctttccaatatatca 720
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Sbjct: 107674138 aaaaatacaggaataaaggctgcaatggcggcttcatgacagaggctttccaatatatca 107674197
Query: 721 ttgataacaacggcatcgattcagaagcctcctatccctacaaagccgtgg 771
|||||||||||||||||||||||||||||||||||||||||||||||||||
Sbjct: 107674198 ttgataacaacggcatcgattcagaagcctcctatccctacaaagccgtgg 107674248
Score = 335 bits (169), Expect = 3e-89
Identities = 169/169 (100%)
Strand = Plus / Plus
Query: 770 ggatggaaaatgcaagtatgactcaaaaaatcgagctgccacgtgttcaaggtatactga 829
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Sbjct: 107675987 ggatggaaaatgcaagtatgactcaaaaaatcgagctgccacgtgttcaaggtatactga 107676046
Query: 830 acttcctttcgccgatgaatatgccttaaaagaagctgtggccaataagggacctgtgtc 889
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Sbjct: 107676047 acttcctttcgccgatgaatatgccttaaaagaagctgtggccaataagggacctgtgtc 107676106
Query: 890 cgttgctatagatgcgaagcattcttctttcttcttctacaggagtggt 938
|||||||||||||||||||||||||||||||||||||||||||||||||
Sbjct: 107676107 cgttgctatagatgcgaagcattcttctttcttcttctacaggagtggt 107676155
Score = 301 bits (152), Expect = 4e-79
Identities = 152/152 (100%)
Strand = Plus / Plus
Query: 392 gaccagtgaagaagtgatatcattgatgagttgcgtgagagttcccagccaatggccgag 451
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Sbjct: 107671672 gaccagtgaagaagtgatatcattgatgagttgcgtgagagttcccagccaatggccgag 107671731
Query: 452 aaatgtcacttacaagtcaaaccctaatcagaaattgcctgattctatggactggagaga 511
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Sbjct: 107671732 aaatgtcacttacaagtcaaaccctaatcagaaattgcctgattctatggactggagaga 107671791
Query: 512 gaaggggtgtgttactgaagtgaaataccagg 543
||||||||||||||||||||||||||||||||
Sbjct: 107671792 gaaggggtgtgttactgaagtgaaataccagg 107671823
Score = 281 bits (142), Expect = 4e-73
Identities = 142/142 (100%)
Strand = Plus / Plus
Query: 1 ctcattgtggatacctcatgtgacaagttccaatttctttttaaagtctctttaactgaa 60
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Sbjct: 107666257 ctcattgtggatacctcatgtgacaagttccaatttctttttaaagtctctttaactgaa 107666316
Query: 61 gtctctttgctgcctttggaatctttggagagagcccactgtcacgcattctttgtatag 120
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Sbjct: 107666317 gtctctttgctgcctttggaatctttggagagagcccactgtcacgcattctttgtatag 107666376
Query: 121 gagcacctgctggttctatcat 142
||||||||||||||||||||||
Sbjct: 107666377 gagcacctgctggttctatcat 107666398
Score = 252 bits (127), Expect = 4e-64
Identities = 127/127 (100%)
Strand = Plus / Plus
Query: 143 aatgaaatgcctggtttgggtgctcctcctgtgctcctcagcgatggcacagctgcacag 202
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Sbjct: 107667386 aatgaaatgcctggtttgggtgctcctcctgtgctcctcagcgatggcacagctgcacag 107667445
Query: 203 agaccccaccttggatcgtcactgggatctctggaagaaaacctatggaaaacaatacaa 262
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Sbjct: 107667446 agaccccaccttggatcgtcactgggatctctggaagaaaacctatggaaaacaatacaa 107667505
Query: 263 ggaaaag 269
|||||||
Sbjct: 107667506 ggaaaag 107667512
Score = 248 bits (125), Expect = 6e-63
Identities = 125/125 (100%)
Strand = Plus / Plus
Query: 268 agaatgaggaagtagcacggcgtctcatctgggaaaagaacctaaaaactgtaatgcttc 327
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Sbjct: 107670741 agaatgaggaagtagcacggcgtctcatctgggaaaagaacctaaaaactgtaatgcttc 107670800
Query: 328 acaatctggagcattcgatgggaatgcattcatatgatctaggcatgaaccacctgggag 387
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Sbjct: 107670801 acaatctggagcattcgatgggaatgcattcatatgatctaggcatgaaccacctgggag 107670860
Query: 388 acatg 392
|||||
Sbjct: 107670861 acatg 107670865
Score = 198 bits (100), Expect = 5e-48
Identities = 103/104 (99%)
Strand = Plus / Plus
Query: 936 ggtgtctactatgacccctcctgtactcagaatgtgaatcatggtgtactagtggtcggc 995
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Sbjct: 107677452 ggtgtctactatgacccctcctgtactcagaatgtgaatcatggtgtactagtggtcggc 107677511
Query: 996 tacggtaaccttaatgggaaagactactggcttgtgaaaaacag 1039
||||||||||||||||| ||||||||||||||||||||||||||
Sbjct: 107677512 tacggtaaccttaatggaaaagactactggcttgtgaaaaacag 107677555
Score = 176 bits (89), Expect = 2e-41
Identities = 119/129 (92%)
Strand = Plus / Plus
Query: 1038 agctggggactcaactttggtgacggaggatacatacggatggcaagaaatagtgaaaat 1097
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Sbjct: 107691124 agctggggactcaactttggtgacggaggatacatacggatggcaagaaatagtgaaaat 107691183
Query: 1098 ccctgggggaatgctaattaccccccttacccaaaaatctagaggaattcttctttttaa 1157
| ||| |||| ||||||||| ||| |||||||| |||||||||||| ||||| |||||
Sbjct: 107691184 cactgtgggattgctaattatccctcttacccagaaatctagaggatatcttcattttat 107691243
Query: 1158 aacatttca 1166
|||||||||
Sbjct: 107691244 aacatttca 107691252
Database: Sscrofa_10.2.fasta
Posted date: Nov 16, 2011 10:34 AM
Number of letters in database: 2,808,509,378
Number of sequences in database: 4582
Lambda K H
1.37 0.711 1.31
Gapped
Lambda K H
1.37 0.711 1.31
Matrix: blastn matrix:1 -3
Gap Penalties: Existence: 5, Extension: 2
Number of Sequences: 4582
Number of Hits to DB: 40,555,053
Number of extensions: 335
Number of successful extensions: 335
Number of sequences better than 1.0e-05: 1
Number of HSP's gapped: 335
Number of HSP's successfully gapped: 8
Length of query: 1166
Length of database: 2,808,509,378
Length adjustment: 21
Effective length of query: 1145
Effective length of database: 2,808,413,156
Effective search space: 3215633063620
Effective search space used: 3215633063620
X1: 11 (21.8 bits)
X2: 15 (29.7 bits)
X3: 50 (99.1 bits)
S1: 18 (36.2 bits)
S2: 30 (60.0 bits)