Search to RefSeqBP_Rel49
BLASTX 2.2.24 [Aug-08-2010]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= 20110601C-001419
(1440 letters)
Database: RefSeq49_BP.fasta
33,088 sequences; 17,681,374 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
Alignment gi|NP_776456.1| cathepsin B precursor [Bos taurus]. 606 e-173
Alignment gi|XP_002685665.1| PREDICTED: tubulointerstitial nephritis anti... 138 1e-32
Alignment gi|XP_887401.3| PREDICTED: tubulointerstitial nephritis antigen... 138 1e-32
Alignment gi|NP_001030279.1| tubulointerstitial nephritis antigen [Bos ta... 125 9e-29
Alignment gi|NP_001028789.1| dipeptidyl peptidase 1 [Bos taurus]. 100 2e-21
Alignment gi|NP_001029557.1| pro-cathepsin H precursor [Bos taurus]. 85 2e-16
Alignment gi|NP_001071303.1| cathepsin Z precursor [Bos taurus]. 80 3e-15
Alignment gi|NP_001028787.1| cathepsin S precursor [Bos taurus]. 74 3e-13
Alignment gi|NP_001077155.1| cathepsin L1 [Bos taurus]. 73 6e-13
Alignment gi|NP_776457.1| cathepsin L2 precursor [Bos taurus]. 72 9e-13
>ref|NP_776456.1| cathepsin B precursor [Bos taurus].
Length = 335
Score = 606 bits (1563), Expect = e-173
Identities = 279/335 (83%), Positives = 294/335 (87%)
Frame = +2
Query: 254 MWRXXXXXXXXXXXXXXRESLHFQPLSDELVNFINKQNTTWTAGHNFYNVDLSYVKKLCG 433
MWR R SL+F PLSDELVNF+NKQNTTW AGHNFYNVDLSYVKKLCG
Sbjct: 1 MWRLLATLSCLLVLTSARSSLYFPPLSDELVNFVNKQNTTWKAGHNFYNVDLSYVKKLCG 60
Query: 434 TFLGGPKLPQRAAFAADMILPKSFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDR 613
LGGPKLPQR AFAAD++LP+SFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDR
Sbjct: 61 AILGGPKLPQRDAFAADVVLPESFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDR 120
Query: 614 ICIRSNGRVNVEVSAEDMLTXXXXXXXXXXXXXFPSGAWNFWTKKGLVSGGLYDSHVGCR 793
ICI SNGRVNVEVSAEDMLT FPSGAWNFWTKKGLVSGGLY+SHVGCR
Sbjct: 121 ICIHSNGRVNVEVSAEDMLTCCGGECGDGCNGGFPSGAWNFWTKKGLVSGGLYNSHVGCR 180
Query: 794 PYSIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYTPSYKEDKHFGCSSYSISRNEKEIM 973
PYSIPPCEHHVNGSRPPCTGEGDTPKCSK CEPGY+PSYKEDKHFGCSSYS++ NEKEIM
Sbjct: 181 PYSIPPCEHHVNGSRPPCTGEGDTPKCSKTCEPGYSPSYKEDKHFGCSSYSVANNEKEIM 240
Query: 974 AEIYKNGPVEGAFTVYSDFLQYKSGVYQHVTGDLMGGHAIRILGWGVENGTPYWLVGNSC 1153
AEIYKNGPVEGAF+VYSDFL YKSGVYQHV+G++MGGHAIRILGWGVENGTPYWLVGNS
Sbjct: 241 AEIYKNGPVEGAFSVYSDFLLYKSGVYQHVSGEIMGGHAIRILGWGVENGTPYWLVGNSW 300
Query: 1154 NTDWGDNGFFKILIGQDHCGIESEIVAGIPCTPHF 1258
NTDWGDNGFFKIL GQDHCGIESEIVAG+PCT +
Sbjct: 301 NTDWGDNGFFKILRGQDHCGIESEIVAGMPCTHQY 335
>ref|XP_002685665.1| PREDICTED: tubulointerstitial nephritis antigen-like 1-like [Bos
taurus].
Length = 534
Score = 138 bits (348), Expect = 1e-32
Identities = 96/330 (29%), Positives = 151/330 (45%), Gaps = 29/330 (8%)
Frame = +2
Query: 329 LSDELVNFINKQNTTWTAGHN--FYNVDLSY-VKKLCGTFLGGPKLPQRAAFAADM---- 487
+ ++++ IN + W AG++ F+ + L ++ GT + ++F A+M
Sbjct: 209 VDEDMIEAINHGDYGWRAGNHSAFWGMTLDEGIRYRLGTV-------RPSSFVANMNEIH 261
Query: 488 -------ILPKSFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIRSNGRVNV 646
+LP++F+A E+WPN I + DQG+C WAF SDR+ I S G ++
Sbjct: 262 TVLGPGEVLPRTFEASEKWPNL--IHDPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMSP 319
Query: 647 EVSAEDMLTXXXXXXXXXXXXXFPSGAWNFWTKKGLVSGGLY--DSHVGCRPYSIPPCEH 820
+S +++L+ GAW F ++G+VS Y H PPC
Sbjct: 320 VLSPQNLLSCDTHNQQGCRGGRL-DGAWWFLRRRGVVSDHCYPFSGHGRDEAVPAPPCMM 378
Query: 821 HVNGSRPPCTGEGDTPKCSKICEPGYTPSYKEDKHFGCSSYSISRNEKEIMAEIYKNGPV 1000
H G G ++ C Y + D + +Y + NEKEIM E+ +NGPV
Sbjct: 379 HSRAM-----GRGKRQATAR-CPNSYV--HANDIYQVTPAYRLGSNEKEIMKELMENGPV 430
Query: 1001 EGAFTVYSDFLQYKSGVYQHVTGDL--------MGGHAIRILGWGVE-----NGTPYWLV 1141
+ V+ DF Y+SG+Y H L G H+++I GWG E YW
Sbjct: 431 QALMEVHEDFFLYQSGIYSHTPVSLGRPERYRRHGTHSVKITGWGEETLPDGRTIKYWTA 490
Query: 1142 GNSCNTDWGDNGFFKILIGQDHCGIESEIV 1231
NS WG+ G F+I+ G + C IES ++
Sbjct: 491 ANSWGPAWGERGHFRIVRGANECDIESFVL 520
>ref|XP_887401.3| PREDICTED: tubulointerstitial nephritis antigen-like 1 isoform 2 [Bos
taurus].
Length = 534
Score = 138 bits (348), Expect = 1e-32
Identities = 96/330 (29%), Positives = 151/330 (45%), Gaps = 29/330 (8%)
Frame = +2
Query: 329 LSDELVNFINKQNTTWTAGHN--FYNVDLSY-VKKLCGTFLGGPKLPQRAAFAADM---- 487
+ ++++ IN + W AG++ F+ + L ++ GT + ++F A+M
Sbjct: 209 VDEDMIEAINHGDYGWRAGNHSAFWGMTLDEGIRYRLGTV-------RPSSFVANMNEIH 261
Query: 488 -------ILPKSFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIRSNGRVNV 646
+LP++F+A E+WPN I + DQG+C WAF SDR+ I S G ++
Sbjct: 262 TVLGPGEVLPRTFEASEKWPNL--IHDPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMSP 319
Query: 647 EVSAEDMLTXXXXXXXXXXXXXFPSGAWNFWTKKGLVSGGLY--DSHVGCRPYSIPPCEH 820
+S +++L+ GAW F ++G+VS Y H PPC
Sbjct: 320 VLSPQNLLSCDTHNQQGCRGGRL-DGAWWFLRRRGVVSDHCYPFSGHGRDEAVPAPPCMM 378
Query: 821 HVNGSRPPCTGEGDTPKCSKICEPGYTPSYKEDKHFGCSSYSISRNEKEIMAEIYKNGPV 1000
H G G ++ C Y + D + +Y + NEKEIM E+ +NGPV
Sbjct: 379 HSRAM-----GRGKRQATAR-CPNSYV--HANDIYQVTPAYRLGSNEKEIMKELMENGPV 430
Query: 1001 EGAFTVYSDFLQYKSGVYQHVTGDL--------MGGHAIRILGWGVE-----NGTPYWLV 1141
+ V+ DF Y+SG+Y H L G H+++I GWG E YW
Sbjct: 431 QALMEVHEDFFLYQSGIYSHTPVSLGRPERYRRHGTHSVKITGWGEETLPDGRTIKYWTA 490
Query: 1142 GNSCNTDWGDNGFFKILIGQDHCGIESEIV 1231
NS WG+ G F+I+ G + C IES ++
Sbjct: 491 ANSWGPAWGERGHFRIVRGANECDIESFVL 520
>ref|NP_001030279.1| tubulointerstitial nephritis antigen [Bos taurus].
Length = 476
Score = 125 bits (314), Expect = 9e-29
Identities = 98/338 (28%), Positives = 142/338 (42%), Gaps = 30/338 (8%)
Frame = +2
Query: 311 SLHFQPLSDELVNFINKQNTTWTAGH--NFYNVDLSY-VKKLCGTFLGGPKLPQRAAFAA 481
S H + L+ +NK + WTA + F+ + L K GT P L A
Sbjct: 150 SQHVCLVQPGLIEHVNKGDYGWTAQNYSQFWGMTLEEGFKYRLGTLPPSPLLLSMNEVTA 209
Query: 482 DMI----LPKSFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIRSNGRVNVE 649
+ LP+ F A +WP DQ +C + WAF +DRI I+S GR
Sbjct: 210 SLTKTTDLPEFFIASYKWPGWT--HGPLDQKNCAASWAFSTASVAADRIAIQSQGRYTAN 267
Query: 650 VSAEDMLTXXXXXXXXXXXXXFPSGAWNFWTKKGLVSGGLYDSHVGCRPYSIPPCEHHVN 829
+S +++++ AW + K+GLVS Y P N
Sbjct: 268 LSPQNLISCCAKKRHGCNSGSVDR-AWWYLRKRGLVSHACY------------PLFKDQN 314
Query: 830 GSRPPCT------GEGD---TPKCSKICEPGYTPSYKEDKHFGCSS-YSISRNEKEIMAE 979
+ C G G T C E K ++ + CS Y +S NE EIM E
Sbjct: 315 ATNNGCAMASRSDGRGKRHATTPCPNSIE-------KSNRIYQCSPPYRVSSNETEIMRE 367
Query: 980 IYKNGPVEGAFTVYSDFLQYKSGVYQHVTGD--------LMGGHAIRILGWGVENGT--- 1126
I +NGPV+ V+ DF YK+G+Y+H+T HA+++ GWG G
Sbjct: 368 IMQNGPVQAIMQVHEDFFNYKTGIYRHITSTNEDSEKYRKFRTHAVKLTGWGTLRGAQGQ 427
Query: 1127 --PYWLVGNSCNTDWGDNGFFKILIGQDHCGIESEIVA 1234
+W+ NS WG+NG+F+IL G + IE I+A
Sbjct: 428 KEKFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIA 465
>ref|NP_001028789.1| dipeptidyl peptidase 1 [Bos taurus].
Length = 463
Score = 100 bits (250), Expect = 2e-21
Identities = 88/320 (27%), Positives = 135/320 (42%), Gaps = 18/320 (5%)
Frame = +2
Query: 338 ELVNFINKQNTTWTAGH--NFYNVDLSYVKKLCGTFLGGPKLPQRAAFAAD-----MILP 496
+ V IN +WTA + + L + + G P+ A A+ + LP
Sbjct: 173 DFVKAINAIQKSWTAAPYMEYETLTLKEMIRRGGGHSRRIPRPKPAPITAEIQKKILHLP 232
Query: 497 KSFDAREQWPNCPTIK---EIRDQGSCGSCWAFGAVEAISDRICIRSNGRVNVEVSAEDM 667
S+D W N I +R+QGSCGSC++F ++ + RI I +N +S +++
Sbjct: 233 TSWD----WRNVHGINFVTPVRNQGSCGSCYSFASMGMMEARIRILTNNTQTPILSPQEV 288
Query: 668 LTXXXXXXXXXXXXXFPSGAWNFWTKKGLVSGGLYDSHVGCRPYSIPPCEHHVNGSRPPC 847
++ + A + GLV C PY+ G+ PC
Sbjct: 289 VSCSQYAQGCEGGFPYLI-AGKYAQDFGLVEED-------CFPYT---------GTDSPC 331
Query: 848 TGEGDTPKCSKICEPGYTPSYKEDKHFGCSSYSISRNEKEIMAEIYKNGPVEGAFTVYSD 1027
+ G Y + H+ Y NE + E+ GP+ AF VY D
Sbjct: 332 R-----------LKEGCFRYYSSEYHYVGGFYG-GCNEALMKLELVHQGPMAVAFEVYDD 379
Query: 1028 FLQYKSGVYQHV------TGDLMGGHAIRILGWGVE--NGTPYWLVGNSCNTDWGDNGFF 1183
FL Y+ GVY H + HA+ ++G+G + +G YW+V NS T WG+NG+F
Sbjct: 380 FLHYRKGVYHHTGLRDPFNPFELTNHAVLLVGYGTDAASGLDYWIVKNSWGTSWGENGYF 439
Query: 1184 KILIGQDHCGIESEIVAGIP 1243
+I G D C IES +A P
Sbjct: 440 RIRRGTDECAIESIALAATP 459
>ref|NP_001029557.1| pro-cathepsin H precursor [Bos taurus].
Length = 335
Score = 84.7 bits (208), Expect = 2e-16
Identities = 79/303 (26%), Positives = 133/303 (43%), Gaps = 12/303 (3%)
Frame = +2
Query: 344 VNFINKQNTTWTAGHNFYNVDLSYVKKLCGTFLGGPKLPQRAAFAADMIL------PKSF 505
+N N +N T+ G N ++ D+S+ +L +L PQ + L P S
Sbjct: 65 INAHNARNHTFKMGLNQFS-DMSF-DELKRKYLWSE--PQNCSATKSNYLRGTGPYPPSM 120
Query: 506 DAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIRSNGRVNVEVSAEDMLTXXXX 685
D R++ + +++QGSCGSCW F A+ + I + G++ ++ + ++
Sbjct: 121 DWRKKGN---FVTPVKNQGSCGSCWTFSTTGALESAVAI-ATGKLPF-LAEQQLVDCAQN 175
Query: 686 XXXXXXXXXFPSGAWNFWT-KKGLVSGGLYDSHVGCRPYSIPPCEHHVNGSRPPCTGEGD 862
PS A+ + KG++ Y PY G
Sbjct: 176 FNNHGCQGGLPSQAFEYIRYNKGIMGEDTY-------PY------------------RGQ 210
Query: 863 TPKCSKICEPGYTPSYKEDKHFGCSSYSISRNEKEIMAE-IYKNGPVEGAFTVYSDFLQY 1039
C +P ++ +D +I+ N++E M E + + PV AF V +DF+ Y
Sbjct: 211 DGDCKY--QPSKAIAFVKDVA------NITLNDEEAMVEAVALHNPVSFAFEVTADFMMY 262
Query: 1040 KSGVYQ----HVTGDLMGGHAIRILGWGVENGTPYWLVGNSCNTDWGDNGFFKILIGQDH 1207
+ G+Y H T D + HA+ +G+G E G PYW+V NS +WG G+F I G++
Sbjct: 263 RKGIYSSTSCHKTPDKVN-HAVLAVGYGEEKGIPYWIVKNSWGPNWGMKGYFLIERGKNM 321
Query: 1208 CGI 1216
CG+
Sbjct: 322 CGL 324
>ref|NP_001071303.1| cathepsin Z precursor [Bos taurus].
Length = 304
Score = 80.5 bits (197), Expect = 3e-15
Identities = 64/259 (24%), Positives = 97/259 (37%), Gaps = 6/259 (2%)
Frame = +2
Query: 434 TFLGGPKLPQRAAFAADMILPKSFDAREQWPNCPTIKEI---RDQGS---CGSCWAFGAV 595
T LG P+ + + LPKS+D W N + R+Q CGSCWA G+
Sbjct: 44 TQLGRRTYPRPHEYLSPSDLPKSWD----WRNVNGVNYASVTRNQHIPQYCGSCWAHGST 99
Query: 596 EAISDRICIRSNGRVNVEVSAEDMLTXXXXXXXXXXXXXFPSGAWNFWTKKGLVSGGLYD 775
A++DRI I+ G + + + P W + + G+
Sbjct: 100 SAMADRINIKRKGAWPSTLLSVQHVIDCGDAGSCEGGNDLP--VWEYAHRHGIPDET--- 154
Query: 776 SHVGCRPYSIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYTPSYKEDKHFGCSSYSISR 955
C Y E C C++ E +Y K Y
Sbjct: 155 ----CNNYQAKDQE---------CDKFNQCGTCTEFKECHVIKNYTLWK---VGDYGSLS 198
Query: 956 NEKEIMAEIYKNGPVEGAFTVYSDFLQYKSGVYQHVTGDLMGGHAIRILGWGVENGTPYW 1135
+++MAEIY NGP+ Y G+Y H + + GWGV +G YW
Sbjct: 199 GREKMMAEIYTNGPISCGIMATEKMSNYTGGIYSEYNDQAFINHIVSVAGWGVSDGMEYW 258
Query: 1136 LVGNSCNTDWGDNGFFKIL 1192
+V NS WG++G+ +I+
Sbjct: 259 IVRNSWGEPWGEHGWMRIV 277
>ref|NP_001028787.1| cathepsin S precursor [Bos taurus].
Length = 331
Score = 73.9 bits (180), Expect = 3e-13
Identities = 68/260 (26%), Positives = 114/260 (43%), Gaps = 7/260 (2%)
Frame = +2
Query: 458 PQRAAFAAD--MILPKSFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIRSN 631
P+ + +D LP S D RE+ C T E++ QG+CGSCWAF AV A+ ++ +++
Sbjct: 102 PRNVTYKSDPNQKLPDSMDWREK--GCVT--EVKYQGACGSCWAFSAVGALEAQVKLKT- 156
Query: 632 GRVNVEVSAEDMLTXXXXXXXXXXXXX-FPSGAWNFWTKKGLVSGGLYDSHVGCRPYSIP 808
G++ V +SA++++ F + A+ + ++ DS PY
Sbjct: 157 GKL-VSLSAQNLVDCSTAKYGNKGCNGGFMTEAFQY-----IIDNNGIDSEASY-PYKAM 209
Query: 809 P--CEHHVNGSRPPCTGEGDTPKCSKICEPGYTPSYKEDKHFGCSSYSISRNEKEIMAEI 982
C++ V C+ + P FG +E+ + +
Sbjct: 210 DGKCQYDVKNRAATCSRYIELP-------------------FG--------SEEALKEAV 242
Query: 983 YKNGPVE-GAFTVYSDFLQYKSGVYQHVTGDLMGGHAIRILGWGVENGTPYWLVGNSCNT 1159
GPV G +S F YK+GVY + H + ++G+G +G YWLV NS
Sbjct: 243 ANKGPVSVGIDASHSSFFLYKTGVYYDPSCTQNVNHGVLVVGYGNLDGKDYWLVKNSWGL 302
Query: 1160 DWGDNGFFKILIGQ-DHCGI 1216
+GD G+ ++ +HCGI
Sbjct: 303 HFGDQGYIRMARNSGNHCGI 322
>ref|NP_001077155.1| cathepsin L1 [Bos taurus].
Length = 333
Score = 73.2 bits (178), Expect = 6e-13
Identities = 70/251 (27%), Positives = 106/251 (42%), Gaps = 9/251 (3%)
Frame = +2
Query: 491 LPKSFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIRSNGRVNVEVSAEDML 670
+P S D RE+ P +++QG CGSCWAF A A+ ++ + G++ V +S ++++
Sbjct: 114 IPPSVDWREKGYVTP----VKNQGKCGSCWAFSATGALEGQM-FQKTGKL-VSLSEQNLV 167
Query: 671 TXXXXXXXXXXXXXFPSGAWNFWTKKGLVSGGLYDSHVGCRPYS--IPPCEHHVNGSRPP 844
F A+ + L GGL DS PY+ + C ++ N S
Sbjct: 168 DCSQPEGNRGCHGGFIDNAFQYV----LDVGGL-DSEESY-PYTGLVGTCLYNPNNSAAN 221
Query: 845 CTGEGDTPKCSKICEPGYTPSYKEDKHFGCSSYSISRNEKEIMAEIYKNGPVEGAFTVYS 1024
TG D PK EK +M + GP+ A ++
Sbjct: 222 ETGFVDLPK----------------------------QEKALMKAVANLGPISVAVDAHN 253
Query: 1025 DFLQ-YKSGVYQHVTGDLMG-GHAIRILGWGVENG----TPYWLVGNSCNTDWGDNGFFK 1186
Q YKSG+Y HA+ ++G+G E YWLV NS WG NG+ K
Sbjct: 254 PSFQFYKSGIYYEPNCSSESVDHAVLVVGYGFEGADSDDNKYWLVKNSWGEHWGMNGYIK 313
Query: 1187 ILIGQ-DHCGI 1216
+ + +HCGI
Sbjct: 314 MAKDRNNHCGI 324
>ref|NP_776457.1| cathepsin L2 precursor [Bos taurus].
Length = 334
Score = 72.4 bits (176), Expect = 9e-13
Identities = 64/249 (25%), Positives = 104/249 (41%), Gaps = 7/249 (2%)
Frame = +2
Query: 491 LPKSFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIRSNGRVNVEVSAEDML 670
+PKS D W + +++QG CGSCWAF A A+ ++ R G++ V +S ++++
Sbjct: 114 VPKSVD----WTKKGYVTPVKNQGQCGSCWAFSATGALEGQM-FRKTGKL-VSLSEQNLV 167
Query: 671 TXXXXXXXXXXXXXFPSGAWNFWTKKGLVSGGLYDSHVGCRPYSIPPCEHHVNGSRPPCT 850
A+ + +GGL DS Y + + +P C+
Sbjct: 168 DCSRAQGNQGCNGGLMDNAFQYIKD----NGGL-DSE---ESYPYLATDTNSCNYKPECS 219
Query: 851 GEGDTPKCSKICEPGYTPSYKEDKHFGCSSYSISRNEKEIMAEIYKNGPVEGAFTV-YSD 1027
DT G+ I + EK +M + GP+ A ++
Sbjct: 220 AANDT---------GFV--------------DIPQREKALMKAVATVGPISVAIDAGHTS 256
Query: 1028 FLQYKSGVYQHVTGDLMG-GHAIRILGWGVE----NGTPYWLVGNSCNTDWGDNGFFKIL 1192
F YKSG+Y H + ++G+G E N +W+V NS +WG NG+ K+
Sbjct: 257 FQFYKSGIYYDPDCSCKDLDHGVLVVGYGFEGTDSNNNKFWIVKNSWGPEWGWNGYVKMA 316
Query: 1193 IGQ-DHCGI 1216
Q +HCGI
Sbjct: 317 KDQNNHCGI 325
Database: RefSeq49_BP.fasta
Posted date: Oct 17, 2011 1:42 PM
Number of letters in database: 17,681,374
Number of sequences in database: 33,088
Lambda K H
0.318 0.134 0.401
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 33088
Number of Hits to DB: 62,256,848
Number of extensions: 1918387
Number of successful extensions: 8802
Number of sequences better than 1.0e-05: 14
Number of HSP's gapped: 8718
Number of HSP's successfully gapped: 15
Length of query: 480
Length of database: 17,681,374
Length adjustment: 106
Effective length of query: 374
Effective length of database: 14,174,046
Effective search space: 5301093204
Effective search space used: 5301093204
Neighboring words threshold: 12
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 36 (18.5 bits)
Search to RefSeqCP_Rel49
BLASTX 2.2.24 [Aug-08-2010]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= 20110601C-001419
(1440 letters)
Database: RefSeq49_CP.fasta
33,336 sequences; 18,874,504 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
Alignment gi|XP_543203.2| PREDICTED: similar to cathepsin B preproprotein... 582 e-166
Alignment gi|XP_535330.2| PREDICTED: similar to P3ECSL [Canis familiaris]. 135 1e-31
Alignment gi|XP_538969.2| PREDICTED: similar to tubulointerstitial nephri... 133 5e-31
Alignment gi|NP_001182763.1| dipeptidyl peptidase 1 [Canis lupus familiar... 100 3e-21
Alignment gi|XP_535784.2| PREDICTED: similar to Dipeptidyl-peptidase I pr... 96 7e-20
Alignment gi|XP_536212.2| PREDICTED: similar to Cathepsin H precursor [Ca... 92 9e-19
Alignment gi|XP_854795.1| PREDICTED: similar to Cathepsin Z precursor (Ca... 79 8e-15
Alignment gi|NP_001002938.1| cathepsin S precursor [Canis lupus familiari... 76 9e-14
Alignment gi|NP_001003115.1| cathepsin L1 precursor [Canis lupus familiar... 70 4e-12
Alignment gi|XP_533219.2| PREDICTED: similar to Cathepsin F precursor (CA... 70 4e-12
>ref|XP_543203.2| PREDICTED: similar to cathepsin B preproprotein [Canis familiaris].
Length = 420
Score = 582 bits (1499), Expect = e-166
Identities = 266/338 (78%), Positives = 292/338 (86%)
Frame = +2
Query: 251 KMWRXXXXXXXXXXXXXXRESLHFQPLSDELVNFINKQNTTWTAGHNFYNVDLSYVKKLC 430
KMW+ + L F+ LSDELV+++NK+NTTW AGHNF+NVD SY+++LC
Sbjct: 81 KMWQLLTTLSCLVMLTGAQSRLPFRALSDELVDYVNKRNTTWKAGHNFHNVDPSYLRRLC 140
Query: 431 GTFLGGPKLPQRAAFAADMILPKSFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISD 610
GTFLGGPKLPQR FA ++ILP+SFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISD
Sbjct: 141 GTFLGGPKLPQRVQFAKNLILPESFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISD 200
Query: 611 RICIRSNGRVNVEVSAEDMLTXXXXXXXXXXXXXFPSGAWNFWTKKGLVSGGLYDSHVGC 790
RICIR+NG VNVEVSAEDMLT FP+ AWNFWTK+GLVSGGLYDSHVGC
Sbjct: 201 RICIRTNGHVNVEVSAEDMLTCCGDQCGDGCNGGFPAEAWNFWTKQGLVSGGLYDSHVGC 260
Query: 791 RPYSIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYTPSYKEDKHFGCSSYSISRNEKEI 970
RPYSIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGY+PSYKEDKH+GCSSYS+S NEKEI
Sbjct: 261 RPYSIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPSYKEDKHYGCSSYSVSDNEKEI 320
Query: 971 MAEIYKNGPVEGAFTVYSDFLQYKSGVYQHVTGDLMGGHAIRILGWGVENGTPYWLVGNS 1150
MAEIYKNGPVE AFTVYSDFL YKSGVYQHVTG++MGGHA+RILGWGVE+GTPYWLVGNS
Sbjct: 321 MAEIYKNGPVEAAFTVYSDFLLYKSGVYQHVTGEMMGGHAVRILGWGVEDGTPYWLVGNS 380
Query: 1151 CNTDWGDNGFFKILIGQDHCGIESEIVAGIPCTPHF*K 1264
NTDWGDNGFFKIL G+DHCGIESEIVAGIPCT + K
Sbjct: 381 WNTDWGDNGFFKILRGRDHCGIESEIVAGIPCTDQYWK 418
>ref|XP_535330.2| PREDICTED: similar to P3ECSL [Canis familiaris].
Length = 550
Score = 135 bits (339), Expect = 1e-31
Identities = 94/322 (29%), Positives = 144/322 (44%), Gaps = 21/322 (6%)
Frame = +2
Query: 329 LSDELVNFINKQNTTWTAGHN--FYNVDLSY-VKKLCGTFLGGPKLPQ----RAAFAADM 487
+ +++N IN+ N W AG++ F+ + L ++ GT +
Sbjct: 225 VDQDMINAINQGNYGWRAGNHSAFWGMTLDEGIRYRLGTIRPSSSVTNMNEIHTVLRPGE 284
Query: 488 ILPKSFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIRSNGRVNVEVSAEDM 667
+LP +F+A E+WPN I E DQG+C WAF SDR+ I S G + +S +++
Sbjct: 285 VLPTAFEAAEKWPNL--IHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQNL 342
Query: 668 LTXXXXXXXXXXXXXFPSGAWNFWTKKGLVSGGLYDSHVGCRPYSIPPCEHHVNGSRPPC 847
L+ GAW F ++G+VS Y VG P + SR
Sbjct: 343 LSCDTHNQQGCRGGRL-DGAWWFLRRRGVVSDHCYP-FVGREQDEAGPAPRCMMHSRAMG 400
Query: 848 TGEGD-TPKCSKICEPGYTPSYKEDKHFGCSSYSISRNEKEIMAEIYKNGPVEGAFTVYS 1024
G+ T +C + + D + +Y + NEKEIM E+ +NGPV+ V+
Sbjct: 401 RGKRQATARCPS------SHVHANDIYQVTPAYRLGTNEKEIMKELMENGPVQALMEVHE 454
Query: 1025 DFLQYKSGVYQHVTGDL--------MGGHAIRILGWGVE-----NGTPYWLVGNSCNTDW 1165
DF Y+ G+Y H L G H+++I GWG E YW NS W
Sbjct: 455 DFFLYQGGIYSHTPVSLGRPERYRRHGTHSVKITGWGEETLPDGRTLKYWTAANSWGPAW 514
Query: 1166 GDNGFFKILIGQDHCGIESEIV 1231
G+ G F+I+ G + C IES ++
Sbjct: 515 GERGHFRIVRGANECDIESFVL 536
>ref|XP_538969.2| PREDICTED: similar to tubulointerstitial nephritis antigen [Canis
familiaris].
Length = 476
Score = 133 bits (334), Expect = 5e-31
Identities = 98/327 (29%), Positives = 145/327 (44%), Gaps = 28/327 (8%)
Frame = +2
Query: 338 ELVNFINKQNTTWTAGH--NFYNVDLSY-VKKLCGTFLGGPKL----PQRAAFAADMILP 496
EL+ +NK + WTA + F+ + L K GT P L A+ A LP
Sbjct: 159 ELIEHVNKGDYGWTAQNYSQFWGMTLEEGFKYRLGTLPPSPMLLSMNEMTASLPATTDLP 218
Query: 497 KSFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIRSNGRVNVEVSAEDMLTX 676
+ F A +WP DQ +C + WAF +DRI I+SNGR +S +++++
Sbjct: 219 EFFIASYKWPGWT--HGPLDQKNCAASWAFSTASVAADRIAIQSNGRYTANLSPQNLISC 276
Query: 677 XXXXXXXXXXXXFPSGAWNFWTKKGLVSGGLYD-------SHVGCRPYSIPPCEHHVNGS 835
AW F K+GLVS Y ++ GC S + +
Sbjct: 277 CAKNRHGCNSGSIDR-AWWFLRKRGLVSHACYPLFKDQNATNYGCAMASRSDGRGKRHAT 335
Query: 836 RPPCTGEGDTPKCSKICEPGYTPSYKEDKHFGCSS-YSISRNEKEIMAEIYKNGPVEGAF 1012
+P C E K ++ + CS Y +S NE EIM EI +NGPV+
Sbjct: 336 KP----------CPNNIE-------KSNRIYQCSPPYRVSSNETEIMKEIMQNGPVQAIM 378
Query: 1013 TVYSDFLQYKSGVYQHVTG--------DLMGGHAIRILGWGVENGT-----PYWLVGNSC 1153
V+ DF YK+G+Y+H+T + HA+++ GWG G +W+ NS
Sbjct: 379 QVHEDFFHYKTGIYRHITRTNEESRKYQKLQTHAVKLTGWGTLKGAQGQKEKFWIAANSW 438
Query: 1154 NTDWGDNGFFKILIGQDHCGIESEIVA 1234
WG+NG+F+IL G + IE I+A
Sbjct: 439 GISWGENGYFRILRGVNESDIEKLIIA 465
>ref|NP_001182763.1| dipeptidyl peptidase 1 [Canis lupus familiaris].
Length = 459
Score = 100 bits (249), Expect = 3e-21
Identities = 83/322 (25%), Positives = 134/322 (41%), Gaps = 20/322 (6%)
Frame = +2
Query: 338 ELVNFINKQNTTWTAGHNFYNVDLSYVKKLCGTFLGGPKLPQRAAFAADMILPKSFDARE 517
E V IN +WTA L+ + T +GG K+P+ P + + E
Sbjct: 172 EFVKAINTIQKSWTATRYIEYETLTLRDMM--TRVGGRKIPRPKP------TPLTAEIHE 223
Query: 518 QWPNCPT------------IKEIRDQGSCGSCWAFGAVEAISDRICIRSNGRVNVEVSAE 661
+ PT + +R+Q SCGSC+AF + + RI I +N +S +
Sbjct: 224 EISRLPTSWDWRNVRGTNFVSPVRNQASCGSCYAFASTAMLEARIRILTNNTQTPILSPQ 283
Query: 662 DMLTXXXXXXXXXXXXXFPSGAWNFWTKKGLVSGGLYDSHVGCRPYSIPPCEHHVNGSRP 841
++++ + A + GLV C PY+ GS
Sbjct: 284 EIVSCSQYAQGCEGGFPYLI-AGKYAQDFGLV-------EEACFPYA---------GSDS 326
Query: 842 PCTGEGDTPKCSKICEPGYTPSYKEDKHFGCSSYSISRNEKEIMAEIYKNGPVEGAFTVY 1021
P C+P Y +++ + + NE + E+ ++GP+ AF VY
Sbjct: 327 P-------------CKPNDCFRYYSSEYYYVGGFYGACNEALMKLELVRHGPMAVAFEVY 373
Query: 1022 SDFLQYKSGVYQHV------TGDLMGGHAIRILGWGVE--NGTPYWLVGNSCNTDWGDNG 1177
DF Y+ G+Y H + HA+ ++G+G + +G YW+V NS + WG++G
Sbjct: 374 DDFFHYQKGIYYHTGLRDPFNPFELTNHAVLLVGYGTDSASGMDYWIVKNSWGSRWGEDG 433
Query: 1178 FFKILIGQDHCGIESEIVAGIP 1243
+F+I G D C IES VA P
Sbjct: 434 YFRIRRGTDECAIESIAVAATP 455
>ref|XP_535784.2| PREDICTED: similar to Dipeptidyl-peptidase I precursor (DPP-I) (DPPI)
(Cathepsin C) (Cathepsin J) (Dipeptidyl transferase),
partial [Canis familiaris].
Length = 481
Score = 96.3 bits (238), Expect = 7e-20
Identities = 86/315 (27%), Positives = 132/315 (41%), Gaps = 13/315 (4%)
Frame = +2
Query: 338 ELVNFINKQNTTWTAGHNFYNVDLSY---VKKLCGTFLGGPKLPQRAAFAADMI--LPKS 502
E V IN +WTA L+ +++ G + PK A + I LP S
Sbjct: 194 EFVKAINTIQKSWTATRYIEYETLTLRDMMRRAGGRKIPRPKPTPLTAEIHEEISRLPTS 253
Query: 503 FDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIRSNGRVNVEVSAEDMLTXXX 682
+D R + +R+Q SCGSC+AF + + RI I +N +S +++++
Sbjct: 254 WDWRNV-RGTNFVSPVRNQASCGSCYAFASTVMLEARIRILTNNTQTPILSPQEIVSCSQ 312
Query: 683 XXXXXXXXXXFPSGAWNFWTKKGLVSGGLYDSHVGCRPYSIPPCEHHVNGSRPPCTGEGD 862
+ K GL D C Y+ GS PC
Sbjct: 313 YAQGCEGGFPYLIAG------KYAQDFGLVDE--ACFSYA---------GSDSPCKPND- 354
Query: 863 TPKCSKICEPGYTPSYKEDKHFGCSSYSISRNEKEIMAEIYKNGPVEGAFTVYSDFLQYK 1042
C Y+ Y H+ Y NE + E+ ++GP+ AF VY DF Y+
Sbjct: 355 -------CFHYYSSEY----HYVGGFYGAC-NEALMKLELVRHGPMAVAFEVYDDFFHYQ 402
Query: 1043 SGVYQH------VTGDLMGGHAIRILGWGVEN--GTPYWLVGNSCNTDWGDNGFFKILIG 1198
G+Y H + + HA+ ++G+G ++ G YW+V NS + WG++G+F+I G
Sbjct: 403 KGIYYHTGLRDPINPFELTNHAVLLVGYGTDSASGMDYWIVKNSWGSRWGEDGYFQICRG 462
Query: 1199 QDHCGIESEIVAGIP 1243
D C IES VA P
Sbjct: 463 TDECAIESIAVAATP 477
>ref|XP_536212.2| PREDICTED: similar to Cathepsin H precursor [Canis familiaris].
Length = 304
Score = 92.4 bits (228), Expect = 9e-19
Identities = 68/231 (29%), Positives = 103/231 (44%), Gaps = 5/231 (2%)
Frame = +2
Query: 539 IKEIRDQGSCGSCWAFGAVEAISDRICIRSNGRVNVEVSAEDMLTXXXXXXXXXXXXXFP 718
+ +++QGSCGSCW F A+ I I+S +++ AE L
Sbjct: 97 VSPVKNQGSCGSCWTFSTTGALESAIAIKSGKLLSL---AEQQLVDC------------- 140
Query: 719 SGAWNFWTKKGLVSGGLYDSHVGCRPYSIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGY 898
A NF ++ GC+ Y P GE P + + Y
Sbjct: 141 --AQNF-------------NNHGCQGYGAPLQAFEYIRYNKGIMGEDSYPYKGQDGDCKY 185
Query: 899 TPSYKEDKHFGCSSYSISRNEKEIMAE-IYKNGPVEGAFTVYSDFLQYKSGVYQ----HV 1063
PS + F +I+ N+++ M E + PV AF V SDF+ Y+ G+Y H
Sbjct: 186 QPS--KAIAFVKDVANITINDEQAMVEAVALYNPVSFAFEVTSDFMMYRKGIYSSTSCHK 243
Query: 1064 TGDLMGGHAIRILGWGVENGTPYWLVGNSCNTDWGDNGFFKILIGQDHCGI 1216
T D + HA+ +G+G +NG PYW+V NS WG NG+F + G++ CG+
Sbjct: 244 TPDKVN-HAVLAVGYGEQNGIPYWIVKNSWGPQWGMNGYFLMERGKNMCGL 293
>ref|XP_854795.1| PREDICTED: similar to Cathepsin Z precursor (Cathepsin X) (Cathepsin
P) [Canis familiaris].
Length = 375
Score = 79.3 bits (194), Expect = 8e-15
Identities = 68/251 (27%), Positives = 103/251 (41%), Gaps = 6/251 (2%)
Frame = +2
Query: 458 PQRAAFAADMILPKSFDAREQWPNCPTIK---EIRDQGS---CGSCWAFGAVEAISDRIC 619
P+ + + LPKS+D W N + R+Q CGSCWA G+ A++DRI
Sbjct: 123 PRPHEYLSPSDLPKSWD----WRNVNGVNYASATRNQHIPQYCGSCWAHGSTSAMADRIN 178
Query: 620 IRSNGRVNVEVSAEDMLTXXXXXXXXXXXXXFPSGAWNFWTKKGLVSGGLYDSHVGCRPY 799
I+ G + + + P W++ + G+ C Y
Sbjct: 179 IKRKGAWPSTLLSVQHVLDCANAGSCEGGNDLP--VWSYAHEHGIPDET-------CNNY 229
Query: 800 SIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYTPSYKEDKHFGCSSYSISRNEKEIMAE 979
E + CT + +C I YT D +G S+S EK +MAE
Sbjct: 230 QAKDQECNKFNQCGTCT---EFKECHAI--QNYTLWRVGD--YG----SLSGREK-MMAE 277
Query: 980 IYKNGPVEGAFTVYSDFLQYKSGVYQHVTGDLMGGHAIRILGWGVENGTPYWLVGNSCNT 1159
IY NGP+ + Y G++ H I ++GWGV +GT YW+V NS
Sbjct: 278 IYANGPISCGIMATEKMVNYTGGIHAEYQEQAYINHVISVVGWGVSDGTEYWIVRNSWGE 337
Query: 1160 DWGDNGFFKIL 1192
WG+ G+ +I+
Sbjct: 338 PWGERGWMRIV 348
>ref|NP_001002938.1| cathepsin S precursor [Canis lupus familiaris].
Length = 331
Score = 75.9 bits (185), Expect = 9e-14
Identities = 77/306 (25%), Positives = 135/306 (44%), Gaps = 13/306 (4%)
Frame = +2
Query: 344 VNFINKQNTTWTAGHNFYNVDLSYVKKLCG----TFLGGPKLPQR------AAFAADMIL 493
+ F+ N + G + Y++ ++++ + G + +G ++P + ++ L
Sbjct: 56 LKFVMLHNLEHSMGMHSYDLGMNHLGDMTGEEVISLMGSLRVPSQWQRNVTYRSNSNQKL 115
Query: 494 PKSFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIRSNGRVNVEVSAEDMLT 673
P S D RE+ C T E++ QGSCG+CWAF AV A+ ++ +++ G++ V +SA++++
Sbjct: 116 PDSVDWREK--GCVT--EVKYQGSCGACWAFSAVGALEAQLKLKT-GKL-VSLSAQNLVD 169
Query: 674 XXXXXXXXXXXXXFPSGAWNFWTKKGLVSGGLYDSHVGCRPYSIPPCEHHVNGSRPPCTG 853
+G + + ++ DS PY + + + T
Sbjct: 170 CSTEKYGNKGC----NGGFMTTAFQYIIDNNGIDSEASY-PYKAMNGKCRYDSKKRAAT- 223
Query: 854 EGDTPKCSKICEPGYTPSYKEDKHFGCSSYSISRNEKEIMAEIYKNGPVEGAFTV--YSD 1027
CSK E FG +E + + GPV A YS
Sbjct: 224 ------CSKYTE----------LPFG--------SEDALKEAVANKGPVSVAIDASHYSF 259
Query: 1028 FLQYKSGVYQHVTGDLMGGHAIRILGWGVENGTPYWLVGNSCNTDWGDNGFFKILIGQ-D 1204
FL Y+SGVY + H + ++G+G NG YWLV NS ++GD G+ ++ +
Sbjct: 260 FL-YRSGVYYEPSCTQNVNHGVLVVGYGNLNGKDYWLVKNSWGLNFGDQGYIRMARNSGN 318
Query: 1205 HCGIES 1222
HCGI S
Sbjct: 319 HCGIAS 324
>ref|NP_001003115.1| cathepsin L1 precursor [Canis lupus familiaris].
Length = 333
Score = 70.5 bits (171), Expect = 4e-12
Identities = 67/263 (25%), Positives = 111/263 (42%), Gaps = 8/263 (3%)
Frame = +2
Query: 452 KLPQRAAFAADMILPKSFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIRSN 631
K+ Q FA +PKS D RE+ P +++QG CGSCWAF A A+ ++ R
Sbjct: 104 KMFQEPLFAE---IPKSVDWREKGYVTP----VKNQGQCGSCWAFSATGALEGQM-FRKT 155
Query: 632 GRVNVEVSAEDMLTXXXXXXXXXXXXXFPSGAWNFWTKKGLVSGGLYDSHVGCRPYSIPP 811
G++ V +S ++++ A+ + G + ++G
Sbjct: 156 GKL-VSLSEQNLVDCSRAQGNEGCNGGLMDNAFRYVKDNGGLDSEESYPYLG---RDTET 211
Query: 812 CEHHVNGSRPPCTGEGDTPKCSKICEPGYTPSYKEDKHFGCSSYSISRNEKEIMAEIYKN 991
C + +P C+ DT G+ + + EK +M +
Sbjct: 212 CNY-----KPECSAANDT---------GFV--------------DLPQREKALMKAVATL 243
Query: 992 GPVEGAFTV-YSDFLQYKSGVY---QHVTGDLMGGHAIRILGWGVE---NGTPYWLVGNS 1150
GP+ A + F YKSG+Y + DL H + ++G+G E + +W+V NS
Sbjct: 244 GPISVAIDAGHQSFQFYKSGIYFDPDCSSKDL--DHGVLVVGYGFEGTDSNNKFWIVKNS 301
Query: 1151 CNTDWGDNGFFKILIGQ-DHCGI 1216
+WG NG+ K+ Q +HCGI
Sbjct: 302 WGPEWGWNGYVKMAKDQNNHCGI 324
>ref|XP_533219.2| PREDICTED: similar to Cathepsin F precursor (CATSF) [Canis
familiaris].
Length = 442
Score = 70.5 bits (171), Expect = 4e-12
Identities = 54/237 (22%), Positives = 102/237 (43%), Gaps = 3/237 (1%)
Frame = +2
Query: 521 WPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIRSNGRVNVEVSAEDMLTXXXXXXXXX 700
W + + +++DQG CGSCWAF + + ++ +++ S +++L
Sbjct: 235 WRSKGAVTKVKDQGMCGSCWAFSVTGNVEGQWFLKEGTLLSL--SEQELLDCDKVDKACL 292
Query: 701 XXXXFPSGAWNFWTKKGLVSGGLYDSHVGCRPYSIPPCEHHVNGSRPPCTGEGDTPKCSK 880
PS A++ + GGL YS +G CS
Sbjct: 293 GG--LPSNAYSAI----MTLGGLETED----DYSY----------------QGHLQACSF 326
Query: 881 ICEPGYTPSYKEDKHFGCSSYSISRNEKEIMAEIYKNGPVEGAFTVYSDFLQYKSGV--- 1051
S K+ + + S +S+NE+++ A + K GP+ A + Y+ G+
Sbjct: 327 --------SAKKARVYINDSMELSQNEQKLAAWLAKKGPISVAINAFG-MQFYRHGISHP 377
Query: 1052 YQHVTGDLMGGHAIRILGWGVENGTPYWLVGNSCNTDWGDNGFFKILIGQDHCGIES 1222
+ + + HA+ ++G+G +G P+W + NS TDWG+ G++ + G CG+ +
Sbjct: 378 LRPLCSPWLIDHAVLLVGYGNRSGIPFWAIKNSWGTDWGEEGYYYLHRGSGACGVNT 434
Database: RefSeq49_CP.fasta
Posted date: Oct 17, 2011 1:42 PM
Number of letters in database: 18,874,504
Number of sequences in database: 33,336
Lambda K H
0.318 0.134 0.401
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 33336
Number of Hits to DB: 64,685,692
Number of extensions: 1955539
Number of successful extensions: 8895
Number of sequences better than 1.0e-05: 12
Number of HSP's gapped: 8839
Number of HSP's successfully gapped: 12
Length of query: 480
Length of database: 18,874,504
Length adjustment: 107
Effective length of query: 373
Effective length of database: 15,307,552
Effective search space: 5709716896
Effective search space used: 5709716896
Neighboring words threshold: 12
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 36 (18.5 bits)
Search to RefSeqHP_Rel49
BLASTX 2.2.24 [Aug-08-2010]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= 20110601C-001419
(1440 letters)
Database: RefSeq49_HP.fasta
32,964 sequences; 18,297,164 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
Alignment gi|NP_680093.1| cathepsin B preproprotein [Homo sapiens]. 558 e-159
Alignment gi|NP_680092.1| cathepsin B preproprotein [Homo sapiens]. 558 e-159
Alignment gi|NP_680091.1| cathepsin B preproprotein [Homo sapiens]. 558 e-159
Alignment gi|NP_680090.1| cathepsin B preproprotein [Homo sapiens]. 558 e-159
Alignment gi|NP_001899.1| cathepsin B preproprotein [Homo sapiens]. 558 e-159
Alignment gi|NP_001191344.1| tubulointerstitial nephritis antigen-like is... 133 5e-31
Alignment gi|NP_071447.1| tubulointerstitial nephritis antigen-like isofo... 133 5e-31
Alignment gi|NP_001191343.1| tubulointerstitial nephritis antigen-like is... 130 2e-30
Alignment gi|NP_055279.3| tubulointerstitial nephritis antigen [Homo sapi... 129 9e-30
Alignment gi|NP_001805.3| dipeptidyl peptidase 1 isoform a preproprotein ... 101 2e-21
>ref|NP_680093.1| cathepsin B preproprotein [Homo sapiens].
Length = 339
Score = 558 bits (1437), Expect = e-159
Identities = 253/335 (75%), Positives = 281/335 (83%)
Frame = +2
Query: 254 MWRXXXXXXXXXXXXXXRESLHFQPLSDELVNFINKQNTTWTAGHNFYNVDLSYVKKLCG 433
MW+ R F PLSDELVN++NK+NTTW AGHNFYNVD+SY+K+LCG
Sbjct: 1 MWQLWASLCCLLVLANARSRPSFHPLSDELVNYVNKRNTTWQAGHNFYNVDMSYLKRLCG 60
Query: 434 TFLGGPKLPQRAAFAADMILPKSFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDR 613
TFLGGPK PQR F D+ LP SFDAREQWP CPTIKEIRDQGSCGSCWAFGAVEAISDR
Sbjct: 61 TFLGGPKPPQRVMFTEDLKLPASFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDR 120
Query: 614 ICIRSNGRVNVEVSAEDMLTXXXXXXXXXXXXXFPSGAWNFWTKKGLVSGGLYDSHVGCR 793
ICI +N V+VEVSAED+LT +P+ AWNFWT+KGLVSGGLY+SHVGCR
Sbjct: 121 ICIHTNAHVSVEVSAEDLLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYESHVGCR 180
Query: 794 PYSIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYTPSYKEDKHFGCSSYSISRNEKEIM 973
PYSIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGY+P+YK+DKH+G +SYS+S +EK+IM
Sbjct: 181 PYSIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSEKDIM 240
Query: 974 AEIYKNGPVEGAFTVYSDFLQYKSGVYQHVTGDLMGGHAIRILGWGVENGTPYWLVGNSC 1153
AEIYKNGPVEGAF+VYSDFL YKSGVYQHVTG++MGGHAIRILGWGVENGTPYWLV NS
Sbjct: 241 AEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGVENGTPYWLVANSW 300
Query: 1154 NTDWGDNGFFKILIGQDHCGIESEIVAGIPCTPHF 1258
NTDWGDNGFFKIL GQDHCGIESE+VAGIP T +
Sbjct: 301 NTDWGDNGFFKILRGQDHCGIESEVVAGIPRTDQY 335
>ref|NP_680092.1| cathepsin B preproprotein [Homo sapiens].
Length = 339
Score = 558 bits (1437), Expect = e-159
Identities = 253/335 (75%), Positives = 281/335 (83%)
Frame = +2
Query: 254 MWRXXXXXXXXXXXXXXRESLHFQPLSDELVNFINKQNTTWTAGHNFYNVDLSYVKKLCG 433
MW+ R F PLSDELVN++NK+NTTW AGHNFYNVD+SY+K+LCG
Sbjct: 1 MWQLWASLCCLLVLANARSRPSFHPLSDELVNYVNKRNTTWQAGHNFYNVDMSYLKRLCG 60
Query: 434 TFLGGPKLPQRAAFAADMILPKSFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDR 613
TFLGGPK PQR F D+ LP SFDAREQWP CPTIKEIRDQGSCGSCWAFGAVEAISDR
Sbjct: 61 TFLGGPKPPQRVMFTEDLKLPASFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDR 120
Query: 614 ICIRSNGRVNVEVSAEDMLTXXXXXXXXXXXXXFPSGAWNFWTKKGLVSGGLYDSHVGCR 793
ICI +N V+VEVSAED+LT +P+ AWNFWT+KGLVSGGLY+SHVGCR
Sbjct: 121 ICIHTNAHVSVEVSAEDLLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYESHVGCR 180
Query: 794 PYSIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYTPSYKEDKHFGCSSYSISRNEKEIM 973
PYSIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGY+P+YK+DKH+G +SYS+S +EK+IM
Sbjct: 181 PYSIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSEKDIM 240
Query: 974 AEIYKNGPVEGAFTVYSDFLQYKSGVYQHVTGDLMGGHAIRILGWGVENGTPYWLVGNSC 1153
AEIYKNGPVEGAF+VYSDFL YKSGVYQHVTG++MGGHAIRILGWGVENGTPYWLV NS
Sbjct: 241 AEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGVENGTPYWLVANSW 300
Query: 1154 NTDWGDNGFFKILIGQDHCGIESEIVAGIPCTPHF 1258
NTDWGDNGFFKIL GQDHCGIESE+VAGIP T +
Sbjct: 301 NTDWGDNGFFKILRGQDHCGIESEVVAGIPRTDQY 335
>ref|NP_680091.1| cathepsin B preproprotein [Homo sapiens].
Length = 339
Score = 558 bits (1437), Expect = e-159
Identities = 253/335 (75%), Positives = 281/335 (83%)
Frame = +2
Query: 254 MWRXXXXXXXXXXXXXXRESLHFQPLSDELVNFINKQNTTWTAGHNFYNVDLSYVKKLCG 433
MW+ R F PLSDELVN++NK+NTTW AGHNFYNVD+SY+K+LCG
Sbjct: 1 MWQLWASLCCLLVLANARSRPSFHPLSDELVNYVNKRNTTWQAGHNFYNVDMSYLKRLCG 60
Query: 434 TFLGGPKLPQRAAFAADMILPKSFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDR 613
TFLGGPK PQR F D+ LP SFDAREQWP CPTIKEIRDQGSCGSCWAFGAVEAISDR
Sbjct: 61 TFLGGPKPPQRVMFTEDLKLPASFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDR 120
Query: 614 ICIRSNGRVNVEVSAEDMLTXXXXXXXXXXXXXFPSGAWNFWTKKGLVSGGLYDSHVGCR 793
ICI +N V+VEVSAED+LT +P+ AWNFWT+KGLVSGGLY+SHVGCR
Sbjct: 121 ICIHTNAHVSVEVSAEDLLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYESHVGCR 180
Query: 794 PYSIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYTPSYKEDKHFGCSSYSISRNEKEIM 973
PYSIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGY+P+YK+DKH+G +SYS+S +EK+IM
Sbjct: 181 PYSIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSEKDIM 240
Query: 974 AEIYKNGPVEGAFTVYSDFLQYKSGVYQHVTGDLMGGHAIRILGWGVENGTPYWLVGNSC 1153
AEIYKNGPVEGAF+VYSDFL YKSGVYQHVTG++MGGHAIRILGWGVENGTPYWLV NS
Sbjct: 241 AEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGVENGTPYWLVANSW 300
Query: 1154 NTDWGDNGFFKILIGQDHCGIESEIVAGIPCTPHF 1258
NTDWGDNGFFKIL GQDHCGIESE+VAGIP T +
Sbjct: 301 NTDWGDNGFFKILRGQDHCGIESEVVAGIPRTDQY 335
>ref|NP_680090.1| cathepsin B preproprotein [Homo sapiens].
Length = 339
Score = 558 bits (1437), Expect = e-159
Identities = 253/335 (75%), Positives = 281/335 (83%)
Frame = +2
Query: 254 MWRXXXXXXXXXXXXXXRESLHFQPLSDELVNFINKQNTTWTAGHNFYNVDLSYVKKLCG 433
MW+ R F PLSDELVN++NK+NTTW AGHNFYNVD+SY+K+LCG
Sbjct: 1 MWQLWASLCCLLVLANARSRPSFHPLSDELVNYVNKRNTTWQAGHNFYNVDMSYLKRLCG 60
Query: 434 TFLGGPKLPQRAAFAADMILPKSFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDR 613
TFLGGPK PQR F D+ LP SFDAREQWP CPTIKEIRDQGSCGSCWAFGAVEAISDR
Sbjct: 61 TFLGGPKPPQRVMFTEDLKLPASFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDR 120
Query: 614 ICIRSNGRVNVEVSAEDMLTXXXXXXXXXXXXXFPSGAWNFWTKKGLVSGGLYDSHVGCR 793
ICI +N V+VEVSAED+LT +P+ AWNFWT+KGLVSGGLY+SHVGCR
Sbjct: 121 ICIHTNAHVSVEVSAEDLLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYESHVGCR 180
Query: 794 PYSIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYTPSYKEDKHFGCSSYSISRNEKEIM 973
PYSIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGY+P+YK+DKH+G +SYS+S +EK+IM
Sbjct: 181 PYSIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSEKDIM 240
Query: 974 AEIYKNGPVEGAFTVYSDFLQYKSGVYQHVTGDLMGGHAIRILGWGVENGTPYWLVGNSC 1153
AEIYKNGPVEGAF+VYSDFL YKSGVYQHVTG++MGGHAIRILGWGVENGTPYWLV NS
Sbjct: 241 AEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGVENGTPYWLVANSW 300
Query: 1154 NTDWGDNGFFKILIGQDHCGIESEIVAGIPCTPHF 1258
NTDWGDNGFFKIL GQDHCGIESE+VAGIP T +
Sbjct: 301 NTDWGDNGFFKILRGQDHCGIESEVVAGIPRTDQY 335
>ref|NP_001899.1| cathepsin B preproprotein [Homo sapiens].
Length = 339
Score = 558 bits (1437), Expect = e-159
Identities = 253/335 (75%), Positives = 281/335 (83%)
Frame = +2
Query: 254 MWRXXXXXXXXXXXXXXRESLHFQPLSDELVNFINKQNTTWTAGHNFYNVDLSYVKKLCG 433
MW+ R F PLSDELVN++NK+NTTW AGHNFYNVD+SY+K+LCG
Sbjct: 1 MWQLWASLCCLLVLANARSRPSFHPLSDELVNYVNKRNTTWQAGHNFYNVDMSYLKRLCG 60
Query: 434 TFLGGPKLPQRAAFAADMILPKSFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDR 613
TFLGGPK PQR F D+ LP SFDAREQWP CPTIKEIRDQGSCGSCWAFGAVEAISDR
Sbjct: 61 TFLGGPKPPQRVMFTEDLKLPASFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDR 120
Query: 614 ICIRSNGRVNVEVSAEDMLTXXXXXXXXXXXXXFPSGAWNFWTKKGLVSGGLYDSHVGCR 793
ICI +N V+VEVSAED+LT +P+ AWNFWT+KGLVSGGLY+SHVGCR
Sbjct: 121 ICIHTNAHVSVEVSAEDLLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYESHVGCR 180
Query: 794 PYSIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYTPSYKEDKHFGCSSYSISRNEKEIM 973
PYSIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGY+P+YK+DKH+G +SYS+S +EK+IM
Sbjct: 181 PYSIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSEKDIM 240
Query: 974 AEIYKNGPVEGAFTVYSDFLQYKSGVYQHVTGDLMGGHAIRILGWGVENGTPYWLVGNSC 1153
AEIYKNGPVEGAF+VYSDFL YKSGVYQHVTG++MGGHAIRILGWGVENGTPYWLV NS
Sbjct: 241 AEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGVENGTPYWLVANSW 300
Query: 1154 NTDWGDNGFFKILIGQDHCGIESEIVAGIPCTPHF 1258
NTDWGDNGFFKIL GQDHCGIESE+VAGIP T +
Sbjct: 301 NTDWGDNGFFKILRGQDHCGIESEVVAGIPRTDQY 335
>ref|NP_001191344.1| tubulointerstitial nephritis antigen-like isoform 3 [Homo sapiens].
Length = 362
Score = 133 bits (334), Expect = 5e-31
Identities = 96/321 (29%), Positives = 142/321 (44%), Gaps = 23/321 (7%)
Frame = +2
Query: 338 ELVNFINKQNTTWTAGHN--FYNVDLSY-VKKLCGTFLGGPKLPQRAAFAADM----ILP 496
+++ IN+ N W AG++ F+ + L ++ GT + + +LP
Sbjct: 40 DMIKAINQGNYGWQAGNHSAFWGMTLDEGIRYRLGTIRPSSSVMNMHEIYTVLNPGEVLP 99
Query: 497 KSFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIRSNGRVNVEVSAEDMLTX 676
+F+A E+WPN I E DQG+C WAF SDR+ I S G + +S +++L+
Sbjct: 100 TAFEASEKWPNL--IHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQNLLSC 157
Query: 677 XXXXXXXXXXXXFPSGAWNFWTKKGLVSGGLYDSHVGCRPYSIPPCEHHVNGSRPPCTGE 856
GAW F ++G+VS C P+S E G PPC
Sbjct: 158 DTHQQQGCRGGRL-DGAWWFLRRRGVVSDH-------CYPFS--GRERDEAGPAPPCMMH 207
Query: 857 GDTPKCSKICEPGYTP-SY--KEDKHFGCSSYSISRNEKEIMAEIYKNGPVEGAFTVYSD 1027
K + P SY D + Y + N+KEIM E+ +NGPV+ V+ D
Sbjct: 208 SRAMGRGKRQATAHCPNSYVNNNDIYQVTPVYRLGSNDKEIMKELMENGPVQALMEVHED 267
Query: 1028 FLQYKSGVYQHVTGDL--------MGGHAIRILGWGVE-----NGTPYWLVGNSCNTDWG 1168
F YK G+Y H L G H+++I GWG E YW NS WG
Sbjct: 268 FFLYKGGIYSHTPVSLGRPERYRRHGTHSVKITGWGEETLPDGRTLKYWTAANSWGPAWG 327
Query: 1169 DNGFFKILIGQDHCGIESEIV 1231
+ G F+I+ G + C IES ++
Sbjct: 328 ERGHFRIVRGVNECDIESFVL 348
>ref|NP_071447.1| tubulointerstitial nephritis antigen-like isoform 1 precursor [Homo
sapiens].
Length = 467
Score = 133 bits (334), Expect = 5e-31
Identities = 96/321 (29%), Positives = 142/321 (44%), Gaps = 23/321 (7%)
Frame = +2
Query: 338 ELVNFINKQNTTWTAGHN--FYNVDLSY-VKKLCGTFLGGPKLPQRAAFAADM----ILP 496
+++ IN+ N W AG++ F+ + L ++ GT + + +LP
Sbjct: 145 DMIKAINQGNYGWQAGNHSAFWGMTLDEGIRYRLGTIRPSSSVMNMHEIYTVLNPGEVLP 204
Query: 497 KSFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIRSNGRVNVEVSAEDMLTX 676
+F+A E+WPN I E DQG+C WAF SDR+ I S G + +S +++L+
Sbjct: 205 TAFEASEKWPNL--IHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQNLLSC 262
Query: 677 XXXXXXXXXXXXFPSGAWNFWTKKGLVSGGLYDSHVGCRPYSIPPCEHHVNGSRPPCTGE 856
GAW F ++G+VS C P+S E G PPC
Sbjct: 263 DTHQQQGCRGGRL-DGAWWFLRRRGVVSDH-------CYPFS--GRERDEAGPAPPCMMH 312
Query: 857 GDTPKCSKICEPGYTP-SY--KEDKHFGCSSYSISRNEKEIMAEIYKNGPVEGAFTVYSD 1027
K + P SY D + Y + N+KEIM E+ +NGPV+ V+ D
Sbjct: 313 SRAMGRGKRQATAHCPNSYVNNNDIYQVTPVYRLGSNDKEIMKELMENGPVQALMEVHED 372
Query: 1028 FLQYKSGVYQHVTGDL--------MGGHAIRILGWGVE-----NGTPYWLVGNSCNTDWG 1168
F YK G+Y H L G H+++I GWG E YW NS WG
Sbjct: 373 FFLYKGGIYSHTPVSLGRPERYRRHGTHSVKITGWGEETLPDGRTLKYWTAANSWGPAWG 432
Query: 1169 DNGFFKILIGQDHCGIESEIV 1231
+ G F+I+ G + C IES ++
Sbjct: 433 ERGHFRIVRGVNECDIESFVL 453
>ref|NP_001191343.1| tubulointerstitial nephritis antigen-like isoform 2 precursor [Homo
sapiens].
Length = 436
Score = 130 bits (328), Expect = 2e-30
Identities = 86/264 (32%), Positives = 120/264 (45%), Gaps = 16/264 (6%)
Frame = +2
Query: 488 ILPKSFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIRSNGRVNVEVSAEDM 667
+LP +F+A E+WPN I E DQG+C WAF SDR+ I S G + +S +++
Sbjct: 171 VLPTAFEASEKWPNL--IHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQNL 228
Query: 668 LTXXXXXXXXXXXXXFPSGAWNFWTKKGLVSGGLYDSHVGCRPYSIPPCEHHVNGSRPPC 847
L+ GAW F ++G+VS C P+S E G PPC
Sbjct: 229 LSCDTHQQQGCRGGRL-DGAWWFLRRRGVVSDH-------CYPFS--GRERDEAGPAPPC 278
Query: 848 TGEGDTPKCSKICEPGYTP-SY--KEDKHFGCSSYSISRNEKEIMAEIYKNGPVEGAFTV 1018
K + P SY D + Y + N+KEIM E+ +NGPV+ V
Sbjct: 279 MMHSRAMGRGKRQATAHCPNSYVNNNDIYQVTPVYRLGSNDKEIMKELMENGPVQALMEV 338
Query: 1019 YSDFLQYKSGVYQHVTGDL--------MGGHAIRILGWGVE-----NGTPYWLVGNSCNT 1159
+ DF YK G+Y H L G H+++I GWG E YW NS
Sbjct: 339 HEDFFLYKGGIYSHTPVSLGRPERYRRHGTHSVKITGWGEETLPDGRTLKYWTAANSWGP 398
Query: 1160 DWGDNGFFKILIGQDHCGIESEIV 1231
WG+ G F+I+ G + C IES ++
Sbjct: 399 AWGERGHFRIVRGVNECDIESFVL 422
>ref|NP_055279.3| tubulointerstitial nephritis antigen [Homo sapiens].
Length = 476
Score = 129 bits (323), Expect = 9e-30
Identities = 98/331 (29%), Positives = 142/331 (42%), Gaps = 23/331 (6%)
Frame = +2
Query: 311 SLHFQPLSDELVNFINKQNTTWTAGH--NFYNVDLSYVKKL-CGTFLGGPKL----PQRA 469
S H + EL+ +NK + WTA + F+ + L K GT P L A
Sbjct: 150 SQHVCLVRSELIEQVNKGDYGWTAQNYSQFWGMTLEDGFKFRLGTLPPSPMLLSMNEMTA 209
Query: 470 AFAADMILPKSFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIRSNGRVNVE 649
+ A LP+ F A +WP DQ +C + WAF +DRI I+S GR
Sbjct: 210 SLPATTDLPEFFVASYKWPGWT--HGPLDQKNCAASWAFSTASVAADRIAIQSKGRYTAN 267
Query: 650 VSAEDMLTXXXXXXXXXXXXXFPSGAWNFWTKKGLVSGGLYDSHVGCRPYSIPPCEHHVN 829
+S +++++ AW + K+GLVS Y P N
Sbjct: 268 LSPQNLISCCAKNRHGCNSGSIDR-AWWYLRKRGLVSHACY------------PLFKDQN 314
Query: 830 GSRPPCT--GEGDTPKCSKICEPGYTPSYKEDKHFGCSS-YSISRNEKEIMAEIYKNGPV 1000
+ C D +P K ++ + CS Y +S NE EIM EI +NGPV
Sbjct: 315 ATNNGCAMASRSDGRGKRHATKPCPNNVEKSNRIYQCSPPYRVSSNETEIMKEIMQNGPV 374
Query: 1001 EGAFTVYSDFLQYKSGVYQHVTGD--------LMGGHAIRILGWGVENGT-----PYWLV 1141
+ V DF YK+G+Y+HVT + HA+++ GWG G +W+
Sbjct: 375 QAIMQVREDFFHYKTGIYRHVTSTNKESEKYRKLQTHAVKLTGWGTLRGAQGQKEKFWIA 434
Query: 1142 GNSCNTDWGDNGFFKILIGQDHCGIESEIVA 1234
NS WG+NG+F+IL G + IE I+A
Sbjct: 435 ANSWGKSWGENGYFRILRGVNESDIEKLIIA 465
>ref|NP_001805.3| dipeptidyl peptidase 1 isoform a preproprotein [Homo sapiens].
Length = 463
Score = 101 bits (252), Expect = 2e-21
Identities = 79/262 (30%), Positives = 117/262 (44%), Gaps = 11/262 (4%)
Frame = +2
Query: 491 LPKSFDAREQWPNCPTIK---EIRDQGSCGSCWAFGAVEAISDRICIRSNGRVNVEVSAE 661
LP S+D W N I +R+Q SCGSC++F ++ + RI I +N +S +
Sbjct: 231 LPTSWD----WRNVHGINFVSPVRNQASCGSCYSFASMGMLEARIRILTNNSQTPILSPQ 286
Query: 662 DMLTXXXXXXXXXXXXXFPSGAWNFWTKKGLVSGGLYDSHVGCRPYSIPPCEHHVNGSRP 841
++++ + A + GLV C PY+ G+
Sbjct: 287 EVVSCSQYAQGCEGGFPYLI-AGKYAQDFGLV-------EEACFPYT---------GTDS 329
Query: 842 PCTGEGDTPKCSKICEPGYTPSYKEDKHFGCSSYSISRNEKEIMAEIYKNGPVEGAFTVY 1021
PC K + C Y+ Y H+ Y NE + E+ +GP+ AF VY
Sbjct: 330 PC-------KMKEDCFRYYSSEY----HYVGGFYG-GCNEALMKLELVHHGPMAVAFEVY 377
Query: 1022 SDFLQYKSGVYQHV------TGDLMGGHAIRILGWGVE--NGTPYWLVGNSCNTDWGDNG 1177
DFL YK G+Y H + HA+ ++G+G + +G YW+V NS T WG+NG
Sbjct: 378 DDFLHYKKGIYHHTGLRDPFNPFELTNHAVLLVGYGTDSASGMDYWIVKNSWGTGWGENG 437
Query: 1178 FFKILIGQDHCGIESEIVAGIP 1243
+F+I G D C IES VA P
Sbjct: 438 YFRIRRGTDECAIESIAVAATP 459
Database: RefSeq49_HP.fasta
Posted date: Oct 17, 2011 1:42 PM
Number of letters in database: 18,297,164
Number of sequences in database: 32,964
Lambda K H
0.318 0.134 0.401
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 32964
Number of Hits to DB: 63,503,963
Number of extensions: 1927102
Number of successful extensions: 8741
Number of sequences better than 1.0e-05: 21
Number of HSP's gapped: 8572
Number of HSP's successfully gapped: 21
Length of query: 480
Length of database: 18,297,164
Length adjustment: 106
Effective length of query: 374
Effective length of database: 14,802,980
Effective search space: 5536314520
Effective search space used: 5536314520
Neighboring words threshold: 12
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 36 (18.5 bits)
Search to RefSeqMP_Rel49
BLASTX 2.2.24 [Aug-08-2010]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= 20110601C-001419
(1440 letters)
Database: RefSeq49_MP.fasta
30,036 sequences; 15,617,559 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
Alignment gi|NP_031824.1| cathepsin B preproprotein [Mus musculus]. 533 e-151
Alignment gi|NP_036163.3| tubulointerstitial nephritis antigen [Mus muscu... 130 2e-30
Alignment gi|NP_001161805.1| tubulointerstitial nephritis antigen-like pr... 126 4e-29
Alignment gi|NP_075965.2| tubulointerstitial nephritis antigen-like precu... 126 4e-29
Alignment gi|NP_034112.3| dipeptidyl peptidase 1 preproprotein [Mus muscu... 94 2e-19
Alignment gi|NP_031827.2| pro-cathepsin H preproprotein [Mus musculus]. 86 6e-17
Alignment gi|NP_081620.2| cathepsin L-like 3 [Mus musculus]. 78 2e-14
Alignment gi|NP_067256.2| cathepsin S preproprotein [Mus musculus]. 78 2e-14
Alignment gi|NP_071720.1| cathepsin Z preproprotein [Mus musculus]. 73 5e-13
Alignment gi|NP_062414.3| cathepsin 8 [Mus musculus]. 73 5e-13
>ref|NP_031824.1| cathepsin B preproprotein [Mus musculus].
Length = 339
Score = 533 bits (1373), Expect = e-151
Identities = 241/313 (76%), Positives = 269/313 (85%)
Frame = +2
Query: 320 FQPLSDELVNFINKQNTTWTAGHNFYNVDLSYVKKLCGTFLGGPKLPQRAAFAADMILPK 499
F PLSD+L+N+INKQNTTW AG NFYNVD+SY+KKLCGT LGGPKLP R AF D+ LP+
Sbjct: 23 FHPLSDDLINYINKQNTTWQAGRNFYNVDISYLKKLCGTVLGGPKLPGRVAFGEDIDLPE 82
Query: 500 SFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIRSNGRVNVEVSAEDMLTXX 679
+FDAREQW NCPTI +IRDQGSCGSCWAFGAVEAISDR CI +NGRVNVEVSAED+LT
Sbjct: 83 TFDAREQWSNCPTIGQIRDQGSCGSCWAFGAVEAISDRTCIHTNGRVNVEVSAEDLLTCC 142
Query: 680 XXXXXXXXXXXFPSGAWNFWTKKGLVSGGLYDSHVGCRPYSIPPCEHHVNGSRPPCTGEG 859
+PSGAW+FWTKKGLVSGG+Y+SHVGC PY+IPPCEHHVNGSRPPCTGEG
Sbjct: 143 GIQCGDGCNGGYPSGAWSFWTKKGLVSGGVYNSHVGCLPYTIPPCEHHVNGSRPPCTGEG 202
Query: 860 DTPKCSKICEPGYTPSYKEDKHFGCSSYSISRNEKEIMAEIYKNGPVEGAFTVYSDFLQY 1039
DTP+C+K CE GY+PSYKEDKHFG +SYS+S + KEIMAEIYKNGPVEGAFTV+SDFL Y
Sbjct: 203 DTPRCNKSCEAGYSPSYKEDKHFGYTSYSVSNSVKEIMAEIYKNGPVEGAFTVFSDFLTY 262
Query: 1040 KSGVYQHVTGDLMGGHAIRILGWGVENGTPYWLVGNSCNTDWGDNGFFKILIGQDHCGIE 1219
KSGVY+H GD+MGGHAIRILGWGVENG PYWL NS N DWGDNGFFKIL G++HCGIE
Sbjct: 263 KSGVYKHEAGDMMGGHAIRILGWGVENGVPYWLAANSWNLDWGDNGFFKILRGENHCGIE 322
Query: 1220 SEIVAGIPCTPHF 1258
SEIVAGIP T +
Sbjct: 323 SEIVAGIPRTDQY 335
>ref|NP_036163.3| tubulointerstitial nephritis antigen [Mus musculus].
Length = 475
Score = 130 bits (328), Expect = 2e-30
Identities = 98/329 (29%), Positives = 146/329 (44%), Gaps = 21/329 (6%)
Frame = +2
Query: 311 SLHFQPLSDELVNFINKQNTTWTAGH--NFYNVDLSYVKKL-CGTFLGGPKL----PQRA 469
S H + EL++ INK + WTA + F+ + L K GT P L A
Sbjct: 149 SQHVCLVHPELIDHINKGDYGWTAQNYSQFWGMTLEEGFKFRLGTLPPSPMLLSMNEMTA 208
Query: 470 AFAADMILPKSFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIRSNGRVNVE 649
+F LP+ F A +WP DQ +C + WAF +DRI I+S GR
Sbjct: 209 SFPPRADLPEIFIASYKWPGWT--HGPLDQKNCAASWAFSTASVAADRIAIQSKGRYTAN 266
Query: 650 VSAEDMLTXXXXXXXXXXXXXFPSGAWNFWTKKGLVSGGLYDSHVGCRPYSIPPCEHHVN 829
+S +++++ AW F K+GLVS Y + +++
Sbjct: 267 LSPQNLISCCAKNRHGCNSGSIDR-AWWFLRKRGLVSHACYPL------FKDQNTTNNIC 319
Query: 830 GSRPPCTGEGDTPKCSKICEPGYTPSYKEDKHFGCSS-YSISRNEKEIMAEIYKNGPVEG 1006
G G +K C + K ++ + CS Y +S NE EIM EI +NGPV+
Sbjct: 320 AMASRSDGRGKR-HATKPCPNSFE---KSNRIYQCSPPYRVSSNETEIMREIIQNGPVQA 375
Query: 1007 AFTVYSDFLQYKSGVYQHVTG--------DLMGGHAIRILGWGVENGT-----PYWLVGN 1147
V+ DF YK+G+Y+HV + HA+++ GWG G +W+ N
Sbjct: 376 IMQVHEDFFYYKTGIYRHVVSTNEEPEKYKKLRTHAVKLTGWGTLRGARGKKEKFWIAAN 435
Query: 1148 SCNTDWGDNGFFKILIGQDHCGIESEIVA 1234
S WG+NG+F+IL G + IE I+A
Sbjct: 436 SWGKSWGENGYFRILRGVNESDIEKLIIA 464
>ref|NP_001161805.1| tubulointerstitial nephritis antigen-like precursor [Mus musculus].
Length = 466
Score = 126 bits (317), Expect = 4e-29
Identities = 92/318 (28%), Positives = 140/318 (44%), Gaps = 20/318 (6%)
Frame = +2
Query: 338 ELVNFINKQNTTWTAGHN--FYNVDLSY-VKKLCGTFLGGPKLPQR----AAFAADMILP 496
+++ IN+ N W AG++ F+ + L ++ GT + +LP
Sbjct: 144 DMIKAINRGNYGWQAGNHSAFWGMTLDEGIRYRLGTIRPSSTVMNMNEIYTVLGQGEVLP 203
Query: 497 KSFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIRSNGRVNVEVSAEDMLTX 676
+F+A E+WPN I E DQG+C WAF SDR+ I S G + +S +++L+
Sbjct: 204 TAFEASEKWPNL--IHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPILSPQNLLSC 261
Query: 677 XXXXXXXXXXXXFPSGAWNFWTKKGLVSGGLYDSHVGCRPYSIPPCEHHVNGSRPPCTGE 856
GAW F ++G+VS Y G P + SR G
Sbjct: 262 DTHHQQGCRGGRL-DGAWWFLRRRGVVSDNCYPFS-GREQNEASPTPRCMMHSR--AMGR 317
Query: 857 GDTPKCSKICEPGYTPSYKEDKHFGCSSYSISRNEKEIMAEIYKNGPVEGAFTVYSDFLQ 1036
G S+ C G S D + +Y + +EKEIM E+ +NGPV+ V+ DF
Sbjct: 318 GKRQATSR-CPNGQVDS--NDIYQVTPAYRLGSDEKEIMKELMENGPVQALMEVHEDFFL 374
Query: 1037 YKSGVYQHVTGD--------LMGGHAIRILGWGVE-----NGTPYWLVGNSCNTDWGDNG 1177
Y+ G+Y H G H+++I GWG E YW NS WG+ G
Sbjct: 375 YQRGIYSHTPVSQGRPEQYRRHGTHSVKITGWGEETLPDGRTIKYWTAANSWGPWWGERG 434
Query: 1178 FFKILIGQDHCGIESEIV 1231
F+I+ G + C IE+ ++
Sbjct: 435 HFRIVRGTNECDIETFVL 452
>ref|NP_075965.2| tubulointerstitial nephritis antigen-like precursor [Mus musculus].
Length = 466
Score = 126 bits (317), Expect = 4e-29
Identities = 92/318 (28%), Positives = 140/318 (44%), Gaps = 20/318 (6%)
Frame = +2
Query: 338 ELVNFINKQNTTWTAGHN--FYNVDLSY-VKKLCGTFLGGPKLPQR----AAFAADMILP 496
+++ IN+ N W AG++ F+ + L ++ GT + +LP
Sbjct: 144 DMIKAINRGNYGWQAGNHSAFWGMTLDEGIRYRLGTIRPSSTVMNMNEIYTVLGQGEVLP 203
Query: 497 KSFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIRSNGRVNVEVSAEDMLTX 676
+F+A E+WPN I E DQG+C WAF SDR+ I S G + +S +++L+
Sbjct: 204 TAFEASEKWPNL--IHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPILSPQNLLSC 261
Query: 677 XXXXXXXXXXXXFPSGAWNFWTKKGLVSGGLYDSHVGCRPYSIPPCEHHVNGSRPPCTGE 856
GAW F ++G+VS Y G P + SR G
Sbjct: 262 DTHHQQGCRGGRL-DGAWWFLRRRGVVSDNCYPFS-GREQNEASPTPRCMMHSR--AMGR 317
Query: 857 GDTPKCSKICEPGYTPSYKEDKHFGCSSYSISRNEKEIMAEIYKNGPVEGAFTVYSDFLQ 1036
G S+ C G S D + +Y + +EKEIM E+ +NGPV+ V+ DF
Sbjct: 318 GKRQATSR-CPNGQVDS--NDIYQVTPAYRLGSDEKEIMKELMENGPVQALMEVHEDFFL 374
Query: 1037 YKSGVYQHVTGD--------LMGGHAIRILGWGVE-----NGTPYWLVGNSCNTDWGDNG 1177
Y+ G+Y H G H+++I GWG E YW NS WG+ G
Sbjct: 375 YQRGIYSHTPVSQGRPEQYRRHGTHSVKITGWGEETLPDGRTIKYWTAANSWGPWWGERG 434
Query: 1178 FFKILIGQDHCGIESEIV 1231
F+I+ G + C IE+ ++
Sbjct: 435 HFRIVRGTNECDIETFVL 452
>ref|NP_034112.3| dipeptidyl peptidase 1 preproprotein [Mus musculus].
Length = 462
Score = 94.4 bits (233), Expect = 2e-19
Identities = 73/261 (27%), Positives = 115/261 (44%), Gaps = 10/261 (3%)
Frame = +2
Query: 491 LPKSFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIRSNGRVNVEVSAEDML 670
LP+S+D R + +R+Q SCGSC++F ++ + RI I +N +S ++++
Sbjct: 230 LPESWDWRNV-QGVNYVSPVRNQESCGSCYSFASMGMLEARIRILTNNSQTPILSPQEVV 288
Query: 671 TXXXXXXXXXXXXXFPSGAWNFWTKKGLVSGGLYDSHVGCRPYSIPPCEHHVNGSRPPCT 850
+ + + G Y G V S P T
Sbjct: 289 SCSPYAQGCDGGFPY-------------LIAGKYAQDFGV-----------VEESCFPYT 324
Query: 851 GEGDTPKCSKICEPGYTPSYKEDKHF--GCSSYSISRNEKEIMAEIYKNGPVEGAFTVYS 1024
+ K + C Y+ Y F GC NE + E+ K+GP+ AF V+
Sbjct: 325 AKDSPCKPRENCLRYYSSDYYYVGGFYGGC-------NEALMKLELVKHGPMAVAFEVHD 377
Query: 1025 DFLQYKSGVYQHV------TGDLMGGHAIRILGWGVE--NGTPYWLVGNSCNTDWGDNGF 1180
DFL Y SG+Y H + HA+ ++G+G + G YW++ NS ++WG++G+
Sbjct: 378 DFLHYHSGIYHHTGLSDPFNPFELTNHAVLLVGYGRDPVTGIEYWIIKNSWGSNWGESGY 437
Query: 1181 FKILIGQDHCGIESEIVAGIP 1243
F+I G D C IES VA IP
Sbjct: 438 FRIRRGTDECAIESIAVAAIP 458
>ref|NP_031827.2| pro-cathepsin H preproprotein [Mus musculus].
Length = 333
Score = 86.3 bits (212), Expect = 6e-17
Identities = 68/247 (27%), Positives = 109/247 (44%), Gaps = 6/247 (2%)
Frame = +2
Query: 494 PKSFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIRSNGRVNVEVSAEDMLT 673
P S D R++ + +++QG+CGSCW F A+ + I S +++ + + ++
Sbjct: 115 PSSMDWRKKGN---VVSPVKNQGACGSCWTFSTTGALESAVAIASGKMLSL--AEQQLVD 169
Query: 674 XXXXXXXXXXXXXFPSGAWNFWT-KKGLVSGGLYDSHVGCRPYSIPPCEHHVNGSRPPCT 850
PS A+ + KG++ Y PY
Sbjct: 170 CAQAFNNHGCKGGLPSQAFEYILYNKGIMEEDSY-------PYI---------------- 206
Query: 851 GEGDTPKCSKICEPGYTPSYKEDKHFGCSSYSISRNEKEIMAE-IYKNGPVEGAFTVYSD 1027
G C + P ++ F + +I+ N++ M E + PV AF V D
Sbjct: 207 --GKDSSCR------FNP--QKAVAFVKNVVNITLNDEAAMVEAVALYNPVSFAFEVTED 256
Query: 1028 FLQYKSGVYQ----HVTGDLMGGHAIRILGWGVENGTPYWLVGNSCNTDWGDNGFFKILI 1195
FL YKSGVY H T D + HA+ +G+G +NG YW+V NS + WG+NG+F I
Sbjct: 257 FLMYKSGVYSSKSCHKTPDKVN-HAVLAVGYGEQNGLLYWIVKNSWGSQWGENGYFLIER 315
Query: 1196 GQDHCGI 1216
G++ CG+
Sbjct: 316 GKNMCGL 322
>ref|NP_081620.2| cathepsin L-like 3 [Mus musculus].
Length = 331
Score = 78.2 bits (191), Expect = 2e-14
Identities = 71/249 (28%), Positives = 114/249 (45%), Gaps = 4/249 (1%)
Frame = +2
Query: 491 LPKSFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIRSNGRVNVEVSAEDML 670
+PKS D W + + ++DQGSCGSCWAF AV ++ ++ R G++ V +S ++++
Sbjct: 115 VPKSVD----WRDHGYVTPVKDQGSCGSCWAFSAVGSLEGQM-FRKTGKL-VPLSVQNLV 168
Query: 671 TXXXXXXXXXXXXXFPSGAWNFWTKKGLVSGGLYDSHVGCRPYSIPPCEHHVNGSRPPCT 850
P A+ + +GGL D+ V PY +NG+ C
Sbjct: 169 DCSWSQGNQGCDGGLPDLAFQYVKD----NGGL-DTSVSY-PYEA------LNGT---CR 213
Query: 851 GEGDTPKCSKICEPGYTPSYKEDKHFGCSSYSISRNEKEIMAEIYKNGPVE-GAFTVYSD 1027
PK S G+ ++ +E +M + GP+ G T +
Sbjct: 214 YN---PKNSAATVTGFV--------------NVQSSEDALMKAVATVGPISVGIDTKHKS 256
Query: 1028 FLQYKSGVYQHVT-GDLMGGHAIRILGWGVEN-GTPYWLVGNSCNTDWGDNGFFKILIGQ 1201
F YK G+Y + HA+ ++G+G E+ G YWLV NS DWG NG+ K+ +
Sbjct: 257 FQFYKEGMYYEPDCSSTVLDHAVLVVGYGEESDGRKYWLVKNSWGRDWGMNGYIKMAKDR 316
Query: 1202 -DHCGIESE 1225
++CGI S+
Sbjct: 317 NNNCGIASD 325
>ref|NP_067256.2| cathepsin S preproprotein [Mus musculus].
Length = 340
Score = 77.8 bits (190), Expect = 2e-14
Identities = 76/307 (24%), Positives = 136/307 (44%), Gaps = 14/307 (4%)
Frame = +2
Query: 344 VNFINKQNTTWTAGHNFYNV------DLSYVKKLCGTFLGGPKLPQRAAFA------ADM 487
+ FI N ++ G + Y V D++ + LC +G ++P+++ ++
Sbjct: 64 LKFIMIHNLEYSMGMHTYQVGMNDMGDMTNEEILCR--MGALRIPRQSPKTVTFRSYSNR 121
Query: 488 ILPKSFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIRSNGRVNVEVSAEDM 667
LP + D RE+ C T E++ QGSCG+CWAF AV A+ ++ +++ G++ + +SA+++
Sbjct: 122 TLPDTVDWREK--GCVT--EVKYQGSCGACWAFSAVGALEGQLKLKT-GKL-ISLSAQNL 175
Query: 668 LTXXXXXXXXXXXXXFPSGAWNFWTKKGLVSGGLYDSHVGCRPYSIPPCEHHVNGSRPPC 847
+ G + + ++ G ++ PY + H N
Sbjct: 176 VDCSNEEKYGNKGC---GGGYMTEAFQYIIDNGGIEADASY-PYKATDEKCHYNSKNRAA 231
Query: 848 TGEGDTPKCSKICEPGYTPSYKEDKHFGCSSYSISRNEKEIMAEIYKNGPVE-GAFTVYS 1024
T CS+ + FG +E + + GPV G +S
Sbjct: 232 T-------CSRYIQ----------LPFG--------DEDALKEAVATKGPVSVGIDASHS 266
Query: 1025 DFLQYKSGVYQHVTGDLMGGHAIRILGWGVENGTPYWLVGNSCNTDWGDNGFFKIL-IGQ 1201
F YKSGVY + H + ++G+G +G YWLV NS ++GD G+ ++ +
Sbjct: 267 SFFFYKSGVYDDPSCTGNVNHGVLVVGYGTLDGKDYWLVKNSWGLNFGDQGYIRMARNNK 326
Query: 1202 DHCGIES 1222
+HCGI S
Sbjct: 327 NHCGIAS 333
>ref|NP_071720.1| cathepsin Z preproprotein [Mus musculus].
Length = 306
Score = 73.2 bits (178), Expect = 5e-13
Identities = 64/265 (24%), Positives = 100/265 (37%), Gaps = 14/265 (5%)
Frame = +2
Query: 440 LGGPKLPQRAAFAADMILPKSFDAREQWPNCPTIKEI---RDQGS---CGSCWAFGAVEA 601
LG P+ + + LPK++D W N + R+Q CGSCWA G+ A
Sbjct: 47 LGRRTYPRPHEYLSPADLPKNWD----WRNVNGVNYASVTRNQHIPQYCGSCWAHGSTSA 102
Query: 602 ISDRICIRSNGR-VNVEVSAEDMLTXXXXXXXXXXXXXFPSGAWNFWTKKGLVSGGLYDS 778
++DRI I+ G ++ +S ++++ W + K G+
Sbjct: 103 MADRINIKRKGAWPSILLSVQNVIDCGNAGSCEGGNDL---PVWEYAHKHGIPDET---- 155
Query: 779 HVGCRPYSIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYTPSYKEDKH------FGCSS 940
C Y + C K + G +KE +
Sbjct: 156 ---CNNY------------------QAKDQDCDKFNQCGTCTEFKECHTIQNYTLWRVGD 194
Query: 941 YSISRNEKEIMAEIYKNGPVEGAFTVYSDFLQYKSGVYQHVTGDLMGGHAIRILGWGVEN 1120
Y +++MAEIY NGP+ Y G+Y + H I + GWGV N
Sbjct: 195 YGSLSGREKMMAEIYANGPISCGIMATEMMSNYTGGIYAEHQDQAVINHIISVAGWGVSN 254
Query: 1121 -GTPYWLVGNSCNTDWGDNGFFKIL 1192
G YW+V NS WG+ G+ +I+
Sbjct: 255 DGIEYWIVRNSWGEPWGEKGWMRIV 279
>ref|NP_062414.3| cathepsin 8 [Mus musculus].
Length = 333
Score = 73.2 bits (178), Expect = 5e-13
Identities = 68/258 (26%), Positives = 110/258 (42%), Gaps = 11/258 (4%)
Frame = +2
Query: 491 LPKSFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIRSNGRVNVEVSAEDML 670
LPK D R++ C T +++QG+C SCWAF A AI ++ R G++ V +S ++++
Sbjct: 114 LPKFVDWRKR--GCVT--PVKNQGTCNSCWAFSAAGAIEGQM-FRKTGKL-VPLSTQNLV 167
Query: 671 TXXXXXXXXXXXXXFPSGAWNF-WTKKGLVSGGLYDSHVGCRPYSIPP--CEHHVNGSRP 841
A + W +GL + Y PY C +H S
Sbjct: 168 DCSRLEGNFGCFKGSTFLALKYVWKNRGLEAESTY-------PYKGTDGHCRYHPERSAA 220
Query: 842 PCTGEGDTPKCSKICEPGYTPSYKEDKHFGCSSYS-ISRNEKEIMAEIYKNGPVE-GAFT 1015
T S+S +S +EK++M + GP+ G
Sbjct: 221 RIT-----------------------------SFSFVSNSEKDLMRAVATIGPISVGIDA 251
Query: 1016 VYSDFLQYKSGVY-QHVTGDLMGGHAIRILGWGVE----NGTPYWLVGNSCNTDWGDNGF 1180
+ F Y+ G+Y + + H++ ++G+G E +G YWL+ NS WG NG+
Sbjct: 252 RHKSFRLYREGIYYEPKCSSNIINHSVLVVGYGYEGKESDGNKYWLIKNSHGEQWGMNGY 311
Query: 1181 FKILIGQ-DHCGIESEIV 1231
K+ G+ +HCGI S V
Sbjct: 312 MKLARGRNNHCGIASYAV 329
Database: RefSeq49_MP.fasta
Posted date: Oct 17, 2011 1:42 PM
Number of letters in database: 15,617,559
Number of sequences in database: 30,036
Lambda K H
0.318 0.134 0.401
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 30036
Number of Hits to DB: 53,533,890
Number of extensions: 1602550
Number of successful extensions: 7028
Number of sequences better than 1.0e-05: 25
Number of HSP's gapped: 6878
Number of HSP's successfully gapped: 26
Length of query: 480
Length of database: 15,617,559
Length adjustment: 105
Effective length of query: 375
Effective length of database: 12,463,779
Effective search space: 4673917125
Effective search space used: 4673917125
Neighboring words threshold: 12
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 35 (18.1 bits)
Search to RefSeqSP_Rel49
BLASTX 2.2.24 [Aug-08-2010]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= 20110601C-001419
(1440 letters)
Database: RefSeq49_SP.fasta
24,897 sequences; 11,343,932 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
Alignment gi|NP_001090927.1| cathepsin B precursor [Sus scrofa]. 653 0.0
Alignment gi|XP_003127800.2| PREDICTED: tubulointerstitial nephritis anti... 132 5e-31
Alignment gi|XP_001927698.3| PREDICTED: tubulointerstitial nephritis anti... 125 6e-29
Alignment gi|XP_003129789.1| PREDICTED: dipeptidyl peptidase 1-like [Sus ... 100 3e-21
Alignment gi|NP_999094.1| pro-cathepsin H precursor [Sus scrofa]. 91 2e-18
Alignment gi|NP_001116576.1| cathepsin Z [Sus scrofa]. 77 2e-14
Alignment gi|NP_999057.1| cathepsin L1 precursor [Sus scrofa]. 77 3e-14
Alignment gi|XP_001929706.1| PREDICTED: cathepsin S [Sus scrofa]. 76 5e-14
Alignment gi|XP_003355243.1| PREDICTED: cathepsin S-like [Sus scrofa]. 76 5e-14
Alignment gi|XP_003122543.2| PREDICTED: cathepsin F [Sus scrofa]. 70 3e-12
>ref|NP_001090927.1| cathepsin B precursor [Sus scrofa].
Length = 335
Score = 653 bits (1685), Expect = 0.0
Identities = 306/335 (91%), Positives = 306/335 (91%)
Frame = +2
Query: 254 MWRXXXXXXXXXXXXXXRESLHFQPLSDELVNFINKQNTTWTAGHNFYNVDLSYVKKLCG 433
MWR RESLHFQPLSDELVNFINKQNTTWTAGHNFYNVDLSYVKKLCG
Sbjct: 1 MWRLLATLSCLVLLTSARESLHFQPLSDELVNFINKQNTTWTAGHNFYNVDLSYVKKLCG 60
Query: 434 TFLGGPKLPQRAAFAADMILPKSFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDR 613
TFLGGPKLPQRAAFAADMILPKSFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDR
Sbjct: 61 TFLGGPKLPQRAAFAADMILPKSFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDR 120
Query: 614 ICIRSNGRVNVEVSAEDMLTXXXXXXXXXXXXXFPSGAWNFWTKKGLVSGGLYDSHVGCR 793
ICIRSNGRVNVEVSAEDMLT FPSGAWNFWTKKGLVSGGLYDSHVGCR
Sbjct: 121 ICIRSNGRVNVEVSAEDMLTCCGDECGDGCNGGFPSGAWNFWTKKGLVSGGLYDSHVGCR 180
Query: 794 PYSIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYTPSYKEDKHFGCSSYSISRNEKEIM 973
PYSIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYTPSYKEDKHFGCSSYSISRNEKEIM
Sbjct: 181 PYSIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYTPSYKEDKHFGCSSYSISRNEKEIM 240
Query: 974 AEIYKNGPVEGAFTVYSDFLQYKSGVYQHVTGDLMGGHAIRILGWGVENGTPYWLVGNSC 1153
AEIYKNGPVEGAFTVYSDFLQYKSGVYQHVTGDLMGGHAIRILGWGVENGTPYWLVGNS
Sbjct: 241 AEIYKNGPVEGAFTVYSDFLQYKSGVYQHVTGDLMGGHAIRILGWGVENGTPYWLVGNSW 300
Query: 1154 NTDWGDNGFFKILIGQDHCGIESEIVAGIPCTPHF 1258
NTDWGDNGFFKIL GQDHCGIESEIVAGIPCTPHF
Sbjct: 301 NTDWGDNGFFKILRGQDHCGIESEIVAGIPCTPHF 335
>ref|XP_003127800.2| PREDICTED: tubulointerstitial nephritis antigen [Sus scrofa].
Length = 362
Score = 132 bits (332), Expect = 5e-31
Identities = 91/320 (28%), Positives = 141/320 (44%), Gaps = 22/320 (6%)
Frame = +2
Query: 338 ELVNFINKQNTTWTAGHN--FYNVDLSY-VKKLCGTFLGGPKLPQ----RAAFAADMILP 496
+++ IN+ N W AG++ F+ + L ++ GT + +LP
Sbjct: 40 DMIKAINQGNYGWRAGNHSAFWGMTLDEGIRYRLGTIRPSSSVANMNEIHTVLGPGEVLP 99
Query: 497 KSFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIRSNGRVNVEVSAEDMLTX 676
++F+A E+WPN I + DQG+C WAF SDR+ I S G + +S +++L+
Sbjct: 100 RAFEASEKWPNL--IHDPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQNLLSC 157
Query: 677 XXXXXXXXXXXXFPSGAWNFWTKKGLVSGGLY--DSHVGCRPYSIPPCEHHVNGSRPPCT 850
GAW F ++G+VS Y H P C H
Sbjct: 158 DTHNQQGCQGGRL-DGAWWFLRRRGVVSDHCYPFSGHERNEAGPAPRCMMHSRAM----- 211
Query: 851 GEGDTPKCSKICEPGYTPSYKEDKHFGCSSYSISRNEKEIMAEIYKNGPVEGAFTVYSDF 1030
G G ++ C Y + D + +Y + NEK+IM E+ +NGPV+ V+ DF
Sbjct: 212 GRGKRQATAR-CPNSYV--HANDIYQVTPAYRLGSNEKDIMKELMENGPVQALMEVHEDF 268
Query: 1031 LQYKSGVYQHVTGD--------LMGGHAIRILGWGVE-----NGTPYWLVGNSCNTDWGD 1171
Y+SG+Y H G H+++I GWG E YW NS WG+
Sbjct: 269 FLYQSGIYSHTPVSHGRPERYRRHGTHSVKITGWGEETLPDGRMLKYWTAANSWGPGWGE 328
Query: 1172 NGFFKILIGQDHCGIESEIV 1231
G F+I+ G + C IES ++
Sbjct: 329 RGHFRIVRGANECDIESFVL 348
>ref|XP_001927698.3| PREDICTED: tubulointerstitial nephritis antigen [Sus scrofa].
Length = 476
Score = 125 bits (314), Expect = 6e-29
Identities = 97/335 (28%), Positives = 144/335 (42%), Gaps = 27/335 (8%)
Frame = +2
Query: 311 SLHFQPLSDELVNFINKQNTTWTAGH--NFYNVDLSY-VKKLCGTFLGGPKLPQRAAFAA 481
S H + L+ +N+ + WTA + F+ + L K GT P L A
Sbjct: 150 SQHVCLVQPGLIEHVNEGDFGWTAQNYSQFWGMTLEEGFKYRLGTLPPSPLLLSMNEVTA 209
Query: 482 DMI----LPKSFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIRSNGRVNVE 649
+ LP+ F A +WP DQ +C + WAF +DRI I+S GR
Sbjct: 210 SLPETTDLPEFFVASYKWPGWT--HGPLDQKNCAASWAFSTASVAADRIAIQSEGRYTAN 267
Query: 650 VSAEDMLTXXXXXXXXXXXXXFPSGAWNFWTKKGLVSGGLYDSHVGCRPYSIPPCEHHVN 829
+S +++++ AW + K+GLVS Y P N
Sbjct: 268 LSPQNLISCCAKNRHGCNSGSIDR-AWWYLRKRGLVSHACY------------PLFKDQN 314
Query: 830 GSRPPCT------GEGDTPKCSKICEPGYTPSYKEDKHFGCSS-YSISRNEKEIMAEIYK 988
+ C G G +K C + K ++ + CS Y +S NE EIM EI +
Sbjct: 315 ATNNGCAMASRSDGRGKR-HATKPCPNNFE---KSNRIYQCSPPYRVSSNETEIMREIMQ 370
Query: 989 NGPVEGAFTVYSDFLQYKSGVYQHVTGD--------LMGGHAIRILGWGVENGT-----P 1129
NGPV+ V+ DF YK+G+Y+HVT + HA+++ GWG G
Sbjct: 371 NGPVQAIMQVHEDFFHYKTGIYRHVTSTNEESDKYRKLRTHAVKLTGWGTLKGAQGRKEK 430
Query: 1130 YWLVGNSCNTDWGDNGFFKILIGQDHCGIESEIVA 1234
+W+ NS WG+NG+F+IL G + IE I+A
Sbjct: 431 FWIAANSWGKSWGENGYFRILRGVNESDIEKLIIA 465
>ref|XP_003129789.1| PREDICTED: dipeptidyl peptidase 1-like [Sus scrofa].
Length = 463
Score = 100 bits (248), Expect = 3e-21
Identities = 85/317 (26%), Positives = 136/317 (42%), Gaps = 15/317 (4%)
Frame = +2
Query: 338 ELVNFINKQNTTWTAGH--NFYNVDLSYVKKLCGTFLGGPKLPQRAAFAAD-----MILP 496
+ V IN +WTA + + L + + G + P+ A A+ + LP
Sbjct: 173 DFVKAINGIQKSWTATAYMEYETLTLKEMTQRGGGYNQRLPRPKPAPITAEIQEKSLHLP 232
Query: 497 KSFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIRSNGRVNVEVSAEDMLTX 676
S+D R + +R+Q SCGSC++F ++ + RI I +N +S +++++
Sbjct: 233 ASWDWRNV-RGTNFVTPVRNQASCGSCYSFASMGMMEARIRILTNNTQTPILSPQEVVSC 291
Query: 677 XXXXXXXXXXXXFPSGAWNFWTKKGLVSGGLYDSHVGCRPYSIPPCEHHVNGSRPPCTGE 856
+ A + GLV C PY+ G+ PCT
Sbjct: 292 SQYAQGCAGGFPYLI-AGKYAQDFGLVEEA-------CFPYT---------GTDSPCT-- 332
Query: 857 GDTPKCSKICEPGYTPSYKEDKHFGCSSYSISRNEKEIMAEIYKNGPVEGAFTVYSDFLQ 1036
+ G Y + H+ Y NE + E+ +GP+ AF VY DFL
Sbjct: 333 ---------VKEGCFRYYSSEYHYVGGFYG-GCNEALMKLELVHHGPMAVAFEVYDDFLH 382
Query: 1037 YKSGVYQHV------TGDLMGGHAIRILGWGVE--NGTPYWLVGNSCNTDWGDNGFFKIL 1192
Y+ G+Y H + HA+ ++G+G + +G YW+V NS T WG++G+F+I
Sbjct: 383 YRKGIYHHTGLRDPFNPFELTNHAVLLVGYGTDLASGMDYWIVKNSWGTSWGEDGYFRIR 442
Query: 1193 IGQDHCGIESEIVAGIP 1243
G D C IES VA P
Sbjct: 443 RGTDECAIESIAVAATP 459
>ref|NP_999094.1| pro-cathepsin H precursor [Sus scrofa].
Length = 335
Score = 90.9 bits (224), Expect = 2e-18
Identities = 65/232 (28%), Positives = 106/232 (45%), Gaps = 6/232 (2%)
Frame = +2
Query: 539 IKEIRDQGSCGSCWAFGAVEAISDRICIRSNGRVNVEVSAEDMLTXXXXXXXXXXXXXFP 718
+ +++QGSCGSCW F A+ + I + G++ + ++ + ++ P
Sbjct: 129 VSPVKNQGSCGSCWTFSTTGALESAVAI-ATGKM-LSLAEQQLVDCAQNFNNHGCQGGLP 186
Query: 719 SGAWNFWT-KKGLVSGGLYDSHVGCRPYSIPPCEHHVNGSRPPCTGEGDTPKCSKICEPG 895
S A+ + KG++ Y P G+ D K +P
Sbjct: 187 SQAFEYIRYNKGIMGEDTY-----------------------PYKGQDDHCKF----QPD 219
Query: 896 YTPSYKEDKHFGCSSYSISRNEKEIMAE-IYKNGPVEGAFTVYSDFLQYKSGVYQ----H 1060
++ +D +I+ N++E M E + PV AF V +DFL Y+ G+Y H
Sbjct: 220 KAIAFVKDVA------NITMNDEEAMVEAVALYNPVSFAFEVTNDFLMYRKGIYSSTSCH 273
Query: 1061 VTGDLMGGHAIRILGWGVENGTPYWLVGNSCNTDWGDNGFFKILIGQDHCGI 1216
T D + HA+ +G+G ENG PYW+V NS WG NG+F I G++ CG+
Sbjct: 274 KTPDKVN-HAVLAVGYGEENGIPYWIVKNSWGPQWGMNGYFLIERGKNMCGL 324
>ref|NP_001116576.1| cathepsin Z [Sus scrofa].
Length = 304
Score = 77.4 bits (189), Expect = 2e-14
Identities = 63/259 (24%), Positives = 95/259 (36%), Gaps = 6/259 (2%)
Frame = +2
Query: 434 TFLGGPKLPQRAAFAADMILPKSFDAREQWPNCPTIKEI---RDQGS---CGSCWAFGAV 595
T LG P+ + + LP+S+D W N + R+Q CGSCWA G+
Sbjct: 44 TQLGHRTYPRPHEYLSPSDLPRSWD----WRNVNGVNYASVTRNQHIPQYCGSCWAHGST 99
Query: 596 EAISDRICIRSNGRVNVEVSAEDMLTXXXXXXXXXXXXXFPSGAWNFWTKKGLVSGGLYD 775
A++DRI I+ G + + + P W + + G+
Sbjct: 100 SAMADRINIKRKGAWPSTLLSVQHVIDCGNAGSCEGGDDLP--VWAYAHRHGIPDET--- 154
Query: 776 SHVGCRPYSIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYTPSYKEDKHFGCSSYSISR 955
C Y C C++ E +Y K Y
Sbjct: 155 ----CNNYQ---------AKDQVCDKFNQCGTCTEFKECHVIQNYTLWK---VGDYGSVS 198
Query: 956 NEKEIMAEIYKNGPVEGAFTVYSDFLQYKSGVYQHVTGDLMGGHAIRILGWGVENGTPYW 1135
+++MAEIY NGP+ Y G+Y H + + GWGV GT YW
Sbjct: 199 GREKMMAEIYANGPISCGIMATEKMSNYTGGIYAEYKDQAYINHIVSVAGWGVSGGTEYW 258
Query: 1136 LVGNSCNTDWGDNGFFKIL 1192
+V NS WG+ G+ +I+
Sbjct: 259 IVRNSWGEPWGERGWMRIV 277
>ref|NP_999057.1| cathepsin L1 precursor [Sus scrofa].
Length = 334
Score = 76.6 bits (187), Expect = 3e-14
Identities = 69/253 (27%), Positives = 111/253 (43%), Gaps = 9/253 (3%)
Frame = +2
Query: 491 LPKSFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIRSNGRVNVEVSAEDML 670
+PKS D RE+ + +++QG CGSCWAF A A+ ++ R G++ V +S ++++
Sbjct: 114 VPKSVDWREKG----YVTAVKNQGQCGSCWAFSATGALEGQM-FRKTGKL-VSLSEQNLV 167
Query: 671 TXXXXXXXXXXXXXFPSGAWNFWTKKGLVSGGLYDSHVGCRPYSIPPCEHHVNGSRPPCT 850
A+ + +GGL S P N CT
Sbjct: 168 DCSRPQGNQGCNGGLMDNAFQYVKD----NGGLDTEE------SYPYLGRETNS----CT 213
Query: 851 GEGDTPKCSKICEPGYTPSYKEDKHFGCSSYSISRNEKEIMAEIYKNGPVEGAFTV-YSD 1027
+ P+CS + G+ I + EK +M + GP+ A +S
Sbjct: 214 YK---PECSAANDTGFV--------------DIPQREKALMKAVATVGPISVAIDAGHSS 256
Query: 1028 FLQYKSGVYQHV---TGDLMGGHAIRILGWGVE----NGTPYWLVGNSCNTDWGDNGFFK 1186
F YKSG+Y + DL H + ++G+G E N + +W+V NS +WG NG+ K
Sbjct: 257 FQFYKSGIYYDPDCSSKDL--DHGVLVVGYGFEGTDSNSSKFWIVKNSWGPEWGWNGYVK 314
Query: 1187 ILIGQ-DHCGIES 1222
+ Q +HCGI +
Sbjct: 315 MAKDQNNHCGIST 327
>ref|XP_001929706.1| PREDICTED: cathepsin S [Sus scrofa].
Length = 331
Score = 75.9 bits (185), Expect = 5e-14
Identities = 70/246 (28%), Positives = 110/246 (44%), Gaps = 4/246 (1%)
Frame = +2
Query: 491 LPKSFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIRSNGRVNVEVSAEDML 670
LP S D RE+ C T E++ QGSCGSCWAF AV A+ ++ +++ GR+ V +SA++++
Sbjct: 115 LPDSMDWREK--GCVT--EVKYQGSCGSCWAFSAVGALEAQVKMKT-GRL-VSLSAQNLV 168
Query: 671 TXXXXXXXXXXXXXFPSGAWNFWTKKGLVSGGLYDSHVGCRPYSIPPCEHHVNGSRPPCT 850
+ KG G + ++ Y I S P
Sbjct: 169 DCSTEK----------------YRNKGCNGGFMTEAF----QYIIDNNGIDSEASYPYKA 208
Query: 851 GEGDTPKCSKICEPGYTPSYKEDKHFGCSSYSIS--RNEKEIMAEIYKNGPVEGAFTV-Y 1021
+G SK ++ CS Y+ +E + + GPV A +
Sbjct: 209 VDGKCKYDSK------------NRAATCSRYTELPFADEYALKEAVANKGPVSVAIDAKH 256
Query: 1022 SDFLQYKSGVYQHVTGDLMGGHAIRILGWGVENGTPYWLVGNSCNTDWGDNGFFKIL-IG 1198
S F Y+SGVY + H + ++G+G NG YWLV NS ++GD G+ ++
Sbjct: 257 SSFFFYRSGVYYDPSCTQNVNHGVLVVGYGNLNGKDYWLVKNSWGLNFGDGGYIRMARNS 316
Query: 1199 QDHCGI 1216
++HCGI
Sbjct: 317 ENHCGI 322
>ref|XP_003355243.1| PREDICTED: cathepsin S-like [Sus scrofa].
Length = 331
Score = 75.9 bits (185), Expect = 5e-14
Identities = 70/246 (28%), Positives = 110/246 (44%), Gaps = 4/246 (1%)
Frame = +2
Query: 491 LPKSFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIRSNGRVNVEVSAEDML 670
LP S D RE+ C T E++ QGSCGSCWAF AV A+ ++ +++ GR+ V +SA++++
Sbjct: 115 LPDSMDWREK--GCVT--EVKYQGSCGSCWAFSAVGALEAQVKMKT-GRL-VSLSAQNLV 168
Query: 671 TXXXXXXXXXXXXXFPSGAWNFWTKKGLVSGGLYDSHVGCRPYSIPPCEHHVNGSRPPCT 850
+ KG G + ++ Y I S P
Sbjct: 169 DCSTEK----------------YRNKGCNGGFMTEAF----QYIIDNNGIDSEASYPYKA 208
Query: 851 GEGDTPKCSKICEPGYTPSYKEDKHFGCSSYSIS--RNEKEIMAEIYKNGPVEGAFTV-Y 1021
+G SK ++ CS Y+ +E + + GPV A +
Sbjct: 209 VDGKCKYDSK------------NRAATCSRYTELPFADEYALKEAVANKGPVSVAIDAKH 256
Query: 1022 SDFLQYKSGVYQHVTGDLMGGHAIRILGWGVENGTPYWLVGNSCNTDWGDNGFFKIL-IG 1198
S F Y+SGVY + H + ++G+G NG YWLV NS ++GD G+ ++
Sbjct: 257 SSFFFYRSGVYYDPSCTQNVNHGVLVVGYGNLNGKDYWLVKNSWGLNFGDGGYIRMARNS 316
Query: 1199 QDHCGI 1216
++HCGI
Sbjct: 317 ENHCGI 322
>ref|XP_003122543.2| PREDICTED: cathepsin F [Sus scrofa].
Length = 490
Score = 70.1 bits (170), Expect = 3e-12
Identities = 58/259 (22%), Positives = 107/259 (41%), Gaps = 3/259 (1%)
Frame = +2
Query: 449 PKLPQRAAFAADMILPKSFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIRS 628
P R A + + P +D W + +++DQG CGSCWAF + + ++
Sbjct: 263 PGRKMRLAKSVSSLPPPEWD----WRKKGAVTKVKDQGMCGSCWAFSVTGNVEGQWFLKQ 318
Query: 629 NGRVNVEVSAEDMLTXXXXXXXXXXXXXFPSGAWNFWTKKGLVSGGLYDSHVGCRPYSIP 808
+++ S +++L PS A++ K L GGL
Sbjct: 319 GTLLSL--SEQELLDCDKVDKGCMGG--LPSNAYS--AIKTL--GGLETEE--------- 361
Query: 809 PCEHHVNGSRPPCTGEGDTPKCSKICEPGYTPSYKEDKHFGCSSYSISRNEKEIMAEIYK 988
++ G C+ + K Y D S +S+NE+++ A + +
Sbjct: 362 --DYSYRGHLQTCSFNAEKAKV-----------YIND------SVELSQNEQKLAAWLAE 402
Query: 989 NGPVEGAFTVYSDFLQYKSGV---YQHVTGDLMGGHAIRILGWGVENGTPYWLVGNSCNT 1159
GP+ A + Y+ G+ + + + HA+ ++G+G + TP+W + NS T
Sbjct: 403 KGPISVAINAFG-MQFYRHGISHPLRPLCSPWLIDHAVLLVGYGNRSATPFWAIKNSWGT 461
Query: 1160 DWGDNGFFKILIGQDHCGI 1216
DWG+ G++ + G CG+
Sbjct: 462 DWGEEGYYYLYRGSGACGV 480
Database: RefSeq49_SP.fasta
Posted date: Oct 17, 2011 1:42 PM
Number of letters in database: 11,343,932
Number of sequences in database: 24,897
Lambda K H
0.318 0.134 0.401
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 24897
Number of Hits to DB: 40,155,185
Number of extensions: 1232396
Number of successful extensions: 5636
Number of sequences better than 1.0e-05: 14
Number of HSP's gapped: 5605
Number of HSP's successfully gapped: 14
Length of query: 480
Length of database: 11,343,932
Length adjustment: 103
Effective length of query: 377
Effective length of database: 8,779,541
Effective search space: 3309886957
Effective search space used: 3309886957
Neighboring words threshold: 12
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 35 (18.1 bits)
Search to Sscrofa10_2
BLASTN 2.2.24 [Aug-08-2010]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= 20110601C-001419
(1440 letters)
Database: Sscrofa_10.2.fasta
4582 sequences; 2,808,509,378 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
Sscrofa_Chr14 333 2e-88
>Sscrofa_Chr14
|| Length = 153851969
Score = 333 bits (168), Expect = 2e-88
Identities = 244/267 (91%), Gaps = 2/267 (0%)
Strand = Plus / Minus
Query: 1175 ggcttctttaagatcctcataggacaggatcactgtggcatcgagtcagagatcgtggct 1234
||||||||||||||||||| ||||||||||||||||||||||||||||||||||||||||
Sbjct: 16203879 ggcttctttaagatcctcagaggacaggatcactgtggcatcgagtcagagatcgtggct 16203820
Query: 1235 ggaatcccatgtactccccatttctagaagtgctgatctgtttggccgcacctggggcag 1294
||||||||||||||||||||||||||||||||||||||||||||||||| ||||||||||
Sbjct: 16203819 ggaatcccatgtactccccatttctagaagtgctgatctgtttggccgcccctggggcag 16203760
Query: 1295 tttttcccgcagtatacccctagaagattgcgggtgtgatgcggggaatagcaatgtctt 1354
|||||||||||||||| |||| | ||||| |||||||||| ||||| | ||||||||
Sbjct: 16203759 tttttcccgcagtatagcccttg-ggattgggggtgtgatggggggataggaaatgtctt 16203701
Query: 1355 ttattctttgagttcaaatcagatgcangcgttttgagactggactcaaagactggatcg 1414
|||||||||||||||| || ||||||| ||||||| |||| || ||||| |||||| |||
Sbjct: 16203700 ttattctttgagttcagataagatgcaggcgtttttagacaggcctcaaggactgggtcg 16203641
Query: 1415 ggtctag-ctcgtgtctgccatcagca 1440
|| | || |||||||||||||||||||
Sbjct: 16203640 ggccaagcctcgtgtctgccatcagca 16203614
Score = 307 bits (155), Expect = 9e-81
Identities = 155/155 (100%)
Strand = Plus / Minus
Query: 226 aggtgaatctaggatccacctgccaaaaatgtggcggctcttggccaccctcagctgcct 285
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Sbjct: 16212200 aggtgaatctaggatccacctgccaaaaatgtggcggctcttggccaccctcagctgcct 16212141
Query: 286 ggtgctgctgaccagtgcccgggagagtctgcatttccagcctctgtcggatgagctggt 345
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Sbjct: 16212140 ggtgctgctgaccagtgcccgggagagtctgcatttccagcctctgtcggatgagctggt 16212081
Query: 346 caattttattaacaagcaaaacactacgtggacgg 380
|||||||||||||||||||||||||||||||||||
Sbjct: 16212080 caattttattaacaagcaaaacactacgtggacgg 16212046
Score = 283 bits (143), Expect = 1e-73
Identities = 143/143 (100%)
Strand = Plus / Minus
Query: 87 gatcccaggggccccagtaagtcctgcttcagggtcacgctgggagcagccctgctgagc 146
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Sbjct: 16217292 gatcccaggggccccagtaagtcctgcttcagggtcacgctgggagcagccctgctgagc 16217233
Query: 147 agaggttagtcgtcacctggacgctggccctacttcgtacaggacagtgttccgcagagc 206
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Sbjct: 16217232 agaggttagtcgtcacctggacgctggccctacttcgtacaggacagtgttccgcagagc 16217173
Query: 207 ttgagcctcatctcctttcaggt 229
|||||||||||||||||||||||
Sbjct: 16217172 ttgagcctcatctcctttcaggt 16217150
Score = 281 bits (142), Expect = 5e-73
Identities = 145/146 (99%)
Strand = Plus / Minus
Query: 785 ggttgcaggccctactccatcccaccttgcgaacaccacgtgaacggctcccggcccccg 844
||||||||||||||||||||||||||||| ||||||||||||||||||||||||||||||
Sbjct: 16207492 ggttgcaggccctactccatcccaccttgtgaacaccacgtgaacggctcccggcccccg 16207433
Query: 845 tgcactggggagggggacacccccaagtgcagcaagatctgcgagcctggctacaccccg 904
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Sbjct: 16207432 tgcactggggagggggacacccccaagtgcagcaagatctgcgagcctggctacaccccg 16207373
Query: 905 tcctacaaagaagacaagcacttcgg 930
||||||||||||||||||||||||||
Sbjct: 16207372 tcctacaaagaagacaagcacttcgg 16207347
Score = 252 bits (127), Expect = 5e-64
Identities = 130/131 (99%)
Strand = Plus / Minus
Query: 1046 ggagtgtaccagcacgtcacaggagacttgatgggaggccatgccatccgcatcctgggc 1105
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Sbjct: 16204417 ggagtgtaccagcacgtcacaggagacttgatgggaggccatgccatccgcatcctgggc 16204358
Query: 1106 tggggagtggagaatggcaccccctactggctggtcggcaactcctgtaacacagactgg 1165
||||||||||||||||||||||||||||||||||||||||||||||| ||||||||||||
Sbjct: 16204357 tggggagtggagaatggcaccccctactggctggtcggcaactcctggaacacagactgg 16204298
Query: 1166 ggtgacaatgg 1176
|||||||||||
Sbjct: 16204297 ggtgacaatgg 16204287
Score = 234 bits (118), Expect = 1e-58
Identities = 118/118 (100%)
Strand = Plus / Minus
Query: 464 agagctgcttttgctgcggacatgatcctgcccaaaagcttcgatgcccgggaacagtgg 523
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Sbjct: 16210409 agagctgcttttgctgcggacatgatcctgcccaaaagcttcgatgcccgggaacagtgg 16210350
Query: 524 cccaactgcccgaccatcaaagagatcagagaccagggctcctgtggctcctgctggg 581
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Sbjct: 16210349 cccaactgcccgaccatcaaagagatcagagaccagggctcctgtggctcctgctggg 16210292
Score = 230 bits (116), Expect = 2e-57
Identities = 122/124 (98%)
Strand = Plus / Minus
Query: 929 ggatgcagctcctacagcatctctaggaacgagaaggagatcatggcggagatctacaaa 988
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Sbjct: 16206177 ggatgcagctcctacagcatctctaggaacgagaaggagatcatggcggagatctacaaa 16206118
Query: 989 aacggcccggtcgagggggccttcactgtgtactcggacttcctgcagtataagtctgga 1048
|| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Sbjct: 16206117 aatggcccggtcgagggggccttcactgtgtactcggacttcctgcagtataagtctggt 16206058
Query: 1049 gtgt 1052
||||
Sbjct: 16206057 gtgt 16206054
Score = 230 bits (116), Expect = 2e-57
Identities = 119/120 (99%)
Strand = Plus / Minus
Query: 580 ggcgtttggggctgtggaagccatctctgaccggatctgcatccgcagcaacgggcgtgt 639
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Sbjct: 16209010 ggcgtttggggctgtggaagccatctctgaccggatctgcatccgcagcaacgggcgtgt 16208951
Query: 640 caatgtggaggtgtccgctgaggacatgctcacctgttgtggcgacgagtgtggggatgg 699
||| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Sbjct: 16208950 caacgtggaggtgtccgctgaggacatgctcacctgttgtggcgacgagtgtggggatgg 16208891
Score = 176 bits (89), Expect = 2e-41
Identities = 89/89 (100%)
Strand = Plus / Minus
Query: 699 gctgtaacggtggctttccctctggagcctggaacttctggacaaagaagggcctggtgt 758
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Sbjct: 16207843 gctgtaacggtggctttccctctggagcctggaacttctggacaaagaagggcctggtgt 16207784
Query: 759 ccgggggcctctatgactcgcatgtgggt 787
|||||||||||||||||||||||||||||
Sbjct: 16207783 ccgggggcctctatgactcgcatgtgggt 16207755
Score = 172 bits (87), Expect = 3e-40
Identities = 87/87 (100%)
Strand = Plus / Minus
Query: 379 ggccggacacaatttctacaatgtggacctgagctacgtgaagaagctctgtggcacctt 438
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Sbjct: 16211143 ggccggacacaatttctacaatgtggacctgagctacgtgaagaagctctgtggcacctt 16211084
Query: 439 cctgggtggacccaagctgccccagag 465
|||||||||||||||||||||||||||
Sbjct: 16211083 cctgggtggacccaagctgccccagag 16211057
Score = 159 bits (80), Expect = 5e-36
Identities = 80/80 (100%)
Strand = Plus / Minus
Query: 2 aaggcggctggctgttcgggcgtcagaacctgcccgagcgctcggaggctgcagacctag 61
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Sbjct: 16223475 aaggcggctggctgttcgggcgtcagaacctgcccgagcgctcggaggctgcagacctag 16223416
Query: 62 gccctcggcggcggcggcgg 81
||||||||||||||||||||
Sbjct: 16223415 gccctcggcggcggcggcgg 16223396
Database: Sscrofa_10.2.fasta
Posted date: Nov 16, 2011 10:34 AM
Number of letters in database: 2,808,509,378
Number of sequences in database: 4582
Lambda K H
1.37 0.711 1.31
Gapped
Lambda K H
1.37 0.711 1.31
Matrix: blastn matrix:1 -3
Gap Penalties: Existence: 5, Extension: 2
Number of Sequences: 4582
Number of Hits to DB: 36,859,575
Number of extensions: 266
Number of successful extensions: 266
Number of sequences better than 1.0e-05: 1
Number of HSP's gapped: 264
Number of HSP's successfully gapped: 11
Length of query: 1440
Length of database: 2,808,509,378
Length adjustment: 21
Effective length of query: 1419
Effective length of database: 2,808,413,156
Effective search space: 3985138268364
Effective search space used: 3985138268364
X1: 11 (21.8 bits)
X2: 15 (29.7 bits)
X3: 50 (99.1 bits)
S1: 18 (36.2 bits)
S2: 30 (60.0 bits)