Search to RefSeq (Human) database
Query= 20040204S-034613 (20040204S-034613) 20040204S-034613
(953 letters)
Database: RefSeq (Homo Sapiens/Amino Acid)
27,437 sequences; 13,743,509 total letters
Searching...................................................done
Score E
Sequences producing significant alignments: (bits) Value
Alignment gi|23957706|ref|NP_705839.1| hypothetical protein MGC20446 [Homo... 214 1e-61
Alignment gi|19923603|ref|NP_079119.2| cytochrome b reductase 1; duodenal ... 108 2e-26
Alignment gi|27597088|ref|NP_001906.2| cytochrome b-561 [Homo sapiens] 75 5e-16
Alignment gi|32698982|ref|NP_872386.1| hypothetical protein FLJ39035 [Homo... 47 2e-05
Alignment gi|4507913|ref|NP_003922.1| WAS protein family, member 1; WASP f... 34 0.13
Alignment gi|45827771|ref|NP_055144.3| autoantigen RCD8 [Homo sapiens] 33 0.22
Alignment gi|33438582|ref|NP_005471.2| trophinin associated protein (tasti... 33 0.28
Alignment gi|42733592|ref|NP_976223.1| hypothetical MGC50722 [Homo sapiens] 32 0.63
Alignment gi|16507208|ref|NP_055940.2| capicua homolog [Homo sapiens] 32 0.63
Alignment gi|42657429|ref|XP_291144.5| similar to bA110H4.2 (similar to me... 32 0.82
>gi|23957706|ref|NP_705839.1| hypothetical protein MGC20446 [Homo
sapiens]
Length = 242
Score = 214 bits (546), Expect(2) = 1e-61
Identities = 100/132 (75%), Positives = 107/132 (81%)
Frame = +2
Query: 431 MAVGWFYLSALALFSLGSMCILSTIYWMRYWHEGFA*DGTILTFNWHPVLMVTGMVVLYS 610
M G FYLS L L SLGSMCIL TIYWM+YW GFA +G+I FNWHPVLMV GMVV Y
Sbjct: 1 MVSGRFYLSCLLLGSLGSMCILFTIYWMQYWRGGFAWNGSIYMFNWHPVLMVAGMVVFYG 60
Query: 611 AASLAYRLPQSWVGPKLPWKLGHAAMHLMAFILTVLGLAGVFNSHNHEKIPNLYSLHSWL 790
ASL YRLPQSWVGPKLPWKL HAA+HLMAF+LTV+GL VF HNH + NLYSLHSWL
Sbjct: 61 GASLVYRLPQSWVGPKLPWKLLHAALHLMAFVLTVVGLVAVFTFHNHGRTANLYSLHSWL 120
Query: 791 GITTVFLFACQW 826
GITTVFLFACQW
Sbjct: 121 GITTVFLFACQW 132
Score = 40.8 bits (94), Expect(2) = 1e-61
Identities = 18/24 (75%), Positives = 20/24 (83%)
Frame = +3
Query: 846 FCCPWASVRLRSLLKPIHVLFGAS 917
F PWAS+ LRSLLKPIHV FGA+
Sbjct: 139 FLLPWASMWLRSLLKPIHVFFGAA 162
>gi|19923603|ref|NP_079119.2| cytochrome b reductase 1; duodenal
cytochrome b; cytochrome b reducatse 1 [Homo sapiens]
Length = 286
Score = 108 bits (269), Expect(2) = 2e-26
Identities = 48/116 (41%), Positives = 71/116 (61%)
Frame = +2
Query: 476 LGSMCILSTIYWMRYWHEGFA*DGTILTFNWHPVLMVTGMVVLYSAASLAYRLPQSWVGP 655
+G + ++ + W+ ++ EG DG+ L FNWHPVLMVTG V + A + YRLP +W
Sbjct: 19 VGFLSVIFALVWVLHYREGLGWDGSALEFNWHPVLMVTGFVFIQGIAIIVYRLPWTWKCS 78
Query: 656 KLPWKLGHAAMHLMAFILTVLGLAGVFNSHNHEKIPNLYSLHSWLGITTVFLFACQ 823
KL K HA ++ +A IL ++ + VF +HN I N+YSLHSW+G+ V + Q
Sbjct: 79 KLLMKSIHAGLNAVAAILAIISVVAVFENHNVNNIANMYSLHSWVGLIAVICYLLQ 134
Score = 29.3 bits (64), Expect(2) = 2e-26
Identities = 12/22 (54%), Positives = 14/22 (63%)
Frame = +3
Query: 846 FCCPWASVRLRSLLKPIHVLFG 911
F PWA + LR+ L PIHV G
Sbjct: 142 FLLPWAPLSLRAFLMPIHVYSG 163
>gi|27597088|ref|NP_001906.2| cytochrome b-561 [Homo sapiens]
Length = 251
Score = 74.7 bits (182), Expect(2) = 5e-16
Identities = 41/117 (35%), Positives = 62/117 (52%)
Frame = +2
Query: 476 LGSMCILSTIYWMRYWHEGFA*DGTILTFNWHPVLMVTGMVVLYSAASLAYRLPQSWVGP 655
LG + T W+ + G A + L FN HP+ MV G++ L A L YR+ ++
Sbjct: 23 LGLTLVAMTGAWLGLYRGGIAWESD-LQFNAHPLCMVIGLIFLQGNALLVYRVFRNEA-- 79
Query: 656 KLPWKLGHAAMHLMAFILTVLGLAGVFNSHNHEKIPNLYSLHSWLGITTVFLFACQW 826
K K+ H +H+ A ++ ++GL VF+ H + +LYSLHSW GI L+ QW
Sbjct: 80 KRTTKVLHGLLHIFALVIALVGLVAVFDYHRKKGYADLYSLHSWCGILVFVLYFVQW 136
Score = 27.7 bits (60), Expect(2) = 5e-16
Identities = 13/27 (48%), Positives = 16/27 (59%)
Frame = +3
Query: 846 FCCPWASVRLRSLLKPIHVLFGASFSL 926
F P AS LRS +P H+ FGA+ L
Sbjct: 143 FLFPGASFSLRSRYRPQHIFFGATIFL 169
>gi|32698982|ref|NP_872386.1| hypothetical protein FLJ39035 [Homo
sapiens]
Length = 229
Score = 47.0 bits (110), Expect = 2e-05
Identities = 31/96 (32%), Positives = 46/96 (47%), Gaps = 3/96 (3%)
Frame = +2
Query: 545 GTILTFNWHPVLMVTGMVVLYSAASLAYRLPQS---WVGPKLPWKLGHAAMHLMAFILTV 715
GT L F+WHPV M + + A L + S + K +L H A +A +
Sbjct: 48 GTSL-FSWHPVFMALAFCLCMAEAILLFSPEHSLFFFCSRKARIRL-HWAGQTLAILCAA 105
Query: 716 LGLAGVFNSHNHEKIPNLYSLHSWLGITTVFLFACQ 823
LGL + +S ++P+L S HSW+G T+ A Q
Sbjct: 106 LGLGFIISSRTRSELPHLVSWHSWVGALTLLATAVQ 141
>gi|4507913|ref|NP_003922.1| WAS protein family, member 1; WASP
family Verprolin-homologous protein; scar,
dictyostelium, homology of, 1 [Homo sapiens]
Length = 559
Score = 34.3 bits (77), Expect = 0.13
Identities = 24/71 (33%), Positives = 31/71 (43%), Gaps = 1/71 (1%)
Frame = +1
Query: 628 PPAPVMG-RAQAALEVGPRSHAPDGLHPDCAGAGGRL*LSQPREDPQPLLPAQLAGHHHR 804
PP P G R + + V +H P GLHP + A G P P ++PA H
Sbjct: 430 PPLPPPGIRPSSPVTVTALAHPPSGLHPTPSTAPGPHVPLMPPSPPSQVIPASEPKRH-- 487
Query: 805 LPLRLPVVLGA 837
P LPV+ A
Sbjct: 488 -PSTLPVISDA 497
>gi|45827771|ref|NP_055144.3| autoantigen RCD8 [Homo sapiens]
Length = 1401
Score = 33.5 bits (75), Expect = 0.22
Identities = 27/68 (39%), Positives = 28/68 (41%), Gaps = 8/68 (11%)
Frame = +1
Query: 652 AQAALEVGPRSHAPDGLHPD-CAGAGGRL*LSQPREDPQPLLPAQLA-------GHHHRL 807
A AL G S AP+GL PD A A L L PR P P L QL G H
Sbjct: 748 ASEALSRGFGSSAPEGLEPDSMASAASALHLLSPRPRPGPELGPQLGLDGGPGDGDRHNT 807
Query: 808 PLRLPVVL 831
P L L
Sbjct: 808 PSLLEAAL 815
>gi|33438582|ref|NP_005471.2| trophinin associated protein (tastin);
trophinin assisting protein; trophinin-assisting protein
(tastin) [Homo sapiens]
Length = 778
Score = 33.1 bits (74), Expect = 0.28
Identities = 24/79 (30%), Positives = 32/79 (40%), Gaps = 11/79 (13%)
Frame = -1
Query: 284 PPFGLCSGLTQVYGRQLQE-LGDPRQIPPVEPRALPYLKTTTPEPQWDARR--------- 135
PP C ++ LQE L P PP EPR L + PE +R+
Sbjct: 522 PPEAFCRSEPEIPEPSLQEQLEVPEPYPPAEPRPLESCCRSEPEIPESSRQEQLEVPEPC 581
Query: 134 -LASPRPASCHLAPEPQLP 81
A PRP + EP++P
Sbjct: 582 PPAEPRPLESYCRIEPEIP 600
>gi|42733592|ref|NP_976223.1| hypothetical MGC50722 [Homo sapiens]
Length = 953
Score = 32.0 bits (71), Expect = 0.63
Identities = 31/124 (25%), Positives = 48/124 (38%), Gaps = 8/124 (6%)
Frame = -1
Query: 929 PERE*GSKEDVDGFKEAAQPHRRPGA-AERPQAPXXXXXXXXXRW*CPASCAGSRGWGSS 753
P+R G++ F+ PH R G ++RP + +C+ R WG+
Sbjct: 460 PQRAWGAQGQDRSFQRPESPHERLGHFSQRPWSALAGQ-----------ACSPQRAWGAQ 508
Query: 752 RGCES*RRPPAPAQS----G*RPSGAWLRGPTSRAAWALPMTGAGGTPVTQH---CRAPP 594
R S +RP +P + +P A P R AW T P ++ +PP
Sbjct: 509 RQGPSSQRPGSPPEKRSPFPQQPWSAVATQPCPRRAWTACETWEDPGPRLRNPLERPSPP 568
Query: 593 CQSP 582
Q P
Sbjct: 569 AQRP 572
>gi|16507208|ref|NP_055940.2| capicua homolog [Homo sapiens]
Length = 1608
Score = 32.0 bits (71), Expect = 0.63
Identities = 16/35 (45%), Positives = 17/35 (48%)
Frame = +2
Query: 176 STEGPLVPPEESAVGPRVPGAVFRIPESDQSTVRR 280
ST GPL PP A GP P R D +T RR
Sbjct: 566 STAGPLRPPPPGAGGPATPSKATRFLPMDPATFRR 600
>gi|42657429|ref|XP_291144.5| similar to bA110H4.2 (similar to
membrane protein) [Homo sapiens]
Length = 1607
Score = 31.6 bits (70), Expect = 0.82
Identities = 14/28 (50%), Positives = 19/28 (67%)
Frame = -3
Query: 294 ALLPALRTVLWSDSGIRKTAPGTRGPTA 211
+L+PA T SD+G+R+T PGT P A
Sbjct: 1286 SLIPAPFTAASSDAGMRRTRPGTSAPAA 1313
Score = 31.6 bits (70), Expect = 0.82
Identities = 19/55 (34%), Positives = 28/55 (50%)
Frame = -3
Query: 294 ALLPALRTVLWSDSGIRKTAPGTRGPTADSSGGTKGPSVLKNHHPRTTVGCTEAG 130
+L+PA T D+G+R+T PGT P A + PS L N R+ + + G
Sbjct: 1016 SLIPATFTAASRDAGMRRTRPGTSAPAA--AAAAPPPSTL-NPTSRSLLNAVDGG 1067
Database: RefSeq (Homo Sapiens/Amino Acid)
Posted date: Apr 12, 2004 5:34 AM
Number of letters in database: 13,743,509
Number of sequences in database: 27,437
Lambda K H
0.318 0.135 0.401
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
length of database: 13,743,509
effective HSP length: 101
effective length of database: 10,972,372
effective search space used: 2370032352
frameshift window, decay const: 50, 0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
Search to RefSeq (Mouse) database
Query= 20040204S-034613 (20040204S-034613) 20040204S-034613
(953 letters)
Database: RefSeq (Mus Musculus/Amino Acid)
26,199 sequences; 11,906,519 total letters
Searching...................................................done
Score E
Sequences producing significant alignments: (bits) Value
Alignment gi|41235724|ref|NP_958739.1| similar to hypothetical protein MGC... 216 2e-61
Alignment gi|38074855|ref|XP_130253.2| cytochrome b reductase 1 [Mus muscu... 91 9e-21
Alignment gi|31542436|ref|NP_031831.2| cytochrome b-561 [Mus musculus] 78 3e-17
Alignment gi|13994209|ref|NP_114083.1| WASP family 1 [Mus musculus] 35 0.049
Alignment gi|33147082|ref|NP_476512.1| HLA-B-associated transcript 3 [Mus ... 34 0.14
Alignment gi|16507202|ref|NP_082158.1| capicua homolog [Mus musculus] 33 0.18
Alignment gi|34147258|ref|NP_899023.1| hypothetical protein 5330438D12 [Mu... 33 0.31
Alignment gi|41680647|ref|NP_598619.2| RIKEN cDNA 4930504E06; cDNA sequenc... 32 0.41
Alignment gi|9055226|ref|NP_061232.1| glycoprotein 9 (platelet); platelet ... 32 0.41
Alignment gi|27369607|ref|NP_766036.1| RIKEN cDNA 4732452J19 [Mus musculus] 32 0.41
>gi|41235724|ref|NP_958739.1| similar to hypothetical protein
MGC20446 [Mus musculus]
Length = 242
Score = 216 bits (550), Expect(2) = 2e-61
Identities = 98/132 (74%), Positives = 108/132 (81%)
Frame = +2
Query: 431 MAVGWFYLSALALFSLGSMCILSTIYWMRYWHEGFA*DGTILTFNWHPVLMVTGMVVLYS 610
MA GWFYLS + L SLGSMCIL T YWM+YW GFA DGT+L FNWHPVLMV GMVVLY
Sbjct: 1 MASGWFYLSCMVLGSLGSMCILFTAYWMQYWRGGFAWDGTVLMFNWHPVLMVAGMVVLYG 60
Query: 611 AASLAYRLPQSWVGPKLPWKLGHAAMHLMAFILTVLGLAGVFNSHNHEKIPNLYSLHSWL 790
AASL YRLP SWVGP+LPWK+ HAA+HL+AF TV+GL VF HNH +I +LYSLHSWL
Sbjct: 61 AASLVYRLPSSWVGPRLPWKVLHAALHLLAFTCTVVGLIAVFRFHNHSRIAHLYSLHSWL 120
Query: 791 GITTVFLFACQW 826
GITTV LFACQW
Sbjct: 121 GITTVVLFACQW 132
Score = 38.5 bits (88), Expect(2) = 2e-61
Identities = 17/23 (73%), Positives = 18/23 (78%)
Frame = +3
Query: 846 FCCPWASVRLRSLLKPIHVLFGA 914
F PWAS LRSLLKP+HV FGA
Sbjct: 139 FLLPWASQWLRSLLKPLHVFFGA 161
>gi|38074855|ref|XP_130253.2| cytochrome b reductase 1 [Mus
musculus]
Length = 324
Score = 90.5 bits (223), Expect(2) = 9e-21
Identities = 47/150 (31%), Positives = 75/150 (50%), Gaps = 34/150 (22%)
Frame = +2
Query: 476 LGSMCILSTIYWMRYWHEGFA*DGTILTFNWHPVLMVTGMVVLYS--------------- 610
+G + ++ + W+ ++ EG +G+ L FNWHPVL VTG V +
Sbjct: 19 VGFLSVIFVLIWVLHFREGLGWNGSGLEFNWHPVLAVTGFVFIQGIGTGILEGVGGVTAR 78
Query: 611 -------------------AASLAYRLPQSWVGPKLPWKLGHAAMHLMAFILTVLGLAGV 733
+A + YRLP +W KL K HA ++ +A IL ++ + V
Sbjct: 79 WDPGRRRGFQGIHETRESYSAIIVYRLPWTWKCSKLLMKSIHAGLNAVAAILAIISVVAV 138
Query: 734 FNSHNHEKIPNLYSLHSWLGITTVFLFACQ 823
F HN +K+P++YSLHSW+G+T + L+ Q
Sbjct: 139 FEYHNVQKVPHMYSLHSWVGLTALILYIQQ 168
Score = 27.7 bits (60), Expect(2) = 9e-21
Identities = 11/22 (50%), Positives = 14/22 (63%)
Frame = +3
Query: 846 FCCPWASVRLRSLLKPIHVLFG 911
F PWA LR+++ PIHV G
Sbjct: 176 FLLPWAPPSLRAIVMPIHVYSG 197
>gi|31542436|ref|NP_031831.2| cytochrome b-561 [Mus musculus]
Length = 250
Score = 78.2 bits (191), Expect(2) = 3e-17
Identities = 44/117 (37%), Positives = 63/117 (53%)
Frame = +2
Query: 476 LGSMCILSTIYWMRYWHEGFA*DGTILTFNWHPVLMVTGMVVLYSAASLAYRLPQSWVGP 655
LG + T W+ + G A + + L FN HP+ MV GM+ L A L YR+ +
Sbjct: 22 LGLTVVAVTGAWLGLYRGGIAWESS-LQFNVHPLCMVIGMIFLQGDALLVYRVFRREA-- 78
Query: 656 KLPWKLGHAAMHLMAFILTVLGLAGVFNSHNHEKIPNLYSLHSWLGITTVFLFACQW 826
K K+ H +H+ AFI+ ++GL VF+ H + +LYSLHSW GI L+ QW
Sbjct: 79 KRTTKILHGLLHVFAFIIALVGLVAVFDYHKKKGYADLYSLHSWCGILVFVLYFVQW 135
Score = 28.1 bits (61), Expect(2) = 3e-17
Identities = 13/28 (46%), Positives = 17/28 (60%)
Frame = +3
Query: 846 FCCPWASVRLRSLLKPIHVLFGASFSLW 929
F P AS LRS +P H+ FGA+ L+
Sbjct: 142 FLFPGASFSLRSRYRPQHIFFGATIFLF 169
>gi|13994209|ref|NP_114083.1| WASP family 1 [Mus musculus]
Length = 559
Score = 35.4 bits (80), Expect = 0.049
Identities = 25/71 (35%), Positives = 31/71 (43%), Gaps = 1/71 (1%)
Frame = +1
Query: 628 PPAPVMG-RAQAALEVGPRSHAPDGLHPDCAGAGGRL*LSQPREDPQPLLPAQLAGHHHR 804
PP P G R + + V +H P GLHP + A G P P +LPA H
Sbjct: 430 PPLPPPGIRPSSPVAVAALAHPPSGLHPAPSTAPGPHAPLMPPSPPSQVLPASEPKRH-- 487
Query: 805 LPLRLPVVLGA 837
P LPV+ A
Sbjct: 488 -PSTLPVISDA 497
>gi|33147082|ref|NP_476512.1| HLA-B-associated transcript 3 [Mus
musculus]
Length = 1154
Score = 33.9 bits (76), Expect = 0.14
Identities = 18/44 (40%), Positives = 25/44 (56%)
Frame = -1
Query: 242 RQLQELGDPRQIPPVEPRALPYLKTTTPEPQWDARRLASPRPAS 111
R ++ +GDP Q P EP + + T+PEPQ R ASP P +
Sbjct: 960 RYVRRVGDPPQTLPEEPMEVQGAERTSPEPQ---RENASPAPGT 1000
>gi|16507202|ref|NP_082158.1| capicua homolog [Mus musculus]
Length = 1606
Score = 33.5 bits (75), Expect = 0.18
Identities = 16/35 (45%), Positives = 18/35 (51%)
Frame = +2
Query: 176 STEGPLVPPEESAVGPRVPGAVFRIPESDQSTVRR 280
ST PL PP A GP P R P +D +T RR
Sbjct: 565 STAVPLRPPPPGAGGPATPSKATRFPPTDSATFRR 599
>gi|34147258|ref|NP_899023.1| hypothetical protein 5330438D12 [Mus
musculus]
Length = 224
Score = 32.7 bits (73), Expect = 0.31
Identities = 33/100 (33%), Positives = 39/100 (39%), Gaps = 3/100 (3%)
Frame = +1
Query: 628 PPAPVMGRAQAALEVGPRSH--APDGLHPDCAGAGGRL*LSQPREDPQPLLPAQLAGHHH 801
P A +G +A E GPR A G+ P G L L PR P P P
Sbjct: 116 PGAARLGSERAG-ERGPRGRGEAEGGVGPPTGGRAACLCLCFPRTPP-PRAPPSGCPRSF 173
Query: 802 RLPLRLPVVLGACGLSAAPGRLCGCAASLNPST-SSLEPH 918
+R PV GACG S P C P T ++ PH
Sbjct: 174 PTLVRGPVRCGACGASVLPPPRCLRYLGALPQTPAARRPH 213
>gi|41680647|ref|NP_598619.2| RIKEN cDNA 4930504E06; cDNA sequence,
clone 2-4; NF-E2 inducible protein [Mus musculus]
Length = 459
Score = 32.3 bits (72), Expect = 0.41
Identities = 25/61 (40%), Positives = 29/61 (47%), Gaps = 2/61 (3%)
Frame = -2
Query: 565 VERE-YGAISGKAFVPVPHPID-GGEDAHGAQGEQRQGRQVKPSHSHSDQALAPP*KRSW 392
VE E + A+SG P HP D G DA GA GEQ G Q P + PP + S
Sbjct: 20 VESENHEALSG----PEKHPQDKDGADADGAAGEQEPGDQTLPPAQDGENLECPPPEASS 75
Query: 391 S 389
S
Sbjct: 76 S 76
>gi|9055226|ref|NP_061232.1| glycoprotein 9 (platelet); platelet
glycoprotein IX [Mus musculus]
Length = 177
Score = 32.3 bits (72), Expect = 0.41
Identities = 21/89 (23%), Positives = 39/89 (43%), Gaps = 3/89 (3%)
Frame = +2
Query: 500 TIYWMRYWHEGFA*DGTILTFNWHPVLMVT---GMVVLYSAASLAYRLPQSWVGPKLPWK 670
++ ++R W E + + + P L G + Y S ++LP SW P + W
Sbjct: 92 SLTYLRLWLEDHMPEALMHVYCASPDLATRRPLGQLTGYELGSCGWKLPPSWAYPGVWWD 151
Query: 671 LGHAAMHLMAFILTVLGLAGVFNSHNHEK 757
+ A+ ++ IL LAG+ N+ +
Sbjct: 152 VSLVAVAVLGLIL----LAGLLNTFTESR 176
>gi|27369607|ref|NP_766036.1| RIKEN cDNA 4732452J19 [Mus musculus]
Length = 657
Score = 32.3 bits (72), Expect = 0.41
Identities = 18/57 (31%), Positives = 26/57 (45%)
Frame = -1
Query: 248 YGRQLQELGDPRQIPPVEPRALPYLKTTTPEPQWDARRLASPRPASCHLAPEPQLPR 78
Y RQ Q G P+++P T EP ++L +P P + + P PQ PR
Sbjct: 431 YKRQFQWHGRKPGPETGIPQSMPAASHTQLEPSLPDQQLITPNPTASSMLPNPQRPR 487
Database: RefSeq (Mus Musculus/Amino Acid)
Posted date: Apr 12, 2004 5:36 AM
Number of letters in database: 11,906,519
Number of sequences in database: 26,199
Lambda K H
0.318 0.135 0.401
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
length of database: 11,906,519
effective HSP length: 100
effective length of database: 9,286,619
effective search space used: 2015196323
frameshift window, decay const: 50, 0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)