Animal-Genome cDNA Clone 20030531/20030531C-0/20030531C-000510


Query

>20030531C-000510 (20030531C-000510) 20030531C-000510
AGCTTTTTTG CGAGAGCCGC TGGATCAGCC GGACGTCCCC AGGCCCTGGC
CATGCGGCTC CTGCTACTTC TGGCCCCACT GGGCTGGCTG CTTCTGACCG
AAACGAAGGG TGACGCCAAA CCGGAGGACA ACCTCTTGGT GCTCACGGTG
GCCACGAAGG AGACCGAGGG GTTCCGACGC TTCAAGCGCT CAGCCCAGTT
CTTCAACTAC AAGATCCAGG CGCTGGGGCT GGGGGAGGAC TGGAATGAGA
AGGAGGCATC GTCGGGTGGA GGGCTGAAGG TTCGGCTGCT GAAGAAAGCC
CTGGAAAAGC ATGCAGACGA GAACCTGGTC ATTCTCTTCA CAGACAGCTA
TGACGTGGTG TTTGCCTCCG GGCCCCGAGA GCTGCTGAAG AAGTTCCGGC
AGGCCAAGAG CCAGGTGGTC TTCTCAGCAG AGGAGCTCAT CTACCCGGAC
CGCAGGCTGG AGGCCAAGTA CCCGGCCGTC TCCGACGGCA AGAGGTTCCT
GGGCTCTGGA GGCTTCATTG GTTATGCCCC CAACCTCAGC AAACTGGTGG
CTGAGTGGGA GGGTCAGGAC AGCGACAGCG ACCAACTCTT TTATACCAAG
ATATTCTTGG ACCCGGAGAA GAGGGAGCGG ATCAATATCA CCTTGGACCA
CCGCTGCCGT ATCTTCCAGA ATCTGGATGG AGCCTTGGAT GAGGTTGTGC
TCAAGTTCGA GATGGGCCAA GTGAGAGCGA GGAACCTGGC CTACGACACC
CTCCCGGTCC TGATTCACGG CAATGGGCCC ACCAAGCTGC AGCTGAACTA
CCTGGGCAAC TACATCCCGC GCTTCTGGAC CTTCGAGACG GGGTGCTCCG
TGTGTGATGA GGGCCTGCGC AGCCTCAAGG GCATTGGGGA TGAAGCTCTG
CCCACAGTCC TGGTCGGCCT GTTCATCGAA CAGCCCACGC CGTTCCTGTC
CCTGTTCTTC CAGCGGCTCC TGCGCCTCCA ATACCCCCGG AACGGATGCG
GCTTTTCCTT CACACCATGA GCAGCACCAC AGGCTCTAG

Search to RefSeq (Human) database

Query= 20030531C-000510 (20030531C-000510) 20030531C-000510
         (1039 letters)

Database: RefSeq (Homo Sapiens/Amino Acid)
           19,244 sequences; 10,046,356 total letters

Searching..................................................done

                                                                               Score     E
            Sequences producing significant alignments:                        (bits)  Value

Alignment   gi|4557837|ref|NP_000293.1| (NM_000302) procollagen-lysine 5-di...   536  e-152
Alignment   gi|4505889|ref|NP_000926.1| (NM_000935) procollagen-lysine, 2-o...   337  1e-92
Alignment   gi|4505891|ref|NP_001075.1| (NM_001084) procollagen-lysine, 2-o...   327  1e-89
Alignment   gi|17986277|ref|NP_001837.1| (NM_001846) alpha 2 type IV collag...    34  0.15
Alignment   gi|15890086|ref|NP_203699.1| (NM_033380) alpha 5 type IV collag...    30  0.15
Alignment   gi|15890088|ref|NP_203700.1| (NM_033381) alpha 5 type IV collag...    29  0.15
Alignment   gi|4502955|ref|NP_000486.1| (NM_000495) alpha 5 type IV collage...    29  0.15
Alignment   gi|4506431|ref|NP_002881.1| (NM_002890) RAS p21 protein activat...    33  0.25
Alignment   gi|23110974|ref|NP_690056.1| (NM_152843) light ear protein isof...    31  1.2
Alignment   gi|23110972|ref|NP_690055.1| (NM_152842) light ear protein isof...    31  1.2


>gi|4557837|ref|NP_000293.1| (NM_000302) procollagen-lysine
           5-dioxygenase; Procollagen-lysine, 2-oxoglutarate
           5-dioxygenase (lysine hydroxylase); lysine,
           2-oxoglutarate 5-dioxygenase [Homo sapiens]
          Length = 727

 Score =  536 bits (1380), Expect = e-152
 Identities = 269/299 (89%), Positives = 277/299 (91%), Gaps = 2/299 (0%)
 Frame = +1

Query: 100 ETKGDAKPEDNLLVLTVATKETEGFRRFKRSAQFFNYKIQALGLGEDWN-EKEASSGGGL 276
           E KGDAKPEDNLLVLTVATKETEGFRRFKRSAQFFNYKIQALGLGEDWN EK  S+GGG
Sbjct: 17  EAKGDAKPEDNLLVLTVATKETEGFRRFKRSAQFFNYKIQALGLGEDWNVEKGTSAGGGQ 76

Query: 277 KVRLLKKALEKHAD-ENLVILFTDSYDVVFASGPRELLKKFRQAKSQVVFSAEELIYPDR 453
           KVRLLKKALEKHAD E+LVILFTDSYDV+FASGPRELLKKFRQA+SQVVFSAEELIYPDR
Sbjct: 77  KVRLLKKALEKHADKEDLVILFTDSYDVLFASGPRELLKKFRQARSQVVFSAEELIYPDR 136

Query: 454 RLEAKYPAVSDGKRFLGSGGFIGYAPNLSKLVAEWEGQDSDSDQLFYTKIFLDPEKRERI 633
           RLE KYP VSDGKRFLGSGGFIGYAPNLSKLVAEWEGQDSDSDQLFYTKIFLDPEKRE+I
Sbjct: 137 RLETKYPVVSDGKRFLGSGGFIGYAPNLSKLVAEWEGQDSDSDQLFYTKIFLDPEKREQI 196

Query: 634 NITLDHRCRIFQNLDGALDEVVLKFEMGQVRARNLAYDTLPVLIHGNGPTKLQLNYLGNY 813
           NITLDHRCRIFQNLDGALDEVVLKFEMG VRARNLAYDTLPVLIHGNGPTKLQLNYLGNY
Sbjct: 197 NITLDHRCRIFQNLDGALDEVVLKFEMGHVRARNLAYDTLPVLIHGNGPTKLQLNYLGNY 256

Query: 814 IPRFWTFETGCSVCDEGLRSLKGIGDEALPTVLVGLFIEQPTPXXXXXXXXXXXXXYPR 990
           IPRFWTFETGC+VCDEGLRSLKGIGDEALPTVLVG+FIEQPTP             YP+
Sbjct: 257 IPRFWTFETGCTVCDEGLRSLKGIGDEALPTVLVGVFIEQPTPFVSLFFQRLLRLHYPQ 315



>gi|4505889|ref|NP_000926.1| (NM_000935) procollagen-lysine,
           2-oxoglutarate 5-dioxygenase (lysine hydroxylase) 2;
           Procollagen-lysine, 2-oxoglutarate 5-dioxygenase (lysine
           hydroxylase) [Homo sapiens]
          Length = 737

 Score =  337 bits (863), Expect = 1e-92
 Identities = 159/298 (53%), Positives = 220/298 (73%), Gaps = 3/298 (1%)
 Frame = +1

Query: 106 KGDAKPEDNLLVLTVATKETEGFRRFKRSAQFFNYKIQALGLGEDWNEKEA--SSGGGLK 279
           K  + P D LLV+TVATKE++GF RF +SA++FNY ++ LG GE+W   +   S GGG K
Sbjct: 30  KPSSIPTDKLLVITVATKESDGFHRFMQSAKYFNYTVKVLGQGEEWRGGDGINSIGGGQK 89

Query: 280 VRLLKKALEKHADEN-LVILFTDSYDVVFASGPRELLKKFRQAKSQVVFSAEELIYPDRR 456
           VRL+K+ +E +AD++ LV++FT+ +DV+FA GP E+LKKF++A  +VVF+A+ +++PD+R
Sbjct: 90  VRLMKEVMEHYADQDDLVVMFTECFDVIFAGGPEEVLKKFQKANHKVVFAADGILWPDKR 149

Query: 457 LEAKYPAVSDGKRFLGSGGFIGYAPNLSKLVAEWEGQDSDSDQLFYTKIFLDPEKRERIN 636
           L  KYP V  GKR+L SGGFIGYAP ++++V +W  QD+D DQLFYTK+++DP KRE IN
Sbjct: 150 LADKYPVVHIGKRYLNSGGFIGYAPYVNRIVQQWNLQDNDDDQLFYTKVYIDPLKREAIN 209

Query: 637 ITLDHRCRIFQNLDGALDEVVLKFEMGQVRARNLAYDTLPVLIHGNGPTKLQLNYLGNYI 816
           ITLDH+C+IFQ L+GA+DEVVLKFE G+ RA+N  Y+TLPV I+GNGPTK+ LNY GNY+
Sbjct: 210 ITLDHKCKIFQTLNGAVDEVVLKFENGKARAKNTFYETLPVAINGNGPTKILLNYFGNYV 269

Query: 817 PRFWTFETGCSVCDEGLRSLKGIGDEALPTVLVGLFIEQPTPXXXXXXXXXXXXXYPR 990
           P  WT + GC++C+     L  +  +  P V +G+FIEQPTP             YP+
Sbjct: 270 PNSWTQDNGCTLCEFDTVDLSAV--DVHPNVSIGVFIEQPTPFLPRFLDILLTLDYPK 325



>gi|4505891|ref|NP_001075.1| (NM_001084) procollagen-lysine,
           2-oxoglutarate 5-dioxygenase 3; procollagen-lysine,
           2-oxoglutarate 5-dioxygenase 3v; lysyl hydroxylase 3
           [Homo sapiens]
          Length = 738

 Score =  327 bits (837), Expect = 1e-89
 Identities = 156/275 (56%), Positives = 209/275 (75%), Gaps = 3/275 (1%)
 Frame = +1

Query: 127 DNLLVLTVATKETEGFRRFKRSAQFFNYKIQALGLGEDWNEKEASS--GGGLKVRLLKKA 300
           + LLV+TVAT ETEG+ RF RSA+FFNY ++ LGLGE+W   + +   GGG KVR LKK
Sbjct: 37  EKLLVITVATAETEGYLRFLRSAEFFNYTVRTLGLGEEWRGGDVARTVGGGQKVRWLKKE 96

Query: 301 LEKHAD-ENLVILFTDSYDVVFASGPRELLKKFRQAKSQVVFSAEELIYPDRRLEAKYPA 477
           +EK+AD E+++I+F DSYDV+ A  P ELLKKF Q+ S+++FSAE   +P+  L  +YP
Sbjct: 97  MEKYADREDMIIMFVDSYDVILAGSPTELLKKFVQSGSRLLFSAESFCWPEWGLAEQYPE 156

Query: 478 VSDGKRFLGSGGFIGYAPNLSKLVAEWEGQDSDSDQLFYTKIFLDPEKRERINITLDHRC 657
           V  GKRFL SGGFIG+A  + ++V +W+ +D D DQLFYT+++LDP  RE++++ LDH+
Sbjct: 157 VGTGKRFLNSGGFIGFATTIHQIVRQWKYKDDDDDQLFYTRLYLDPGLREKLSLNLDHKS 216

Query: 658 RIFQNLDGALDEVVLKFEMGQVRARNLAYDTLPVLIHGNGPTKLQLNYLGNYIPRFWTFE 837
           RIFQNL+GALDEVVLKF+  +VR RN+AYDTLP+++HGNGPTKLQLNYLGNY+P  WT E
Sbjct: 217 RIFQNLNGALDEVVLKFDRNRVRIRNVAYDTLPIVVHGNGPTKLQLNYLGNYVPNGWTPE 276

Query: 838 TGCSVCDEGLRSLKGIGDEALPTVLVGLFIEQPTP 942
            GC  C++  R+L   G +  P V + +F+EQPTP
Sbjct: 277 GGCGFCNQDRRTLP--GGQPPPRVFLAVFVEQPTP 309



>gi|17986277|ref|NP_001837.1| (NM_001846) alpha 2 type IV collagen
           preproprotein; canstatin [Homo sapiens]
          Length = 1712

 Score = 34.3 bits (77), Expect = 0.15
 Identities = 20/46 (43%), Positives = 22/46 (47%)
 Frame = +3

Query: 39  PGPGHAAPATSGPTGLAASDRNEG*RQTGGQPLGAHGGHEGDRGVP 176
           PGP      + GP GL   D   G R   G+ LGA  G  GD GVP
Sbjct: 749 PGPD----GSPGPIGLPGPDGPPGERGLPGEVLGAQPGPRGDAGVP 790



>gi|15890086|ref|NP_203699.1| (NM_033380) alpha 5 type IV collagen
            isoform 2, precursor; collagen IV, alpha-5 polypeptide;
            collagen of basement membrane, alpha-5 chain [Homo
            sapiens]
          Length = 1691

 Score = 29.6 bits (65), Expect = 3.6
 Identities = 20/52 (38%), Positives = 25/52 (47%), Gaps = 3/52 (5%)
 Frame = +3

Query: 39   PGPG--HAAPATSGPTGLAASDRNEG*RQTGGQP-LGAHGGHEGDRGVPTLQ 185
            PGP      P   GP GL  +   +G +   GQP L    G +GD+G P LQ
Sbjct: 1263 PGPTGFQGLPGPEGPPGLPGNGGIKGEKGNPGQPGLPGLPGLKGDQGPPGLQ 1314


 Score = 29.3 bits (64), Expect(2) = 0.15
 Identities = 36/134 (26%), Positives = 50/134 (36%), Gaps = 2/134 (1%)
 Frame = +3

Query: 366 LRAPRAAEEVPAGQ-EPGGLLSRGAHLPGPQAGGQVPGRLRRQEVPGLWRLHWLCPXXXX 542
           L+ P     +P  + EPG ++   + LPGP+     PG    Q +PG
Sbjct: 142 LQGPPGPPGIPGMKGEPGSIIM--SSLPGPKGNPGYPGPPGIQGLPG------------- 186

Query: 543 XXXXXXXXXXXXRPTLLYQDILGPGEEGADQYHLGPP-LPYLPESGWSLG*GCAQVRDGP 719
                        PT +   I  PG  G     +GPP  P LP    ++G      +
Sbjct: 187 -------------PTGIPGPIGPPGPPGL----MGPPGPPGLPGPKGNMG---LNFQGPK 226

Query: 720 SESEEPGLRHPPGP 761
            E  E GL+ PPGP
Sbjct: 227 GEKGEQGLQGPPGP 240


 Score = 23.5 bits (49), Expect(2) = 0.15
 Identities = 9/25 (36%), Positives = 13/25 (52%)
 Frame = +3

Query: 735 PGLRHPPGPDSRQWAHQAAAELPGQ 809
           PG+R PPGP   +   +     PG+
Sbjct: 272 PGIRGPPGPPGGEKGEKGEQGEPGK 296



>gi|15890088|ref|NP_203700.1| (NM_033381) alpha 5 type IV collagen
           isoform 3, precursor; collagen IV, alpha-5 polypeptide;
           collagen of basement membrane, alpha-5 chain [Homo
           sapiens]
          Length = 1688

 Score = 29.3 bits (64), Expect(2) = 0.15
 Identities = 36/134 (26%), Positives = 50/134 (36%), Gaps = 2/134 (1%)
 Frame = +3

Query: 366 LRAPRAAEEVPAGQ-EPGGLLSRGAHLPGPQAGGQVPGRLRRQEVPGLWRLHWLCPXXXX 542
           L+ P     +P  + EPG ++   + LPGP+     PG    Q +PG
Sbjct: 142 LQGPPGPPGIPGMKGEPGSIIM--SSLPGPKGNPGYPGPPGIQGLPG------------- 186

Query: 543 XXXXXXXXXXXXRPTLLYQDILGPGEEGADQYHLGPP-LPYLPESGWSLG*GCAQVRDGP 719
                        PT +   I  PG  G     +GPP  P LP    ++G      +
Sbjct: 187 -------------PTGIPGPIGPPGPPGL----MGPPGPPGLPGPKGNMG---LNFQGPK 226

Query: 720 SESEEPGLRHPPGP 761
            E  E GL+ PPGP
Sbjct: 227 GEKGEQGLQGPPGP 240


 Score = 28.5 bits (62), Expect = 8.0
 Identities = 21/52 (40%), Positives = 26/52 (49%), Gaps = 3/52 (5%)
 Frame = +3

Query: 39   PG-PG-HAAPATSGPTGLAASDRNEG*RQTGGQP-LGAHGGHEGDRGVPTLQ 185
            PG PG    P   GP GL  +   +G +   GQP L    G +GD+G P LQ
Sbjct: 1260 PGRPGFQGLPGPEGPPGLPGNGGIKGEKGNPGQPGLPGLPGLKGDQGPPGLQ 1311


 Score = 23.5 bits (49), Expect(2) = 0.15
 Identities = 9/25 (36%), Positives = 13/25 (52%)
 Frame = +3

Query: 735 PGLRHPPGPDSRQWAHQAAAELPGQ 809
           PG+R PPGP   +   +     PG+
Sbjct: 272 PGIRGPPGPPGGEKGEKGEQGEPGK 296



>gi|4502955|ref|NP_000486.1| (NM_000495) alpha 5 type IV collagen
           isoform 1, precursor; collagen IV, alpha-5 polypeptide;
           collagen of basement membrane, alpha-5 chain [Homo
           sapiens]
          Length = 1685

 Score = 29.3 bits (64), Expect(2) = 0.15
 Identities = 36/134 (26%), Positives = 50/134 (36%), Gaps = 2/134 (1%)
 Frame = +3

Query: 366 LRAPRAAEEVPAGQ-EPGGLLSRGAHLPGPQAGGQVPGRLRRQEVPGLWRLHWLCPXXXX 542
           L+ P     +P  + EPG ++   + LPGP+     PG    Q +PG
Sbjct: 142 LQGPPGPPGIPGMKGEPGSIIM--SSLPGPKGNPGYPGPPGIQGLPG------------- 186

Query: 543 XXXXXXXXXXXXRPTLLYQDILGPGEEGADQYHLGPP-LPYLPESGWSLG*GCAQVRDGP 719
                        PT +   I  PG  G     +GPP  P LP    ++G      +
Sbjct: 187 -------------PTGIPGPIGPPGPPGL----MGPPGPPGLPGPKGNMG---LNFQGPK 226

Query: 720 SESEEPGLRHPPGP 761
            E  E GL+ PPGP
Sbjct: 227 GEKGEQGLQGPPGP 240


 Score = 23.5 bits (49), Expect(2) = 0.15
 Identities = 9/25 (36%), Positives = 13/25 (52%)
 Frame = +3

Query: 735 PGLRHPPGPDSRQWAHQAAAELPGQ 809
           PG+R PPGP   +   +     PG+
Sbjct: 272 PGIRGPPGPPGGEKGEKGEQGEPGK 296



>gi|4506431|ref|NP_002881.1| (NM_002890) RAS p21 protein activator 1
           isoform 1; RAS p21 protein activator (GTPase activating
           protein); GTPase activating protein;
           triphosphatase-activating protein [Homo sapiens]
          Length = 1047

 Score = 33.5 bits (75), Expect = 0.25
 Identities = 25/68 (36%), Positives = 28/68 (40%), Gaps = 1/68 (1%)
 Frame = +3

Query: 582 PTLLYQDILGPGEEGADQYHLGPPLPYLPESGWSLG*-GCAQVRDGPSESEEPGLRHPPG 758
           PT L  + LGPG      +   PP PYLP  G  LG        DGP   EE
Sbjct: 120 PTSLLAETLGPG----GGFPPLPPPPYLPPLGAGLGTVDEGDSLDGPEYEEEEVAIPLTA 175

Query: 759 PDSRQWAH 782
           P + QW H
Sbjct: 176 PPTNQWYH 183



>gi|23110974|ref|NP_690056.1| (NM_152843) light ear protein isoform
           d [Homo sapiens]
          Length = 528

 Score = 31.2 bits (69), Expect = 1.2
 Identities = 15/47 (31%), Positives = 25/47 (52%), Gaps = 1/47 (2%)
 Frame = -2

Query: 354 VIAVCEENDQVLV-CMLFQGFLQQPNLQPSTRRCLLLIPVLPQPQRL 217
           ++  C+ +  +L  C+L++G +    L PS    +LL    PQ QRL
Sbjct: 179 ILQTCQRSPHILAGCILYKGLIVSTQLPPSLTAKVLLHRTAPQEQRL 225



>gi|23110972|ref|NP_690055.1| (NM_152842) light ear protein isoform
           e [Homo sapiens]
          Length = 232

 Score = 31.2 bits (69), Expect = 1.2
 Identities = 15/47 (31%), Positives = 25/47 (52%), Gaps = 1/47 (2%)
 Frame = -2

Query: 354 VIAVCEENDQVLV-CMLFQGFLQQPNLQPSTRRCLLLIPVLPQPQRL 217
           ++  C+ +  +L  C+L++G +    L PS    +LL    PQ QRL
Sbjct: 174 ILQTCQRSPHILAGCILYKGLIVSTQLPPSLTAKVLLHRTAPQEQRL 220


  Database: RefSeq (Homo Sapiens/Amino Acid)
    Posted date:  Jul 1, 2003  6:07 AM
  Number of letters in database: 10,046,356
  Number of sequences in database:  19,244

Lambda     K      H
   0.318    0.135    0.401

Gapped
Lambda     K      H
   0.267   0.0410    0.140


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 21965494
Number of Sequences: 19244
Number of extensions: 599143
Number of successful extensions: 2852
Number of sequences better than 10.0: 40
Number of HSP's better than 10.0 without gapping: 1687
Number of HSP's successfully gapped in prelim test: 87
Number of HSP's that attempted gapping in prelim test: 519
Number of HSP's gapped (non-prelim): 2404
length of query: 346
length of database: 10,046,356
effective HSP length: 47
effective length of query: 298
effective length of database: 9,141,888
effective search space: 2724282624
effective search space used: 2724282624
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 61 (28.1 bits)

Search to RefSeq (Mouse) database

Query= 20030531C-000510 (20030531C-000510) 20030531C-000510
         (1039 letters)

Database: RefSeq (Mus Musculus/Amino Acid)
           15,330 sequences; 7,155,815 total letters

Searching..................................................done

                                                                               Score     E
            Sequences producing significant alignments:                        (bits)  Value

Alignment   gi|6755106|ref|NP_035252.1| (NM_011122) procollagen-lysine, 2-o...   509  e-145
Alignment   gi|6755108|ref|NP_036091.1| (NM_011961) procollagen lysine, 2-o...   334  4e-92
Alignment   gi|6755110|ref|NP_036092.1| (NM_011962) procollagen-lysine, 2-o...   332  1e-91
Alignment   gi|9506945|ref|NP_062275.1| (NM_019402) poly(A) binding protein...    35  0.060
Alignment   gi|30410760|ref|NP_082542.2| (NM_028266) procollagen, type XVI,...    35  0.079
Alignment   gi|14719436|ref|NP_149030.1| (NM_033041) hairy and enhancer of ...    32  0.39
Alignment   gi|28893065|ref|NP_796092.1| (NM_177118) RIKEN cDNA A830073O21 ...    32  0.51
Alignment   gi|13899211|ref|NP_081632.1| (NM_027356) RIKEN cDNA 2700038N03;...    32  0.51
Alignment   gi|6679271|ref|NP_032841.1| (NM_008815) ets variant gene 4 (E1A...    32  0.67
Alignment   gi|31982382|ref|NP_038891.3| (NM_013863) Bcl2-associated athano...    31  0.87


>gi|6755106|ref|NP_035252.1| (NM_011122) procollagen-lysine,
           2-oxoglutarate 5-dioxygenase 1; procollagen-lysine,
           2-oxoglutarate 5-dioxygenase; lysyl hydroxylase 1 [Mus
           musculus]
          Length = 728

 Score =  509 bits (1311), Expect = e-145
 Identities = 256/300 (85%), Positives = 272/300 (90%), Gaps = 3/300 (1%)
 Frame = +1

Query: 100 ETKGDAKPEDNLLVLTVATKETEGFRRFKRSAQFFNYKIQALGLGEDWNEK--EASSGGG 273
           + K DAK EDNLLVLTVATKETEGFRRFKRSAQFFNYKIQ+LGLGEDW+     A++GGG
Sbjct: 17  QAKDDAKLEDNLLVLTVATKETEGFRRFKRSAQFFNYKIQSLGLGEDWSVDGGPAAAGGG 76

Query: 274 LKVRLLKKALEKHAD-ENLVILFTDSYDVVFASGPRELLKKFRQAKSQVVFSAEELIYPD 450
            KVRLLKKALEKHAD E+LVILF DSYDVVFASGPRELLKKF+QAKSQVVFSAEE IYPD
Sbjct: 77  QKVRLLKKALEKHADKEDLVILFVDSYDVVFASGPRELLKKFQQAKSQVVFSAEEHIYPD 136

Query: 451 RRLEAKYPAVSDGKRFLGSGGFIGYAPNLSKLVAEWEGQDSDSDQLFYTKIFLDPEKRER 630
           RRLEAKYP V DGKRFLGSGGFIGYAP+LSKLVAEWEGQDSDSDQLFYTKIFL+PEKRE+
Sbjct: 137 RRLEAKYPTVPDGKRFLGSGGFIGYAPSLSKLVAEWEGQDSDSDQLFYTKIFLNPEKREQ 196

Query: 631 INITLDHRCRIFQNLDGALDEVVLKFEMGQVRARNLAYDTLPVLIHGNGPTKLQLNYLGN 810
           INI+LDHRCRIFQNLDGALDEVVLKFEMG VRARNLAYDTLPV++HGNGPTKLQLNYLGN
Sbjct: 197 INISLDHRCRIFQNLDGALDEVVLKFEMGHVRARNLAYDTLPVVVHGNGPTKLQLNYLGN 256

Query: 811 YIPRFWTFETGCSVCDEGLRSLKGIGDEALPTVLVGLFIEQPTPXXXXXXXXXXXXXYPR 990
           YIPRFWTFETGC+VCDEGLRSLKGIGDEALPTVLVG+FIEQPTP             YP+
Sbjct: 257 YIPRFWTFETGCTVCDEGLRSLKGIGDEALPTVLVGVFIEQPTPFLSLFFLRLLRLRYPQ 316



>gi|6755108|ref|NP_036091.1| (NM_011961) procollagen lysine,
           2-oxoglutarate 5-dioxygenase 2; lysyl hydroxylase 2 [Mus
           musculus]
          Length = 737

 Score =  334 bits (857), Expect = 4e-92
 Identities = 163/293 (55%), Positives = 212/293 (71%), Gaps = 3/293 (1%)
 Frame = +1

Query: 121 PEDNLLVLTVATKETEGFRRFKRSAQFFNYKIQALGLGEDWNEKEA--SSGGGLKVRLLK 294
           P D LLV+TVATKE +GF RF  SA++FNY ++ LG G++W   +   S GGG KVRLLK
Sbjct: 35  PADKLLVITVATKENDGFHRFMNSAKYFNYTVKVLGQGQEWRGGDGMNSIGGGQKVRLLK 94

Query: 295 KALEKHAD-ENLVILFTDSYDVVFASGPRELLKKFRQAKSQVVFSAEELIYPDRRLEAKY 471
           +A+E +A  E+LVILFT+ +DVVFA GP E+LKKF++   ++VF+A+ L++PD+RL  KY
Sbjct: 95  EAMEHYASQEDLVILFTECFDVVFAGGPEEVLKKFQKTNHKIVFAADGLLWPDKRLADKY 154

Query: 472 PAVSDGKRFLGSGGFIGYAPNLSKLVAEWEGQDSDSDQLFYTKIFLDPEKRERINITLDH 651
           P V  GKR+L SGGFIGYAP +S+LV +W  QD+D DQLFYTK+++DP KRE  NITLDH
Sbjct: 155 PVVHIGKRYLNSGGFIGYAPYISRLVQQWNLQDNDDDQLFYTKVYIDPLKREAFNITLDH 214

Query: 652 RCRIFQNLDGALDEVVLKFEMGQVRARNLAYDTLPVLIHGNGPTKLQLNYLGNYIPRFWT 831
           +C+IFQ L+GA DEVVLKFE G+ R +N  Y+TLPV I+GNGPTK+ LNY GNY+P  WT
Sbjct: 215 KCKIFQALNGATDEVVLKFENGKSRVKNTFYETLPVAINGNGPTKILLNYFGNYVPNSWT 274

Query: 832 FETGCSVCDEGLRSLKGIGDEALPTVLVGLFIEQPTPXXXXXXXXXXXXXYPR 990
            E GC++CD     L  +  +  P V +G+FIEQPTP             YP+
Sbjct: 275 QENGCALCDVDTIDLSTV--DVPPKVTLGVFIEQPTPFLPRFLNLLLTLDYPK 325



>gi|6755110|ref|NP_036092.1| (NM_011962) procollagen-lysine,
           2-oxoglutarate 5-dioxygenase 3; lysyl hydroxylase 2;
           lysyl hydroxylase 3 [Mus musculus]
          Length = 741

 Score =  332 bits (852), Expect = 1e-91
 Identities = 158/275 (57%), Positives = 212/275 (76%), Gaps = 3/275 (1%)
 Frame = +1

Query: 127 DNLLVLTVATKETEGFRRFKRSAQFFNYKIQALGLGEDWNEKEASS--GGGLKVRLLKKA 300
           D LLV+TVAT ETEG+RRF +SA+FFNY ++ LGLG++W   + +   GGG KVR LKK
Sbjct: 40  DKLLVITVATAETEGYRRFLQSAEFFNYTVRTLGLGQEWRGGDVARTVGGGQKVRWLKKE 99

Query: 301 LEKHADE-NLVILFTDSYDVVFASGPRELLKKFRQAKSQVVFSAEELIYPDRRLEAKYPA 477
           +EK+AD+ +++I+F DSYDV+ AS P ELLKKF Q+ S ++FSAE   +P+  L  +YP
Sbjct: 100 MEKYADQKDMIIMFVDSYDVILASSPTELLKKFVQSGSHLLFSAESFCWPEWGLAEQYPE 159

Query: 478 VSDGKRFLGSGGFIGYAPNLSKLVAEWEGQDSDSDQLFYTKIFLDPEKRERINITLDHRC 657
           V  GKRFL SGGFIG+AP + ++V +W  +D D DQLFYT+++LDP  RE++ ++LDH+
Sbjct: 160 VGMGKRFLNSGGFIGFAPTIHQIVRQWNYKDDDDDQLFYTQLYLDPGLREKLKLSLDHKS 219

Query: 658 RIFQNLDGALDEVVLKFEMGQVRARNLAYDTLPVLIHGNGPTKLQLNYLGNYIPRFWTFE 837
           RIFQNL+GALDEV+LKF+  +VR RN+AYDTLPV++HGNGPTKLQLNYLGNY+P  WT +
Sbjct: 220 RIFQNLNGALDEVILKFDQNRVRIRNVAYDTLPVVVHGNGPTKLQLNYLGNYVPNGWTPQ 279

Query: 838 TGCSVCDEGLRSLKGIGDEALPTVLVGLFIEQPTP 942
            GC  C++ LR+L   G +  P VL+ +F+EQPTP
Sbjct: 280 GGCGFCNQTLRTLP--GGQPPPRVLLAVFVEQPTP 312



>gi|9506945|ref|NP_062275.1| (NM_019402) poly(A) binding protein,
           nuclear 1; poly(A) binding protein II [Mus musculus]
          Length = 302

 Score = 35.0 bits (79), Expect = 0.060
 Identities = 28/79 (35%), Positives = 33/79 (41%), Gaps = 1/79 (1%)
 Frame = +3

Query: 273 AEGSAAEESPGKACRREPGHSLH-RQL*RGVCLRAPRAAEEVPAGQEPGGLLSRGAHLPG 449
           A G A E  PG A   + G+ L   +L  G  L  P   EE P  + P G        PG
Sbjct: 31  AGGEAGEGDPGGA--GDYGNGLESEELEPGELLPEPEPEEEPPRPRAPPGA-------PG 81

Query: 450 PQAGGQVPGRLRRQEVPGL 506
           P  G   PG    +E PGL
Sbjct: 82  PGPGSGAPGSQEEEEEPGL 100



>gi|30410760|ref|NP_082542.2| (NM_028266) procollagen, type XVI, alpha
            1; [a]1 (XVI) collagen; similar to collagen alpha 1(XVI)
            chain precursor [Mus musculus]
          Length = 1580

 Score = 34.7 bits (78), Expect = 0.079
 Identities = 48/168 (28%), Positives = 56/168 (32%), Gaps = 12/168 (7%)
 Frame = +3

Query: 39   PGP-GHAAPATSGPTGLAASDRNEG*RQTGGQPLGAHGGHEGDRGVPTLQALSPVLQLQD 215
            PGP G   P   G  GL       G + T G+P     G EG +G P  Q L     L
Sbjct: 759  PGPEGVGHPGKPGQPGLPGVQGPPGPKGTQGEPGPPGTGAEGPQGEPGTQGLPGTQGLPG 818

Query: 216  PXXXXXXXXXXXX--------XIVGWRAEGSAAEESPG-KACRREPGHSLHRQL*RGVCL 368
            P                     I      G+     PG K  R E G         G C
Sbjct: 819  PRGPPGSAGEKGAQGSPGPKGAIGPMGPPGAGVSGPPGQKGSRGEKGEP-------GEC- 870

Query: 369  RAPRAAEEVPAGQEPG--GLLSRGAHLPGPQAGGQVPGRLRRQEVPGL 506
              P   E + +G  PG  GL    +  PGPQ    VPG      +PGL
Sbjct: 871  SCPSRGEPIFSGM-PGAPGLWMGSSSQPGPQGPPGVPGPPGPPGMPGL 917



>gi|14719436|ref|NP_149030.1| (NM_033041) hairy and enhancer of
           split 7; bHLH factor Hes7 [Mus musculus]
          Length = 225

 Score = 32.3 bits (72), Expect = 0.39
 Identities = 23/67 (34%), Positives = 27/67 (39%)
 Frame = +3

Query: 603 ILGPGEEGADQYHLGPPLPYLPESGWSLG*GCAQVRDGPSESEEPGLRHPPGPDSRQWAH 782
           ILGP        H GPP P L    WS     ++  D  + +   GL  PP P  RQ
Sbjct: 153 ILGPALHQRPPVHQGPPSPRL---AWSPSHCSSRAGDSGAPAPLTGLLPPPPPPYRQDGA 209

Query: 783 QAAAELP 803
             A  LP
Sbjct: 210 PKAPPLP 216



>gi|28893065|ref|NP_796092.1| (NM_177118) RIKEN cDNA A830073O21 gene
           [Mus musculus]
          Length = 243

 Score = 32.0 bits (71), Expect = 0.51
 Identities = 21/43 (48%), Positives = 24/43 (54%), Gaps = 4/43 (9%)
 Frame = +3

Query: 396 PAGQEPGGLLSRGAHLPGPQAGGQVPGRLR--RQEV--PGLWR 512
           P G  PGGL  RG H  G + G ++P RLR  RQE   PG  R
Sbjct: 113 PGGLHPGGLSPRGWHPGGLRRGLRLPRRLRDARQEAAEPGRTR 155



>gi|13899211|ref|NP_081632.1| (NM_027356) RIKEN cDNA 2700038N03; DNA
            segment, Chr 5, ERATO Doi 655, expressed [Mus musculus]
          Length = 217

 Score = 32.0 bits (71), Expect = 0.51
 Identities = 24/92 (26%), Positives = 35/92 (37%), Gaps = 1/92 (1%)
 Frame = -2

Query: 1023 CSWCEGKAASVPGVLEAQEPLEEQGQERRGLFDEQADQDCGQ-SFIPNALEAAQALITHG 847
            CSW  G+++ VPG+   Q  LE  G +  G    +    C + S         Q  + H
Sbjct: 62   CSWPGGQSSGVPGLPALQGALEAMGDKPPGFRGSRNWIGCVEASLCLEHFGGPQGRLCH- 120

Query: 846  APRLEGPEARDXXXXXXXXXLGGPIAVNQDRE 751
             PR  G    +          GGP+ V  D +
Sbjct: 121  LPRGVGLRGEEERLYSHFTTGGGPVMVGGDAD 152



>gi|6679271|ref|NP_032841.1| (NM_008815) ets variant gene 4 (E1A
           enhancer binding protein, E1AF); polyomavirus enhancer
           activator 3 [Mus musculus]
          Length = 555

 Score = 31.6 bits (70), Expect = 0.67
 Identities = 26/73 (35%), Positives = 33/73 (44%)
 Frame = +3

Query: 273 AEGSAAEESPGKACRREPGHSLHRQL*RGVCLRAPRAAEEVPAGQEPGGLLSRGAHLPGP 452
           A+ ++   SP       PGH+ H           P AA   PA Q PG  +S  A  PGP
Sbjct: 24  AQAASLRPSPATLVVSSPGHAEH-----------PPAA---PA-QTPGPQVSASARGPGP 68

Query: 453 QAGGQVPGRLRRQ 491
            AGG   GR+ R+
Sbjct: 69  VAGGS--GRMERR 79



>gi|31982382|ref|NP_038891.3| (NM_013863) Bcl2-associated athanogene
            3; Bcl-2-interacting death supressor [Mus musculus]
          Length = 577

 Score = 31.2 bits (69), Expect = 0.87
 Identities = 24/70 (34%), Positives = 28/70 (39%), Gaps = 7/70 (10%)
 Frame = -2

Query: 1002 AASVPGVLE------AQEPLEEQGQERRGLFDE-QADQDCGQSFIPNALEAAQALITHGA 844
            A S PGV        A  P   Q   R G+ +  Q D+ CGQ        AAQ    HG
Sbjct: 115  AYSQPGVQRFRTEAAAATPQRSQSPLRGGMTEAAQTDKQCGQMPATATTAAAQPPTAHGP 174

Query: 843  PRLEGPEARD 814
             R + P A D
Sbjct: 175  ERSQSPAASD 184


  Database: RefSeq (Mus Musculus/Amino Acid)
    Posted date:  Jul 1, 2003  6:07 AM
  Number of letters in database: 7,155,815
  Number of sequences in database:  15,330

Lambda     K      H
   0.318    0.135    0.401

Gapped
Lambda     K      H
   0.267   0.0410    0.140


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 15237570
Number of Sequences: 15330
Number of extensions: 392396
Number of successful extensions: 1664
Number of sequences better than 10.0: 36
Number of HSP's better than 10.0 without gapping: 1377
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 1649
length of query: 346
length of database: 7,155,815
effective HSP length: 47
effective length of query: 298
effective length of database: 6,435,305
effective search space: 1917720890
effective search space used: 1917720890
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 60 (27.7 bits)