Animal-Genome cDNA Clone 20030531/20030531C-0/20030531C-003029


Query

>20030531C-003029 (20030531C-003029) 20030531C-003029
TCCTTTTGTA AGAAGAGCTC AGAATCACAA GAGGGAAACT TGCCCCAAAG
ATGAGCCTCG CATGTACAAC CTTGGCCAGC TTCCTCCTGA TTTTCATTGT
TTCCAACAAA GGTACAGTCT CTCAAATTAC TGAGGTTGTC TGGGGCATCG
TGGATCAAGA CATCAACCTG GACATTCCTG AACTTTCAAA ACATGATAAC
GTAGATCATA TACGATGGCA GAAGAATGAA AACAAGATCG CAGAATTTAA
AAAAAACAAA GAAACTCACC CTGTGAAAGA CACATACATG ATGTTACCAA
ATGGAACTCT GAGAATTAAA GATCTGAAGA GAGATGATGA GGGTATCTAC
AAGGTAACTG TCTATGCTAC GGATGGAAAA CACATGCTGG AGAGAAAATT
TGATTTGCAG ATTCTAGATG GGGTCTCAAA ACCTGTAATC TCCTGGAGCT
GTGCCAACAA AACGGTGACC TGTGAGGTAG CAGAAGGAAG TGACCCTAAG
TTAAAACTGT ATGTAAATAA GTCCACTGCC AGAGAAGGTC GTCAGAAGGT
CATCCTGTGG AAGTGGAACA CCAAATGGAG CACATTATTC AAGTGTGTGG
CCAGTAACAA CGCCAGTGAG CAAATCAGCA TGGTGACCAT CAGTTGTACG
GGGCAAGGTC TGGATATCTA TCTCATCATC GGCATCTGCG GAGGAGGCAC
CGTATTCCTC ATCTTCATAG CACTACTCAT ATTCTACACC AGCAAGAGGA
AAAAGCAGAG CAGCAGGAGA AATGATGAGG AGCTGGAGAT AAGAGCGAAG
AGAGCGAGCC CCGAGGAAAG GGGCCGGAAG CCCCATCCAT TTCCAGGCTC
AACTCCTCAA AATCCGGTCA TTTCCCAAAC TCCTCCAATG CCTGGTCATC
GTTCTCAGGC ACCTACTCCT CGTCCCGGGC CTCCTGGCCC CCGTGTCCAG
CACCCACAGA AGAAGAGGCC TCCTCCTACC CCGGGCACAC AAGTTCACCA
GCAAAAAGGC CCTCCCCTCC CCAAGCCTCG AGTTCAAACA AAACCTACCC
ATGAGGCCAA AGAAAACTCA TAACTCTGTC TCCTGATGCG GTTGTCCCTT
CCCTCTAATT AAAAGAGGAC AGAAACTGTC CTTTCCTGTA AAAGGCACTG
TGGAGTTTCT CCACTCCAGG CGCGCATGCT AGCCGCTTCC ATCAAGCGTG
CTCTGCAGGG GGACCGGGAC AGCGCCCAAG CCCAGGAGCC ACAGCCCTGT
CTGTACCATC TAACTCAGCC CCGAGGCCTG ACATCTGGAG TCTCTGGCCT
CCTCCACCAC TGCAGGAAGG GAAAAACCGT CAAAGAGTAC GGATTAGGAT
GATGGGTGAC CGAGCACAAA ATCCTGGAGA TCTCTCAGCC TCTGTCTCAC
ACCATGTGCA GGCATCAGAT ATCAAGTGGA GGGTGTGGCC GTGTCTTGTT
GCAAGCAACC TGCCTGCTGG AGACATGCTT GTCATTTGCT CACCTTATGG
GGAGTAGAAG TGAAATAAAA GGCTTGACCT GACACCAGTG TTTACTCATT
GTGGAAAGGA GCCCAGCATC CAGCTACGGA ATC

Search to RefSeq (Human) database

Query= 20030531C-003029 (20030531C-003029) 20030531C-003029
         (1583 letters)

Database: RefSeq (Homo Sapiens/Amino Acid)
           19,244 sequences; 10,046,356 total letters

Searching..................................................done

                                                                               Score     E
            Sequences producing significant alignments:                        (bits)  Value

Alignment   gi|4502653|ref|NP_001758.1| (NM_001767) CD2 antigen (p50), shee...   385  e-107
Alignment   gi|22749435|ref|NP_689935.1| (NM_152722) hypothetical protein F...    52  1e-06
Alignment   gi|19923572|ref|NP_067004.3| (NM_021181) 19A24 protein; novel L...    46  6e-05
Alignment   gi|5032119|ref|NP_005830.1| (NM_005839) Ser/Arg-related nuclear...    40  0.004
Alignment   gi|31377595|ref|NP_653205.2| (NM_144604) hypothetical protein B...    39  0.012
Alignment   gi|4502687|ref|NP_003865.1| (NM_003874) CD84 antigen (leukocyte...    39  0.012
Alignment   gi|13194197|ref|NP_056069.1| (NM_015254) kinesin family member ...    38  0.016
Alignment   gi|7662234|ref|NP_055515.1| (NM_014700) KIAA0665 gene product; ...    38  0.016
Alignment   gi|21361571|ref|NP_001769.2| (NM_001778) CD48 antigen (B-cell m...    38  0.016
Alignment   gi|21464113|ref|NP_653087.1| (NM_144504) F11 receptor isoform a...    37  0.036


>gi|4502653|ref|NP_001758.1| (NM_001767) CD2 antigen (p50), sheep red
            blood cell receptor; lymphocyte-function antigen-2 [Homo
            sapiens]
          Length = 351

 Score =  385 bits (988), Expect = e-107
 Identities = 198/346 (57%), Positives = 243/346 (70%), Gaps = 6/346 (1%)
 Frame = +3

Query: 51   MSLACTTLASFLLIFIVSNKGTVS-QITEVV--WGIVDQDINLDIPELSKHDNVDHIRWQ 221
            MS  C  +ASFLLIF VS+KG VS +IT  +  WG + QDINLDIP     D++D I+W+
Sbjct: 1    MSFPCKFVASFLLIFNVSSKGAVSKEITNALETWGALGQDINLDIPSFQMSDDIDDIKWE 60

Query: 222  K--NENKIAEFKKNKETHPVKDTYMMLPNGTLRIKDLKRDDEGIYKVTVYATDGKHMLER 395
            K  ++ KIA+F+K KET   KDTY +  NGTL+IK LK DD+ IYKV++Y T GK++LE+
Sbjct: 61   KTSDKKKIAQFRKEKETFKEKDTYKLFKNGTLKIKHLKTDDQDIYKVSIYDTKGKNVLEK 120

Query: 396  KFDLQILDGVSKPVISWSCANKTVTCEVAEGSDPKLKLYVNKSTAREGRQKVILWKWNTK 575
             FDL+I + VSKP ISW+C N T+TCEV  G+DP+L LY +    +   Q+VI  KW T
Sbjct: 121  IFDLKIQERVSKPKISWTCINTTLTCEVMNGTDPELNLYQDGKHLKLS-QRVITHKWTTS 179

Query: 576  WSTLFKCVASNNASEQISMVTISCTGQGLDIYLIIGICGGGTVFLIFIALLIFYTSKRKK 755
             S  FKC A N  S++ S+  +SC  +GLDIYLIIGICGGG++ ++F+ALL+FY +KRKK
Sbjct: 180  LSAKFKCTAGNKVSKESSVEPVSCPEKGLDIYLIIGICGGGSLLMVFVALLVFYITKRKK 239

Query: 756  QSSRRNDEELEIRAKRASPEERGRKPHPFPGSTPQNPVISQ-TPPMPGHRSQAXXXXXXX 932
            Q SRRNDEELE RA R + EERGRKP   P STPQNP  SQ  PP PGHRSQA
Sbjct: 240  QRSRRNDEELETRAHRVATEERGRKPQQIPASTPQNPATSQHPPPPPGHRSQAPSHRPPP 299

Query: 933  XXXXVQHPQKKRPPPTPGTQVHQQKGPPLPKPRVQTKPTHEAKENS 1070
                VQH  +KRPP   GTQVHQQKGPPLP+PRVQ KP H A ENS
Sbjct: 300  PGHRVQHQPQKRPPAPSGTQVHQQKGPPLPRPRVQPKPPHGAAENS 345



>gi|22749435|ref|NP_689935.1| (NM_152722) hypothetical protein
            FLJ25530 [Homo sapiens]
          Length = 416

 Score = 51.6 bits (122), Expect = 1e-06
 Identities = 70/332 (21%), Positives = 124/332 (37%), Gaps = 57/332 (17%)
 Frame = +3

Query: 72   LASFLLIFIVSNKG----TVSQITEVVWGIVDQDINLDIPELSKHDNVDHIRWQKNENKI 239
            LA F+ + ++         ++    ++ G V +   L +   S   +   ++WQ   +K
Sbjct: 17   LAPFVYLLLIQTDPLEGVNITSPVRLIHGTVGKSALLSVQYSSTSSDRPVVKWQLKRDKP 76

Query: 240  AEFKKNKETHPV-------KDTYMMLPNGTLRIKDLKRDDEGIYKVTVYATDGKHMLERK 398
                ++  T  +       +D   +  NG+L + DL+  DEG Y+V +  TD     E+
Sbjct: 77   VTVVQSIGTEVIGTLRPDYRDRIRLFENGSLLLSDLQLADEGTYEVEISITDDTFTGEKT 136

Query: 399  FDLQILDGVSKPVISWSCANK-------TVTCEVAEGSDPKLK-LYVNKSTAREGR---- 542
             +L +   +S+P +  +           T+ C    G+ P    L   K    + R
Sbjct: 137  INLTVDVPISRPQVLVASTTVLELSEAFTLNCSHENGTKPSYTWLKDGKPLLNDSRMLLS 196

Query: 543  --QKVI-LWKWNTKWSTLFKCVASNNASEQISMVTISCTGQGLDIYLIIGICGGGTVFLI 713
              QKV+ + +   +   L+ CV  N  S+  S+       +   +Y+I+   G   +  +
Sbjct: 197  PDQKVLTITRVLMEDDDLYSCVVENPISQGRSLPVKITVYRRSSLYIILSTGGIFLLVTL 256

Query: 714  FIALLIFYTSKRK-----KQSS-------------------------RRNDEELEIRAKR 803
                  +  SKRK     KQ+S                         R+N   L I   +
Sbjct: 257  VTVCACWKPSKRKQKKLEKQNSLEYMDQNDDRLKPEADTLPRSGEQERKNPMALYILKDK 316

Query: 804  ASPE-ERGRKPHPFPGSTPQNPVISQTPPMPG 896
             SPE E    P P   + P  P  S +P +PG
Sbjct: 317  DSPETEENPAPEPRSATEPGPPGYSVSPAVPG 348



>gi|19923572|ref|NP_067004.3| (NM_021181) 19A24 protein; novel LY9
           (lymphocyte antigen 9) like protein; CD2-like receptor
           activating cytotoxic cells [Homo sapiens]
          Length = 335

 Score = 46.2 bits (108), Expect = 6e-05
 Identities = 56/239 (23%), Positives = 99/239 (40%), Gaps = 23/239 (9%)
 Frame = +3

Query: 111 GTVSQITEVVWGIVDQDINLDIPELSKHDNVDHIRWQKNENKIAEFKKNKETHPVKDTY- 287
           G V ++   V G V        P  SK   VD I W  N   +   +    T  V
Sbjct: 24  GPVKELVGSVGGAVT------FPLKSKVKQVDSIVWTFNTTPLVTIQPEGGTIIVTQNRN 77

Query: 288 ---MMLPNG--TLRIKDLKRDDEGIYKVTVYATDGKHMLERKFDLQILDGVSKPVISWSC 452
              +  P+G  +L++  LK++D GIY V +Y++  +    +++ L + + +SKP ++
Sbjct: 78  RERVDFPDGGYSLKLSKLKKNDSGIYYVGIYSSSLQQPSTQEYVLHVYEHLSKPKVTMGL 137

Query: 453 -ANK------TVTCEVAEGSDPKL----KLYVNKSTAREGRQKVILWKWNTKWSTLFKCV 599
            +NK       +TC +  G +  +     L    + +  G    I W+W     T F CV
Sbjct: 138 QSNKNGTCVTNLTCCMEHGEEDVIYTWKALGQAANESHNGSILPISWRWGESDMT-FICV 196

Query: 600 ASNNASEQISMVTIS---CTGQGLD---IYLIIGICGGGTVFLIFIALLIFYTSKRKKQ 758
           A N  S   S   ++   C G   D     +++ +     +  +F+  L  +  KR++Q
Sbjct: 197 ARNPVSRNFSSPILARKLCEGAADDPDSSMVLLCLLLVPLLLSLFVLGLFLWFLKRERQ 255



>gi|5032119|ref|NP_005830.1| (NM_005839) Ser/Arg-related nuclear
            matrix protein (plenty of prolines 101-l; Ser/Arg-related
            nuclear matrix protein (plenty of prolines 101-like)
            [Homo sapiens]
          Length = 820

 Score = 40.0 bits (92), Expect = 0.004
 Identities = 49/182 (26%), Positives = 66/182 (35%), Gaps = 5/182 (2%)
 Frame = +3

Query: 744  KRKKQSSRRNDEELEIRAKRASPEERGRKPHPFPGS----TPQNPVISQTP-PMPGHRSQ 908
            KR+K++S R       R+    P  R R P P P      TP  P   +TP P P  RS
Sbjct: 533  KRQKETSPRGRRR---RSPSPPPTRRRRSPSPAPPPRRRRTPTPPPRRRTPSPPPRRRSP 589

Query: 909  AXXXXXXXXXXXVQHPQKKRPPPTPGTQVHQQKGPPLPKPRVQTKPTHEAKENS*LCLLM 1088
            +              P ++R  P+P  +      PP PK R    P  + + +
Sbjct: 590  SPRRYSP--------PIQRRYSPSPPPKRRTASPPPPPKRRASPSPPPKRRVSHSPPPKQ 641

Query: 1089 RLSLPSN*KRTETVLSCKRHCGVSPLQARMLAASIKRALQGDRDSAQAQEPQPCLYHLTQ 1268
            R S  +  KR    LS K   G SP                 R + +A+ PQP   H
Sbjct: 642  RSSPVT--KRRSPSLSSKHRKGSSP----------------SRSTREARSPQPNKRHSPS 683

Query: 1269 PR 1274
            PR
Sbjct: 684  PR 685


 Score = 31.2 bits (69), Expect = 2.0
 Identities = 28/117 (23%), Positives = 41/117 (34%), Gaps = 8/117 (6%)
 Frame = +3

Query: 744  KRKKQSSRRNDEELEIRAKRA---SPEER--GRKPHPFPGS---TPQNPVISQTPPMPGH 899
            K +  S RR    +    KR+   SP  R   R P P P     TP+ P  S     P
Sbjct: 176  KSRSPSPRRRSSPVRRERKRSHSRSPRHRTKSRSPSPAPEKKEKTPELPEPSVKVKEPSV 235

Query: 900  RSQAXXXXXXXXXXXVQHPQKKRPPPTPGTQVHQQKGPPLPKPRVQTKPTHEAKENS 1070
            +                 P+ K P P   ++  Q+K    P+ R ++K     +  S
Sbjct: 236  QEATSTSDILKVPKPEPIPEPKEPSPEKNSKKEQEKEKTRPRSRSRSKSRSRTRSRS 292


 Score = 28.9 bits (63), Expect = 9.8
 Identities = 22/76 (28%), Positives = 27/76 (34%)
 Frame = +2

Query: 809  PRGKGPEAPSISRLNSSKSGHFPNSSNAWSSFSGTYSSSRASWPPCPAPTEEEASSYPGH 988
            P+ +   +P   R  S        SS      S + SS           T E  S  P
Sbjct: 619  PKRRASPSPPPKRRVSHSPPPKQRSSPVTKRRSPSLSSKHRKGSSPSRSTREARSPQPNK 678

Query: 989  TSSPAKRPSPPQASSS 1036
              SP+ RP  PQ SSS
Sbjct: 679  RHSPSPRPRAPQTSSS 694



>gi|31377595|ref|NP_653205.2| (NM_144604) hypothetical protein
            BC001584 [Homo sapiens]
          Length = 953

 Score = 38.5 bits (88), Expect = 0.012
 Identities = 34/102 (33%), Positives = 44/102 (42%), Gaps = 16/102 (15%)
 Frame = +2

Query: 794  SEESEPRGKGPEAPSISRLNSSKSGHFPNSSNAWSSFSGTYSSSRA-SWPPCPAPTEE-- 964
            S  S   G G    S SR  SS    + + S+  SSFSG+ S SR+ S  P P+PT
Sbjct: 559  SRSSSYSGSGS---SRSRSRSSSYSSYSSRSSRHSSFSGSRSRSRSFSSSPSPSPTPSPH 615

Query: 965  --------EASSYPGHTSS-----PAKRPSPPQASSSNKTYP 1051
                    E +  PG         PA  P+PPQA+ +    P
Sbjct: 616  RPSIRTKGEPAPPPGKAGEKSVKKPAPPPAPPQATKTTAPVP 657



>gi|4502687|ref|NP_003865.1| (NM_003874) CD84 antigen (leukocyte
           antigen); leukocyte antigen CD84 [Homo sapiens]
          Length = 328

 Score = 38.5 bits (88), Expect = 0.012
 Identities = 41/181 (22%), Positives = 72/181 (39%), Gaps = 19/181 (10%)
 Frame = +3

Query: 144 GIVDQDINLDIPELSKHDNVDHIRWQKNENKIAEFKKNKETHPV---------KDTYMML 296
           GI+ + +   +  + +   V  I W    +       + ET PV         +  + +
Sbjct: 31  GILGESVTFPV-NIQEPRQVKIIAWTSKTSVAYVTPGDSETAPVVTVTHRNYYERIHALG 89

Query: 297 PNGTLRIKDLKRDDEGIYKVTVYATDGKHMLERKFDLQILDGVSKPVISW-------SCA 455
           PN  L I DL+ +D G YK  +      +   ++++LQI   + KP I+        S
Sbjct: 90  PNYNLVISDLRMEDAGDYKADINTQADPYTTTKRYNLQIYRRLGKPKITQSLMASVNSTC 149

Query: 456 NKTVTCEVAEGSDPKLKLYVNKSTAREGRQKVILWKWNTKWSTLFKCVASN---NASEQI 626
           N T+TC V +    +  +  N S   E    + +++        + C A N   N S+ I
Sbjct: 150 NVTLTCSVEK---EEKNVTYNWSPLGEEGNVLQIFQTPEDQELTYTCTAQNPVSNNSDSI 206

Query: 627 S 629
           S
Sbjct: 207 S 207



>gi|13194197|ref|NP_056069.1| (NM_015254) kinesin family member 13B;
            guanylate kinase associated kinesin; kinesin 13B [Homo
            sapiens]
          Length = 1826

 Score = 38.1 bits (87), Expect = 0.016
 Identities = 47/176 (26%), Positives = 72/176 (40%), Gaps = 8/176 (4%)
 Frame = +2

Query: 821  GPEAPSISRLNSSKSGHFPNSSNAWSSFSGTYSSSRASWPPCPAPTEEEASSYPGHTSSP 1000
            GP +P    L+ + SG+F +S +  +          A+ PP   PT  EA        +P
Sbjct: 1556 GPPSP----LSEASSGYFSHSVSTATLSDALGPGLDAAAPPGSMPTAPEAEP-----EAP 1606

Query: 1001 AKRPSPPQASSSNKTYP*GQRKLITLS---PDAVVPSL*LKEDRNCPFL*KA-----LWS 1156
               P PP A  + +  P G ++L++     PD   P+         PF  +      L S
Sbjct: 1607 ISHPPPPTAVPAEE--PPGPQQLVSPGRERPDLEAPA------PGSPFRVRRVRASELRS 1658

Query: 1157 FSTPGAHASRFHQACSAGGPGQRPSPGATALSVPSNSAPRPDIWSLWPPPPLQEGK 1324
            FS   A        CS G  G  P+PGA   ++ S+S    ++     P  L+EG+
Sbjct: 1659 FSRMLAG----DPGCSPGAEGNAPAPGAGGQALASDSEEADEV-----PEWLREGE 1705



>gi|7662234|ref|NP_055515.1| (NM_014700) KIAA0665 gene product;
            rab11-family interacting protein 3 [Homo sapiens]
          Length = 756

 Score = 38.1 bits (87), Expect = 0.016
 Identities = 54/224 (24%), Positives = 75/224 (33%), Gaps = 17/224 (7%)
 Frame = +2

Query: 803  SEPRGKGPE--------------APSISRLNSSKSGHFPNSSNAWSSFSGTYSSSRASWP 940
            SEP G  PE               P+  RL +   G  P S        G  +   A W
Sbjct: 12   SEPPGPDPEPGGPDGPGAAQLAPGPAELRLGAPVGGPDPQSPGLDEPAPGAAADGGARWS 71

Query: 941  PCPAPTEEEASSYPGHTSSPAKRPSPPQASSSNKTYP*GQRKLITLSPDAVVPSL*LKED 1120
              PAP  E     PG ++ P +     Q +S +   P G R    L P+        +E
Sbjct: 72   AGPAPGLEGGPRDPGPSAPPPRSGPRGQLASPDAPGP-GPRSEAPL-PELDPLFSWTEEP 129

Query: 1121 RNC-PFL*KALWSFSTPGAHAS-RFHQACSAGGPGQRPSPGATALSVPSNSAPRP-DIWS 1291
              C P        F   G+ +S R         P   P+ G  AL     S P+P D+
Sbjct: 130  EECGPASCPESAPFRLQGSSSSHRARGEVDVFSPFPAPTAGELALEQGPGSPPQPSDLSQ 189

Query: 1292 LWPPPPLQEGKNRQRVRIRMMGDRAQNPGDLSASVSHHVQASDI 1423
              P P    G      R+R + D     GD    +   +Q + +
Sbjct: 190  THPLPSEPVGSQEDGPRLRAVFDALDGDGDGFVRIEDFIQFATV 233



>gi|21361571|ref|NP_001769.2| (NM_001778) CD48 antigen (B-cell
           membrane protein) [Homo sapiens]
          Length = 243

 Score = 38.1 bits (87), Expect = 0.016
 Identities = 56/217 (25%), Positives = 88/217 (39%), Gaps = 20/217 (9%)
 Frame = +3

Query: 60  ACTTLASFLL---IFIVSNKGTVSQITEVVWGIVDQDINLDIPELSKHDNVDHIRWQKN- 227
           +C  L   LL   + + S +G +  +T V    V  +I+  +PE     N   + W
Sbjct: 8   SCLALELLLLPLSLLVTSIQGHLVHMTVVSGSNVTLNISESLPE-----NYKQLTWFYTF 62

Query: 228 ENKIAEFKKNKETH---PVKDTYMMLP-NGTLRIKDLKRDDEGIYKVTVYATDGKHMLER 395
           + KI E+   K  +     K    + P +G L I  ++++D   Y + V    G    E
Sbjct: 63  DQKIVEWDSRKSKYFESKFKGRVRLDPQSGALYISKVQKEDNSTYIMRVLKKTGNEQ-EW 121

Query: 396 KFDLQILDGVSKPVISW--------SCANKTVTCEVAEGSDPKLKLYVNKSTAREGRQKV 551
           K  LQ+LD V KPVI          +C  K ++C V  G       Y +K    +  Q
Sbjct: 122 KIKLQVLDPVPKPVIKIEKIEDMDDNCYLK-LSC-VIPGESVNYTWYGDKRPFPKELQNS 179

Query: 552 IL--WKWNTKWSTLFKCVASNNASEQISMVTIS--CT 650
           +L        +S  + C  SN+ S +   V +S  CT
Sbjct: 180 VLETTLMPHNYSRCYTCQVSNSVSSKNGTVCLSPPCT 216



>gi|21464113|ref|NP_653087.1| (NM_144504) F11 receptor isoform a
           precursor; platelet F11 receptor; platelet adhesion
           molecule; junctional adhesion molecule 1 [Homo sapiens]
          Length = 299

 Score = 37.0 bits (84), Expect = 0.036
 Identities = 29/86 (33%), Positives = 38/86 (43%), Gaps = 5/86 (5%)
 Frame = +3

Query: 255 NKETHPVKDTYMMLPNGTLRIKDLKRDDEGIYKVTVYATDGKHMLERKFDLQILDGVSKP 434
           NK T   +D    LP G +  K + R+D G Y   V    G    E K  L +L   SKP
Sbjct: 77  NKITASYEDRVTFLPTG-ITFKSVTREDTGTYTCMVSEEGGNSYGEVKVKLIVLVPPSKP 135

Query: 435 VI----SWSCANKTV-TCEVAEGSDP 497
            +    S +  N+ V TC   +GS P
Sbjct: 136 TVNIPSSATIGNRAVLTCSEQDGSPP 161


  Database: RefSeq (Homo Sapiens/Amino Acid)
    Posted date:  Jul 1, 2003  6:07 AM
  Number of letters in database: 10,046,356
  Number of sequences in database:  19,244

Lambda     K      H
   0.318    0.135    0.401

Gapped
Lambda     K      H
   0.267   0.0410    0.140


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 36448921
Number of Sequences: 19244
Number of extensions: 1067080
Number of successful extensions: 6352
Number of sequences better than 10.0: 203
Number of HSP's better than 10.0 without gapping: 2730
Number of HSP's successfully gapped in prelim test: 299
Number of HSP's that attempted gapping in prelim test: 1310
Number of HSP's gapped (non-prelim): 5082
length of query: 527
length of database: 10,046,356
effective HSP length: 49
effective length of query: 478
effective length of database: 9,103,400
effective search space: 4351425200
effective search space used: 4351425200
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 63 (28.9 bits)

Search to RefSeq (Mouse) database

Query= 20030531C-003029 (20030531C-003029) 20030531C-003029
         (1583 letters)

Database: RefSeq (Mus Musculus/Amino Acid)
           15,330 sequences; 7,155,815 total letters

Searching..................................................done

                                                                               Score     E
            Sequences producing significant alignments:                        (bits)  Value

Alignment   gi|7304949|ref|NP_038514.1| (NM_013486) CD2 antigen [Mus musculus]   285  4e-77
Alignment   gi|31541977|ref|NP_653122.2| (NM_144539) RIKEN cDNA 4930560D03;...    55  7e-08
Alignment   gi|11037800|ref|NP_067623.1| (NM_021610) glycoprotein A33 (tran...    47  2e-05
Alignment   gi|31982610|ref|NP_083888.2| (NM_029612) CD2 antigen family, me...    42  8e-04
Alignment   gi|21311861|ref|NP_083027.1| (NM_028751) RIKEN cDNA 0610041D19 ...    38  0.015
Alignment   gi|27777681|ref|NP_766094.1| (NM_172506) brother of CDO; Biregi...    38  0.015
Alignment   gi|31560788|ref|NP_065254.2| (NM_020508) bromodomain containing...    37  0.020
Alignment   gi|30794258|ref|NP_821172.1| (NM_178029) hypothetical protein L...    37  0.033
Alignment   gi|21746185|ref|NP_653115.1| (NM_144532) RIKEN cDNA 2410038D05 ...    37  0.033
Alignment   gi|7949115|ref|NP_058079.1| (NM_016799) Ser/Arg-related nuclear...    35  0.074


>gi|7304949|ref|NP_038514.1| (NM_013486) CD2 antigen [Mus musculus]
          Length = 344

 Score =  285 bits (729), Expect = 4e-77
 Identities = 144/332 (43%), Positives = 202/332 (60%), Gaps = 2/332 (0%)
 Frame = +3

Query: 57   LACTTLASFLLIFIVSNKGTVSQITEVVWGIVDQDINLDIPELSKHDNVDHIRWQKNENK 236
            + C  L SF L+F +S KG   +  E +WG++   I L+IP     D++D +RW +
Sbjct: 1    MKCKFLGSFFLLFSLSGKGADCRDNETIWGVLGHGITLNIPNFQMTDDIDEVRWVRRGTL 60

Query: 237  IAEFKKNKETHPVKDTYMMLPNGTLRIKD-LKRDDEGIYKVTVYATDGKHMLERKFDLQI 413
            +AEFK+ K    + +TY +L NG+L+IK  + R+D G Y V VY T+G   LE+  D++I
Sbjct: 61   VAEFKRKKPPFLISETYEVLANGSLKIKKPMMRNDSGTYNVMVYGTNGMTRLEKDLDVRI 120

Query: 414  LDGVSKPVISWSCANKTVTCEVAEGSDPKLKLYVNKSTAREGRQKVILWKWNTKWSTLFK 593
            L+ VSKP+I W C N T+TC V +G+D +LKLY  ++      QK + ++W T  +  FK
Sbjct: 121  LERVSKPMIHWECPNTTLTCAVLQGTDFELKLYQGETLLNSLPQKNMSYQW-TNLNAPFK 179

Query: 594  CVASNNASEQISMVTISCTGQGLDIYLIIGICGGGTVFLIFIALLIFYTSKRKKQSSRRN 773
            C A N  S++  M  ++C  +GL  Y+ +G+  GG + ++ +AL IF   KR+K++ RR
Sbjct: 180  CEAINPVSKESKMEVVNCPEKGLSFYVTVGVGAGGLLLVLLVALFIFCICKRRKRNRRRK 239

Query: 774  DEELEIRAKRASPEERGRKPHPFPGSTPQNPVISQTPPMPGHRSQAXXXXXXXXXXXV-Q 950
            DEELEI+A R S  ERG KPH  P +  QN V  Q PP PGH  Q              +
Sbjct: 240  DEELEIKASRTSTVERGPKPHSTPAAAAQNSVALQAPPPPGHHLQTPGHRPLPPGHRTRE 299

Query: 951  HPQKKRPPPTPGTQVHQQKGPPLPKPRVQTKP 1046
            H QKKRPPP+ GTQ+HQQKGPPLP+PRVQ KP
Sbjct: 300  HQQKKRPPPS-GTQIHQQKGPPLPRPRVQPKP 330



>gi|31541977|ref|NP_653122.2| (NM_144539) RIKEN cDNA 4930560D03;
           novel Ly9 [Mus musculus]
          Length = 333

 Score = 55.5 bits (132), Expect = 7e-08
 Identities = 58/250 (23%), Positives = 108/250 (43%), Gaps = 21/250 (8%)
 Frame = +3

Query: 96  IVSNKGTVSQITEVVWGIVDQDINLDIPELSKHDNVDHIRWQKNENKIAEFKKNKETHPV 275
           + +  GT+ ++   + G V     L+I E+     VD++ W  N   +A  KK+  T
Sbjct: 19  VTAASGTLKKVAGALDGSVT--FTLNITEIK----VDYVVWTFNTFFLAMVKKDGVTSQS 72

Query: 276 KDTY-MMLPNG--TLRIKDLKRDDEGIYKVTVYATDGKHMLERKFDLQILDGVSKPVISW 446
            +   ++ P+G  ++++  LK++D G Y+  +Y+T  +  L +++ L +   +S+P ++
Sbjct: 73  SNKERIVFPDGLYSMKLSQLKKNDSGAYRAEIYSTSSQASLIQEYVLHVYKHLSRPKVTI 132

Query: 447 S-CANKTVTCEV----AEGSDPKLKLYVNKSTAR------EGRQKVILWKWNTKWSTLFK 593
              +NK  TC +    +   D +   Y  K+  +      +G    I W+   K   L
Sbjct: 133 DRQSNKNGTCVINLTCSTDQDGENVTYSWKAVGQGDNQFHDGATLSIAWRSGEKDQAL-T 191

Query: 594 CVASNNASEQISMVTIS---CTGQGLDIYLIIGI----CGGGTVFLIFIALLIFYTSKRK 752
           C+A N  S   S        C     D+  + GI    C    + L  + L IF+T   K
Sbjct: 192 CMARNPVSNSFSTPVFPQKLCEDAATDLTSLRGILYILCFSAVLILFAVLLTIFHTMWIK 251

Query: 753 KQSSRRNDEE 782
           K      D++
Sbjct: 252 KGKGCEEDKK 261



>gi|11037800|ref|NP_067623.1| (NM_021610) glycoprotein A33
           (transmembrane); A33 antigen [Mus musculus]
          Length = 319

 Score = 47.4 bits (111), Expect = 2e-05
 Identities = 47/200 (23%), Positives = 85/200 (42%), Gaps = 21/200 (10%)
 Frame = +3

Query: 294 LPNGTLRIKDLKRDDEGIYKVTV-YATDGKHMLERKFDLQILDGVSKPVISWSCA----- 455
           L N ++ I  L  DD G Y+ +V   +D     + +  L +L   SKP  S
Sbjct: 97  LSNASITIDQLTMDDNGTYECSVSLMSDQDVNAKSRVRLLVLVPPSKPDCSIQGEMVIGN 156

Query: 456 NKTVTCEVAEGSDPKLKLYVNKSTAREGR--------QKVILWKWNTKWSTLFKCVASNN 611
           N  +TC  AEGS      + + +   + R        + ++L   +T+ +  + C +SN+
Sbjct: 157 NIQLTCHSAEGSPSPQYSWKSYNAQNQQRPLTQPVSGEPLLLKNISTETAGYYICTSSND 216

Query: 612 ASEQISMVTISCTGQGLDIYLIIGICGGGTVFLIFIALLIFYTSKRKK-------QSSRR 770
              +   +T++     ++I L  GI G   V LI I ++++    R+K       + +R
Sbjct: 217 VGIESCNITVAPRPPSMNIALYAGIAGSVFVALIIIGVIVYCCCCREKDDKDQDREDARP 276

Query: 771 NDEELEIRAKRASPEERGRK 830
           N    ++  K      RGR+
Sbjct: 277 NRAAYQVPKKEQKEISRGRE 296



>gi|31982610|ref|NP_083888.2| (NM_029612) CD2 antigen family, member
           10 [Mus musculus]
          Length = 285

 Score = 42.0 bits (97), Expect = 8e-04
 Identities = 42/183 (22%), Positives = 79/183 (42%), Gaps = 21/183 (11%)
 Frame = +3

Query: 132 EVVWGIVDQDINLDIPELSKHDNVDHIRWQKNENKIAEFKKNKETHP-----VKDTY--- 287
           E V G++ + INL + E+  ++ + HI W   +N IA  K  K+  P     V   Y
Sbjct: 26  EEVIGVLQESINLSL-EIPSNEEIKHIDWLF-QNNIAIVKPGKKGQPAVIMAVDPRYRGR 83

Query: 288 --MMLPNGTLRIKDLKRDDEGIYKVTVYATDGKHMLERKFDLQILDGVSKPVISWS---- 449
             +   + +L I +L  +D G+Y   V     +  + + + L++   +SKP I+ +
Sbjct: 84  VSISESSYSLHISNLTWEDSGLYNAQVNLKTSESHITKSYHLRVYRRLSKPHITVNSNIS 143

Query: 450 ---CANKTVTCEVAEGSDPKLKLYVNK----STAREGRQKVILWKWNTKWSTLFKCVASN 608
                N ++TC +         ++++     +T+ EG      W+   K +  + C  SN
Sbjct: 144 EEGVCNISLTCSIERAGMDVTYIWLSSQDSTNTSHEGSVLSTSWRPGDK-APSYTCRVSN 202

Query: 609 NAS 617
             S
Sbjct: 203 PVS 205



>gi|21311861|ref|NP_083027.1| (NM_028751) RIKEN cDNA 0610041D19 [Mus
            musculus]
          Length = 539

 Score = 37.7 bits (86), Expect = 0.015
 Identities = 51/169 (30%), Positives = 69/169 (40%), Gaps = 21/169 (12%)
 Frame = +2

Query: 863  SGHFPNSSNAWSSFSGTYSS-SRASWPP---CPAPTEEEASSYPGHTSSPAKRPSPPQAS 1030
            SG  P S  A  S +G   S S A   P    P P  ++ S YP         PSPP
Sbjct: 259  SGGDPASPPAPGSPNGECCSVSTAGGSPEEELPLPAFDKLSPYP--------TPSPP--- 307

Query: 1031 SSNKTYP*GQRKLITLSPDAV-VPSL*LKEDRNCPFL*KALWSFSTPGAHASRFHQ---- 1195
              +  YP   RK+I  S D + +P        NC +  +   S S     + R H+
Sbjct: 308  --HPLYP--GRKVIEFSEDKIRIPRN--SPLPNCTYATRQAISLSLVEDGSERAHRSSVP 361

Query: 1196 ---ACSAGGPGQRPSPGATALSVPSNSA-PRPDIWSLW--------PPP 1306
               A + G P  +PSP  +ALS P++SA    D+ + W        PPP
Sbjct: 362  SSPASAQGSPHHQPSPAPSALSAPASSASSEEDLLASWQRAFVDRTPPP 410


 Score = 29.3 bits (64), Expect = 5.3
 Identities = 17/48 (35%), Positives = 22/48 (45%)
 Frame = +1

Query: 937  APVSSTHRRRGLLLPRAHKFTSKKALPSPSLEFKQNLPMRPKKTHNSV 1080
            AP  S HR RGL+LP           P    E   NLP+ P++   S+
Sbjct: 438  APPPSPHRERGLVLPA----EPDSGFPQDEEEEMLNLPVSPEEERQSL 481



>gi|27777681|ref|NP_766094.1| (NM_172506) brother of CDO; Biregional
           Cdon binding protein [Mus musculus]
          Length = 1110

 Score = 37.7 bits (86), Expect = 0.015
 Identities = 22/91 (24%), Positives = 43/91 (47%), Gaps = 6/91 (6%)
 Frame = +3

Query: 111 GTVSQITEVVWGIVDQDINLDIPELSKHDNVD------HIRWQKNENKIAEFKKNKETHP 272
           G V+ +   V     QD  LD+  + + D  +      H+     + ++    K +
Sbjct: 108 GAVASVPATVTLANLQDFKLDVQHVIEVDEGNTAVIACHLPESHPKAQVRYSVKQEWLEA 167

Query: 273 VKDTYMMLPNGTLRIKDLKRDDEGIYKVTVY 365
            +D Y+++P+G L+I +  ++DEG+YK   Y
Sbjct: 168 SRDNYLIMPSGNLQIVNASQEDEGMYKCAAY 198



>gi|31560788|ref|NP_065254.2| (NM_020508) bromodomain containing 4;
            bromodomain-containing 5; bromodomain-containing 4 [Mus
            musculus]
          Length = 1400

 Score = 37.4 bits (85), Expect = 0.020
 Identities = 26/111 (23%), Positives = 41/111 (36%)
 Frame = +3

Query: 738  TSKRKKQSSRRNDEELEIRAKRASPEERGRKPHPFPGSTPQNPVISQTPPMPGHRSQAXX 917
            TS+     S  ++ E+  ++K+     R +K H         P  +  P  P
Sbjct: 707  TSESSSSDSEDSETEMAPKSKKKGHTGRDQKKHHHHHHPQMQPAPAPVPQQPP------- 759

Query: 918  XXXXXXXXXVQHPQKKRPPPTPGTQVHQQKGPPLPKPRVQTKPTHEAKENS 1070
                        P  ++PPP P  Q  QQ+ PP P P    + T  A ++S
Sbjct: 760  ------------PPPQQPPPPPPPQQQQQQPPPPPPPPSMPQQTAPAMKSS 798


 Score = 28.9 bits (63), Expect = 7.0
 Identities = 21/68 (30%), Positives = 26/68 (37%), Gaps = 2/68 (2%)
 Frame = +3

Query: 831  PHPFPGSTPQNPVISQTPPMPGHRSQAXXXXXXXXXXXVQHPQKKRPPPT--PGTQVHQQ 1004
            P P     PQ  +  + PP P   S             +Q  QK +PP    P  +V  Q
Sbjct: 903  PQPPMAQPPQVLLEDEEPPAPPLTSMQMQLY-------LQQLQKVQPPTPLLPSVKVQSQ 955

Query: 1005 KGPPLPKP 1028
              PPLP P
Sbjct: 956  PPPPLPPP 963



>gi|30794258|ref|NP_821172.1| (NM_178029) hypothetical protein
            LOC233904 [Mus musculus]
          Length = 849

 Score = 36.6 bits (83), Expect = 0.033
 Identities = 24/104 (23%), Positives = 36/104 (34%)
 Frame = +3

Query: 738  TSKRKKQSSRRNDEELEIRAKRASPEERGRKPHPFPGSTPQNPVISQTPPMPGHRSQAXX 917
            +S      S   +EE       ASP     +P P P   P+   +  +P MP    +
Sbjct: 196  SSSSSSSESSSEEEEQSAVIPSASPPREVPEPLPAPDEKPETDGLVDSPVMPLSEKETLP 255

Query: 918  XXXXXXXXXVQHPQKKRPPPTPGTQVHQQKGPPLPKPRVQTKPT 1049
                        P ++ PP  P        GPP   PR+  +P+
Sbjct: 256  TQPAG-------PAEEPPPSVPQPPAEPPAGPPDAAPRLDERPS 292


 Score = 35.8 bits (81), Expect = 0.057
 Identities = 44/202 (21%), Positives = 69/202 (33%), Gaps = 17/202 (8%)
 Frame = +2

Query: 794  SEESEPRGKGPEAPSISRLNSSKSGHFPNSSNAWSSFSGTYSSSRASWPPCPAPTEE--- 964
            + +SE       + S S  +SS S    +     S+   + S  R    P PAP E+
Sbjct: 178  TSDSESGSSSSSSSSSSSSSSSSSSESSSEEEEQSAVIPSASPPREVPEPLPAPDEKPET 237

Query: 965  ------------EASSYPGHTSSPAKRPSP--PQASSSNKTYP*GQRKLITLSPDAVVPS 1102
                        E  + P   + PA+ P P  PQ  +     P      +   P + +P
Sbjct: 238  DGLVDSPVMPLSEKETLPTQPAGPAEEPPPSVPQPPAEPPAGPPDAAPRLDERPSSPIPL 297

Query: 1103 L*LKEDRNCPFL*KALWSFSTPGAHASRFHQACSAGGPGQRPSPGATALSVPSNSAPRPD 1282
            L   + R       A      P    +   QA S+G P  R  P     ++ +
Sbjct: 298  LPPPKKRRKTVSFSAAEEAPVPEPSTAAPLQAKSSG-PVSRKVPRVVERTIRNLPLDHAS 356

Query: 1283 IWSLWPPPPLQEGKNRQRVRIR 1348
            +   WP    + G+NR   R+R
Sbjct: 357  LVKSWPEEVARGGRNRAGGRVR 378


 Score = 29.3 bits (64), Expect = 5.3
 Identities = 45/187 (24%), Positives = 69/187 (36%), Gaps = 5/187 (2%)
 Frame = +2

Query: 791  KSEESEPRGKGPEAPSISRLN-----SSKSGHFPNSSNAWSSFSGTYSSSRASWPPCPAP 955
            K E     G+  ++ S S+ +       ++G   +S +  SS S + SSS +S     +
Sbjct: 146  KKEAEASDGEDEDSDSSSQCSLYADSDGENGSTSDSESGSSSSSSSSSSSSSSSSSSESS 205

Query: 956  TEEEASSYPGHTSSPAKRPSPPQASSSNKTYP*GQRKLITLSPDAVVPSL*LKEDRNCPF 1135
            +EEE  S    ++SP +    P  +   K    G      L    V+P   L E    P
Sbjct: 206  SEEEEQSAVIPSASPPREVPEPLPAPDEKPETDG------LVDSPVMP---LSEKETLP- 255

Query: 1136 L*KALWSFSTPGAHASRFHQACSAGGPGQRPSPGATALSVPSNSAPRPDIWSLWPPPPLQ 1315
                    + P   A           P   P P A   + P ++APR D     P P L
Sbjct: 256  --------TQPAGPAEE--------PPPSVPQPPAEPPAGPPDAAPRLDERPSSPIPLLP 299

Query: 1316 EGKNRQR 1336
              K R++
Sbjct: 300  PPKKRRK 306



>gi|21746185|ref|NP_653115.1| (NM_144532) RIKEN cDNA 2410038D05 [Mus
            musculus]
          Length = 271

 Score = 36.6 bits (83), Expect = 0.033
 Identities = 25/81 (30%), Positives = 37/81 (44%), Gaps = 7/81 (8%)
 Frame = +2

Query: 827  EAPSISRLNSSKSGHFPNSSNA----WSSFSGTYSSSRASWPPCPAPTEEEASSYPGHTS 994
            E P+++R  S K    P S  A     SS  G+ +S  +  PP     +EE SS P   +
Sbjct: 28   EGPALTRRRSKKESWHPGSQKASSGDQSSSQGSEASGSSKHPPRTKVGQEEPSSAPARPA 87

Query: 995  S---PAKRPSPPQASSSNKTY 1048
            S     +  S PQ  ++ +TY
Sbjct: 88   SHRHSHRHRSDPQQDAAQRTY 108



>gi|7949115|ref|NP_058079.1| (NM_016799) Ser/Arg-related nuclear
            matrix protein; plenty-of-prolines-101; serine/arginine
            repetitive matrix protein 1 [Mus musculus]
          Length = 897

 Score = 35.4 bits (80), Expect = 0.074
 Identities = 38/181 (20%), Positives = 68/181 (36%), Gaps = 4/181 (2%)
 Frame = +3

Query: 744  KRKKQSSRRNDEELEIRAKRASPEERGRKPHPFPGSTPQNPVISQTPPMPGHRSQAXXXX 923
            KR+K++S R       ++       R R P P P    ++P  +  PP P    +
Sbjct: 533  KRQKETSPRMQMGKRWQSPVTKSSRRRRSPSPPPARRRRSPSPAPPPPPPPPPPRRRRSP 592

Query: 924  XXXXXXXVQHPQKKRPPPTP---GTQVHQQKGP-PLPKPRVQTKPTHEAKENS*LCLLMR 1091
                      P  +R  P+P      + ++  P P PK R  + P    +  S
Sbjct: 593  TPPPRRRTPSPPPRRRSPSPRRYSPPIQRRYSPSPPPKRRTASPPPPPKRRAS------- 645

Query: 1092 LSLPSN*KRTETVLSCKRHCGVSPLQARMLAASIKRALQGDRDSAQAQEPQPCLYHLTQP 1271
             S P   + + +    +R   V+  ++  L++  ++     R + +A+ PQP   H   P
Sbjct: 646  PSPPPKRRVSHSPPPKQRSPTVTKRRSPSLSSKHRKGSSPGRSTREARSPQPNKRHSPSP 705

Query: 1272 R 1274
            R
Sbjct: 706  R 706


  Database: RefSeq (Mus Musculus/Amino Acid)
    Posted date:  Jul 1, 2003  6:07 AM
  Number of letters in database: 7,155,815
  Number of sequences in database:  15,330

Lambda     K      H
   0.318    0.135    0.401

Gapped
Lambda     K      H
   0.267   0.0410    0.140


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 25742142
Number of Sequences: 15330
Number of extensions: 733272
Number of successful extensions: 4019
Number of sequences better than 10.0: 137
Number of HSP's better than 10.0 without gapping: 2252
Number of HSP's successfully gapped in prelim test: 110
Number of HSP's that attempted gapping in prelim test: 497
Number of HSP's gapped (non-prelim): 3455
length of query: 527
length of database: 7,155,815
effective HSP length: 47
effective length of query: 480
effective length of database: 6,435,305
effective search space: 3088946400
effective search space used: 3088946400
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 62 (28.5 bits)