Drosophila melanogaster Meigen (fruit fly) [DME]

Amino acids sequence

        10         20         30         40         50         60
MGVTGLWKLI EPCGKPVPVE TLEGKILAVD ISIWLHQVVK GFQDNKGSAL SNAHLLGLFH
        70         80         90        100        110        120
RLCKLLYYRV RPVFIFDGCV PQLKRDTIAR RQQQRNKLSN EADRIQALLL QSLAKEKVVQ
       130        140        150        160        170        180
QALGKNAELL LKSPVKRPPP AKKNDEDDLF KLPELPAASV QDNQDESEQD TSASASDSSF
       190        200        210        220        230        240
DESTARHSYN SSLQAIDVKS QHFRNLPADV RHEILTDIKE TRKQSSWGRL HELPARSDDF
       250        260        270        280        290        300
CSFQMKRLLK RRAVQESLEQ AEQEMGGHTL TYAELCDFFN EEGILTPTAI EQCTRQISSD
       310        320        330        340        350        360
EHTRFLLVRD LKKKAMESTK QEVKMEMIEE VPAEEEDEKP STSTKKEAVK SVDLGTEFDE
       370        380        390        400        410        420
DLAKALSMSM EETKVYDEKD YEYDSDQELR LNRAQTKQLR HAAKGPARAY MIEYGGMNDE
       430        440        450        460        470        480
EVGNIMEATQ FNDTQSLEKL LEITTVPTDM ADNSIEEAKL ISQAIEESKQ LSQAIEESKK
       490        500        510        520        530        540
NLNEDKVEIV DTDTDSDLEE VMEVQELDKG KKNLEICVDI TGQADSNDLF ADIFEDGEAN
       550        560        570        580        590        600
KIEKTISVEE DDDFIEVKDS EELKLDTEDE NKPITNKSIK ESNEVKPFID EIIEVKDSQE
       610        620        630        640        650        660
AVPAEPNLKP DLESILNDLK KQTAAVKDIQ LNVNEEEKPK PKVEISSILD ELKVKMADVK
       670        680        690        700        710        720
NITLDNVKLS NSVPIILSSD DESTLKSSKI VPKQELIELC DSDDNKNNRL SPNKTPSKNK
       730        740        750        760        770        780
SIKDFFETSY VVKRTPDKSQ ASNETSPGTP KTPKPFFRKR TPKSGRKRAS DANEDSDEEV
       790        800        810        820        830        840
SPTKRSSKAS KSLFEPKEPE EEKTVDPEEI IKDAAEALKS QKTSEELQEL ATNLAQERKE
       850        860        870        880        890        900
LEIERNRQDR MGMSISQRMS IDCQELLRLF GIPYIVAPME AEAQCAFLNA TDLTHGTITD
       910        920        930        940        950        960
DSDIWLFGGR TVYKNFFAQN KHVMEFRAEQ IEQTFNCNRG KLIQLACLVG SDYTTGIHGI
       970        980        990       1000       1010       1020
GAVTALEILA SFSGQDANGP GICNQSVLQT LIKFRDWWQA HKCSNLPPGS SARLALRKKL
      1030       1040       1050       1060       1070       1080
KNIELHEGFP SGAVVEAYLA PTIDDNRDAF SWGTPDVESI REFTRKSFGW TTSKTDDILM
      1090       1100       1110       1120       1130       1140
PVMKKINEKK IQGSIRNYFT AKSALRVQQP HVSKRVQLAI DKMSGKIDET PEKPKKVTRT
      1150       1160       1170       1180       1190       1200
RRAKAAPPTD DDLAIADVAT KAARPKRGKR KAAPESVVLD GELPSTSQSI PKPEKCPRIP
      1210       1220       1230
SSVEVIPQRE KDLEQMRLNK AKAAEILKNS AKANRK  

Encoded by mus201 gene

FULL NAME: mutagen-sensitive 201


OTHER NAME(S):
55A11T
CG10890
CG32956
Dmel\CG10890
DmXPG
ESTS:55A11T
mus(2)201
mus-201
rad202
XPG
xpg/mus201
XPG[Dm]
XPG[DM]


Nucleic acid sequence

        10         20         30         40         50         60
atgggagtta cgggcttgtg gaagctcatt gagccgtgcg gcaagccagt ccccgtagag
        70         80         90        100        110        120
actttggagg gcaaaatcct agcagtggat atatcgattt ggttgcatca ggttgtgaag
       130        140        150        160        170        180
ggctttcagg acaacaaggg atcggccctg agtaatgccc atttactggg tctattccat
       190        200        210        220        230        240
cgtctctgta aattgctata ctatcgtgtg cgaccggttt tcattttcga tggatgcgtg
       250        260        270        280        290        300
ccgcagctta aaagggacac tattgcacgt cgccagcagc agaggaataa gctcagcaac
       310        320        330        340        350        360
gaagctgatc gcattcaggc tttgctcctt caatccctgg ccaaagaaaa agtagtgcag
       370        380        390        400        410        420
caagcgctgg gcaagaatgc agagctactg ttgaaatccc cggttaagcg accgcctcca
       430        440        450        460        470        480
gctaagaaga acgacgagga tgacttgttt aagcttcccg aactgccggc tgcatcggtg
       490        500        510        520        530        540
caagataatc aagatgaaag cgagcaggac accagtgcca gtgcctcaga cagctccttt
       550        560        570        580        590        600
gacgagtcaa ccgctcgaca ttcttacaac tctagcttac aggccatcga tgtaaagagt
       610        620        630        640        650        660
cagcattttc ggaatctgcc agccgatgtg cgacacgaga tccttacaga catcaaggag
       670        680        690        700        710        720
acgcgcaagc aatcgtcgtg gggccgtctg cacgagttgc ccgcccgcag cgatgacttc
       730        740        750        760        770        780
tgctccttcc agatgaagag actcctcaag cgccgagccg ttcaggaaag cttagagcag
       790        800        810        820        830        840
gcggaacagg agatgggcgg acatacgctt acctatgcag agttgtgtga ctttttcaat
       850        860        870        880        890        900
gaggagggca ttctcacccc aacggccatt gaacagtgca cccgacaaat tagctcggat
       910        920        930        940        950        960
gaacatacac gattcctgct ggtcagggat cttaaaaaga aagcaatgga gagcaccaaa
       970        980        990       1000       1010       1020
caagaggtta aaatggagat gattgaggag gtacccgctg aggaggagga tgagaagcca
      1030       1040       1050       1060       1070       1080
agcacttcca ccaagaaaga ggctgtgaaa agtgtggacc taggcacaga gttcgatgaa
      1090       1100       1110       1120       1130       1140
gatttggcca aagcactgtc tatgtcaatg gaggagacca aggtgtacga tgaaaaggac
      1150       1160       1170       1180       1190       1200
tacgagtacg actcggacca agagttgcgc ctcaatcgag ctcaaaccaa gcaattgcgc
      1210       1220       1230       1240       1250       1260
catgcggcca agggacctgc acgggcctat atgattgaat acggcggaat gaacgacgag
      1270       1280       1290       1300       1310       1320
gaagttggca acatcatgga ggccactcag ttcaatgaca ctcaaagcct tgaaaagttg
      1330       1340       1350       1360       1370       1380
ctagagatca cgacagtccc gaccgacatg gctgacaact cgattgagga ggccaaactt
      1390       1400       1410       1420       1430       1440
atttcacaag ctattgaaga gagcaaacaa ctatcccaag caattgagga aagcaagaag
      1450       1460       1470       1480       1490       1500
aatcttaacg aggacaaggt ggagattgta gatactgata ccgactcaga cttggaggaa
      1510       1520       1530       1540       1550       1560
gtaatggaag ttcaggagct ggataaaggc aagaagaatc ttgagatttg tgttgatatt
      1570       1580       1590       1600       1610       1620
actggccaag cggattcgaa tgatctgttt gcggatattt ttgaggatgg agaggcaaat
      1630       1640       1650       1660       1670       1680
aaaatagaga aaactataag cgtcgaggaa gacgatgact ttatagaagt gaaagacagt
      1690       1700       1710       1720       1730       1740
gaggaattaa aattggatac tgaagatgaa aataaaccaa taacgaataa gagtattaaa
      1750       1760       1770       1780       1790       1800
gaaagtaatg aagtcaagcc gttcattgat gaaattatcg aagtgaaaga tagccaagaa
      1810       1820       1830       1840       1850       1860
gcggttcctg cagaacctaa tctcaaaccc gatctggaat caattttaaa tgatctgaaa
      1870       1880       1890       1900       1910       1920
aaacaaacgg ctgcggttaa agatattcag ctaaacgtaa atgaagaaga aaaaccaaaa
      1930       1940       1950       1960       1970       1980
ccaaaggtgg aaatcagttc tatattggat gagctaaaag taaagatggc cgacgttaaa
      1990       2000       2010       2020       2030       2040
aacatcactc tagataacgt gaaattaagc aatagtgtcc ctatcatact atcctccgat
      2050       2060       2070       2080       2090       2100
gacgaaagta ccttgaaatc ctcgaaaata gttccgaagc aagagctaat tgagttgtgc
      2110       2120       2130       2140       2150       2160
gacagcgacg ataataaaaa caatcgccta tcaccgaaca aaacgccaag caaaaacaag
      2170       2180       2190       2200       2210       2220
tccattaagg acttttttga gaccagttac gtggtcaagc gaactcccga caaatctcaa
      2230       2240       2250       2260       2270       2280
gcgtcaaacg aaacttcgcc gggaacacca aagaccccga agcccttctt cagaaagaga
      2290       2300       2310       2320       2330       2340
accccgaagt ccggacggaa acgagctagt gatgccaatg aggacagtga tgaggaagtt
      2350       2360       2370       2380       2390       2400
tcgcccacta aaaggtccag taaggcctcg aagtctctct tcgagccaaa ggagccggaa
      2410       2420       2430       2440       2450       2460
gaagaaaaga ctgtagatcc ggaggagatt atcaaagatg ctgcagaggc tcttaaatct
      2470       2480       2490       2500       2510       2520
caaaagacct cagaagaact gcaggagttg gctacaaact tagctcagga gcgaaaggaa
      2530       2540       2550       2560       2570       2580
cttgaaatcg agcggaatcg gcaagaccga atgggcatgt ccatcagcca gcgcatgagc
      2590       2600       2610       2620       2630       2640
atcgattgcc aggagctgct gcgtcttttc ggcattccgt acattgtggc tcccatggag
      2650       2660       2670       2680       2690       2700
gcagaggcgc agtgcgcctt tctcaatgcc acagatctta cccacggcac catcacggat
      2710       2720       2730       2740       2750       2760
gatagtgata tctggctttt tggtggtcga actgtctaca agaacttctt tgcacaaaac
      2770       2780       2790       2800       2810       2820
aagcacgtga tggaattccg ggcggaacag atcgagcaaa cgtttaactg caacaggggt
      2830       2840       2850       2860       2870       2880
aaactgatcc agttggcctg tttggtgggc agtgactaca ctacaggaat tcatggcatt
      2890       2900       2910       2920       2930       2940
ggtgctgtaa cggccctgga gattttggcc tccttttccg gacaggatgc gaatgggcca
      2950       2960       2970       2980       2990       3000
ggtatctgca atcaatcggt gcttcaaacg ctaatcaagt tccgcgactg gtggcaagca
      3010       3020       3030       3040       3050       3060
cacaagtgca gcaatcttcc acctggcagc tcggctcgcc ttgctctgcg caaaaaactt
      3070       3080       3090       3100       3110       3120
aagaacatcg aactgcacga gggttttccc agcggtgcag tggtggaggc atatttagcg
      3130       3140       3150       3160       3170       3180
cccacgatcg acgacaatcg ggatgcattt agttggggca caccggatgt ggaatcaatc
      3190       3200       3210       3220       3230       3240
agagaattca cgcgaaaatc tttcggctgg actacttcaa aaacggacga cattctgatg
      3250       3260       3270       3280       3290       3300
cccgtgatga aaaaaattaa cgagaagaag atacagggtt ccatacgcaa ctactttacg
      3310       3320       3330       3340       3350       3360
gcaaagagtg ctctgcgagt tcaacaaccg cacgtcagca aacgtgtcca attggcaatt
      3370       3380       3390       3400       3410       3420
gacaagatgt ccggaaagat cgatgagacg ccggagaaac cgaagaaggt gacacgcaca
      3430       3440       3450       3460       3470       3480
agacgtgcaa aggcagctcc gccaacagat gatgatttgg ccatcgcgga tgtcgcgaca
      3490       3500       3510       3520       3530       3540
aaagctgccc gtcccaaacg cggcaaacga aaagctgcac ctgaatcggt agttttggat
      3550       3560       3570       3580       3590       3600
ggggaacttc cttccacatc gcagtcgata ccgaagcctg aaaaatgtcc tcgaatacct
      3610       3620       3630       3640       3650       3660
agcagcgttg aagttatacc tcaaagggaa aaggatctgg agcagatgcg cctcaataaa
      3670       3680       3690       3700       3710
gcaaaggcgg cagagattct taaaaattca gcgaaagcca ataggaaata a

Last modification date: Oct. 2, 2011