Drosophila melanogaster Meigen (fruit fly) [DME]


RELATED PATHWAY(S):
base excision repair (BER)


Amino acids sequence

        10         20         30         40         50         60
MQEESGSTPL LSSSFSYTPT VAPLVLKIAL IQAARSFAAS KYTIKVLLSL NLALSEQAIV
        70         80         90        100        110        120
QKAVKPICTV MASEVDASSG PEDGTVPTLM SLTPYITNLE HGNNESGKYV SGLPNNRKRQ
       130        140        150        160        170        180
KLSILEHNTI QNNDVDNTEV EPNMDPKSKM SDTKSNKYSE MEKHLNDNSK IVIEGTISIG
       190        200        210        220        230        240
NSKRKSPEKL VPYDNDCYMP PEQALVTSVE LVVPAPQTHS SNTPGRQSEE PNLSTLGESS
       250        260        270        280        290        300
TTPSSTIDNK LQYSTAGLYN STSSTSILAN DKIVGCANDS SNLNLRIPTK LVVTTASGDI
       310        320        330        340        350        360
LIDDRRASLW TPHHDESGQR QQRTGTASSD TKQEPMNVSS ELSYHHQNRH SELLLQIEKE
       370        380        390        400        410        420
SSGSFLQASP IPLQDNHNNA ASGQFGQTEE TSNIDSQSHN NFYAQMMQPQ HLLHNQHQQS
       430        440        450        460        470        480
MHEHSPRHQQ PASYSGYITH YQNPPMFGAH QSEHHQRLNQ QQQPLQHLLD CHGHLEQSTP
       490        500        510        520        530        540
ISQQNQHHLS QQIHQHQHQQ THQRLPLREN YHDIIMDDFH EEPSHAFKLT LSPSNTKPEN
       550        560        570        580        590        600
QDDGYETSAG DVLTPNSHSS STHSITPQHQ MQHSNIVLMT QNQKKSDDLQ LTKVTLSGEA
       610        620        630        640        650        660
HTDPNACSSN SSQGQVLASQ SHLELSEGTR CSSHASVVDP YSFMGEELHM HSPSHRHLDA
       670        680        690        700        710        720
VTTGPGRYGI LVSNDTPECL SREMYRHSQQ STTVLEQTDS SSCGINFKPM PKKRGRKKKL
       730        740        750        760        770        780
VAVNADTSQM TTPVDQQKVS AGRADCEDGG GDQAAKPKER KKHDRFNGMS EEEVIKRTIP
       790        800        810        820        830        840
DHLCDNLDIV IVGINPGLFA AYKGHHYAGP GNHFWKCLYL AGLTQEQMSA DEDHKLIKQG
       850        860        870        880        890        900
IGFTNMVARA TKGSADLTRK EIKEGSRILL EKLQRFRPKV AVFNGKLIFE VFSGKKEFHF
       910        920        930        940        950        960
GRQPDRVDGT DTFIWVMPSS SARCAQLPRA ADKVPFYAAL KKFRDFLNGQ IPHIDESECV
       970        980        990       1000       1010       1020
FTDQRIRLCS AQQQVDIVGK INKTHQPPLG DHPSSLTVVS NCSGPIAGDA ECGIVAEESD
      1030       1040       1050       1060       1070       1080
QVQSEKMIPQ MDPTVPSSSN ATDGKSFSYT AENTPLLPVS NHNPSINENN YLSVMGSQQP
      1090       1100       1110       1120       1130       1140
LSQQPLEKKK RGRPKKIKGQ DIIDHSVGGK ASIAGQHIPS HDFNNILNLS VMSGGGTIET
      1150       1160       1170       1180       1190       1200
PKKKRGRPKK LKPAIDNIMT VKQLQHGNNN LNTTAGLSAS SMHPISMEHI AASPQSSHQM
      1210       1220       1230       1240       1250       1260
PPSLYNTPPP SHLLYTASAS PMASPALNCN YTQVHGHGTP PVGQVASVAQ GSSPVIDTQN
      1270       1280       1290       1300       1310       1320
DHLAQQKQSH HGNLGAGLDM RDHPHLGETP PPSSPNMCST VDFDPPDEHS GSQVGSRVQN
      1330       1340       1350       1360       1370       1380
KAVELDHQHP QIMEKVQYDS PVPNTEANPA HPHENYQQWL SPHPHQSNQP AQKLTHRQQH
      1390       1400       1410       1420       1430       1440
PPMHHFHQEQ TENWQRYEEQ NSNPYMVISA HHQHLSPRLG NQTHQNSSPS GHISSDVAHK
      1450       1460       1470       1480       1490       1500
SLCGLESLVD QIPAIREQDC SNIPLATVAA AAAAVESRIL SLQHQHQHPL QPHQQNQQNQ
      1510       1520       1530       1540       1550       1560
QQQLKQCKQE NSAHRESCRP TSENSNVSNS NFSVSSLAAS ASSARTDNAI YGNGETKGNN
      1570       1580       1590       1600       1610       1620
ESSHHNSCDT NIDYPIHNQS AYHHTPHLIG SALGTNVNNS EPNLHTISHP HPPHPHPHSM
      1630       1640       1650       1660       1670       1680
YVDQAHHMAH IPSVNVNSMY GPAYGSHPQH TTGEYPGTHG HYSLGGSVQT AVPTSSATLH
      1690       1700       1710       1720       1730
VPSPNYPFGH HPYGHTPPQA NYPSYTHPHT HHHHSHPSHH LTVFDHLKPS DISGYGGF

Encoded by Thd1 gene


OTHER NAME(S):
1981
CG1981
Dmel\CG1981
G/T-mismatch-specific-thymine-DNA-glycosylase


Nucleic acid sequence

        10         20         30         40         50         60
atgcaagagg aaagtggaag cactccactg ttaagtagtt ccttctctta cacgcctaca
        70         80         90        100        110        120
gtagcgccat tagtcttaaa aatagctctt atacaagcag ctagatcatt tgctgcctct
       130        140        150        160        170        180
aaatatacca ttaaagttct gctttcttta aacttagctt tgtctgagca agcaatagtt
       190        200        210        220        230        240
cagaaggcag taaaacctat ttgcaccgta atggcatcag aggtggatgc gagttcggga
       250        260        270        280        290        300
ccagaagatg ggaccgtacc aactttgatg tcattaacac catacataac taatttagaa
       310        320        330        340        350        360
catggcaaca atgaatcagg caagtatgtc tctggcctgc ccaataatcg caaacgacag
       370        380        390        400        410        420
aagctgtcca ttcttgaaca caacaccatt cagaacaatg atgtggataa tacggaagtc
       430        440        450        460        470        480
gagcccaata tggatccgaa atcaaaaatg tcagacacca agtcaaataa atattccgaa
       490        500        510        520        530        540
atggagaagc atttgaatga caacagtaag atcgtcattg aagggacaat atctataggc
       550        560        570        580        590        600
aactcaaaaa gaaaaagtcc agaaaaatta gtgccatatg ataacgattg ctatatgccc
       610        620        630        640        650        660
ccagagcagg cactggtcac tagcgtggag ttagtcgttc ctgcccctca aacacattca
       670        680        690        700        710        720
tcgaacacac cgggcaggca atctgaggag cccaatctta gcactctggg cgaatcctcg
       730        740        750        760        770        780
actacaccgt catcgactat tgataacaaa ttgcaataca gtacggctgg tctgtataat
       790        800        810        820        830        840
tctactagtt ccacaagtat attagcaaat gataaaatcg ttggttgtgc aaatgactct
       850        860        870        880        890        900
tccaatttga acttaagaat tcctactaag ttagtagtaa cgacagcatc aggtgatatt
       910        920        930        940        950        960
ttgattgatg atcgaagggc ttctttatgg acacctcacc atgacgaatc cggccaaagg
       970        980        990       1000       1010       1020
cagcaacgca caggaacagc atcgagtgac acgaagcaag agcccatgaa tgtaagctca
      1030       1040       1050       1060       1070       1080
gagctgtcat accaccacca gaatcgtcac tccgaacttt tgctacaaat cgagaaagaa
      1090       1100       1110       1120       1130       1140
agctctggtt catttttaca agcaagtcca attccattgc aggataacca taataatgca
      1150       1160       1170       1180       1190       1200
gcatcgggcc agtttggcca aacagaagag acttccaata ttgacagtca gtcgcacaat
      1210       1220       1230       1240       1250       1260
aatttttatg cccaaatgat gcagccgcag caccttcttc ataatcagca tcaacaaagc
      1270       1280       1290       1300       1310       1320
atgcatgaac acagtcctcg ccatcagcag cctgcaagtt attctggcta tatcactcat
      1330       1340       1350       1360       1370       1380
tatcaaaatc ctccgatgtt tggtgctcat caaagtgagc atcatcaacg tcttaaccaa
      1390       1400       1410       1420       1430       1440
cagcagcaac cacttcagca tctattggat tgccacgggc atttagagca atcgactcct
      1450       1460       1470       1480       1490       1500
atttcccaac aaaatcaaca tcatttgtca cagcagatcc atcagcacca acatcagcag
      1510       1520       1530       1540       1550       1560
acacaccaac gattgccatt gcgcgaaaac tatcatgata ttattatgga tgatttccat
      1570       1580       1590       1600       1610       1620
gaagaaccca gccatgcctt caaattaacg ctctctccaa gcaacacaaa acctgagaat
      1630       1640       1650       1660       1670       1680
caagatgatg gttatgagac aagtgctggc gacgttttaa caccaaattc gcatagttcg
      1690       1700       1710       1720       1730       1740
tccacacact ccattactcc tcagcatcaa atgcaacatt cgaatatcgt gcttatgacg
      1750       1760       1770       1780       1790       1800
caaaaccaga agaagtctga tgatttgcaa ttgacaaaag ttacactctc cggggaagct
      1810       1820       1830       1840       1850       1860
cacactgacc ccaatgcttg ctcatcaaac agtagtcaag gacaggttct cgcctcccaa
      1870       1880       1890       1900       1910       1920
tctcatctag aactgagtga gggcactcga tgctcaagtc atgcgtctgt tgtcgatccc
      1930       1940       1950       1960       1970       1980
tacagtttta tgggtgagga gttgcacatg cattcaccat cacatcgtca tttggatgct
      1990       2000       2010       2020       2030       2040
gtcacaactg ggccagggcg gtatgggatt ttagtatcaa atgatactcc cgaatgtcta
      2050       2060       2070       2080       2090       2100
agcagagaaa tgtatcgaca cagtcaacaa agtacaactg tattggaaca aactgatagc
      2110       2120       2130       2140       2150       2160
tcttcgtgcg gcataaattt caagccaatg ccaaaaaaaa gaggccgcaa aaagaagcta
      2170       2180       2190       2200       2210       2220
gtagctgtta atgctgatac ctcacaaatg acaacgccag tcgatcaaca aaaggtgtct
      2230       2240       2250       2260       2270       2280
gctggcagag cagattgtga agatggtggt ggcgatcaag ctgcaaaacc caaggaacgc
      2290       2300       2310       2320       2330       2340
aaaaaacatg acagatttaa tggaatgtcc gaggaggaag tgattaaacg gacaattcca
      2350       2360       2370       2380       2390       2400
gatcatctat gtgataacct tgatatcgtt atagtcggga taaatcctgg gttatttgca
      2410       2420       2430       2440       2450       2460
gcatacaagg gacaccacta cgcaggtcca ggcaatcact tctggaagtg tctttactta
      2470       2480       2490       2500       2510       2520
gcaggactta ctcaggagca gatgagtgct gatgaagatc acaagctgat aaagcaagga
      2530       2540       2550       2560       2570       2580
ataggattta ccaatatggt tgctcgagcc acgaaaggat ctgctgacct tacaagaaag
      2590       2600       2610       2620       2630       2640
gaaataaaag agggcagccg aattttgcta gaaaaacttc agaggtttcg cccaaaagta
      2650       2660       2670       2680       2690       2700
gccgttttta atggcaaact aatatttgaa gtgttttcgg gaaaaaagga atttcacttt
      2710       2720       2730       2740       2750       2760
ggtcgccaac ctgatcgtgt tgatggcacg gacacattca tttgggtgat gccctcatct
      2770       2780       2790       2800       2810       2820
tcagcacgat gcgcacagtt gccgcgcgcc gctgacaaag taccgtttta tgcggctcta
      2830       2840       2850       2860       2870       2880
aaaaagtttc gcgacttttt gaacggacag ataccccata tcgatgaatc agagtgcgtg
      2890       2900       2910       2920       2930       2940
tttacggatc aaagaatccg tctatgtagc gcacagcaac aggtggatat cgttggtaaa
      2950       2960       2970       2980       2990       3000
attaataaaa cacatcaacc tcctcttggc gatcatccat ccagtttaac agtagtaagt
      3010       3020       3030       3040       3050       3060
aactgtagtg gtccaatcgc aggggatgcg gagtgtggaa ttgtcgccga ggagtcagat
      3070       3080       3090       3100       3110       3120
caggttcaat cggagaaaat gattccccaa atggatccca cagtgccatc ttcaagtaat
      3130       3140       3150       3160       3170       3180
gcaactgatg ggaaatcgtt ttcctatacg gcagaaaata cacccttact cccagtgtct
      3190       3200       3210       3220       3230       3240
aatcataatc cttcaattaa tgaaaacaat tacttatccg tgatgggttc tcaacagccc
      3250       3260       3270       3280       3290       3300
ttgtcgcagc aaccactaga gaagaaaaaa cggggccgtc cgaagaaaat aaagggacaa
      3310       3320       3330       3340       3350       3360
gatattattg atcattccgt cggcggaaaa gcttcaattg ccggacagca tatacccagt
      3370       3380       3390       3400       3410       3420
catgatttca ataatattct taacctttcg gtgatgtctg gcggtggcac cattgaaact
      3430       3440       3450       3460       3470       3480
ccaaaaaaaa agagaggcag accaaagaaa cttaaacccg caattgacaa cataatgaca
      3490       3500       3510       3520       3530       3540
gttaaacaac ttcaacatgg taataacaat ttaaacacaa cagcagggtt gtcggcaagc
      3550       3560       3570       3580       3590       3600
tcaatgcatc caatatccat ggagcatatt gcagcgtctc cgcaaagcag tcatcaaatg
      3610       3620       3630       3640       3650       3660
ccgccaagtt tgtacaatac gcccccaccg tctcaccttt tgtacactgc atcggcatca
      3670       3680       3690       3700       3710       3720
ccgatggcct ctcctgccct taactgcaat tacactcaag tacatggaca tggaactcca
      3730       3740       3750       3760       3770       3780
ccggtagggc aagtggcttc tgttgctcag ggctcgtcac ctgtcatcga tacgcaaaac
      3790       3800       3810       3820       3830       3840
gaccatcttg ctcaacaaaa gcaatcccac catggaaatc taggcgctgg actagatatg
      3850       3860       3870       3880       3890       3900
cgggatcatc cgcacttggg tgaaacccct ccgccgagtt cgccaaatat gtgttcaaca
      3910       3920       3930       3940       3950       3960
gtcgacttcg atccaccaga cgaacactca ggctctcaag tgggctcacg ggtgcaaaat
      3970       3980       3990       4000       4010       4020
aaagcagttg aattggatca tcaacaccct caaataatgg agaaggtcca gtatgatagc
      4030       4040       4050       4060       4070       4080
cctgtaccta ataccgaggc gaacccagct catcctcacg aaaattacca gcaatggctt
      4090       4100       4110       4120       4130       4140
tcgcctcatc ctcaccaaag taatcaacca gcacaaaaac tgacacatcg gcaacaacat
      4150       4160       4170       4180       4190       4200
ccacctatgc accatttcca tcaggagcag acggagaact ggcagcgtta cgaagaacag
      4210       4220       4230       4240       4250       4260
aattcaaatc cctatatggt gatctctgct caccaccagc atttgagtcc acgacttgga
      4270       4280       4290       4300       4310       4320
aatcaaaccc atcagaattc ctccccatct ggtcatatct cctcagatgt tgctcataaa
      4330       4340       4350       4360       4370       4380
agcttatgtg gactcgaatc gctggttgat caaataccag ccatcagaga acaagattgc
      4390       4400       4410       4420       4430       4440
agcaatatac cattagcgac agtagcggca gctgcggctg ctgtggaaag tcgtatcttg
      4450       4460       4470       4480       4490       4500
agcttgcagc accagcatca acatcccctc caaccacatc aacaaaacca acaaaatcaa
      4510       4520       4530       4540       4550       4560
cagcagcaac taaagcaatg caaacaagag aactcggcgc atagagaatc ttgtagacct
      4570       4580       4590       4600       4610       4620
acaagtgaga atagcaatgt aagcaatagc aatttttcag taagcagtct agccgcttct
      4630       4640       4650       4660       4670       4680
gcttcgtctg caagaactga caacgctata tatggaaatg gagaaacaaa gggcaacaat
      4690       4700       4710       4720       4730       4740
gaaagtagcc atcataacag ttgtgacacc aatattgact atcccattca taatcaatcg
      4750       4760       4770       4780       4790       4800
gcttatcatc atacacctca tttgattggt agcgccctgg gcacgaatgt taataattct
      4810       4820       4830       4840       4850       4860
gagcctaatc ttcatacgat ttcccatcct catccacctc acccacatcc gcattcaatg
      4870       4880       4890       4900       4910       4920
tatgtggacc aagcgcacca catggcacat atcccttccg taaatgtaaa ttcaatgtac
      4930       4940       4950       4960       4970       4980
ggccctgcgt atggatccca tccacagcat actacaggag aatacccagg aacacacggt
      4990       5000       5010       5020       5030       5040
cactatagtc tgggcggtag cgttcagact gctgtgccta caagctctgc aactttacat
      5050       5060       5070       5080       5090       5100
gttccgagtc caaattatcc atttggacac cacccatatg gtcatacgcc accccaagca
      5110       5120       5130       5140       5150       5160
aactatccaa gctatactca tccacatact caccatcacc atagccatcc ctcacaccat
      5170       5180       5190       5200       5210
ttgaccgtgt ttgatcactt aaagccgtcc gacataagtg gatacggcgg tttttga

Last modification date: Oct. 2, 2011