Drosophila melanogaster Meigen (fruit fly) [DME]

FULL NAME: Exonuclease 1


DESCRIPTION:
5'->3' double-stranded DNA exonuclease which may also contain a cryptic 3'->5' double-stranded DNA exonuclease activity. Also exhibits endonuclease activity against 5'-overhanging flap structures similar to those generated by displacement synthesis when DNA polymerase encounters the 5'-end of a downstream Okazaki fragment. Required for DNA mismatch repair (MMR) (By similarity).

STRUCTURE SIMILARITY:
Belongs to the XPG/RAD2 endonuclease family. EXO1 subfamily.


RELATED PATHWAY(S):
mismatch repair (MMR)


Amino acids sequence

        10         20         30         40         50         60
MGITGLIPFV GKASSQLHLK DIRGSTVAVD TYCWLHKGVF GCAEKLARGE DTDVYIQYCL
        70         80         90        100        110        120
KYVNMLLSYD IKPILVFDGQ HLPAKALTEK RRRDSRKQSK ERAAELLRLG RIEEARSHMR
       130        140        150        160        170        180
RCVDVTHDMA LRLIRECRSR NVDCIVAPYE ADAQMAWLNR ADVAQYIITE DSDLTLFGAK
       190        200        210        220        230        240
NIIFKLDLNG SGLLVEAEKL HLAMGCTEEK YHFDKFRRMC ILSGCDYLDS LPGIGLAKAC
       250        260        270        280        290        300
KFILKTEQED MRIALKKIPS YLNMRNLEVD DDYIENFMKA EATFRHMFIY NPLERRMQRL
       310        320        330        340        350        360
CALEDYETDE RYCSNAGTLL EDSEQALHLA LGNLNPFSMK RLDSWTPEKA WPTPKNVKRS
       370        380        390        400        410        420
KHKSIWQTNF QSENTHTPKK ENPCALFFKK VDFVGKTLNE EIEANQRLEQ AKQTEAELFN
       430        440        450        460        470        480
MYSFKAKRRR SPSREDSVDQ ERTPPPSPVH KSRHNPFAKE RTGEEANQRS PVVCENASLL
       490        500        510        520        530        540
RLLSPKKASP LDGEAGVKKV DSLKRSIFAK EQVQIRSRFF ATQDEQTRLQ REHLRDTEND
       550        560        570        580        590        600
DMDEQKLSSH SGHKKLRLVC KDIPGKNPIR QRCSSQISDG ETDTDTTASS LLESQDKGVP
       610        620        630        640        650        660
SPLESQEDLN NSQPQIPTEG NTNSTTIRIK SLDLLLENSP EPTQESDRNN NDAIILLSDD
       670        680        690        700        710        720
SCSSDQRASS TSSSSQQRQN FLPTSKRRVG LSKPSTAKKG TPKSRTNGKL GAVSQNQTKL
       730
SMFGFQTKPV LK    

Encoded by tos gene

FULL NAME: tosca


OTHER NAME(S):
CG10387
Dmel\CG10387
DmTosca
Tosca


Nucleic acid sequence

        10         20         30         40         50         60
atgggcatta ccggcctaat tcccttcgtg ggcaaggcct cctcgcagct gcatctaaaa
        70         80         90        100        110        120
gacattcgcg gcagcacagt ggccgtggac acatattgct ggctacacaa gggagtcttc
       130        140        150        160        170        180
ggctgtgcgg agaagttggc ccgcggcgag gatacggatg tttatataca atactgcttg
       190        200        210        220        230        240
aagtatgtga acatgctgct gtcctacgac atcaagccca ttctggtctt cgatgggcag
       250        260        270        280        290        300
cacttgccgg ccaaggcttt gaccgaaaag cggagaaggg actccaggaa gcagagcaaa
       310        320        330        340        350        360
gagcgggcgg cggaactcct tcgattgggt cgcatcgagg aggcccgatc ccatatgcga
       370        380        390        400        410        420
cgctgcgtgg atgtcaccca cgacatggcg ttgcggttga tccgggaatg ccggagccgg
       430        440        450        460        470        480
aatgttgact gcattgtggc gccctacgag gccgatgccc aaatggcctg gctgaataga
       490        500        510        520        530        540
gcagatgtgg cccagtacat catcaccgag gactcggact taacgctttt tggagccaag
       550        560        570        580        590        600
aacatcatat tcaagctgga cctcaacggc agcggcctgc tggtggaggc ggaaaaactt
       610        620        630        640        650        660
cacctggcca tgggctgcac ggaggagaag taccactttg acaagttccg gcgcatgtgc
       670        680        690        700        710        720
atcctatccg gctgtgatta cctggactca ctgcctggca tcggactggc caaggcgtgc
       730        740        750        760        770        780
aaatttatac taaaaacgga acaggaagac atgcgaatag cattaaaaaa gattccaagt
       790        800        810        820        830        840
tacctcaata tgcgcaatct tgaggtagat gacgactata ttgaaaactt catgaaagcg
       850        860        870        880        890        900
gaggccacct tcaggcacat gtttatctac aacccgctag agcgtcgcat gcagcggttg
       910        920        930        940        950        960
tgtgccctcg aagattatga aactgatgag cgctactgca gcaatgctgg caccttgctg
       970        980        990       1000       1010       1020
gaggatagcg agcaggcttt gcacttggcc ctaggcaact tgaacccctt ctctatgaag
      1030       1040       1050       1060       1070       1080
cgactggact cttggacacc ggaaaaggca tggccaacgc cgaagaacgt gaaacgatcc
      1090       1100       1110       1120       1130       1140
aagcacaaaa gtatttggca aacgaatttt caaagcgaaa acactcacac gccgaagaaa
      1150       1160       1170       1180       1190       1200
gaaaatccgt gtgccttgtt cttcaaaaaa gtggattttg tgggcaaaac tctaaacgag
      1210       1220       1230       1240       1250       1260
gaaatcgaag ctaatcagcg actggaacag gctaaacaaa cggaggccga gttgtttaac
      1270       1280       1290       1300       1310       1320
atgtacagtt tcaaagccaa aaggaggaga agtccgagca gggaagattc tgtagatcaa
      1330       1340       1350       1360       1370       1380
gaacgtacac ctcctccgtc gccggtgcac aaaagtcggc acaatccatt tgccaaggaa
      1390       1400       1410       1420       1430       1440
aggactggag aagaagctaa tcagcgatcc ccagtagttt gtgagaatgc ctctttgctg
      1450       1460       1470       1480       1490       1500
cgtttgctta gtccgaaaaa ggcaagtccg ttggatggag aagctggtgt aaaaaaggtt
      1510       1520       1530       1540       1550       1560
gattcgctta aaagaagcat attcgccaag gaacaagttc agatccgaag tcgcttcttc
      1570       1580       1590       1600       1610       1620
gccacgcagg atgaacagac aaggcttcaa agggagcact taagagatac ggaaaatgac
      1630       1640       1650       1660       1670       1680
gatatggatg agcagaagct gagctctcac tcaggccata agaaactaag actagtatgt
      1690       1700       1710       1720       1730       1740
aaagacattc cagggaagaa cccaataaga caacgttgta gttcacaaat cagtgacggt
      1750       1760       1770       1780       1790       1800
gaaacagata cagataccac agcctcttct cttttggaat cacaagataa aggtgttcct
      1810       1820       1830       1840       1850       1860
tcgcctttag aatcccaaga agatctcaac aactctcaac cccaaatacc aactgaaggc
      1870       1880       1890       1900       1910       1920
aataccaatt caacaactat tcgcattaag tcactggatt tacttctgga aaactctccg
      1930       1940       1950       1960       1970       1980
gaacccactc aggaatccga caggaacaat aatgatgcca tcattctgct atcggacgac
      1990       2000       2010       2020       2030       2040
agctgtagct cggatcagag agcatcatct acctcatcct ccagccagca acgccagaac
      2050       2060       2070       2080       2090       2100
tttttgccaa ccagcaaacg aagagtgggg ctgagtaaac cctccactgc caagaagggc
      2110       2120       2130       2140       2150       2160
actcctaaat ctaggacgaa tggaaaactg ggtgccgtga gccagaatca aaccaagctc
      2170       2180       2190
agcatgtttg gcttccagac gaaacctgtc ctcaaatag  

Last modification date: Oct. 2, 2011