Mus musculus L. (mouse) [MMU]

FULL NAME: DNA mismatch repair protein Msh2


DESCRIPTION:
Component of the post-replicative DNA mismatch repair system (MMR). Forms two different heterodimers: MutS alpha (MSH2-MSH6 heterodimer) and MutS beta (MSH2-MSH3 heterodimer) which binds to DNA mismatches thereby initiating DNA repair. When bound, heterodimers bend the DNA helix and shields approximately 20 base pairs. MutS alpha recognizes single base mismatches and dinucleotide insertion-deletion loops (IDL) in the DNA. MutS beta recognizes larger insertion-deletion loops up to 13 nucleotides long. After mismatch binding, MutS alpha or beta forms a ternary complex with the MutL alpha heterodimer, which is thought to be responsible for directing the downstream MMR events, including strand discrimination, excision, and resynthesis. ATP binding and hydrolysis play a pivotal role in mismatch repair functions. The ATPase activity associated with MutS alpha regulates binding similar to a molecular switch: mismatched DNA provokes ADP-->ATP exchange, resulting in a discernible conformational transition that converts MutS alpha into a sliding clamp capable of hydrolysis-independent diffusion along the DNA backbone. This transition is crucial for mismatch repair. MutS alpha may also play a role in DNA homologous recombination repair. In melanocytes may modulate both UV-B-induced cell cycle regulation and apoptosis (By similarity).

STRUCTURE SIMILARITY:
Belongs to the DNA mismatch repair mutS family.


POST-TRANSLATIONAL MODIFICATION:
Phosphorylated upon DNA damage, probably by ATM or ATR (By similarity).


RELATED PATHWAY(S):
mismatch repair (MMR)


Amino acids sequence

        10         20         30         40         50         60
MAVQPKETLQ LEGAAEAGFV RFFEGMPEKP STTVRLFDRG DFYTAHGEDA LLAAREVFKT
        70         80         90        100        110        120
QGVIKYMGPA GSKTLQSVVL SKMNFESFVK DLLLVRQYRV EVYKNKAGNK ASKENEWYLA
       130        140        150        160        170        180
FKASPGNLSQ FEDILFGNND MSASVGVMGI KMAVVDGQRH VGVGYVDSTQ RKLGLCEFPE
       190        200        210        220        230        240
NDQFSNLEAL LIQIGPKECV LPGGETTGDM GKLRQVIQRG GILITERKRA DFSTKDIYQD
       250        260        270        280        290        300
LNRLLKGKKG EQINSAALPE MENQVAVSSL SAVIKFLELL SDDSNFGQFE LATFDFSQYM
       310        320        330        340        350        360
KLDMAAVRAL NLFQGSVEDT TGSQSLAALL NKCKTAQGQR LVNQWIKQPL MDRNRIEERL
       370        380        390        400        410        420
NLVEAFVEDS ELRQSLQEDL LRRFPDLNRL AKKFQRQAAN LQDCYRLYQG INQLPSVIQA
       430        440        450        460        470        480
LEKYEGRHQA LLLAVFVTPL IDLRSDFSKF QEMIETTLDM DQVENHEFLV KPSFDPNLSE
       490        500        510        520        530        540
LREVMDGLEK KMQSTLINAA RGLGLDPGKQ IKLDSSAQFG YYFRVTCKEE KVLRNNKNFS
       550        560        570        580        590        600
TVDIQKNGVK FTNSELSSLN EEYTKNKGEY EEAQDAIVKE IVNISSGYVE PMQTLNDVLA
       610        620        630        640        650        660
HLDAIVSFAH VSNAAPVPYV RPVILEKGKG RIILKASRHA CVEVQDEVAF IPNDVHFEKD
       670        680        690        700        710        720
KQMFHIITGP NMGGKSTYIR QTGVIVLMAQ IGCFVPCESA EVSIVDCILA RVGAGDSQLK
       730        740        750        760        770        780
GVSTFMAEML ETASILRSAT KDSLIIIDEL GRGTSTYDGF GLAWAISDYI ATKIGAFCMF
       790        800        810        820        830        840
ATHFHELTAL ANQIPTVNNL HVTALTTEET LTMLYQVKKG VCDQSFGIHV AELANFPRHV
       850        860        870        880        890        900
IACAKQKALE LEEFQNIGTS LGCDEAEPAA KRRCLEREQG EKIILEFLSK VKQVPFTAMS
       910        920        930
EESISAKLKQ LKAEVVAKNN SFVNEIISRI KAPAP  

Encoded by Msh2 gene

FULL NAME: mutS homolog 2 (E. coli)


OTHER NAME(S):
AI788990


Nucleic acid sequence

        10         20         30         40         50         60
atggcggtgc agcctaagga gacgctgcag ttggaaggcg cggccgaggc gggcttcgtg
        70         80         90        100        110        120
cgcttctttg agggcatgcc ggagaagccg agcaccacgg tgcgcctctt cgaccgcggc
       130        140        150        160        170        180
gacttttaca cggcgcacgg agaggacgcg ctgctggcgg cccgcgaggt gttcaagacc
       190        200        210        220        230        240
cagggcgtga tcaagtacat ggggccggca gggagtaaga ccctgcagag tgttgtgctt
       250        260        270        280        290        300
agcaagatga actttgagtc tttcgtgaaa gatcttcttc tggttcgcca gtatcgagtt
       310        320        330        340        350        360
gaagtttata agaataaagc tggaaataag gcgtctaagg agaatgagtg gtatctggca
       370        380        390        400        410        420
tttaaggctt ctcccggcaa tctttctcag tttgaagaca tcctgtttgg taacaatgac
       430        440        450        460        470        480
atgtcagctt ccgttggcgt tatgggtatt aaaatggccg tggttgatgg tcaaagacat
       490        500        510        520        530        540
gttggagttg ggtatgtgga ttccacccag aggaagctag gcttgtgtga gttccccgag
       550        560        570        580        590        600
aatgatcagt tctccaatct cgaggctctt ctgattcaga ttggaccaaa ggaatgcgtt
       610        620        630        640        650        660
ttaccaggag gagagactac tggagacatg gggaaactga ggcaggttat ccagagagga
       670        680        690        700        710        720
gggattctga tcacagaaag aaagagagcc gacttttcca ctaaagacat ttatcaggat
       730        740        750        760        770        780
ctcaaccggt tactgaaagg caaaaaagga gaacagataa atagtgctgc cctaccagag
       790        800        810        820        830        840
atggagaatc aggttgcagt ttcatcacta tctgcagtaa tcaagttttt agaactctta
       850        860        870        880        890        900
tcagacgatt caaattttgg gcagtttgaa ctggccactt ttgacttcag ccagtacatg
       910        920        930        940        950        960
aagttggaca tggcagcagt tagagccctc aaccttttcc agggttctgt tgaagacacc
       970        980        990       1000       1010       1020
actggctctc agtctctggc cgcattattg aataaatgca aaactgctca aggacaaaga
      1030       1040       1050       1060       1070       1080
ttggttaacc agtggatcaa gcagccgctc atggatagga acaggataga ggagaggtta
      1090       1100       1110       1120       1130       1140
aatttagtgg aagcttttgt cgaggattca gaactgaggc agagtttaca ggaggatttg
      1150       1160       1170       1180       1190       1200
cttcgccggt ttccagacct taaccggctt gccaagaaat tccagagaca agcagcgaat
      1210       1220       1230       1240       1250       1260
ttacaagact gttaccgact gtatcagggt attaaccagc tccccagcgt catccaggct
      1270       1280       1290       1300       1310       1320
ctggagaaat acgaaggaag acaccaggca ctgttgttgg cagtttttgt gactcctctt
      1330       1340       1350       1360       1370       1380
attgatcttc gttctgattt ttcaaaattt caagaaatga tagaaacaac tttagatatg
      1390       1400       1410       1420       1430       1440
gatcaggtgg aaaaccacga gttccttgta aaaccttcat ttgatcctaa cctgagtgaa
      1450       1460       1470       1480       1490       1500
ctaagagaag tcatggatgg cctggagaag aagatgcagt ccaccttaat aaatgcagcc
      1510       1520       1530       1540       1550       1560
cgggggctcg gattggatcc tggcaaacag attaaattgg actccagtgc acagtttgga
      1570       1580       1590       1600       1610       1620
tattatttcc gtgtaacctg caaggaagag aaagtgcttc gcaacaacaa gaacttcagc
      1630       1640       1650       1660       1670       1680
acagtggaca tccagaagaa tggcgtgaag tttaccaaca gtgaattgtc ctctttaaat
      1690       1700       1710       1720       1730       1740
gaagaatata ctaagaacaa aggcgagtat gaagaggccc aggatgccat tgttaaagaa
      1750       1760       1770       1780       1790       1800
attgtcaata tttcctcagg ctacgtagag ccaatgcaga cgctcaacga tgtgctggct
      1810       1820       1830       1840       1850       1860
cacttagacg ccattgttag cttcgctcat gtgtcaaacg cagcacccgt tccttatgta
      1870       1880       1890       1900       1910       1920
cgaccagtca tcttggagaa aggaaaaggg agaattatat tgaaagcctc caggcatgct
      1930       1940       1950       1960       1970       1980
tgtgttgaag ttcaagatga agttgcattt attccaaatg acgtgcactt tgaaaaagat
      1990       2000       2010       2020       2030       2040
aaacagatgt tccacatcat tactggtccc aatatgggag gtaaatcaac atacattcgt
      2050       2060       2070       2080       2090       2100
cagaccgggg tgattgtact catggcccaa atcgggtgtt ttgtgccctg tgagtcggca
      2110       2120       2130       2140       2150       2160
gaagtgtcca ttgtggattg catccttgct cgagtcgggg ctggtgacag tcaactgaaa
      2170       2180       2190       2200       2210       2220
ggcgtctcca cattcatggc tgaaatgctg gagactgctt ccatcctcag gtcagcaacc
      2230       2240       2250       2260       2270       2280
aaagactcct taataatcat tgatgagctg ggaagaggaa cctctaccta tgatggattt
      2290       2300       2310       2320       2330       2340
gggttagcat gggctatatc agattacatt gcaacgaaga ttggtgcctt ttgcatgttt
      2350       2360       2370       2380       2390       2400
gccacccatt ttcatgaact tactgctttg gccaaccaaa taccaactgt taataatcta
      2410       2420       2430       2440       2450       2460
catgtcacag cgctcactac tgaggagacc ctaactatgc tttaccaagt gaaaaaaggt
      2470       2480       2490       2500       2510       2520
gtctgtgatc agagtttcgg gattcacgtg gctgagctcg ctaacttccc gaggcacgtg
      2530       2540       2550       2560       2570       2580
atagcgtgcg ccaagcagaa ggctctagag cttgaagaat ttcagaacat tggaacctcg
      2590       2600       2610       2620       2630       2640
ctgggatgtg acgaagccga gccggctgca aagagacgct gcctggaaag agagcaaggt
      2650       2660       2670       2680       2690       2700
gagaaaatta ttctggagtt cctgtcgaag gtcaagcagg tgccctttac tgccatgtcg
      2710       2720       2730       2740       2750       2760
gaggagagca tctccgcgaa gctgaagcaa ctgaaagccg aggtggtcgc aaagaacaac
      2770       2780       2790       2800
agcttcgtaa acgagatcat ttcacggata aaggctccgg ctccgtga 

Last modification date: Oct. 2, 2011