Caenorhabditis elegans Maupas (nematode) [CEL]


STRUCTURE SIMILARITY:
Belongs to the DNA mismatch repair mutS family.


RELATED PATHWAY(S):
mismatch repair (MMR)


Amino acids sequence

        10         20         30         40         50         60
MSGGKDEASD KALLKILKSK SPNTIAIFSR GEYFSVYGDD ATFVATNIFK SDVCVKTFTL
        70         80         90        100        110        120
STDNSQQMKY ISVNRGQYEK VVRETIVLLR CSVELYSSEQ GEWKMTKRGS PGNTVDFEQE
       130        140        150        160        170        180
IGVSDQAPIL AIYIHPGDDD NRVTLCAWDA GNVRIVISEY IDTPSFSQTE QCIFGLCPTE
       190        200        210        220        230        240
YILVNEGSVA PKAKKIASMF TRMEVHNKQQ LKPKSQWSDV IESVHLDYKD EAEKQNENIK
       250        260        270        280        290        300
ECLQILHSNA ADEYSISEKY SIFNYGTHGN MLIDSCAVEA LELFQLNYNY LEKSNNLTLY
       310        320        330        340        350        360
NVLNKCKTLP GEKLLRDWLS RPLCQIDHIN ERLDIVEALF ENQTIRQKLR DSILARMPDC
       370        380        390        400        410        420
SQLARRLMRK CTLQDLNRFY QAATLLETVE MQLIQLSEAE QFAPSINRLL KSEITEILKK
       430        440        450        460        470        480
VERFQVLCDE FFDFDYEKEN KEIRVRVDFV PEIQEISEKL EQMERVAEKL RKKYSAKFEC
       490        500        510        520        530        540
DNLKLDKNSQ YGFYFRVTLK EEKSIRKKDV HILETTKGSG VKFSVGELSD INDEFLEFHL
       550        560        570        580        590        600
KYTRAEEEVI SMLCKKAEEF IPLIPAMAQL IATLDVFVSL STFAATSSGI YTRPNLLPLG
       610        620        630        640        650        660
SKRLELKQCR HPVIEGNSEK PFIPNDVVLD KCRLIILTGA NMGGKSTYLR SAALSILLAQ
       670        680        690        700        710        720
IGSFVPCSSA TISVVDGIFT RVGASDKQSQ GISTFMAEML DCSAILQRAT KNSFVVIDEL
       730        740        750        760        770        780
GRGTSTFDGF GIASAIAQDI LNRIQCLSIF ATHFHEMGKL AEQPGAVALQ MGVQIENNEI
       790        800        810        820        830        840
HMLYKVFEGV AQCSFGLQVA KMVGIDENVI NKAAQLLEGL EKKLVIDSKK KKELLESADI

RQAILSLVK     

Encoded by msh-2 gene

FULL NAME: MSH (MutS Homolog) family


Nucleic acid sequence

        10         20         30         40         50         60
atgagtggag gaaaagacga agccagcgac aaagcgctcc taaaaatcct aaaatctaaa
        70         80         90        100        110        120
tcaccgaaca caattgccat tttctcacga ggcgaatact tttctgtgta tggcgatgac
       130        140        150        160        170        180
gccacctttg tcgctactaa cattttcaag agtgatgttt gcgtcaaaac gttcacttta
       190        200        210        220        230        240
tcaaccgaca attctcaaca aatgaagtac atttccgtaa atcgtggaca atacgagaag
       250        260        270        280        290        300
gtcgttcgtg aaaccatcgt actgctcaga tgctccgttg agctatactc gagtgaacag
       310        320        330        340        350        360
ggagagtgga aaatgacgaa gcgtggatct ccaggaaata ctgtggattt tgagcaagaa
       370        380        390        400        410        420
attggagtgt cagatcaggc tccgattctt gcaatctata ttcatccagg agatgatgat
       430        440        450        460        470        480
aatcgagtaa cattgtgcgc ttgggatgct ggaaatgttc gcattgtgat cagtgaatac
       490        500        510        520        530        540
atcgatactc cttctttttc tcaaacggaa caatgcattt ttggactatg ccccacagaa
       550        560        570        580        590        600
tatattctag tgaatgaagg atcagttgca ccgaaggcta agaaaatagc aagcatgttc
       610        620        630        640        650        660
acgagaatgg aggttcacaa caagcaacag ctgaagccaa agtcacaatg gagcgatgta
       670        680        690        700        710        720
atcgagtcgg ttcatctgga ttataaagat gaagccgaaa aacaaaacga aaacatcaag
       730        740        750        760        770        780
gaatgccttc aaattctaca ctcgaacgca gccgatgaat acagtatttc tgagaagtat
       790        800        810        820        830        840
agtattttca attacggaac tcatggaaat atgctgattg attcatgtgc tgtggaggct
       850        860        870        880        890        900
ctggagcttt tccaattgaa ttataattat ttggagaaat cgaataattt gacgctctac
       910        920        930        940        950        960
aatgttttaa acaaatgcaa gactcttcct ggagagaagt tgctccgtga ttggctttct
       970        980        990       1000       1010       1020
cgtccacttt gccagatcga tcacatcaac gagcgactgg atattgttga agctttgttc
      1030       1040       1050       1060       1070       1080
gagaatcaga caatccgtca gaagctccgc gattccattc ttgcaagaat gccagattgc
      1090       1100       1110       1120       1130       1140
tctcaactag ctcgtcgtct catgagaaaa tgcactcttc aggatctcaa tcgattctac
      1150       1160       1170       1180       1190       1200
caagctgcaa ctcttctcga gactgtcgaa atgcaattga ttcaactttc agaagctgaa
      1210       1220       1230       1240       1250       1260
cagtttgccc catcgattaa tcgtcttctt aaatcagaaa tcaccgaaat tctgaagaaa
      1270       1280       1290       1300       1310       1320
gtcgaacgat tccaagtgct ctgtgatgag ttcttcgatt tcgactatga gaaggagaac
      1330       1340       1350       1360       1370       1380
aaagagattc gtgttcgcgt cgattttgtt ccagagatcc aggagattag tgagaagctg
      1390       1400       1410       1420       1430       1440
gagcaaatgg agagagtcgc ggagaagttg cgcaagaaat attccgcgaa attcgaatgc
      1450       1460       1470       1480       1490       1500
gacaatttga agttggataa gaattctcag tatggatttt atttccgagt cactttgaag
      1510       1520       1530       1540       1550       1560
gaagaaaagt cgatccgcaa aaaagatgtt catattctcg aaacaacaaa gggcagtggc
      1570       1580       1590       1600       1610       1620
gtgaagtttt cggttggaga gctcagcgac attaatgatg agttcctcga attccatctg
      1630       1640       1650       1660       1670       1680
aaatacacca gggcagaaga ggaggttatc tcgatgctgt gcaagaaagc cgaagaattt
      1690       1700       1710       1720       1730       1740
attccattga ttcctgctat ggctcagctc atcgcaactc tcgacgtctt cgtctcactt
      1750       1760       1770       1780       1790       1800
tccacatttg ccgccacctc atccgggatc tatactcgac cgaatcttct tcctcttgga
      1810       1820       1830       1840       1850       1860
tcgaagcgtc tcgagttgaa acaatgtaga catccggtta ttgaaggaaa tagtgaaaaa
      1870       1880       1890       1900       1910       1920
cctttcattc cgaatgatgt ggttcttgat aaatgtcgct tgatcattct caccggagcc
      1930       1940       1950       1960       1970       1980
aacatgggtg gaaagagcac ttacctccga tcagcagctc tttccattct tctcgcccaa
      1990       2000       2010       2020       2030       2040
atcggatcgt tcgttccatg ctcctccgct acaatttcgg tcgttgatgg gattttcaca
      2050       2060       2070       2080       2090       2100
cgagttggtg catccgacaa gcaatcgcaa ggaatttcca catttatggc tgaaatgctc
      2110       2120       2130       2140       2150       2160
gattgctctg cgattcttca aagagccact aagaattcat tcgtggtaat tgatgaactt
      2170       2180       2190       2200       2210       2220
ggaagaggaa cttcgacgtt tgatggattt ggaatagcat cggctattgc acaggacatt
      2230       2240       2250       2260       2270       2280
ttgaatcgta tccaatgtct atccatcttc gccacacatt tccatgagat gggaaaactc
      2290       2300       2310       2320       2330       2340
gctgaacagc cgggagctgt tgcccttcaa atgggagttc aaatcgagaa taatgaaatt
      2350       2360       2370       2380       2390       2400
cacatgctct acaaggtttt cgagggagtt gcacagtgct ccttcggcct ccaagttgcc
      2410       2420       2430       2440       2450       2460
aaaatggttg gaatcgatga aaatgtcatt aataaagcgg cccaacttct ggaaggcctc
      2470       2480       2490       2500       2510       2520
gagaagaagc ttgtcatcga tagcaagaag aagaaggaac ttctcgaatc agcagatatc
      2530       2540       2550
cgacaagcca ttctcagcct tgtcaaataa   

Last modification date: Oct. 2, 2011