Identification
Name:Type I restriction enzyme EcoKI R protein
Synonyms:
  • R.EcoKI
Gene Name:hsdR
Enzyme Class:
Biological Properties
General Function:Involved in DNA binding
Specific Function:The EcoKI enzyme recognizes 5'-AACN(6)GTGC-3'. Subunit R is required for both nuclease and ATPase activities, but not for modification
Cellular Location:Cytoplasmic
SMPDB Pathways:Not Available
KEGG Pathways:Not Available
Metabolites:
ECMDB IDNameView
GO Classification:
Function
adenyl nucleotide binding
adenyl ribonucleotide binding
ATP binding
binding
catalytic activity
DNA binding
endonuclease activity
helicase activity
hydrolase activity
hydrolase activity, acting on acid anhydrides
hydrolase activity, acting on acid anhydrides, in phosphorus-containing anhydrides
hydrolase activity, acting on ester bonds
nuclease activity
nucleic acid binding
nucleoside binding
nucleoside-triphosphatase activity
purine nucleoside binding
pyrophosphatase activity
Process
cellular macromolecule metabolic process
DNA metabolic process
DNA modification
macromolecule metabolic process
metabolic process
Gene Properties
Blattner:b4350
Gene OrientationCounterclockwise
Centisome Percentage:98.74
Left Sequence End4581272
Right Sequence End4584784
Gene Sequence:
>3513 bp
ATGATGAATAAATCCAATTTTGAATTCCTGAAGGGCGTCAACGACTTCACTTATGCCATC
GCCTGTGCGGCGGAAAATAACTACCCGGATGATCCCAACACGACGCTGATTAAAATGCGT
ATGTTTGGCGAAGCCACAGCGAAACATCTTGGTCTGTTACTCAACATCCCCCCTTGTGAG
AATCAACACGATCTCCTGCGTGAACTCGGCAAAATCGCCTTTGTTGATGACAACATCCTC
TCTGTATTTCACAAATTACGCCGCATTGGTAACCAGGCGGTGCACGAATATCATAACGAT
CTCAACGATGCCCAGATGTGCCTGCGACTCGGGTTCCGCCTGGCTGTCTGGTACTACCGT
CTGGTCACTAAAGATTATGACTTCCCGGTGCCGGTGTTTGTGTTGCCGGAACGTGGTGAA
AACCTCTATCACCAGGAAGTGCTGACGCTAAAACAACAGCTTGAACAGCAGGTGCGAGAA
AAAGCGCAGACTCAGGCAGAAGTCGAAGCGCAACAGCAGAAGCTGGTTGCCCTGAACGGC
TATATCGCCATTCTGGAAGGCAAACAGCAGGAAACCGAAGCGCAAACCCAGGCTCGCCTT
GCGGCACTGGAAGCACAGCTCGCCGAGAAGAACGCGGAACTGGCAAAACAGACCGAACAG
GAACGTAAGGCTTACCACAAAGAAATTACCGATCAGGCCATCAAGCGCACACTCAACCTT
AGCGAAGAAGAGAGTCGCTTCCTGATTGATGCGCAACTGCGTAAAGCAGGCTGGCAGGCC
GACAGCAAAACCCTGCGCTTCTCCAAAGGCGCACGTCCGGAACCCGGCGTCAATAAAGCC
ATTGCCGAATGGCCGACCGGAAAAGATGAAACGGGTAATCAGGGCTTTGCGGATTATGTG
CTGTTTGTCGGCCTCAAACCCATCGCGGTGGTAGAGGCGAAACGTAACAATATCGACGTT
CCCGCCAGGCTCAATGAGTCGTATCGCTACAGTAAATGTTTCGATAATGGCTTCCTGCGG
GAAACCTTGCTTGAGCACTACTCACCGGATGAAGTGCATGAAGCAGTGCCAGAGTATGAA
ACCAGCTGGCAGGACACCAGCGGCAAACAACGGTTTAAAATCCCCTTCTGCTACTCGACC
AACGGGCGCGAATACCGCGCAACAATGAAGACCAAAAGCGGCATCTGGTATCGCGACGTG
CGTGATACCCGCAATATGTCGAAAGCCTTACCCGAGTGGCACCGCCCGGAAGAGCTGCTG
GAAATGCTCGGCAGCGAACCGCAAAAACAGAATCAGTGGTTTGCCGATAACCCTGGCATG
AGCGAGCTGGGCCTGCGTTATTATCAGGAAGATGCCGTCCGCGCGGTTGAAAAGGCAATC
GTCAAGGGGCAACAAGAGATCCTGCTGGCGATGGCGACCGGTACCGGTAAAACCCGTACG
GCAATCGCCATGATGTTCCGCCTGATCCAGTCCCAGCGTTTTAAACGCATTCTCTTCCTT
GTCGACCGCCGTTCTCTTGGCGAACAGGCGCTGGGCGCGTTTGAAGATACGCGTATTAAC
GGCGACACCTTCAACAGCATTTTCGACATTAAAGGGCTGACGGATAAATTCCCGGAAGAC
AGCACCAAAATTCACGTTGCCACCGTACAGTCGCTGGTGAAACGCACCCTGCAATCAGAT
GAACCGATGCCGGTGGCCCGTTACGACTGTATCGTCGTTGACGAAGCGCATCGCGGCTAT
ATTCTCGATAAAGAGCAGACCGAAGGCGAACTGCAGTTCCGCAGCCAGCTGGATTACGTC
TCTGCCTACCGTCGCATTCTCGATCACTTCGATGCGGTAAAAATCGCTCTCACCGCCACC
CCGGCGCTACATACTGTGCAGATTTTCGGCGAGCCGGTTTACCGTTATACCTACCGTACC
GCGGTTATCGACGGTTTTCTGATCGACCAGGATCCGCCTATTCAGATCATCACCCGCAAC
GCGCAGGAGGGGGTTTATCTCTCCAAAGGCGAGCAGGTAGAGCGCATCAGCCCGCAGGGA
GAAGTGATCAATGACACCCTGGAAGACGATCAGGATTTTGAAGTCGCCGACTTTAACCGT
GGCCTGGTGATCCCGGCGTTTAACCGCGCCGTCTGTAACGAACTCACCAATTATCTTGAC
CCGACCGGATCGCAAAAAACGCTGGTCTTCTGCGTCACCAATGCCCATGCCGATATGGTG
GTGGAAGAGCTGCGTGCCGCGTTCAAGAAAAAGTATCCGCAACTGGAGCACGACGCGATC
ATCAAGATCACCGGTGATGCCGATAAAGACGCGCGCAAAGTGCAGACCATGATCACCCGC
TTCAATAAAGAGCGGCTGCCCAATATCGTGGTAACCGTCGACCTGCTGACGACCGGCGTC
GATATTCCGTCGATCTGTAATATCGTGTTCCTGCGTAAAGTACGCAGCCGCATTCTGTAC
GAACAGATGAAAGGCCGCGCCACGCGCTTATGCCCGGAGGTGAATAAAACCAGCTTTAAG
ATTTTTGACTGTGTCGATATCTACAGCACGCTGGAGAGCGTCGACACCATGCGTCCGGTG
GTGGTGCGCCCGAAGGTGGAACTGCAAACGCTGGTCAATGAAATTACCGATTCAGAAACC
TATAAAATCACCGAAGCGGATGGCCGCAGTTTTGCCGAGCACAGCCATGAACAACTGGTG
GCGAAGCTCCAGCGTATCATCGGTCTGGCCACGTTTAACCGTGACCGCAGCGAAACGATA
GATAAACAGGTGCGTCGTCTGGATGAGCTATGCCAGGACGCGGCGGGCGTGAACTTTAAC
GGCTTCGCCTCGCGCCTGCGGGAAAAAGGGCCGCACTGGAGCGCCGAAGTCTTTAACAAA
CTGCCTGGCTTTATCGCCCGTCTGGAAAAGCTGAAAACGGACATCAACAACCTGAATGAT
GCGCCGATCTTCCTCGATATCGACGATGAAGTGGTGAGTGTAAAATCGCTGTACGGTGAT
TACGACACGCCGCAGGATTTCCTCGAAGCCTTTGACTCGCTGGTGCAACGTTCCCCGAAC
GCGCAACCGGCATTGCAGGCAGTTATTAATCGCCCGCGCGATCTCACCCGTAAAGGGCTG
GTCGAGCTACAGGAGTGGTTTGACCGCCAGCACTTTGAGGAATCTTCCCTGCGCAAAGCA
TGGAAAGAGACGCGCAATGAAGATATCGCCGCCCGGCTGATTGGTCATATTCGCCGCGCT
GCGGTGGGCGATGCGCTGAAACCGTTTGAGGAACGTGTCGATCACGCGCTGACGCGCATT
AAGGGCGAAAACGACTGGAGCAGCGAGCAATTAAGCTGGCTCGATCGTTTAGCGCAGGCG
CTGAAAGAGAAAGTGGTGCTCGACGACGATGTCTTCAAAACCGGCAACTTCCACCGTCGC
GGCGGGAAGGCGATGCTGCAAAGAACCTTTGACGATAATCTCGATACCCTGCTGGGCAAA
TTCAGCGATTATATCTGGGACGAGCTGGCCTGA
Protein Properties
Pfam Domain Function:
Protein Residues:1170
Protein Molecular Weight:134094
Protein Theoretical pI:6
Signaling Regions:
  • None
Transmembrane Regions:
  • None
Protein Sequence:
>Type I restriction enzyme EcoKI R protein
MMNKSNFEFLKGVNDFTYAIACAAENNYPDDPNTTLIKMRMFGEATAKHLGLLLNIPPCE
NQHDLLRELGKIAFVDDNILSVFHKLRRIGNQAVHEYHNDLNDAQMCLRLGFRLAVWYYR
LVTKDYDFPVPVFVLPERGENLYHQEVLTLKQQLEQQVREKAQTQAEVEAQQQKLVALNG
YIAILEGKQQETEAQTQARLAALEAQLAEKNAELAKQTEQERKAYHKEITDQAIKRTLNL
SEEESRFLIDAQLRKAGWQADSKTLRFSKGARPEPGVNKAIAEWPTGKDETGNQGFADYV
LFVGLKPIAVVEAKRNNIDVPARLNESYRYSKCFDNGFLRETLLEHYSPDEVHEAVPEYE
TSWQDTSGKQRFKIPFCYSTNGREYRATMKTKSGIWYRDVRDTRNMSKALPEWHRPEELL
EMLGSEPQKQNQWFADNPGMSELGLRYYQEDAVRAVEKAIVKGQQEILLAMATGTGKTRT
AIAMMFRLIQSQRFKRILFLVDRRSLGEQALGAFEDTRINGDTFNSIFDIKGLTDKFPED
STKIHVATVQSLVKRTLQSDEPMPVARYDCIVVDEAHRGYILDKEQTEGELQFRSQLDYV
SAYRRILDHFDAVKIALTATPALHTVQIFGEPVYRYTYRTAVIDGFLIDQDPPIQIITRN
AQEGVYLSKGEQVERISPQGEVINDTLEDDQDFEVADFNRGLVIPAFNRAVCNELTNYLD
PTGSQKTLVFCVTNAHADMVVEELRAAFKKKYPQLEHDAIIKITGDADKDARKVQTMITR
FNKERLPNIVVTVDLLTTGVDIPSICNIVFLRKVRSRILYEQMKGRATRLCPEVNKTSFK
IFDCVDIYSTLESVDTMRPVVVRPKVELQTLVNEITDSETYKITEADGRSFAEHSHEQLV
AKLQRIIGLATFNRDRSETIDKQVRRLDELCQDAAGVNFNGFASRLREKGPHWSAEVFNK
LPGFIARLEKLKTDINNLNDAPIFLDIDDEVVSVKSLYGDYDTPQDFLEAFDSLVQRSPN
AQPALQAVINRPRDLTRKGLVELQEWFDRQHFEESSLRKAWKETRNEDIAARLIGHIRRA
AVGDALKPFEERVDHALTRIKGENDWSSEQLSWLDRLAQALKEKVVLDDDVFKTGNFHRR
GGKAMLQRTFDDNLDTLLGKFSDYIWDELA
References
External Links:
ResourceLink
Uniprot ID:P08956
Uniprot Name:T1RK_ECOLI
GenBank Gene ID:U00096
Genebank Protein ID:226510991
Ecogene ID:EG10459
Ecocyc:EG10459
ColiBase:b4350
Kegg Gene:b4350
EchoBASE ID:EB0454
CCDB:T1RK_ECOLI
BacMap:226524764
General Reference:
  • Blattner, F. R., Plunkett, G. 3rd, Bloch, C. A., Perna, N. T., Burland, V., Riley, M., Collado-Vides, J., Glasner, J. D., Rode, C. K., Mayhew, G. F., Gregor, J., Davis, N. W., Kirkpatrick, H. A., Goeden, M. A., Rose, D. J., Mau, B., Shao, Y. (1997). "The complete genome sequence of Escherichia coli K-12." Science 277:1453-1462. Pubmed: 9278503
  • Burland, V., Plunkett, G. 3rd, Sofia, H. J., Daniels, D. L., Blattner, F. R. (1995). "Analysis of the Escherichia coli genome VI: DNA sequence of the region from 92.8 through 100 minutes." Nucleic Acids Res 23:2105-2119. Pubmed: 7610040
  • Hayashi, K., Morooka, N., Yamamoto, Y., Fujita, K., Isono, K., Choi, S., Ohtsubo, E., Baba, T., Wanner, B. L., Mori, H., Horiuchi, T. (2006). "Highly accurate genome sequences of Escherichia coli K-12 strains MG1655 and W3110." Mol Syst Biol 2:2006.0007. Pubmed: 16738553
  • Loenen, W. A., Daniel, A. S., Braymer, H. D., Murray, N. E. (1987). "Organization and sequence of the hsd genes of Escherichia coli K-12." J Mol Biol 198:159-170. Pubmed: 3323532
  • Waite-Rees, P. A., Keating, C. J., Moran, L. S., Slatko, B. E., Hornstra, L. J., Benner, J. S. (1991). "Characterization and expression of the Escherichia coli Mrr restriction system." J Bacteriol 173:5207-5219. Pubmed: 1650347