Coarse-grained molecular simulations
Simplifying complex molecular systems — Lennard-Jones, Gay-Berne, and arbitrary molecular surfaces — so AI can accelerate material design and drug discovery.
Coarse-grained molecular simulations are a class of approximation methods for computationally modeling the behavior of complex systems using their simplified representations. Coarse-grained molecular simulations are often used to study large biomolecules or a large ensemble of molecules that would otherwise be intractable using all-atom force-field methods. Depending on the specific situation and goal, a molecule may be represented by a spherical particle, an elliptical particle, or a soft body in an arbitrary shape. For mathematics, I have written an article to describe various approaches we have developed for coarse-grained modeling to get the concept across with higher clarity. Because of their lower computational costs, coarse-grained models are being used in our work to demonstrate how artificial intelligence can be used to speed up material design and drug discovery. For instance, they can be trained using ab initio or experimental data of accurate structures and then used to predict some bulk properties of the materials they model after.
Lennard-Jones particles
The simplest particle that exhibits basic attractive-repulsive features of intermolecular interactions is probably the Lennard-Jones particle. The following animation shows the transformation of a Lennard-Jones system from a condensed state into a gaseous state. A Lennard-Jones particle can be used to model an inert-gas atom, a molecule that can be approximated as a sphere (e.g., a globular protein), or even a macroscopic particle (sometimes called a discrete element when it is used in this way).
Lennard-Jones particles are useful for creating educational simulations. For instance, we can use them to represent solute and solvent molecules. The following simulation shows the process of distillation.
Gay-Berne particles
Lennard-Jones particles model only spherical objects; their interactions with one another depend only on distance. Gay-Berne particles are a generalization to elliptical shapes, which introduces orientational dependence. The following simulation shows the motions of and interactions among these elliptical particles, with electrostatic forces among charges and dipoles.
Molecular surfaces
Although Gay-Berne particles can simulate non-spherical interacting particles, they are limited to elliptical shapes. To build useful coarse-grained models for molecular biology without resorting to full ball-and-stick representations of macromolecules, interacting objects with arbitrary shapes are needed. The following simulation shows a 2D illustration of docking between a substrate and an enzyme. According to p. 93, Molecular Biology of the Cell by Alberts et al., "Despite being weakened by water and salt, ionic bonds are very important in biological systems; an enzyme that binds a positively charged substrate will often have a negatively charged amino-acid chain at the appropriate place."
Polymers can be represented by rod-shaped molecular surfaces, as shown in the following simulations that compare the thermal resistance of linear and branched polymers. The branched polymer disintegrates more easily when temperature increases.
Mixed representations for multiscale models
We can mix atomistic and coarse-grained representations in a single model. Such a mix may be appropriate for multiscale models. The following image uses a full-atom model for water molecules and a coarse-grained model for a solute molecule. Color shading shows the distribution of electrostatic charges (pink is positive, blue is negative). Note the difference in orientation of water molecules near different parts of the solute.