Dutch scientists develop artificial molecules that behave like real ones

Scientists from the Radboud University in Nijmegen, the Netherlands have developed synthetic molecules that resemble real organic molecules. A collaboration of researchers, led by Alex Khajetoorians and Daniel Wegner, can now simulate the behavior of real molecules by using artificial molecules. In this way, they can tweak the properties of molecules in ways that are normally difficult or unrealistic, and they can understand much better how molecules change.

Emil Sierda, who was in charge of conducting the experiments at Radboud University: "A few years ago we had this crazy idea to build a quantum simulator. We wanted to create artificial molecules that resembled real molecules. So we developed a system in which we trapped electrons. Electrons surround a molecule like a cloud, and we used those trapped electrons to build an artificial molecule." The results the team found were astonishing. Sierda: ‘The resemblance between what we built and real molecules was uncanny."

Changing molecules

Alex Khajetoorians, head of the Scanning Probe Microscopy (SPM) department at Radboud University: "Making molecules is difficult enough. What is often harder, is to understand how certain molecules react, for example how they change when they are twisted or altered." How molecules change and react is the basis of chemistry, and leads to chemical reactions, like the formation of water from hydrogen and oxygen.

"We wanted to simulate molecules, so we could have the ultimate toolkit to bend them and tune them in ways that are nearly impossible with real molecules. In that way, we can say something about real molecules, without making them, or without having to deal with the challenges they present, like their constantly changing shape."

Benzene

Using this simulator, the researchers created an artificial version of one of the basic organic molecules in chemistry: benzene. Benzene is the starting component for a vast amount of chemicals, like styrene, which is used to make polystyrene. Khajetoorians: "By making benzene, we simulated a textbook organic molecule, and built a molecule that is made up of elements that are not organic." Above that: the molecules are ten times bigger than their real counterparts, which makes them easier to work with.

Practical uses

The uses of this new technique are endless. Daniel Wegner, assistant professor within the SPM department: "We have only begun to imagine what we can use this for. We have so many ideas that it is hard to decide where to start."

By using the simulator, scientists can understand molecules and their reactions much better, which will help in every scientific field imaginable. Wegner: "New materials for future computer hardware are really hard to make, for instance. By making a simulated version, we can look for the novel properties and functionalities of certain molecules and evaluate whether it will be worth making the real material."

In the far future, all kinds of things may be possible: understanding chemical reactions step by step like in a slow-motion video, or making artificial single-molecule electronic devices, like shrinking the size of a transistor on a computer chip. Quantum simulators are even suggested to perform as quantum supercomputers. Sierda: "But that’s a long way to go, for now, we can start by beginning to understand molecules in a way we never understood before."

The research was conducted by a Radboud University collaboration between the groups of Malte Rösner (Theory of Condensed Matter), Mikhail Katsnelson (Theory of Condensed Matter), Gerrit Groenenboom (Theoretical Chemistry), Daniel Wegner (SPM), and Alex Khajetoorians (SPM).

Caption:Researchers can screen more than 100 million compounds in a single day — much more than any existing model. Credits:Image: iStock
Caption:Researchers can screen more than 100 million compounds in a single day — much more than any existing model. Credits:Image: iStock

MIT CSAIL modeling offers a way to speed up drug discovery

By applying a language model to protein-drug interactions, researchers can quickly screen large libraries of potential drug compounds.

Huge libraries of drug compounds may hold potential treatments for a variety of diseases, such as cancer or heart disease. Ideally, scientists would like to experimentally test each of these compounds against all possible targets, but doing that kind of screen is prohibitively time-consuming.

In recent years, researchers have begun using computational methods to screen those libraries in hopes of speeding up drug discovery. However, many of those methods also take a long time, as most of them calculate each target protein’s three-dimensional structure from its amino-acid sequence, then use those structures to predict which drug molecules it will interact with.

Researchers at MIT and Tufts University have now devised an alternative computational approach based on a type of artificial intelligence algorithm known as a large language model. These models — one well-known example is ChatGPT — can analyze huge amounts of text and figure out which words (or, in this case, amino acids) are most likely to appear together. The new model, known as ConPLex, can match target proteins with potential drug molecules without performing the computationally intensive step of calculating the molecules’ structures.

Using this method, the researchers can screen more than 100 million compounds in a single day — much more than any existing model.

“This work addresses the need for efficient and accurate in silico screening of potential drug candidates, and the scalability of the model enables large-scale screens for assessing off-target effects, drug repurposing, and determining the impact of mutations on drug binding,” says Bonnie Berger, the Simons Professor of Mathematics, head of the Computation and Biology group in MIT’s Computer Science and Artificial Intelligence Laboratory (CSAIL), and one of the senior authors of the new study.

Lenore Cowen, a professor of computer science at Tufts University, is also a senior author of the paper. Rohit Singh, a CSAIL research scientist, and Samuel Sledzieski, an MIT graduate student, are the lead authors of the paper, and Bryan Bryson, an associate professor of biological engineering at MIT and a member of the Ragon Institute of MGH, MIT, and Harvard, is also an author. In addition to the paper, the researchers have made their model available online for other scientists to use.

Making predictions

In recent years, computational scientists have made great advances in developing models that can predict the structures of proteins based on their amino-acid sequences. However, using these models to predict how a large library of potential drugs might interact with a cancerous protein, for example, has proven challenging, mainly because calculating the three-dimensional structures of the proteins requires a great deal of time and computing power.

An additional obstacle is that these kinds of models don’t have a good track record for eliminating compounds known as decoys, which are very similar to a successful drug but don’t interact well with the target.

“One of the longstanding challenges in the field has been that these methods are fragile, in the sense that if I gave the model a drug or a small molecule that looked almost like the true thing, but it was slightly different in some subtle way, the model might still predict that they will interact, even though it should not,” Singh says.

Researchers have designed models that can overcome this kind of fragility, but they are usually tailored to just one class of drug molecules, and they aren’t well-suited to large-scale screens because the computations take too long. 

The MIT team decided to take an alternative approach, based on a protein model they first developed in 2019. Working with a database of more than 20,000 proteins, the language model encodes this information into meaningful numerical representations of each amino-acid sequence that capture associations between sequence and structure.

“With these language models, even proteins that have very different sequences but potentially have similar structures or similar functions can be represented similarly in this language space, and we're able to take advantage of that to make our predictions,” Sledzieski says.

In their new study, the researchers applied the protein model to the task of figuring out which protein sequences will interact with specific drug molecules, both of which have numerical representations that are transformed into a common, shared space by a neural network. They trained the network on known protein-drug interactions, which allowed it to learn to associate specific features of the proteins with drug-binding ability, without having to calculate the 3D structure of any of the molecules.

“With this high-quality numerical representation, the model can short-circuit the atomic representation entirely, and from these numbers predict whether or not this drug will bind,” Singh says. “The advantage of this is that you avoid the need to go through an atomic representation, but the numbers still have all of the information that you need.”

Another advantage of this approach is that it takes into account the flexibility of protein structures, which can be “wiggly” and take on slightly different shapes when interacting with a drug molecule.

High affinity

To make their model less likely to be fooled by decoy drug molecules, the researchers also incorporated a training stage based on the concept of contrastive learning. Under this approach, the researchers give the model examples of “real” drugs and imposters and teach it to distinguish between them.

The researchers then tested their model by screening a library of about 4,700 candidate drug molecules for their ability to bind to a set of 51 enzymes known as protein kinases.

From the top hits, the researchers chose 19 drug-protein pairs to test experimentally. The experiments revealed that of the 19 hits, 12 had a strong binding affinity (in the nanomolar range), whereas nearly all of the many other possible drug-protein pairs would have no affinity. Four of these pairs bound with extremely high, sub-nanomolar affinity (so strong that a tiny drug concentration, on the order of parts per billion, will inhibit the protein).

While the researchers focused mainly on screening small-molecule drugs in this study, they are now working on applying this approach to other types of drugs, such as therapeutic antibodies. This kind of modeling could also prove useful for running toxicity screens of potential drug compounds, to make sure they don’t have any unwanted side effects before testing them in animal models.

“Part of the reason why drug discovery is so expensive is because it has high failure rates. If we can reduce those failure rates by saying upfront that this drug is not likely to work out, that could go a long way in lowering the cost of drug discovery,” Singh says.

This new approach “represents a significant breakthrough in drug-target interaction prediction and opens up additional opportunities for future research to further enhance its capabilities,” says Eytan Ruppin, chief of the Cancer Data Science Laboratory at the National Cancer Institute, who was not involved in the study. “For example, incorporating structural information into the latent space or exploring molecular generation methods for generating decoys could further improve predictions.”

The research was funded by the National Institutes of Health, the National Science Foundation, and the Phillip and Susan Ragon Foundation.

Left: An antiferromagnet can function as “parallel electrical circuits” carrying Néel spin currents. Right: A tunnel junction based on the antiferromagnets hosting Néel spin currents can be regarded as “electrical circuits” with the two ferromagnetic tunnel junctions connected in parallel. (Image by SHAO Dingfu)
Left: An antiferromagnet can function as “parallel electrical circuits” carrying Néel spin currents. Right: A tunnel junction based on the antiferromagnets hosting Néel spin currents can be regarded as “electrical circuits” with the two ferromagnetic tunnel junctions connected in parallel. (Image by SHAO Dingfu)

Chinese physicists discover 'parallel circuits' of spin currents in antiferromagnets

A group of physicists at Hefei Institutes of Physical Science (HFIPS) of the Chinese Academy of Sciences (CAS) revealed a secret of antiferromagnets, which could accelerate spintronics, a next-gen data storage and processing technology for overcoming the bottleneck of modern digital electronics.

Spintronics is a vigorously developing field employing the spin of electrons within magnetic materials to encode information. Spin-polarized electric currents play a central role in spintronics, due to the capabilities of manipulation and detection of magnetic moment directions for writing and reading 1s and 0s. Currently, most spintronic devices are based on ferromagnets, where the net magnetizations can efficiently spin-polarized electric currents. Antiferromagnets, with opposite magnetic moments aligned alternately, are less investigated but may promise even faster and smaller spintronic devices. However, antiferromagnets have zero net magnetization and thus are commonly believed to carry solely spin-neutral currents useless for spintronics. While antiferromagnets consist of two antiparallel aligned magnetic sublattices, their properties are deemed to be "averaged out" over the sublattices making them spin-independent.

Prof. SHAO Ding-Fu, who led the team, has a different point of view on this research. He envisioned that collinear antiferromagnets can function as "electrical circuits" with the two magnetic sublattices connected in parallel. With this simple intuitive picture in mind, Prof. SHAO and his collaborators theoretically predicted that magnetic sublattices could polarize the electric current locally, thus resulting in the staggered spin currents hidden within the globally spin-neutral current.

He dubbed these staggered spin currents as "Néel spin currents" after Louis Néel, a Nobel laureate, who won the prize due to his fundamental work and discoveries concerning antiferromagnetism.

The Néel spin currents are a unique nature of antiferromagnets that has never been recognized. It is capable to generate useful spin-dependent properties which have been previously considered incompatible with antiferromagnets, such as a spin-transfer torque and tunneling magnetoresistance in antiferromagnetic tunnel junctions, crucial for electrical writing and reading of information in antiferromagnetic spintronics.

"Our work uncovered a previously unexplored potential of antiferromagnets, and offered a straightforward solution to achieve efficient reading and writing for antiferromagnetic spintronics," said Prof. SHAO Ding-Fu.