Princeton Researchers Develop New Tool to Narrow the Search for Ideal Material Structures

Insider Brief

  • Princeton researchers said they built a machine-learning tool that predicts the free energy of metal-organic frameworks, or MOFs, in seconds to help screen for stable, synthesizable candidates.
  • The team said it created a machine-readable sequence representation, generated representations for about one million MOFs, and trained a bespoke language model that reached 97% accuracy when tested on roughly 65,000 MOFs with known free energy values.
  • The researchers said prior work by collaborator Diego Gómez-Gualdrón set a 4.4 kJ/mol free energy threshold for synthesizability, and they reported the study was published Dec. 16 in the Journal of the American Chemical Society with support from the National Science Foundation and a Schmidt Sciences AI2050 Fellowship.

PRESS RELEASE — Princeton researchers have developed a new tool to speed the discovery of advanced materials known as metal organic frameworks, or MOFs.

MOFs are an emerging class of materials that form microscopic sponge-like structures with vast interior surface area. That quality promises to transform how society traps, absorbs and filters substances at the molecular level. The researchers say this could lead to better battery chemistry, more efficient carbon capture and improved access to clean water.

But scientists face a problem of choice. MOFs are highly modular, consisting of metal-ion nodes and organic molecules that link the nodes into large networks. The researchers say there are trillions of possible chemical combinations. But not all combinations are equally useful, and some are not even feasible to make in a lab.

Now, a team led by Adji Bousso Dieng has developed a method using machine learning to predict which MOF structures are good candidates, avoiding the need for researchers to wade through countless useless structures.

“Our tool takes seconds to produce a prediction versus between seven hours to more than two days for traditional molecular simulations,” said Dieng, assistant professor of computer science, and associated faculty member in the Princeton Materials Institute.

Specifically, the team used a measure that describes the stability of a molecular structure — called free energy — to make the predictions.

They published a paper detailing the work on December 16 in the Journal of the American Chemical Society.

Developing the tool involved three major steps. First, they had to convert key physical and chemical characteristics of MOFs into sequences that a machine could read. Second, they assembled a database of MOFs on which to train their model. Finally, they ran their model multiple times to make predictions about specific materials.

Turning physical and chemical characteristics related to the free energy of the individual atoms and units of a MOF into a machine-readable representation was arduous, and key to this research.

“The sequence representation we came up with is really what unlocked everything,” said Dieng.

Using this system, they generated representations for one million MOFs.

The team then trained a bespoke language model to predict the free energy values of all one million MOFs in their database, calibrated using a simpler characteristic that is closely related to free energy. When they tested a sample of the database, using around 65,000 materials with known free energy values, the model’s predictions were accurate 97 percent of the time.

Dieng’s collaborator, Diego Gómez-Gualdrón from the Colorado School of Mines, had previously determined that, below a certain free energy value (4.4 kilojoules per mole), a MOF is considered stable and can be feasibly synthesized in a lab.

“If you have a new MOF, you can predict its free energy, and you can also predict whether it is synthesizable or not,” said Dieng.

The team is now working on streamlining their sequence representations, reducing the computational overhead incurred with some structures. They are also using that system to add a search function to their tool to help find stable MOFs.

“We are lifting the problem where now you can compute the sequence representation itself very quickly, very cheaply,” said Dieng. “This technology allows researchers to focus resources on promising candidates for practical applications in carbon capture, energy storage, catalysis and gas separation.”

The article, Highly accurate and fast prediction of MOF free energy via machine learning, was published on December 16 in the Journal of the American Chemical Society. Besides Dieng and Gómez-Gualdrón, authors include Andre Niyongabo Rubungo from Princeton University and Fernando Fajardo-Rojas from the Colorado School of Mines. Support was provided by the National Science Foundation and the Schmidt Sciences AI2050 Fellowship.

Image credit:: David Kelly Crow/Princeton

Need Deeper Intelligence on the AI Market?

AI Insider's Market Intelligence platform tracks funding rounds, competitive landscapes, and technology trends across the global AI ecosystem in real time. Get the data and insights your organization needs to make informed decisions.

Related Articles

Runway Launches $10M Fund to Expand AI Video and World Model Ecosystem

Runway has announced the launch of a $10 million venture fund to support early-stage startups building across AI, media, and simulation, as the company expands

Mantis Biotech Announces $7.4M in Funding to Advance AI-Driven Digital Twin Models for Biomedical Research

Mantis Biotech has secured $7.4 million in seed funding led by Decibel VC, with participation from Y Combinator, Liquid 2, and angel investors, as it

ScaleOps Closes $130M Series C to Advance Autonomous AI Infrastructure Management

ScaleOps has secured $130 million in Series C funding led by Insight Partners, with participation from Lightspeed Venture Partners, NFX, Glilot Capital Partners, and Picture

Stay Updated with AI Insider

Get the latest AI funding news, market intelligence, and industry insights delivered to your inbox weekly.

Subscribe today for the latest news about the AI landscape