NM Group OpenDay

Introduction: The Magic of Machine Learning in Science

In the world of science and technology, Machine Learning (ML) has become a game-changer. It allows computers to find patterns in vast amounts of data and make predictions, often leading to new scientific insights. This year’s Nobel Prize in Chemistry was awarded to Demis Hassabis, John Jumper, and David Baker for their groundbreaking work on predicting protein structures using ML. Their work revolutionized how we understand proteins, which are critical to all life forms. Read more about the Chemistry Nobel Prize here.

Additionally, the Nobel Prize in Physics was awarded to John Hopfield and Geoffrey Hinton for their foundational contributions to artificial neural networks. These neural networks are a key component of modern ML systems, forming the backbone of innovations across various fields, including chemistry and physics. Learn more about the Physics Nobel Prize here.

What if we told you that machines can help us uncover secrets about chemical structures and reactions? Today, you’ll see how ML helps us solve complex problems in computational chemistry, such as predicting the stability of materials like zeolites, which are used in catalysis and energy applications.

Interactive Example: Train Your Own Machine Learning Model

Let’s start with an example you can try yourself! Head over to Google's Teachable Machine and build a simple ML model in minutes.

How to Train Your Model:

Open the Teachable Machine.
Select "Image Project" and capture/upload different objects around you (such as your hand, a pencil, or a book).
Train the model using at least two categories.
Once trained, see how well your model can distinguish between the objects you uploaded.

This fun activity gives you a taste of how machine learning works. In chemistry, we use similar processes with much more complex data to solve big scientific problems.

Example: Zeolites and Their Properties

Zeolites are fascinating materials used in many fields like catalysis and gas separation. Their performance often depends on their structure, which in turn is influenced by the arrangement of atoms. In this example, we explore how Germanium (Ge) atoms distribute within a zeolite structure and how that affects their stability.

Figure 1: Plot showing the possible arrangements of Germanium atoms (in blue) in the UTL zeolite..

Energetics-Stability: The stability of zeolite structures can be understood from an energetic standpoint, where lower energy configurations indicate greater stability.
Ge-O-Ge Counts: The counts of Ge-O-Ge bonds serve as a structural parameter, influencing the overall stability and performance of the zeolite material.

Visualization: Check out the 3D models of different zeolite structures with varying Ge atom distributions.

For more information on this topic, check out the following research paper: Zeolite structure-property relationships.

Interactive Combinations Calculator

Calculate Possible Combinations

UTL zeolite has 38 T-sites, we replaced 12 of them with Ge, check the possible combinations?

Enter the total number of sites and the number of sites you want to replace:

ML Potentials for Possible Combinations

In the field of computational chemistry and material science, the exploration of possible structures is immense. With zeolites, for example, the number of potential configurations can reach millions, owing to variations in atom arrangements and compositions. Machine Learning (ML) techniques offer powerful tools for sampling these vast possibilities and identifying meaningful distributions based on energy stability.

To understand the significance of this sampling, consider that for every change in atomic positions, new energy calculations are required. This makes manual exploration impractical, as we are faced with a combinatorial explosion of potential structures. By leveraging ML models, we can effectively narrow down the search space and focus on the most promising candidates for further investigation.

For example, using ML algorithms, we can identify patterns and correlations within the data, allowing us to predict the energies of untested configurations. This not only speeds up the research process but also enhances our understanding of structure-property relationships.

Figure 2: Using Machine Learning method number of strucutures considred for distribution of Ge in various zeolites.

Through advanced modeling and simulations, researchers can explore this expansive design space to optimize zeolite structures for applications in catalysis, gas separation, and other critical technologies.

Real Data Visualization: Plots of Energy and Ge-O-Ge Count

In this section, we delve into the diverse data associated with one of the zeolite structures called UTL. Using Histogram and UMAP, we visualize the various energy levels and Ge-O-Ge counts across various configurations of UTL zeolites.

Figure 3: The variation of relative energy across different UTL zeolite configurations.

Figure 4: Illustrating the Ge-O-Ge count distribution in various UTL zeolite structures.

This visualization illustrates the rich diversity in the structural arrangements of Ge atoms within UTL zeolites, highlighting how variations in the Ge-O-Ge count and Relative energy of these materials correlate with stability and efficiency. The next step is to analyze the underlying patterns and trends, enabling us to identify configurations that maximize stability and efficiency.

Interactive Regression Model

Interactive Code: Build a Simple Regression Model

Enter the X value to predict the Y value based on a regression model.

Data Input

To update the regression model, enter X values and corresponding Y values below:

Data Table

X Value	Y Value	Action

Slope (m): 1

Intercept (c): 0

Machine Learning Insights: Energy Prediction

In the context of zeolite structures, the number of Ge-O-Ge bonds plays a crucial role in enhancing energetic stabilization. By analyzing real data, we can observe how variations in these atomic arrangements influence the stability of zeolite frameworks.

Through machine learning techniques, we can predict the stability of new zeolite structures even before they are synthesized in the lab. By training models on existing data, particularly focusing on the Ge-O-Ge bond counts, we gain insights into how these configurations contribute to energy efficiency and overall stability.

Structure-Propert Relationship

Figure 5: Highlighting the arrangement of Ge-O-Ge bonds and their contribution to the overall stability of the framework.

The regression model exemplifies this approach. In advanced research, models like reactive neural networks can adapt and learn from changes in atomic bonding and bond-breaking patterns. This allows them to study real-life scenarios like catalysis, such as the optimization of reaction pathways or the development of more efficient catalysts.