Calculating the "fingerprints" of molecules with artificial intelligence

The graphical neural network GNN receives small molecules as input with the task of determining their spectral responses. By matching them with the known spectra, the GNN programme learns to calculate spectra reliably.

The graphical neural network GNN receives small molecules as input with the task of determining their spectral responses. By matching them with the known spectra, the GNN programme learns to calculate spectra reliably. © K. Singh, A. Bande/HZB

With conventional methods, it is extremely time-consuming to calculate the spectral fingerprint of larger molecules. But this is a prerequisite for correctly interpreting experimentally obtained data. Now, a team at HZB has achieved very good results in significantly less time using self-learning graphical neural networks.

"Macromolecules but also quantum dots, which often consist of thousands of atoms, can hardly be calculated in advance using conventional methods such as DFT," says PD Dr. Annika Bande at HZB. With her team she has now investigated how the computing time can be shortened by using methods from artificial intelligence.

The idea: a computer programme from the group of "graphical neural networks" or GNN receives small molecules as input with the task of determining their spectral responses. In the next step, the GNN programme compares the calculated spectra with the known target spectra (DFT or experimental) and corrects the calculation path accordingly. Round after round, the result becomes better. The GNN programme thus learns on its own how to calculate spectra reliably with the help of known spectra.

"We have trained five newer GNNs and found that enormous improvements can be achieved with one of them, the SchNet model: The accuracy increases by 20% and this is done in a fraction of the computation time," says first author Kanishka Singh. Singh participates in the HEIBRiDS graduate school and is supervised by two experts from different backgrounds: computer science expert Prof. Ulf Leser from Humboldt University Berlin and theoretical chemist Annika Bande.

"Recently developed GNN frameworks could do even better," she says. "And the demand is very high. We therefore want to strengthen this line of research and are planning to create a new postdoctoral position for it from summer onwards as part of the Helmholtz project "eXplainable Artificial Intelligence for X-ray Absorption Spectroscopy"."

 

Annotation:

The work was carried out within the framework of the HEIBRiDS graduate school and is being supported by the Helmholtz project "eXplainable Artificial Intelligence for X-ray Absorption Spectroscopy" (XAI-4-XAS).

The core of the project is to extend GNN, as used at HZB, to very large molecules in combination with the probabilistic analysis of molecular motifs developed at HEREON. It is used to capture only the relevant part of the configuration phase space of the molecules, which is necessary for the accurate prediction of X-ray spectra. The results of the ML predictions allow a rigorous interpretation of XAS experiments, so that characteristic parts of the spectrum of an extended material can be assigned 1:1 to its specific structural subgroups.

 

arö

  • Copy link

You might also be interested in

  • Surprising insights into the chemistry of hydroxyl radicals at BESSY II
    Science Highlight
    09.04.2026
    Surprising insights into the chemistry of hydroxyl radicals at BESSY II
    How do radicals form in aqueous solutions when exposed to UV light? This question is important for health research and environmental protection, for example with regard to the overfertilisation of water bodies by intensive agriculture. A team at BESSY II has now developed a new method of investigating hydroxyl radicals in solution. By using a clever trick, the scientists gained surprising insights into the reaction pathway.
  • AI-driven Catalyst Discovery: €30 million funding for German consortium
    News
    30.03.2026
    AI-driven Catalyst Discovery: €30 million funding for German consortium
    Six partners from research and industry, including Helmholtz-Zentrum Berlin (HZB), the Fritz-Haber-Institute of the Max Planck Society (FHI), BASF, Dunia Innovations, Siemens Energy, and the Technical University Berlin are launching a joint project to accelerate the catalyst discovery. The German Federal Ministry for Science, Technology and Space (BMFTR) is providing €30 million in funding for ASCEND (Accelerated Solutions for Catalysis using Emerging Nanotechnology and Digital Innovation). The research initiative targets the defossilisation of energy-intensive industries while safeguarding industrial competitiveness, with a focus on the chemical sector. The five-year project will start on 1st April 2026.
  • Kick-off for a new data and AI centre in Berlin
    News
    27.03.2026
    Kick-off for a new data and AI centre in Berlin
    By establishing a new data and AI centre in Berlin, the Zuse Institute Berlin (ZIB) and the Helmholtz-Zentrum Berlin (HZB) are laying the foundations for a scalable and sovereign data infrastructure in the capital. The project strengthens the scientific capabilities of Berlin’s research community whilst making an important contribution to research security, resilience and technological independence.