AI in Chemistry: Study Highlights Strengths and Weaknesses

Computing power in the chemistry lab: Kevin Jablonka (left) and his team at HIPOLE Jena. Photo: Renzo Paulus

Computing power in the chemistry lab: Kevin Jablonka (left) and his team at HIPOLE Jena. Photo: Renzo Paulus

How well does artificial intelligence perform compared to human experts? A research team at HIPOLE Jena set out to answer this question in the field of chemistry. Using a newly developed evaluation method called “ChemBench,” the researchers compared the performance of modern language models such as GPT-4 with that of experienced chemists. 

The study has recently been published in the journal Nature Chemistry (DOI 10.1038/s41557-025-01815-x).

More than 2,700 chemistry tasks from research and education were tested—ranging from fundamental knowledge to complex problems. In areas such as reaction prediction or the analysis of large datasets, AI models often excelled with high efficiency. However, a critical weakness became apparent: the models also produced confident answers even when they were factually incorrect. Human chemists, by contrast, were more cautious and questioned their own assessments.

“Our study shows that AI can be a valuable tool—but it is no substitute for human expertise,” says Dr. Kevin M. Jablonka, lead author of the study. The findings offer important insights for the responsible use of AI in chemical research and education.

HIPOLE Jena (Helmholtz Institute for Polymers in Energy Applications Jena) is an institute of HZB in cooperation with Friedrich Schiller University Jena (FSU Jena).

ma

  • Copy link

You might also be interested in

  • A New Era in Catalysis: ASCEND Launch in Berlin, €30 Million in Funding
    News
    12.06.2026
    A New Era in Catalysis: ASCEND Launch in Berlin, €30 Million in Funding
    On 11 June 2026, the Helmholtz-Zentrum Berlin (HZB) in Adlershof hosted the launch of ASCEND (Accelerated Solutions for Catalysis using Emerging Nanotechnology and Digital Innovation). The event took place in the presence of the Minister of Research, Dorothee Bär, President of the Helmholtz Association, Prof. Dr. Martin Keller, and President of the Max Planck Society, Prof. Dr. Patrick Cramer. Bringing together leading partners from industry and research, ASCEND is supported by BMFTR with €30 million in funding and officially started on 1 April 2026. The initiative aims to accelerate the discovery of next-generation catalysts and enable more sustainable chemical processes.
  • X-ray analysis reveals overpainted fascist symbols
    Science Highlight
    08.06.2026
    X-ray analysis reveals overpainted fascist symbols
    Erich Mercker was a successful painter during the Nazi era and in the years that followed. After 1945, he covered up Nazi symbols in at least one of his paintings. With an interdisciplinary team, physicist Dr Ioanna Mantouvalou reports on this study in the Nature Journal Heritage Science.
  • Magnon momentum microscopy: A new window into nanoscale spin-waves
    Science Highlight
    08.06.2026
    Magnon momentum microscopy: A new window into nanoscale spin-waves
    An international team lead by the Max Born Institute has developed a new type of momentum microscopy to image magnons — the quanta of collectively excited spins — directly in two-dimensional reciprocal space using soft X-rays. Measurements have taken place at BESSY II and PETRA III, first author ist the HZB physicist Steffen Wittrock. Owing to its remarkable sensitivity, simplicity, and access to nanometer-scale wavelengths, this novel technique establishes a powerful and versatile platform for exploring nonlinear magnon interactions, which are promising for future computing schemes.