Skip to the content.

About - Registration - Schedule - Keynote Speakers - Keynote Abstracts - Accepted Contributions - Important Dates - Call for Papers

About

This will be the 5th-year edition of the ML4Molecules workshop organized by the ELLIS research program on “Machine Learning for Molecule Discovery”. This research program brings together a pan-European community dedicated to accelerating molecular discovery through cutting-edge AI. The meeting will serve as a key opportunity for members and collaborators to present their latest research in the area, exchange insights, coordinate research efforts, and align on strategic goals. ML4Molecules 2025 will focus on the latest developments in generative models, large language models (LLMs), and scalable AI systems for applications in chemistry, drug discovery, and materials science. We will explore topics such as: 1) Foundation and diffusion models for molecule generation, 2) Cross-modal and cross-domain learning (e.g., text, structure, experimental data), 3) AI-driven synthesis planning and reaction prediction, 4) Interpretable and data-efficient ML approaches. 5) Practical considerations in deploying ML in real-world molecular pipelines, etc. In addition to technical sessions, this edition will host the annual meeting of the ELLIS Program on Machine Learning for Molecule Discovery. Building on the progress of previous editions — from foundational methods (2021), critical assessments (2022–2023), to the LLM and foundation model revolution (2024) — ML4Molecules 2025 will foster rigorous, interdisciplinary dialogue and chart new directions for impactful molecular AI research.

Registration

Schedule (subject to changes)

CET Event Speakers Title
08:00 - 09:00 Registration    
09:00 - 09:30 Invited Talk Rocio Mercado Generative AI for Molecular Design: From Drugs to Sustainable Materials
09:30 - 10:00 Invited Talk Daniel Probst Of graphs, sets, and molecules
10:00 - 10:15 Contributed Talk Nawaf Alampara Task Alignment Outweighs Framework Choice in Scientific LLM Agents
10:15 - 10:30 Contributed Talk Riccardo Tedoldi WEISS: Wasserstein efficient sampling strategy for LLMs in drug design
10:30 - 11:00 Coffee Break    
11:00 - 11:30 Invited Talk Marwin Segler Deep Learning for Molecules: The First Decade
11:30 - 11:45 Contributed Talk Nikhil Branson AntiDIF: Accurate and Diverse Antibody Specific Inverse Folding with Discrete Diffusion
11:45 - 12:00 Contributed Talk Yujia Guo Bridging Data-Driven and Expert Knowledge for Interpretable Evaluation of Synthetic Routes
12:00 - 12:30 Panel Discussion Günter Klambauer, José Miguel Hernández Lobato, TBD  
12:30 - 13:30 Lunch    
13:30 - 14:00 Invited Talk Nadine Schneider Applying AI/ML to Accelerate the DMTA Cycle in Drug Discovery
14:00 - 14:15 Contributed Talk Julian Cremer FLOWR.root: A flow matching based foundation model for joint multi-purpose structure-aware 3D ligand generation and affinity prediction
14:15 - 14:30 Contributed Talk Mikkel Jordahn Semi-Supervised Learning for Molecular Graphs via Ensemble Consensus
14:30 - 14:45 Contributed Talk Xuan Vu Nguyen Synthelite: Chemist-aligned and feasibility-aware synthesis planning with LLMs
14:45 - 15:00 Contributed Talk Marcel Hiltscher Explaining What Matters: Faithfulness in Molecular Deep Learning
15:00 - 15:30 Coffee Break    
15:30 - 16:00 ELLIS UnConference Welcome Remarks    
16:00 - 16:30 Retreat of Program Fellows    
16:00 - 18:00 Poster session    
18:00 - 20:00 Reception    

Invited Speakers

Accepted contributions

LLMs, chemical language models & agents

ID Title Authors
5 Task Alignment Outweighs Framework Choice in Scientific LLM Agents Nawaf Alampara, Martiño Ríos-García, Chandan Gupta, Sajid Mannan, Santiago Miret, N M Anoop Krishnan, Kevin Maik Jablonka
8 MolecularIQ: Characterizing Chemical Reasoning Capabilities Through Symbolic Verification on Molecular Graphs Christoph Bartmann, Johannes Schimunek, Mykyta Ielanskyi, Philipp Seidl, Günter Klambauer, Sohvi Luukkonen
9 WEISS: Wasserstein efficient sampling strategy for LLMs in drug design Riccardo Tedoldi, Junyong Li, Ola Engkvist, Andrea Passerini, Annie Westerlund, Alessandro Tibo
11 Integration over Isolation: DFT descriptors boost Chemical Language Model predictive performance Gian-Michele Cherchi, Robert Pollice, Davi Mattoso, Hannes Hovorka
41 Exploring Temperature and Molecular Representation Effects in LLMs for Chemistry Laura van Weesep, Jens Sjölund, Ola Engkvist, Samuel Genheden

Generative models for molecules & proteins

ID Title Authors
1 Look the Other Way: Designing ‘Positive’ Molecules with Negative Data via Task Arithmetic Rıza Özçelik, Sarah de Ruiter, Francesca Grisoni
10 Controllable Molecular Generation with Fine-tuned Flow-matching Model Kunyu Wang, Jon Paul Janet, Alessandro Tibo
14 AntiDIF: Accurate and Diverse Antibody Specific Inverse Folding with Discrete Diffusion Nikhil Branson, Charlotte Deane
16 Auto-Encoding Molecules: Graph-Matching Capabilities Matter Magnus Cunow, Gerrit Großmann, Verena Wolf, Sebastian Josef Vollmer
31 FLOWR.root: A flow matching based foundation model for joint multi-purpose structure-aware 3D ligand generation and affinity prediction Julian Cremer, Tuan Le, Mohammad M. Ghahremanpour, Emilia Sługocka, Filipe Menezes, Djork-Arné Clevert

Molecular property prediction, graphs & omics

ID Title Authors
2 Exploring alignment of scRNA-seq models with prior knowledge Charlotte Claye, Pierre Marschall, Wassila Ouerdane, CELINE HUDELOT, Julien Duquesne
24 Improving Molecular Property Prediction with Score-Based Models Julien Horwood, José Miguel Hernández-Lobato, Dino Oglic
26 Semi-Supervised Learning for Molecular Graphs via Ensemble Consensus Rasmus Hannibal Tirsgaard, Marisa Wodrich, Laurits Fredsgaard, Mikkel Jordahn, Mikkel N. Schmidt
32 Robust Mechanism-of-Action Identification Through In-Context Multi-Source Domain Adaptation Ana Sanchez-Fernandez, Werner Zellinger, Günter Klambauer
34 Unified Bayesian Modelling of Bioactivities Across ChEMBL Michael Backenköhler, Joschka Groß, Andrea Volkamer
39 Formalising Hybrid Modelling Approaches for Molecular Property Prediction Adem R N Aouichaoui, Paul Seghers, Jens Abildskov

Structure-based modelling, protein–ligand binding & MD

ID Title Authors
3 STRIPES: a Novel Spatio-Temporal Language to Encode Dynamic Protein-Ligand Binding Emanuele Criscuolo, Rıza Özçelik, Francesca Grisoni
6 Unifying Structure- and Ligand-based Drug Design via Contrastive Geometric Learning Lisa Schneckenreiter, Sohvi Luukkonen, Lukas Friedrich, Daniel Kuhn, Günter Klambauer
23 LaM-SLidE: Latent Space Modeling of Spatial Dynamical Systems via Linked Entities Florian Sestak, Artur P. Toshev, Andreas Fürst, Günter Klambauer, Andreas Mayr, Johannes Brandstetter
25 Physics-Informed Surrogates in a Verified Agentic Pipeline for Robust Molecular Simulation David Scott Lewis, Enrique Zueco, Enrique Concha
37 FlashMD reloaded: extending the capabilities of a universal, machine-learned direct molecular dynamics propagator Sanggyu Chong, Filippo Bigi, Michele Ceriotti
40 MDAgent: A Modular Multi-Agent Framework for Autonomous Protein-Ligand Molecular Dynamics Simulations Cassandra Masschelein, Salomé Guilbert, Jeremy Goumaz, Bohdan Naida, Philippe Schwaller
42 B-Shapes: a completeness-first, rotation-invariant representation connecting spaces of ligands and pockets Radoslav Krivak, Christos Feidakis, Jiří Vondrášek

Synthesis planning, reactions & mechanistic reasoning

ID Title Authors
7 SynthStrategy: Programmatic Distillation of Latent Chemical Knowledge in Large Language Models Daniel P Armstrong, Zlatko Jončev, Andres M Bran, Philippe Schwaller
19 fragSMILES is inclined to well express chemical fragments and chirality for synthesis planning Fabrizio Mastrolorito
21 Bridging Data-Driven and Expert Knowledge for Interpretable Evaluation of Synthetic Routes Yujia Guo, Mikhail Kabeshov, Tat Hong Duong Le, Marco Vinicio Mijangos Linares, Samuel Genheden, Giulia Bergonzini, Ola Engkvist, Samuel Kaski
22 Grounding Large Language Models in Reaction Knowledge Graphs for Synthesis Retrieval Olga Bunkova, Lorenzo Di Fruscia, Sophia Rupprecht, Artur M. Schweidtmann, Marcel Reinders, Jana Marie Weber
30 Teaching Language Models Mechanistic Explainability Through Arrow-Pushing Théo A. Neukomm, Zlatko Jončev, Philippe Schwaller
35 Synthelite: Chemist-aligned and feasibility-aware synthesis planning with LLMs Xuan Vu Nguyen, Daniel P Armstrong, Milena Wiegand, Andres M Bran, Zlatko Jončev, Philippe Schwaller

Evaluation, interpretability, benchmarks & dataset infrastructure

ID Title Authors
12 Evaluating 3D generative models for molecular design: an industry perspective Alex T. Müller, Astrid Stroobants, Jessica Lanini, Paula Torren-Peraire, Balmiki Ghosh, Finton Sirockin, Nikolas Fechner, Nadine Schneider
13 Quantifying Multi-Objective Optimization in Generative Chemistry for Molecule Design Paula Torren-Peraire, Nadine Schneider, Raquel Rodriguez-Perez, Robin A. Fairhurst, Jessica Lanini
20 Assay-Based Machine Learning: Rethinking Evaluation in Drug Discovery Michael Backenköhler, Joschka Groß, Andrea Volkamer
28 MolForge and MolBox: End-to-End Automation for Reproducible Molecular Machine Learning Datasets Luke Rossen, Francesca Grisoni
29 Explaining What Matters: Faithfulness in Molecular Deep Learning Marcel Hiltscher, Marc Bianciotto, Francesca Grisoni
36 Measuring AI Progress in Drug Discovery: A Reproducible Leaderboard for the Tox21 Challenge Antonia Ebner, Christoph Bartmann, Sonja Topf, Sohvi Luukkonen, Johannes Schimunek, Günter Klambauer

Applications to materials, phase behaviour & large-scale screening

ID Title Authors
4 Automated navigation of condensate phase behavior with active machine learning Yannick Leurs, Willem van den Hout, Andrea Gardin, Joost L.J. van Dongen, Andoni Rodriguez-Abetxuko, Nadia Erkamp, Jan van Hest, Francesca Grisoni, Luc Brunsveld
18 Extended Abstract: Surfactant Simulation and Prediction Through Machine Learning Richard Beckmann, Robert S. Jordan, Marisa Gliege, Santiago Miret, Vijay Kris Narasimhan, Rocío Mercado
33 Accelerated Learning on Large Scale Screens using Generative Library Models Eli N Weinstein, Andrei Slabodkin, Mattia G Gollub, Xiao-Bing Cui, Kerry Dobbs, Fang Zhang, Kristina Gurung, Amira J Bailey, Elizabeth Baker Wood

Important dates

Call for papers

We invite submissions on machine learning for molecules and materials with a special focus on generative models and large language models (LLMs) — from representation and property prediction to end‑to‑end discovery workflows and lab integration. This workshop is part of the ELLIS UnConference on December 2, 2025 (Copenhagen), co‑located with NeurIPS week.

Scope & Topics (include but not limited to)

Submission Instructions

Presentation & Publication Policy

Sponsors

gen ai logo gen ai

Organizing Committee and Contact

Chairs: Nadine Schneider, Francesca Grisoni, and Jose Miguel Hernandez Lobato

Contact: ml4molecules@ml.jku.at