
Ian Dunn, Liv Toft, Tyler Katz, Juhi Gupta, Riya Shah, Ramith Hettiarachchi, David Ryan Koes
NeurIPS MLSB Workshop December 2025
Structure-based drug design (SBDD) focuses on designing small-molecule ligands that bind to specific protein pockets. We propose a unified approach in OMTRA, a multi-modal flow matching model that flexibly performs many tasks relevant to SBDD, including de novo design, molecular docking, and pharmacophore-conditioned generation. OMTRA obtains state-of-the-art performance on pocket-conditioned de novo design and docking. Additionally, we curate a dataset of 500M 3D molecular conformers, complementing protein-ligand data and expanding the chemical diversity available for training.
Ian Dunn, Liv Toft, Tyler Katz, Juhi Gupta, Riya Shah, Ramith Hettiarachchi, David Ryan Koes
NeurIPS MLSB Workshop December 2025
Structure-based drug design (SBDD) focuses on designing small-molecule ligands that bind to specific protein pockets. We propose a unified approach in OMTRA, a multi-modal flow matching model that flexibly performs many tasks relevant to SBDD, including de novo design, molecular docking, and pharmacophore-conditioned generation. OMTRA obtains state-of-the-art performance on pocket-conditioned de novo design and docking. Additionally, we curate a dataset of 500M 3D molecular conformers, complementing protein-ligand data and expanding the chemical diversity available for training.

Ian Dunn, David Ryan Koes
Preprint September 2025
We present FlowMol3, a flow matching model that pushes the state of the art in unconditional 3D de novo small-molecule generation. FlowMol3 achieves nearly 100% molecular validity for drug-like molecules with explicit hydrogens, more accurately reproduces the functional group composition and geometry of its training data, and does so with an order of magnitude fewer learnable parameters than comparable methods.
Ian Dunn, David Ryan Koes
Preprint September 2025
We present FlowMol3, a flow matching model that pushes the state of the art in unconditional 3D de novo small-molecule generation. FlowMol3 achieves nearly 100% molecular validity for drug-like molecules with explicit hydrogens, more accurately reproduces the functional group composition and geometry of its training data, and does so with an order of magnitude fewer learnable parameters than comparable methods.

Emma L. Flynn, Riya Shah, Ian Dunn, Rishal Aggarwal, David Ryan Koes
Frontiers in Bioinformatics September 2025
We present a machine learning approach to enhance structure-based drug discovery by generating three-dimensional pharmacophores using diffusion models. Rather than producing individual molecules, our method creates pharmacophore queries—spatial representations of protein-ligand interaction points—that can rapidly screen existing molecular databases for valid, commercially available compounds.
Emma L. Flynn, Riya Shah, Ian Dunn, Rishal Aggarwal, David Ryan Koes
Frontiers in Bioinformatics September 2025
We present a machine learning approach to enhance structure-based drug discovery by generating three-dimensional pharmacophores using diffusion models. Rather than producing individual molecules, our method creates pharmacophore queries—spatial representations of protein-ligand interaction points—that can rapidly screen existing molecular databases for valid, commercially available compounds.

Ian Dunn, David Ryan Koes
NeurIPS 2024 Workshop on Machine Learning for Sturctural Biology November 2024
In this work we benchmark the performance of existing discrete flow matching methods for 3D de novo small molecule generation and provide explanations of their differing behavior. As a result we present FlowMol-CTMC, an open-source model that achieves state of the art performance for 3D de novo design with fewer learnable parameters than existing methods. Additionally, we propose the use of metrics that capture molecule quality beyond local chemical valency constraints and towards higher-order structural motifs.
Ian Dunn, David Ryan Koes
NeurIPS 2024 Workshop on Machine Learning for Sturctural Biology November 2024
In this work we benchmark the performance of existing discrete flow matching methods for 3D de novo small molecule generation and provide explanations of their differing behavior. As a result we present FlowMol-CTMC, an open-source model that achieves state of the art performance for 3D de novo design with fewer learnable parameters than existing methods. Additionally, we propose the use of metrics that capture molecule quality beyond local chemical valency constraints and towards higher-order structural motifs.

Ian Dunn, Somayeh Pirhadi, Yao Wang, Smmrithi Ravindran, Carter Concepcion, David Ryan Koes
Journal of Chemical Information and Modeling December 2024
We describe our winning submission to the first Critical Assessment of Computational Hit-Finding Experiments (CACHE) challenge. Our screening campaign was largely built around gnina, an open-source molecular docking program developed by the Koes lab that uses a deep-learning based scoring function. Our resulting best hit series tied for first place when evaluated by a panel of expert judges.
Ian Dunn, Somayeh Pirhadi, Yao Wang, Smmrithi Ravindran, Carter Concepcion, David Ryan Koes
Journal of Chemical Information and Modeling December 2024
We describe our winning submission to the first Critical Assessment of Computational Hit-Finding Experiments (CACHE) challenge. Our screening campaign was largely built around gnina, an open-source molecular docking program developed by the Koes lab that uses a deep-learning based scoring function. Our resulting best hit series tied for first place when evaluated by a panel of expert judges.

Ian Dunn, David Ryan Koes
Preprint April 2024
We develop FlowMol: the first flow matching model that samples the geometric and topological structure of organic molecules. We also develop a method for flow matching on discrete data that we call SimplexFlow.
Ian Dunn, David Ryan Koes
Preprint April 2024
We develop FlowMol: the first flow matching model that samples the geometric and topological structure of organic molecules. We also develop a method for flow matching on discrete data that we call SimplexFlow.

Ian Dunn, David Ryan Koes
NeurIPS 2024 Workshop on Generative AI for Biology October 2023 Spotlight
We present a novel method for learning compressed geometric representations of protein structure and use this novel protein encoder to accelerate inference in diffusion models.
Ian Dunn, David Ryan Koes
NeurIPS 2024 Workshop on Generative AI for Biology October 2023 Spotlight
We present a novel method for learning compressed geometric representations of protein structure and use this novel protein encoder to accelerate inference in diffusion models.