Assessing Methods and Obstacles in Chemical Space Exploration

Shawn Reeves; Benjamin DiFrancesco; Vijay Shahani; Stephen MacKinnon; Andreas Windemuth; Andrew Brereton

doi:10.22541/au.159769801.18927367

loading page

Assessing Methods and Obstacles in Chemical Space Exploration

Shawn Reeves,
Benjamin DiFrancesco,
Vijay Shahani,
Stephen MacKinnon,
Andreas Windemuth,
Andrew Brereton

Abstract

Benchmarking the performance of generative methods for drug design is complex and multifaceted. In this report, we propose a separation of concerns for de novo drug design, categorizing the task into three main categories: generation, discrimination, and exploration. We demonstrate that changes to any of these three concerns impacts benchmark performance for drug design taks. In this report we present Deriver, an open-source Python package that acts as a modular framework for molecule generation, with a focus on integrating multiple generative methods. Using Deriver, we demonstrate that changing parameters related to each of these three concerns impacts chemical space traversal significantly, and that the freedom to independently adjust each is critical to real-world applications having conflicting priorities. We find that combining multiple generative methods can improve optimization of molecular properties, and lower the chance of becoming trapped in local minima. Additionally, filtering molecules for drug-likeness (based on physicochemical properties and SMARTS pattern matching) before they are scored can hinder exploration, but can improve the quality of the final molecules. Finally, we demonstrate that any given task has an exploration algorithm best suited to it, though in practice linear probabilistic sampling generally results in the best outcomes, when compared to Monte Carlo sampling or greedy sampling. We intend that Deriver, which is being made freely available, will be helpful to others interested in collaboratively improving existing methods in de novo drug design centered around inheritance of molecular structure, modularity, extensibility, and separation of concerns.

17 Aug 2020Submitted to Applied AI Letters

Show details

Hide details

17 Aug 2020Submission Checks Completed

17 Aug 2020Assigned to Editor

19 Aug 2020Reviewer(s) Assigned

21 Sep 2020Review(s) Completed, Editorial Evaluation Pending

22 Sep 2020Editorial Decision: Revise Minor

14 Oct 20201st Revision Received

16 Oct 2020Submission Checks Completed

16 Oct 2020Assigned to Editor

28 Oct 2020Reviewer(s) Assigned

04 Nov 2020Review(s) Completed, Editorial Evaluation Pending

04 Nov 2020Editorial Decision: Accept

Abstract

Peer review status:ACCEPTED