Machine learning meets volcano plots: computational discovery of cross-coupling catalysts.
Journal article

Machine learning meets volcano plots: computational discovery of cross-coupling catalysts.

  • Meyer B Laboratory for Computational Molecular Design , Institute of Chemical Sciences and Engineering , École Polytechnique Fédérale de Lausanne (EPFL) , CH-1015 Lausanne , Switzerland . Email: clemence.corminboeuf@epfl.ch.
  • Sawatlon B Laboratory for Computational Molecular Design , Institute of Chemical Sciences and Engineering , École Polytechnique Fédérale de Lausanne (EPFL) , CH-1015 Lausanne , Switzerland . Email: clemence.corminboeuf@epfl.ch.
  • Heinen S Institute of Physical Chemistry , Department of Chemistry , University of Basel , Klingelbergstrasse 80 , CH-4056 Basel , Switzerland . Email: anatole.vonlilienfeld@unibas.ch.
  • von Lilienfeld OA Institute of Physical Chemistry , Department of Chemistry , University of Basel , Klingelbergstrasse 80 , CH-4056 Basel , Switzerland . Email: anatole.vonlilienfeld@unibas.ch.
  • Corminboeuf C Laboratory for Computational Molecular Design , Institute of Chemical Sciences and Engineering , École Polytechnique Fédérale de Lausanne (EPFL) , CH-1015 Lausanne , Switzerland . Email: clemence.corminboeuf@epfl.ch.
  • 2018-10-13
Published in:
  • Chemical science. - 2018
English The application of modern machine learning to challenges in atomistic simulation is gaining attraction. We present new machine learning models that can predict the energy of the oxidative addition process between a transition metal complex and a substrate for C-C cross-coupling reactions. In turn, this quantity can be used as a descriptor to estimate the activity of homogeneous catalysts using molecular volcano plots. The versatility of this approach is illustrated for vast libraries of organometallic catalysts based on Pt, Pd, Ni, Cu, Ag, and Au combined with 91 ligands. Out-of-sample machine learning predictions were made on a total of 18 062 compounds leading to 557 catalyst candidates falling into the ideal thermodynamic window. This number was further refined by searching for candidates with an estimated price lower than 10 US$ per mmol. The 37 catalyst finalists are dominated by palladium phosphine ligand combinations but also include the earth abundant transition metal (Cu) with less common ligands. Our results indicate that modern statistical learning techniques can be applied to the computational discovery of readily available and promising catalyst candidates.
Language
  • English
Open access status
gold
Identifiers
Persistent URL
https://sonar.ch/global/documents/288945
Statistics

Document views: 50 File downloads: