Speaker: Dr. Jules SCHLEINITZ

Institution: École Normale Supérieure PSL, Paris

Hosted By: Professor Haibin SU

Co-Host: Professor Zhenyang LIN

Zoom Link: https://hkust.zoom.us/j/95150435343?pwd=Y2g0L3BpRmZWeStVZGVjOHUxalBFUT09

 

Abstract

Synthetic yield prediction using machine learning is intensively studied. Previous work has focused on two categories of data sets: high-throughput experimentation data, as an ideal case study, and data sets extracted from proprietary databases, which are known to have a strong reporting bias toward high yields. However, predicting yields using published reaction data remains elusive. To fill the gap, we built a data set on nickel-catalyzed cross-couplings extracted from organic reaction publications, including scope and optimization information. We demonstrate the importance of including optimization data as a source of failed experiments and emphasize how publication constraints shape the exploration of the chemical space by the synthetic community. While machine learning models still fail to perform out-of-sample predictions, this work shows that adding chemical knowledge enables fair predictions in a low-data regime. Eventually, we hope that this unique public database will foster further improvements of machine learning methods for reaction yield prediction in a more realistic context.

 

About the speaker

Jules Schleinitz completed a bachelor in chemistry and physics and then a master in theoretical chemistry at the École Normale Supérieure in Paris, then was recruited for a three year PhD at École Normale Supérieure under a teaching contract. He will defend his Ph.D thesis entitled "Mechanistic Analysis and Machine Learning" in October. In November he will start a postdoc for Computer Assisted Synthesis in Sarah E. Reisman's group at Caltech.

5 Sep 2022
2:00pm - 3:30pm
Where
Online
Speakers/Performers
Organizer(S)
Department of Chemistry
Contact/Enquiries
Payment Details
Audience
PG students, Faculty and staff
Language(s)
English
Other Events
16 Jun 2026
Seminar, Lecture, Talk
IAS / School of Science Joint Lecture - Shaping Tumor Cell Plasticity and Therapy Resistance in Glioblastoma
Abstract Tumor heterogeneity fueled by plasticity and genetic diversification of cancer cells is key to therapy failure of malignant glioma. The speaker's team implemented spatial and genetic p...
11 May 2026
Seminar, Lecture, Talk
IAS / School of Science Joint Lecture - Regioselective Pyridine C-H-Functionalization and Skeletal Editing
Abstract Pyridines belong to the most abundant heteroarenes in medicinal chemistry and in agrochemical industry. In the lecture, highly regioselective pyridine C-H functionalization through a d...