Machine Learning applied to the Chemical Synthesis and Biocatalysis

16 Apr

Prof. Andrea Mattevi, Università degli Studi di Pavia
Dr. Emanuele Casali, Università degli Studi di Pavia
EMAIL: andrea.mattevi@unipv.it; emanuele.casali@unipv.it

April 8– June 14, 2024

18 h, 3 CFU

This Ph.D. course provides a foundational understanding of machine learning and artificial intelligence in chemistry. Covering basic to advanced models, it emphasizes real-world applications and critical thinking. Practical tools, including Python and RDKit, are explored for model development, focusing on descriptors from simple chemical features to modern Sterimols. The course guides students in using these models to predict reactivities, design catalysts, and apply modern retrosynthesis strategies. The goal is to empower students to make groundbreaking advancements in chemistry through the efficient integration of machine learning and AI.

Examination: Oral interview on topics from the course and a paper from the literature of the student’s choice.

Topics:

  1. Machine Learning and Artificial Intelligence
  2. Qualitative and quantitative predictive models based on descriptors and energy
  3. Predictive models based on ML (MLR, SVM, NN, RF)
  4. Fundamentals of Python and RDKit for Machine Learning
  5. Train set, validation set, test set and overfitting (LassoCV and k-fold cross-validation)
  6. Chemical descriptors (orbitals, dipole moment, atomic charges, bond order, NMR shielding tensors)
  7. Sterical descriptors (Sterimols)
  8. (PythiaCHEM)
  9. Recalls of the basic concepts of organic retrosynthesis
  10. Usage of IBM-Rxn, AiZynthFinder and [Chematica (Syntia)] in automated retrosynthesis
  11. The revolution of protein structure prediction by AI