Machine learning in chemistry : the impact of artificial intelligence /
This book provides practical examples of machine learning applied to science to help researchers make an informed choice about using the method in chemistry.
| Other Authors: | |
|---|---|
| Format: | eBook |
| Language: | English |
| Published: |
Cambridge :
Royal Society of Chemistry,
2020.
|
| Series: | Theoretical and computational chemistry ;
17. |
| Subjects: | |
| Online Access: | Connect to the full text of this electronic book |
Table of Contents:
- Intro
- Title
- Copyright
- Preface
- Contents
- Chapter 1 Computers as Scientists
- 1.1 What Is Computational Science?
- 1.2 What Is Artificial Intelligence?
- 1.3 What Is Machine Learning?
- References
- Chapter 2 How Do Machines Learn?
- 2.1 Framing the Question
- 2.2 Gathering Data
- 2.3 Setting up the Algorithm
- 2.4 The Training Procedure
- 2.5 Overcoming Flaws
- 2.6 Deploying the Algorithms
- Note on References
- References
- Chapter 3 MedChemInformatics: An Introduction to Machine Learning for Drug Discovery
- 3.1 Introduction
- 3.1.1 Artificial Intelligence vs. Machine Learning
- 3.1.2 Supervised vs. Unsupervised Learning
- 3.2 Forewarned Is Forearmed
- 3.2.1 Dataset Assembly and Curation
- 3.2.2 Model Building
- 3.2.3 Principal Component Analysis
- 3.2.4 When Machines Learn in 3D, Alignment IS EVERYTHING
- 3.2.5 Activity Range and Distribution of Activities for QSAR Modelling
- 3.2.6 Outliers
- 3.3 Deep Learning and Neural Networks
- 3.4 Underlying Mathematics
- 3.4.1 "Observables" and "Features"
- 3.4.1.1 Molecular Descriptors and Fingerprints
- 3.4.2 Comparing Molecules Based on Their Descriptors
- 3.4.2.1 Tanimoto Similarity
- 3.4.2.2 Tversky Similarity
- 3.4.2.3 Dice Similarity
- 3.4.2.4 Which Similarity Metric Works Best?
- 3.4.3 Model Quality and Statistics
- 3.4.3.1 Coefficients of Determination
- 3.4.3.2 Kendall's Tau
- 3.4.4 Overtraining and Characteristics of Good Descriptors
- 3.5 Machine Learning Methods
- 3.5.1 k-Nearest Neighbours
- 3.5.2 Linear Regression
- 3.5.2.1 Ordinary or Partial Least Squares?
- 3.5.3 Decision Trees and Random Forests
- 3.5.3.1 Information
- 3.5.3.2 Algorithms
- 3.5.3.3 Decision Tree Ensembles
- 3.5.4 Support Vector Machines
- 3.5.4.1 Hyperplanes
- 3.5.4.2 Kernels
- 3.5.4.3 Support Vectors
- 3.6 Summary
- Acknowledgements
- References
- Chapter 4 Machine Learning for Nonadiabatic Molecular Dynamics
- 4.1 Introduction
- 4.2 Methods
- 4.2.1 Machine Learning (ML) Models
- 4.2.1.1 Linear Model (LR)
- 4.2.1.2 Kernel Ridge Regression (KRR)
- 4.2.1.3 Support Vector Regression (SVR)
- 4.2.1.4 Neural Networks (NNs)
- 4.2.1.5 Training Process
- 4.2.1.6 Descriptors
- 4.2.1.7 Distance-matrix Based Descriptors
- 4.2.1.8 FCHL: Faber-Christensen-Huang-Lilienfeld Representation
- 4.2.2 Training Set Generation
- 4.2.2.1 Initial Training Set
- 4.2.2.2 Adaptive Sampling for Excited States
- 4.2.3 Wave Function Phases
- 4.2.4 Phase Correction Algorithm
- 4.2.5 Surface Hopping Dynamics
- 4.3 Example: Methylenimmonium Cation
- 4.3.1 ML Surface Hopping Dynamics
- 4.3.2 Energy Conservation
- 4.3.3 Further Tools of ML Models
- 4.4 Conclusion and Outlook
- References
- Chapter 5 Machine Learning in Science
- A Role for Mechanical Sympathy?
- 5.1 Introduction
- 5.1.1 A Little History
- 5.1.2 Challenges
- 5.2 Problems and Solutions
- 5.2.1 How Many Samples do I Need to Train an AI?