Chapter 7: Domain Applications in Chemistry, Biology, Physics and Geoscience

https://leanpub.com/generativeaiforscience

Introduction: Generative AI Across the Sciences

https://leanpub.com/generativeaiforscience

About This Chapter: Audience and Prerequisites

https://leanpub.com/generativeaiforscience

Part I: Chemistry & Materials Science

https://leanpub.com/generativeaiforscience

Scientific Foundations: Molecular Representations

https://leanpub.com/generativeaiforscience

What is SMILES?

https://leanpub.com/generativeaiforscience

From SMILES to Molecular Graphs

https://leanpub.com/generativeaiforscience

Molecular Representation Pipeline

https://leanpub.com/generativeaiforscience

Key Molecular Properties

https://leanpub.com/generativeaiforscience

Molecular Graph Learning and Diffusion Models for Drug Discovery

https://leanpub.com/generativeaiforscience

Full Working Code in Google Colab

https://leanpub.com/generativeaiforscience

1. Molecular Graph Learning for Property Prediction

https://leanpub.com/generativeaiforscience

GNN Message Passing Illustrated

https://leanpub.com/generativeaiforscience

Data Sources and Setup

https://leanpub.com/generativeaiforscience

Installing RDKit

https://leanpub.com/generativeaiforscience

Delaney ESOL Dataset (Aqueous Solubility)

https://leanpub.com/generativeaiforscience

Example: Loading the Dataset

https://leanpub.com/generativeaiforscience

1.1 Code: Molecular Graph Property Predictor

https://leanpub.com/generativeaiforscience

2. Molecular Generation with Diffusion Models

https://leanpub.com/generativeaiforscience

2.1 Graph Diffusion for Molecules

https://leanpub.com/generativeaiforscience

2.2 Conditional Generation

https://leanpub.com/generativeaiforscience

3. Geometry-Aware Molecular Generation

https://leanpub.com/generativeaiforscience

4. Crystal Structure Prediction with Diffusion

https://leanpub.com/generativeaiforscience

4.1 Crystal Diffusion Model

https://leanpub.com/generativeaiforscience

4.2 Training on Materials Project Data

https://leanpub.com/generativeaiforscience

4.3 Real-World Results and Limitations

https://leanpub.com/generativeaiforscience

4.4 Production-Ready Improvements

https://leanpub.com/generativeaiforscience

4.5 Active Learning Pipeline

https://leanpub.com/generativeaiforscience

5. Reaction Outcome Prediction with Transformers

https://leanpub.com/generativeaiforscience

6. Pitfalls and Extensions

https://leanpub.com/generativeaiforscience

Common Pitfalls

https://leanpub.com/generativeaiforscience

Extensions for Production

https://leanpub.com/generativeaiforscience

Summary

https://leanpub.com/generativeaiforscience

References

https://leanpub.com/generativeaiforscience

Part II: Biology & Biomedicine

https://leanpub.com/generativeaiforscience

AI in Healthcare: The Broader Landscape

https://leanpub.com/generativeaiforscience

A Landmark Milestone: AI-Designed Drug Succeeds in Phase 2a

https://leanpub.com/generativeaiforscience

The AI Drug Discovery Pipeline

https://leanpub.com/generativeaiforscience

Why This Chapter Focuses on Drug Discovery

https://leanpub.com/generativeaiforscience

Scientific Foundations: Proteins and Sequences

https://leanpub.com/generativeaiforscience

What is a Protein?

https://leanpub.com/generativeaiforscience

Amino Acids and Sequences

https://leanpub.com/generativeaiforscience

Protein Structure Hierarchy

https://leanpub.com/generativeaiforscience

Key Concepts for AI Models

https://leanpub.com/generativeaiforscience

1. Protein Structure Prediction

https://leanpub.com/generativeaiforscience

The Protein Folding Problem

https://leanpub.com/generativeaiforscience

ESM Architecture Overview

https://leanpub.com/generativeaiforscience

1.1 Using Pre-Trained ESMFold

https://leanpub.com/generativeaiforscience

1.2 Code: ESMFold Structure Prediction Wrapper

https://leanpub.com/generativeaiforscience

1.3 Using Smaller Models for Learning

https://leanpub.com/generativeaiforscience

2. Protein Sequence Generation

https://leanpub.com/generativeaiforscience

2.1 The Challenge and Realistic Expectations

https://leanpub.com/generativeaiforscience

2.2 Code: Protein Sequence Generation with ESM-2

https://leanpub.com/generativeaiforscience

2.3 Production-Grade Alternatives

https://leanpub.com/generativeaiforscience

3. Genomic Variant Analysis

https://leanpub.com/generativeaiforscience

3.1 The Variant Effect Prediction Problem

https://leanpub.com/generativeaiforscience

3.2 Code: Simplified Variant Effect Predictor

https://leanpub.com/generativeaiforscience

3.3 Production-Grade Variant Prediction

https://leanpub.com/generativeaiforscience

4. Clinical Trial Optimization

https://leanpub.com/generativeaiforscience

4.1 Code: Clinical Trial Design Assistant

https://leanpub.com/generativeaiforscience

4.2 Ethical and Regulatory Considerations

https://leanpub.com/generativeaiforscience

Summary: Biology & Biomedicine

https://leanpub.com/generativeaiforscience

References

https://leanpub.com/generativeaiforscience

Part III: Physics & Engineering

https://leanpub.com/generativeaiforscience

Learning Objectives

https://leanpub.com/generativeaiforscience

1. Machine Learning for Physical Systems

https://leanpub.com/generativeaiforscience

1.1 Domain-Specific Challenges

https://leanpub.com/generativeaiforscience

2. Particle Jet Classification with Graph Neural Networks

https://leanpub.com/generativeaiforscience

2.1 The Jet Tagging Problem

https://leanpub.com/generativeaiforscience

2.2 Code: Graph Attention Network for Jet Classification

https://leanpub.com/generativeaiforscience

2.3 Physics-Informed Features

https://leanpub.com/generativeaiforscience

2.4 Production Considerations for HEP

https://leanpub.com/generativeaiforscience

3. Spectroscopy Analysis with 1D CNNs

https://leanpub.com/generativeaiforscience

3.1 The Materials Characterization Problem

https://leanpub.com/generativeaiforscience

3.2 Code: 1D CNN for Spectroscopy Classification

https://leanpub.com/generativeaiforscience

3.3 Data Augmentation for Spectroscopy

https://leanpub.com/generativeaiforscience

Summary: Physics & Engineering

https://leanpub.com/generativeaiforscience

References

https://leanpub.com/generativeaiforscience

Part IV: Geoscience & Climate Applications

https://leanpub.com/generativeaiforscience

Section Overview

https://leanpub.com/generativeaiforscience

4.1 Hurricane Intensity Prediction

https://leanpub.com/generativeaiforscience

4.1.1 Motivation and Challenges

https://leanpub.com/generativeaiforscience

4.1.2 Data Sources and Features

https://leanpub.com/generativeaiforscience

4.1.3 Neural Network Architecture

https://leanpub.com/generativeaiforscience
Image Encoder (Satellite Data)
https://leanpub.com/generativeaiforscience
SST Encoder
https://leanpub.com/generativeaiforscience
Atmospheric Encoder
https://leanpub.com/generativeaiforscience
Fusion and Prediction
https://leanpub.com/generativeaiforscience

4.1.4 The Saffir-Simpson Scale

https://leanpub.com/generativeaiforscience

4.1.5 Training Considerations

https://leanpub.com/generativeaiforscience

4.1.6 Operational Considerations

https://leanpub.com/generativeaiforscience

4.2 Climate Downscaling with Super-Resolution

https://leanpub.com/generativeaiforscience

4.2.1 The Resolution Gap Problem

https://leanpub.com/generativeaiforscience

4.2.2 Super-Resolution CNN Architecture

https://leanpub.com/generativeaiforscience

4.2.3 Physics-Informed Constraints

https://leanpub.com/generativeaiforscience

4.2.4 Training Data and Evaluation

https://leanpub.com/generativeaiforscience

4.3 Seismic Event Detection with 1D CNNs

https://leanpub.com/generativeaiforscience

4.3.1 Earthquake Detection Problem

https://leanpub.com/generativeaiforscience

4.3.2 Seismic Waveform Detection Architecture

https://leanpub.com/generativeaiforscience

4.3.3 Training Strategy

https://leanpub.com/generativeaiforscience

4.4 Best Practices for Geoscience Deep Learning

https://leanpub.com/generativeaiforscience

4.4.1 Incorporating Physical Constraints

https://leanpub.com/generativeaiforscience

4.4.2 Data Challenges

https://leanpub.com/generativeaiforscience

4.4.3 Interpretability

https://leanpub.com/generativeaiforscience

4.4.4 Uncertainty Quantification

https://leanpub.com/generativeaiforscience

4.5 Exercises

https://leanpub.com/generativeaiforscience

Conceptual Questions

https://leanpub.com/generativeaiforscience

Coding Exercises

https://leanpub.com/generativeaiforscience

Research Extensions

https://leanpub.com/generativeaiforscience

Summary: Geoscience & Climate Applications

https://leanpub.com/generativeaiforscience

References

https://leanpub.com/generativeaiforscience

Part V: Cross-Cutting Applications in Deep Learning

https://leanpub.com/generativeaiforscience

5.1 Video Analysis for Biological Systems

https://leanpub.com/generativeaiforscience

5.1.1 Introduction: Behavior Recognition from Temporal Data

https://leanpub.com/generativeaiforscience

5.2 Baseline Model: Behavior from Perfect Poses

https://leanpub.com/generativeaiforscience

5.2.1 The Oracle Assumption

https://leanpub.com/generativeaiforscience

5.2.2 Model Architecture

https://leanpub.com/generativeaiforscience

5.2.3 Training Protocol

https://leanpub.com/generativeaiforscience

5.2.4 Results and Analysis

https://leanpub.com/generativeaiforscience

5.3 Realistic Model: Behavior from Raw Video

https://leanpub.com/generativeaiforscience

5.3.1 The Real-World Challenge

https://leanpub.com/generativeaiforscience

5.3.2 End-to-End Video Architecture

https://leanpub.com/generativeaiforscience

5.3.3 Training Results (from actual Colab)

https://leanpub.com/generativeaiforscience

5.4 Transfer Learning for Scientific Domains

https://leanpub.com/generativeaiforscience

5.4.1 The Transfer Learning Paradigm

https://leanpub.com/generativeaiforscience

5.4.2 Domain Adaptation with Adversarial Training

https://leanpub.com/generativeaiforscience

5.5 Multi-Task Learning

https://leanpub.com/generativeaiforscience

5.5.1 Benefits for Scientific Applications

https://leanpub.com/generativeaiforscience

5.5.2 Multi-Task Architecture

https://leanpub.com/generativeaiforscience

Summary: Cross-Cutting Applications

https://leanpub.com/generativeaiforscience

References

https://leanpub.com/generativeaiforscience