Kick off your book project in 2 hours, get started with GhostAI in 2 hours, or do both! Free live workshops, on Zoom. You’ll leave with a real book project and a clear plan to keep going. Saturday, June 27, 2026.

Leanpub Header

Skip to main content

Information Theory and Artificial Intelligence VOL-1

This book is 100% completeLast updated on 2026-06-12

What is intelligence?

At its core, intelligence is the ability to process information.

But what exactly is information?

How do neural networks compress knowledge?

Why does cross-entropy dominate machine learning?

How do Variational Autoencoders learn latent representations?

What role does entropy play in regularization, generalization, and modern AI systems?

Information Theory and Artificial Intelligence: Entropy, Coding, Regularization, and Generative Models (Volume I) provides a comprehensive exploration of the mathematical foundations that power modern artificial intelligence.

Inside this volume, you will discover:

✓ Shannon Entropy and Measures of Information

✓ Mutual Information and Feature Learning

✓ Cross-Entropy and Machine Learning Loss Functions

✓ Kullback–Leibler (KL) Divergence

✓ Source Coding and Data Compression

✓ Huffman, Arithmetic, and Lempel–Ziv Coding

✓ Channel Capacity and Communication Limits

✓ Information-Theoretic Learning Principles

✓ Information Bottleneck Theory

✓ Entropy-Based Regularization

✓ Neural Networks as Information Processing Systems

✓ Error-Correcting Codes and AI Robustness

✓ Neural Communication Systems

✓ Variational Inference and Variational Autoencoders (VAEs)

Whether you are a student, researcher, educator, or AI professional, this book provides the conceptual and mathematical foundation necessary to understand how information drives learning, intelligence, and modern AI systems.

Learn not just how AI works—but why it works.

Minimum price

$9.99

$19.99

You pay

Author earns

$

Also available for 1 book credit with a Reader Membership

PDF
EPUB
About

About

About the Book

Information Theory and Artificial Intelligence: Entropy, Coding, Regularization, and Generative Models (Volume I)

Artificial Intelligence is fundamentally a science of information. Every machine learning model, neural network, reinforcement learning agent, and generative system operates by acquiring, compressing, transforming, transmitting, and generating information. Yet the mathematical principles governing these processes are often studied separately from modern AI.

Information Theory and Artificial Intelligence: Entropy, Coding, Regularization, and Generative Models (Volume I) bridges this gap by presenting a unified framework that connects classical information theory with contemporary artificial intelligence and machine learning.

Beginning with the foundational concepts introduced by Claude Shannon, the book develops a deep understanding of entropy, uncertainty, information measures, coding theory, communication limits, and probabilistic reasoning. These concepts are then linked directly to modern AI systems, revealing how information-theoretic principles shape learning algorithms, neural networks, optimization methods, regularization techniques, representation learning, and generative models.

Readers will explore Shannon entropy, conditional entropy, mutual information, cross-entropy, KL divergence, source coding, data compression, channel capacity, information bottlenecks, entropy-based loss functions, coding theory, adversarial robustness, neural communication systems, and variational inference. The book demonstrates how these concepts influence model generalization, learning efficiency, representation quality, and AI system reliability.

Unlike traditional textbooks that treat information theory and AI as independent disciplines, this volume presents them as interconnected components of a single scientific framework. The result is a deeper understanding of why machine learning systems work, how they process information, and how future intelligent systems may evolve.

Designed for students, researchers, educators, and industry professionals, the book combines mathematical foundations with practical AI applications, making complex concepts accessible without sacrificing academic rigor.

Key Features

• Comprehensive introduction to information theory for AI and machine learning

• Detailed coverage of entropy, mutual information, KL divergence, and coding theory

• Information-theoretic interpretation of neural networks and deep learning

• Entropy-based loss functions and regularization techniques

• Information Bottleneck and Minimum Description Length principles

• Source coding, data compression, and communication theory for AI systems

• Coding theory and neural approaches to error correction

• Variational inference and Variational Autoencoders (VAEs)

• AI applications in communication systems, federated learning, and distributed intelligence

• Mathematical rigor combined with practical relevance

This volume establishes the theoretical foundation required for understanding advanced topics explored in Volume II, including representation learning, GANs, reinforcement learning, large language models, information geometry, explainable AI, federated intelligence, and quantum information theory.

Bundle

Bundles that include this book

Author

About the Author

Anshuman Mishra

Anshuman Kumar Mishra, M.Tech (Computer Science) Assistant Professor, Doranda College, Ranchi University

Prolific Author of 50+ Books on AI, Machine Learning & Computer Science | 20+ Years Experience

Anshuman Kumar Mishra is a dedicated educator, researcher, and highly prolific author with over 20 years of experience in Computer Science and Information Technology. Holding an M.Tech in Computer Science from BIT Mesra, he brings a rare combination of academic depth and practical teaching expertise.

Currently serving as Assistant Professor at Doranda College under Ranchi University, he has mentored thousands of students, helping them build strong foundations in programming, data science, and artificial intelligence. His student-centric teaching style emphasizes conceptual clarity, hands-on practice, and real-world application.

Anshuman is a prolific author with more than 50 books published across a wide spectrum of computer science and emerging technology domains. From foundational programming languages to advanced topics in Artificial Intelligence, Machine Learning, Reinforcement Learning, Decision Theory, and Computer Vision — his books are widely appreciated by students, educators, and professionals for their clear explanations, strong theoretical foundation, and practical approach.

His extensive body of work reflects his deep commitment to making complex subjects accessible and meaningful for learners at all levels. He is particularly recognized for creating well-structured learning paths that help readers progress from beginner to advanced levels with confidence.

Driven by the mission to democratize quality technical education, Anshuman continues to write and update books that bridge the gap between academic theory and industry practice.

When not teaching or writing, he actively follows and explores new developments in AI, Quantum Machine Learning, and Ethical Intelligence systems.

Contents

Table of Contents

Information Theory and Artificial Intelligence: Entropy, Coding, Regularization, and Generative Models VOL-1 ________________________________________ PART I — Foundations of Information Theory Chapter 1: Introduction to Information Theory 1-27 1.1 History and evolution of information theory 1.2 Uncertainty and probabilistic reasoning 1.3 Information and intelligence 1.4 Data, representation, and compression 1.5 Importance of information theory in AI 1.6 Applications across modern machine learning 1.7 Relationship between statistics, probability, and information theory Chapter 2: Entropy and Measures of Information 28-54 2.1 Shannon entropy 2.2 Joint and conditional entropy 2.3 Cross-entropy 2.4 Kullback–Leibler divergence 2.5 Mutual information and multi-information 2.6 Differential entropy for continuous signals 2.7 Applications in feature selection and dimensionality reduction Chapter 3: Source Coding and Data Compression 55-87 3.1 Lossless coding fundamentals 3.2 Huffman coding 3.3 Arithmetic coding 3.4 Lempel–Ziv (LZ77/LZ78) compression 3.5 Rate–distortion theory 3.6 Compression constraints in AI systems 3.7 Neural compression and autoencoder-based coding Chapter 4: Channel Capacity and Communication Limits 88-119 4.1 Shannon’s noisy channel coding theorem 4.2 Channel capacity computation 4.3 Error bounds and reliable communication 4.4 Information transfer in distributed AI systems 4.5 Federated learning communication bottlenecks 4.6 Rate constraints in multi-agent AI systems ________________________________________ PART II — Information Theory in Machine Learning Chapter 5: Information-Theoretic Learning 120-148 5.1 Empirical risk minimization and uncertainty 5.2 Minimum Description Length (MDL) principle 5.3 Information Bottleneck (IB) framework 5.4 Neural representation compression 5.5 Complexity–accuracy trade-offs Chapter 6: Entropy-Based Loss Functions 149-175 6.1 Cross-entropy loss in classification 6.2 KL divergence in generative modeling 6.3 Maximum entropy models 6.4 Entropy regularization in reinforcement learning 6.5 Role of entropy in probabilistic learning Chapter 7: Regularization Techniques Through Information Lens 176-193 7.1 L1 and L2 regularization and MDL 7.2 Dropout as Bayesian approximation 7.3 Noise injection and information dropout 7.4 PAC-Bayes generalization bounds 7.5 Sharp minima, flat minima, and entropy-based optimization Chapter 8: Information Theory in Deep Neural Networks 194-219 8.1 Neural networks as lossy compression systems 8.2 Information propagation across layers 8.3 Mutual Information Neural Estimation (MINE) 8.4 Representation complexity and generalization 8.5 Overfitting explained via entropy ________________________________________ PART III — Coding Theory and AI Systems Chapter 9: Fundamentals of Error-Correcting Codes 220-244 9.1 Linear codes and generator matrices 9.2 Hamming codes 9.3 BCH and Reed–Solomon codes 9.4 Soft decoding and syndrome decoding 9.5 AI applications in communications and storage Chapter 10: Neural Approaches to Coding 245-273 10.1 Neural decoders 10.2 Deep iterative decoding 10.3 End-to-end learned communication systems 10.4 Joint source–channel coding with deep learning 10.5 Performance evaluation metrics Chapter 11: Robustness, Adversarial Noise, and Coding Theory 274-302 11.1 Noise models in AI 11.2 Adversarial examples and information metrics 11.3 Error correction principles for AI robustness 11.4 Code-based defenses for neural networks 11.5 Information-theoretic analysis of vulnerabilities ________________________________________ PART IV — Variational Inference and Probabilistic Deep Learning Chapter 12: Variational Autoencoders (VAEs) 303-327 12.1 Variational inference basics 12.2 Evidence Lower Bound (ELBO) 12.3 KL divergence and latent space structure 12.4 β-VAE 12.5 InfoVAE and ∆-VAE 12.6 Disentanglement via mutual information

The Leanpub 60 Day 100% Happiness Guarantee

Within 60 days of purchase you can get a 100% refund on any Leanpub purchase, in two clicks.

See full terms...

Earn $8 on a $10 Purchase, and $16 on a $20 Purchase

We pay 80% royalties on purchases of $7.99 or more, and 80% royalties minus a 50 cent flat fee on purchases between $0.99 and $7.98. You earn $8 on a $10 sale, and $16 on a $20 sale. So, if we sell 5000 non-refunded copies of your book for $20, you'll earn $80,000.

(Yes, some authors have already earned much more than that on Leanpub.)

In fact, authors have earned over $15 million writing, publishing and selling on Leanpub.

Learn more about writing on Leanpub

Free Updates. DRM Free.

If you buy a Leanpub book, you get free updates for as long as the author updates the book! Many authors use Leanpub to publish their books in-progress, while they are writing them. All readers get free updates, regardless of when they bought the book or how much they paid (including free).

Most Leanpub books are available in PDF (for computers) and EPUB (for phones, tablets and Kindle). The formats that a book includes are shown at the top right corner of this page.

Finally, Leanpub books don't have any DRM copy-protection nonsense, so you can easily read them on any supported device.

Learn more about Leanpub's ebook formats and where to read them

Write and Publish on Leanpub

You can use Leanpub to easily write, publish and sell in-progress and completed ebooks and online courses!

Leanpub is a powerful platform for serious authors, combining a simple, elegant writing and publishing workflow with a store focused on selling in-progress ebooks.

Leanpub is a magical typewriter for authors: just write in plain text, and to publish your ebook, just click a button. (Or, if you are producing your ebook your own way, you can even upload your own PDF and/or EPUB files and then publish with one click!) It really is that easy.

Learn more about writing on Leanpub