The Leanpub 60 Day 100% Happiness Guarantee
Within 60 days of purchase you can get a 100% refund on any Leanpub purchase, in two clicks.
See full terms...
Kick off your book project in 2 hours, get started with GhostAI in 2 hours, or do both! Free live workshops, on Zoom. You’ll leave with a real book project and a clear plan to keep going. Saturday, June 27, 2026.
Building custom inference engines, fine-tuning local models (LoRA), and leveraging Unified Memory directly from Swift.
Transform your Apple devices into AI powerhouses with native MLX Swift and local LLMs. Master "Metal-to-Model" workflows, leveraging unified memory for lightning-fast, zero-copy inference. Implement LoRA fine-tuning, 4-bit quantization, and real-time streaming for a superior user experience. Build the next generation of privacy-centric, offline-first AI applications directly in Swift 6.
Minimum price
$16.99
$25.99
Buying multiple copies for your team? See below for a discount!
About the Book
Unlock the full power of Apple Silicon with the definitive guide to MLX Swift and Local LLMs.
The future of Artificial Intelligence is local. In Volume 6 of the Swift & AI Masterclass, author Edgar Milvus takes you deep into the architecture of Apple's MLX framework, a high-performance array library designed for the "Metal-to-Model" experience. This isn't just about calling APIs; it's about building custom inference engines and fine-tuning models directly on your Mac, iPhone, and iPad.
What’s inside this volume:
Bridging the gap between Python-based research and Swift-based production, this book provides the theoretical foundations and the production-ready code needed to build the next generation of privacy-centric, offline-first AI applications. Whether you are an experienced iOS developer or a Machine Learning engineer, this masterclass is your roadmap to AI excellence on Apple platforms.
Note: This book requires a Mac with Apple Silicon for the code examples.
Table of contents
Chapter 1: Intro to MLX — Apple’s Array Framework for Swift
Chapter 2: MLX Swift vs. Core ML — When to Use Which
Chapter 3: Unified Memory Architecture and Tensor Operations
Chapter 4: Building Neural Networks with MLX NN in Swift
Chapter 5: Optimization Techniques with MLX Optimizers
Chapter 6: Porting Weights — Converting HuggingFace Models to MLX
Chapter 7: Implementing Transformer Architectures in MLX Swift
Chapter 8: Quantization (4-bit/8-bit) for On-Device LLMs
Chapter 9: Streaming Token Inference with MLX Swift
Chapter 10: Performance Profiling MLX vs. llama.cpp
Chapter 11: Local Fine-Tuning with LoRA and QLoRA in Swift
Chapter 12: Memory Management for Large Models on Mac
Chapter 13: Building Agentic Loops with MLX-powered LLMs
Chapter 14: Tool Calling and Function Injection with MLX
Chapter 15: Deploying MLX Swift to macOS and iOS (The Future)
Chapter 16: Designing the MLX-based Model Service Actor
Chapter 17: Real-time UI with SwiftUI and Token Streaming
Chapter 18: Implementing Local Memory with MLX Embeddings
Chapter 19: Hardware Monitoring (GPU/NPU usage) in-app
Chapter 20: Optimization, Sandboxing, and Distribution
If printed, this ebook would span over 600 pages. Each chapter is structured into theoretical foundations, an annotated basic example, an annotated advanced example, and five coding exercises based on real-world scenarios with complete solutions.
Team Discounts
Get a team discount on this book!
Up to 3 members
Up to 5 members
Up to 10 members
Up to 15 members
Up to 25 members
About the Author
A veteran software engineer with 20 years of experience, I have dedicated my career to the art of automation. My philosophy is simple: programming should eliminate repetitive chores to unlock human creativity. This journey began early on with the development of custom code-generation tools and has evolved into a deep mastery of LLMs and their APIs. Today, I specialize in architecting AI-driven solutions that handle everything from complex coding and security tasks to advanced knowledge retrieval, transforming the way we interact with technology
Within 60 days of purchase you can get a 100% refund on any Leanpub purchase, in two clicks.
See full terms...
We pay 80% royalties on purchases of $7.99 or more, and 80% royalties minus a 50 cent flat fee on purchases between $0.99 and $7.98. You earn $8 on a $10 sale, and $16 on a $20 sale. So, if we sell 5000 non-refunded copies of your book for $20, you'll earn $80,000.
(Yes, some authors have already earned much more than that on Leanpub.)
In fact, authors have earned over $15 million writing, publishing and selling on Leanpub.
Learn more about writing on Leanpub
If you buy a Leanpub book, you get free updates for as long as the author updates the book! Many authors use Leanpub to publish their books in-progress, while they are writing them. All readers get free updates, regardless of when they bought the book or how much they paid (including free).
Most Leanpub books are available in PDF (for computers) and EPUB (for phones, tablets and Kindle). The formats that a book includes are shown at the top right corner of this page.
Finally, Leanpub books don't have any DRM copy-protection nonsense, so you can easily read them on any supported device.
Learn more about Leanpub's ebook formats and where to read them
You can use Leanpub to easily write, publish and sell in-progress and completed ebooks and online courses!
Leanpub is a powerful platform for serious authors, combining a simple, elegant writing and publishing workflow with a store focused on selling in-progress ebooks.
Leanpub is a magical typewriter for authors: just write in plain text, and to publish your ebook, just click a button. (Or, if you are producing your ebook your own way, you can even upload your own PDF and/or EPUB files and then publish with one click!) It really is that easy.