Multimodal Models Systems Playbook: Build Vision-Language-Speech Apps with Agent Workflows, RAG, Evaluation & Deployment

Author: Reid Harper
Publisher: Independently Published
ISBN:

9798242295773

Pages: 268
Publication Date: 02 January 2026
Format: Paperback
Availability: Available To Order

We have confirmation that this item is in stock with the supplier. It will be ordered in for you and dispatched immediately.

Our Price $44.85 Quantity:

Share |

Multimodal Models Systems Playbook: Build Vision-Language-Speech Apps with Agent Workflows, RAG, Evaluation & Deployment

Overview

Most books explain what multimodal AI is. This playbook shows you how to actually build and deploy it. As AI systems move beyond text into images, speech, and actions, many teams struggle with fragile pipelines, hallucinations, broken RAG setups, and demos that fail in production. This book fixes that. Multimodal Models Systems Playbook is a practical, systems-first guide for engineers building real multimodal AI applications-using vision, language, and speech models together with agent workflows and retrieval pipelines. Inside, you'll learn how to: Design reliable vision → language pipelines Build voice and speech systems that go beyond transcription Implement multimodal RAG across text, images, and audio Create agent workflows that route tasks by modality Evaluate multimodal systems for grounding, latency, and cost Deploy production-ready systems with fallbacks and observability Each chapter includes clear explanations, failure modes, production checklists, and hands-on mini-labs. Who this book is for: Engineers, AI builders, and teams shipping multimodal systems. Not for: academic theory or vendor-locked tutorials. If you want to move from multimodal demos to production systems, this playbook shows you how.

Full Product Details

Author: Reid Harper
Publisher: Independently Published
Imprint: Independently Published
Dimensions: Width: 15.20cm , Height: 1.40cm , Length: 22.90cm
Weight: 0.363kg
ISBN:

9798242295773

Pages: 268
Publication Date: 02 January 2026
Audience: General/trade , General
Format: Paperback
Publisher's Status: Active
Availability: Available To Order

We have confirmation that this item is in stock with the supplier. It will be ordered in for you and dispatched immediately.

Reviews

Author Information

Tab Content 6

Author Website:

Countries Available

All regions

Latest Reading Guide

Shopping Cart

Your cart is empty

Mailing List

Multimodal Models Systems Playbook: Build Vision-Language-Speech Apps with Agent Workflows, RAG, Evaluation & Deployment

9798242295773

Availability Information

Overview

Full Product Details

9798242295773

Table of Contents

Reviews

Author Information

Tab Content 6

Countries Available

Sign up now