LLMs in Production

Engineering AI Applications

Reinhören

Artikel konnten nicht hinzugefügt werden

Leider können wir den Artikel nicht hinzufügen, da Ihr Warenkorb bereits seine Kapazität erreicht hat.

Der Titel konnte nicht zum Warenkorb hinzugefügt werden.

Bitte versuchen Sie es später noch einmal

Der Titel konnte nicht zum Merkzettel hinzugefügt werden.

Bitte versuchen Sie es später noch einmal

„Von Wunschzettel entfernen“ fehlgeschlagen.

Bitte versuchen Sie es später noch einmal

„Podcast folgen“ fehlgeschlagen

„Podcast nicht mehr folgen“ fehlgeschlagen

Zeitlich begrenztes Angebot

3 Monate Audible Standard kostenlos testen

3 Monate Audible Standard kostenlos testen, danach 6,99 €/Monat. Monatlich kündbar.

Jetzt abonnieren

Das Angebot endet am 15. Juli 2026 23:59 Uhr. Dieses Angebot sichern!

Weitere Angebote

LLMs in Production

Von: Christopher Brousseau, Matt Sharp

Gesprochen von: Christopher Kendrick

Jetzt abonnieren

3 Monate Audible Standard für 0,99 €/Monat, danach 6,99 €/Monat. Monatlich kündbar. Angebot gültig bis zum 15. Juli 2026 um 23:59 Uhr.

Für 22,09 € kaufen

Unlock the potential of Generative AI with this Large Language Model production-ready playbook for seamless deployment, optimization, and scaling. This hands-on guide takes you beyond theory, offering expert strategies for integrating LLMs into real-world applications using retrieval-augmented generation (RAG), vector databases, PEFT, LoRA, and scalable inference architectures. Whether you're an ML engineer, data scientist, or MLOps practitioner, you’ll gain the technical know-how to operationalize LLMs efficiently, reduce compute costs, and ensure rock-solid reliability in production.

What You’ll Learn:

Master LLM Fundamentals – Understand tokenization, transformer architectures, and the evolution linguistics to the creation of foundation models.
RAG & Vector Databases – Augment model capabilities with real-time retrieval and memory-optimized embeddings.
Training vs Fine-tuning – Learn how to train your own model as well as cutting edge techniques like Distillation, RLHF, PEFT, LoRA, and QLoRA for cost-effective adaptation.
Prompt Engineering – Discover the quickly evolving world of prompt engineering and go beyond simple prompt and pray methods and learn how to implement structured outputs, complex workflows, and LLM agents.
Scaling & Cost Optimization – Deploy LLMs into your favorite cloud of choice, on commodity hardware, Kubernetes clusters, and edge devices.
Securing AI Workflows – Implement guardrails for hallucination mitigation, adversarial testing, and compliance monitoring.
MLOps for LLMs – Learn all about LLMOps, automate model lifecycle management, retraining pipelines, and continuous evaluation.

Hands-on Projects Include:

• Training a custom LLM from scratch – Build and optimize an industry-specific model.

• AI-Powered VSCode Extension – Use LLMs to enhance developer productivity with intelligent code completion.

• Deploying on Edge Devices – Run a lightweight LLM on a Raspberry Pi or Jetson Nano for real-world AI applications.

PLEASE NOTE: When you purchase this title, the accompanying PDF will be available in your Audible Library along with the audio.

Christopher BrousseauMatt SharpChristopher KendrickManning Publications

Informatik

Programmieren & Softwareentwicklung

Informatik Programmieren & Softwareentwicklung

Am relevantesten

Good introduction in the topic, speakers voices clear and dynamic. Gameification examples are curious and feel misplaced somehow. But what bothers me the most: no code references at all, what's the point? See code here, see code there, I see no code at all. Just a small PDF with references would be enough. C'mon? They might wanna try to force listeners to buy the book also, I have no clue...

see this code...

Ein Fehler ist aufgetreten. Bitte versuche es in ein paar Minuten noch einmal.

the potential that LLM seem to have ist not critically reviewed. this "book" seems more like a commercial for investing in LLM developement

overoptimistic halfknowledge

Ein Fehler ist aufgetreten. Bitte versuche es in ein paar Minuten noch einmal.