Tell your friends about this item:
Hands-On LLM Serving and Optimization: Hosting LLMs at Scale Chi Wang
Hands-On LLM Serving and Optimization: Hosting LLMs at Scale
Chi Wang
As the demand for real-time AI applications grows, along comes this comprehensive guide to the complexities of deploying and optimizing LLMs at scale. The authors take a real-world approach backed by practical examples and code, and assemble essential strategies for designing infrastructures that are equal to the demands of modern AI applications.
| Media | Books Paperback Book (Book with soft cover and glued back) |
| To be released | April 30, 2026 |
| ISBN13 | 9798341621497 |
| Publishers | O'Reilly Media |
| Pages | 300 |
| Dimensions | 150 × 220 × 10 mm · 601 g (Weight (estimated)) |