Free shipping on orders over $99
Building Generative AI Services with FastAPI

Building Generative AI Services with FastAPI

A Practical Approach to Developing Context Rich Generative AI Applications

by Ali Parandeh
Paperback
Publication Date: 31/05/2025

Share This Book:

51%
OFF
RRP  $133.00

RRP means 'Recommended Retail Price' and is the price our supplier recommends to retailers that the product be offered for sale. It does not necessarily mean the product has been offered or sold at the RRP by us or anyone else.

$65.80
or 4 easy payments of $16.45 with
afterpay

Ready to build applications using generative AI? This practical book outlines the process necessary to design and build production grade AI services with a FastAPI web server that communicate seamlessly with databases, payment systems, and external APIs. You'll learn how to develop autonomous generative AI agents that stream outputs in real-time and interact with other models. Web developers, data scientists, and DevOps engineers will learn to implement end-to-end production-ready services that leverage generative AI.

You'll learn design patterns to manage software complexity, implement FastAPI lifespan for AI model integration, handle long-running generative tasks, perform content filtering, cache outputs, implement retrieval augmented generation (RAG) with a vector database, implement usage/cost monitoring and tracking, protect services with your own authentication and authorization mechanisms, and effectively control stream outputs directly from GenAI models. You'll explore efficient testing methods for AI outputs, validation against databases, and deployment patterns using Docker for robust microservices in the cloud.

  • Build generative services that interact with databases, external APIs, and more
  • Learn how to load AI models into a FastAPI lifecycle memory
  • Monitor and log model requests and responses within services
  • Use authentication and authorization patterns hooked with generative models
  • Handle and cache long-running inference tasks
  • Stream model outputs via streaming events and WebSockets into browsers or files
  • Automate the retraining process of generative models by exposing event-driven endpoints

Ali Parandeh is a Chartered Engineer with the UK Engineering Council and a Microsoft and Google certified developer, data engineer, and data scientist.

ISBN:
9781098160302
9781098160302
Category:
Web services
Format:
Paperback
Publication Date:
31-05-2025
Language:
English
Publisher:
O'Reilly Media, Incorporated
Country of origin:
United States
Dimensions (mm):
250x150x15mm
Weight:
0.67kg

This item is In Stock in our Sydney warehouse and should be sent from our warehouse within 1-2 working days.

Once sent we will send you a Shipping Notification which includes online tracking.

Please check the estimated delivery times below for your region, for after your order is despatched from our warehouse:

ACT Metro  2 working days

NSW Metro  2 working days

NSW Rural  2 - 3 working days

NSW Remote  2 - 5 working days

NT Metro  3 - 6 working days

NT Remote  4 - 10 working days

QLD Metro  2 - 4 working days

QLD Rural  2 - 5 working days

QLD Remote  2 - 7 working days

SA Metro  2 - 5 working days

SA Rural  3 - 6 working days

SA Remote  3 - 7 working days

TAS Metro  3 - 6 working days

TAS Rural  3 - 6 working days

VIC Metro  2 - 3 working days

VIC Rural  2 - 4 working days

VIC Remote  2 - 5 working days

WA Metro  3 - 6 working days

WA Rural  4 - 8 working days

WA Remote  4 - 12 working days

 

Express Post is available if ALL items in your Shopping Cart are listed as 'In Stock'.

Reviews

Be the first to review Building Generative AI Services with FastAPI.