1152039

How Omilia scales foundation model inference on Amazon SageMaker AI

As organizations increasingly rely on custom foundation models (FMs) for generative AI applications, the challenge shifts from model training experimentation to delivering low-latency, high-throughput, cost-efficient inference at production scale.

In this session, learn how Omilia uses Amazon SageMaker AI to deploy and scale custom FMs for enterprise conversational AI, optimizing for reliability, performance, and cost.

Nikos Lakoutsis

Driving Cloud & AI Transformation, Director of Cloud Engineering @Omilia