This talk explores building a complete self-hosted LLM stack in Rust: Paddler, a distributed load balancer for serving LLMs at scale, and Poet, a static site generator that consumes those LLMs for AI-powered content features.
In this talk, we’ll re-create the core ideas of Karpathy’s micrograd, but entirely in Rust.
During this talk we'll build a basic, working async runtime using nothing more than a standard library. The point? To see it's approachable for mere mortals.
The talk explores how Rust’s type system and memory safety can be leveraged to enforce mandatory guardrails at the infrastructure level, where traditional frameworks often fall short.
In this talk, we'll dive deep into what makes concurrency coordination costly, and explore some pathways to mitigate that cost.