A comprehensive multi-agent travel planning system built with LangChain, LangGraph, FastAPI, and React. This full-stack application coordinates specialized AI agents to create complete travel ...
A production-minded FastAPI sidecar for serving Gemma 4 31B on vLLM with Gemma 4 Multi-Token Prediction (MTP) speculative decoding. It keeps the raw vllm serve process private and adds ...