Back to topics
Disciplines

Model Serving work

Model-serving proof around local inference, API wrappers, batch jobs, review queues, and safe operator control.

Related projects

1
AI / Automation Private lab proof

Local AI Model Gateway

Self-hosted LLM and diffusion workflow experiments around local inference, quantized models, API wrappers, prompt pipelines, queues, and human review boundaries.

Python, FastAPI, Docker, local LLMs, Qwen-style open-weight models, GGUF/quantized inference, SDXL-style diffusion workflows, API wrappers

Shows practical AI infrastructure work: running models locally, wrapping them into usable APIs, routing jobs through queues, and keeping operator approval around external actions.