Tools
Quantized Inference projects, skills, and proof
Quantized inference proof around practical local deployment, model selection, memory limits, and performance constraints.
Related projects
1
AI / Automation
Private lab proof
Local AI Model Gateway
Self-hosted LLM and diffusion workflow experiments around local inference, quantized models, API wrappers, prompt pipelines, queues, and human review boundaries.
Python, FastAPI, Docker, local LLMs, Qwen-style open-weight models, GGUF/quantized inference, SDXL-style diffusion workflows, API wrappers
Shows practical AI infrastructure work: running models locally, wrapping them into usable APIs, routing jobs through queues, and keeping operator approval around external actions.