Tools

Quantized Inference projects, skills, and proof

Quantized inference proof around practical local deployment, model selection, memory limits, and performance constraints.

Related projects

AI / Automation Private lab proof

Local AI Model Gateway

Self-hosted LLM and diffusion workflow experiments around local inference, quantized models, API wrappers, prompt pipelines, queues, and human review boundaries.

Python, FastAPI, Docker, local LLMs, Qwen-style open-weight models, GGUF/quantized inference, SDXL-style diffusion workflows, API wrappers

Shows practical AI infrastructure work: running models locally, wrapping them into usable APIs, routing jobs through queues, and keeping operator approval around external actions.

Python FastAPI Docker AI Workflow Systems LLM Workflow Local LLM Model Serving

View detail

Related skills

AI / Workflow

Local LLM / model serving

Local model proof

Self-hosted and local LLM workflows using open-weight models, quantized builds, API wrappers, prompt pipelines, and operator review boundaries.

Private lab / workflow proof

AI Workflow Systems LLM Workflow Local LLM Model Serving Quantized Inference Human-in-the-Loop Automation

Backend / Crawling

Python / FastAPI / crawling systems

Portfolio proof

FastAPI crawler and decision systems with raw/rendered audits, worker queues, evidence APIs, market evidence, exports, and technical SEO diagnostics.

ILCrawler / MarketEngine

Python FastAPI Docker Technical SEO Crawl / Indexing Lighthouse

Relevant problem?

Send the URL, symptom, and what needs to be changed or verified.

Contact