Recipes_
Gpodz-curated LoRA recipes across Qwen3.5 (8 variants, Image-Text-to-Text), Gemma 4 (4 variants, Any-to-Any / Image-Text-to-Text), Gemma 3 No-Think (1 variant, latency-optimized), and DeepSeek V4 Flash (2 reserved-booking variants, text-only). Pick the base that fits your VRAM budget and archetype; Gpodz handles adapter composition, eval gating, and serving warm-up.
15 pinned recipes · indicative pricing from docs/21 §2 · billing starts only when your GPU lane passes readiness.
Available now — mvp
Qwen Starter 2B
Qwen/Qwen3.5-2B · 2B · dense
mvpModality: Image-Text-to-Text
Min lane: Shared · shared-12gb
Training (indicative): $5 setup + $0.45/GPU-hr
Serving (indicative): $0.30/hr (Quick)
Compact 2B-class Qwen recipe for cheap dev iteration, agent prototyping, and the Toolkit-Chat dogfood path.
Qwen Starter 4B
Qwen/Qwen3.5-4B · 4B · dense
mvpModality: Image-Text-to-Text
Min lane: Shared · shared-12gb
Training (indicative): $9 setup + $0.90/GPU-hr
Serving (indicative): $0.60/hr (Quick)
Curated Qwen3.5 4B recipe for first adapters, support bots, and simple agents on Shared or 23GB Isolated GPU capacity.
Qwen Starter 9B
Qwen/Qwen3.5-9B · 9B · dense
mvpModality: Image-Text-to-Text
Min lane: Shared · shared-16gb
Training (indicative): $9 setup + $0.90/GPU-hr
Serving (indicative): $1.05/hr (Quick)
Curated Qwen3.5 9B recipe for stronger first adapters, support bots, and small agents on 16GB Shared or 23GB Isolated capacity.
Qwen Pro 27B
Qwen/Qwen3.5-27B · 27B · dense
mvpModality: Image-Text-to-Text
Min lane: Isolated · isolated-45gb
Training (indicative): $29 setup + $2.40/GPU-hr
Serving (indicative): $1.80/hr (Quick)
Curated Qwen3.5 27B recipe for domain workflows and serious agents on 45GB Isolated capacity.
Qwen Max 35B
Qwen/Qwen3.5-35B-A3B · 35B-A3B · MoE (3B active)
mvpModality: Image-Text-to-Text
Min lane: Isolated · isolated-45gb
Training (indicative): $49 setup + $3.60/GPU-hr
Serving (indicative): $2.40/hr (Quick)
Curated Qwen3.5 35B-A3B MoE recipe for production agents and codebase, workflow, and domain adapters on 45GB Isolated capacity.
Qwen Mega 122B
Qwen/Qwen3.5-122B-A10B · 122B-A10B · MoE (10B active)
mvpModality: Image-Text-to-Text
Min lane: Dedicated · dedicated-80gb
Serving (indicative): Dedicated GPU rate — contact for pricing
Qwen3.5 MoE mega model at 122B total (10B active) for heavy inference on dedicated Hopper/Blackwell hardware.
Gemma Lite E4B
google/gemma-4-E4B-it · ~4B · efficient/edge — Any-to-Any
mvpModality: Any-to-Any (text · image · audio · video)
Min lane: Shared · shared-12gb
Training (indicative): $9 setup + $0.90/GPU-hr
Serving (indicative): $0.60/hr (Quick)
Curated Gemma 4 E4B recipe for compact assistants and lower-cost text experiments on Shared or 23GB Isolated GPU capacity.
Gemma Edge E2B
google/gemma-4-E2B · ~2B · efficient/edge — multimodal
mvpModality: Image-Text-to-Text
Min lane: Shared · shared-12gb
Serving (indicative): Shared GPU rate — contact for pricing
Gemma 4 E2B — smallest Gemma 4 variant for latency-sensitive T2/T3 lanes.
Gemma No-Think 27B
google/gemma-3-27b-it · 27B · dense — Gemma 3 (chain-of-thought disabled)
mvpModality: Image-Text-to-Text
Min lane: Isolated · isolated-45gb
Serving (indicative): Isolated GPU rate — contact for pricing
Gemma 3 27B IT — chain-of-thought disabled for latency-sensitive inference paths.
Beta — validation in progress
Beta recipes are available but carry a beta banner in the training wizard. Multimodal and long-context probes are still completing.
Gemma Vision 26B
google/gemma-4-26B-A4B-it · 26B-A4B · MoE (4B active) — Image-Text-to-Text
betaModality: Image-Text-to-Text
Min lane: Isolated · isolated-45gb
Training (indicative): $49 setup + $3.60/GPU-hr
Serving (indicative): $2.40/hr (Quick)
Curated Gemma 4 26B-A4B recipe for multimodal and domain adapters on 45GB Isolated capacity. Multimodal validation in progress.
Gemma Max 31B
google/gemma-4-31B-it · 31B · dense — Image-Text-to-Text
betaModality: Image-Text-to-Text
Min lane: Isolated · isolated-45gb
Training (indicative): $79 setup + $4.80/GPU-hr
Serving (indicative): $2.40/hr (Quick)
Curated Gemma 4 31B recipe for higher-quality text and vision adapters on 45GB Isolated capacity (90GB lane added after validation).
Reserved booking required
These recipes are not self-serve. They require a pre-arranged reservation on dedicated B200 or H200 capacity. Contact Gpodz to schedule. Idle-billing applies on hot-pool reservations.
DeepSeek Flash Reserved
deepseek-ai/DeepSeek-V4-Flash · Flash · dense (FP8)
reserved booking requiredModality: Text-only
Min lane: Dedicated · dedicated-180gb
Reserved booking required — $7.20/hr (indicative). Contact Gpodz to schedule a dedicated block.
Reserved DeepSeek V4 Flash serving block for long-context reasoning and code or document analysis on a dedicated B200 or H200. 4-hour minimum.
DeepSeek Flash Hot
deepseek-ai/DeepSeek-V4-Flash · Flash · dense (FP8)
reserved booking requiredModality: Text-only
Min lane: Dedicated · dedicated-180gb
Reserved booking required — daily rate, contact Gpodz. Base model is billable while idle (LEGAL-8).
DeepSeek V4 Flash hot pool for low-latency reserved serving on a dedicated B200 or H200. Base model stays resident and is billable while idle.
Internal / pipeline use
These recipes carry launch_status: manual_review and are NOT available to tenant principals. They require the internal:pipeline scope on an API key (CLAUDE.md §8). Shown here for operator visibility only.
Qwen Frontier 397B
manual_reviewQwen/Qwen3.5-397B-A17B · 397B-A17B · MoE (17B active) — multi-GPU tensor-parallel
Modality: Image-Text-to-Text
Requires internal:pipeline scope. Qwen3.5-397B-A17B MoE frontier model — reserved for multi-GPU tensor-parallel serving. Phase 1 does not ship this publicly.
Qwen Starter Pipeline (0.8B)
manual_reviewQwen/Qwen3.5-0.8B · 0.8B · dense — pipeline test only
Modality: Image-Text-to-Text
Requires internal:pipeline scope. Pipeline-test recipe for end-to-end platform validation on the smallest Qwen-line model. Internal only. Never invoiced.
Failed readiness ⇒ no charge. Billing starts only when your GPU lane passes readiness. See trust page for the Gate-4 billing proof.
Indicative pricing shown on each card. Final rates are the authoritative billing-engine values per docs/21 §2 — not this YAML hint. All 15 recipes are pinned to real HuggingFace revision SHAs verified 2026-05-14.