Your fastest path to production for AI apps.
Cheap inference is built into every container. Define your stack in YAML, fork a block, deploy with one command. Zero ops, zero surprises.
From terminal to running tool in minutes.
Same idea Render popularised, with free inference baked into every container so you don't have to wire up API keys or worry about per-token cost.
Pick a deployment type
Tools, workflows, chats, live APIs, cron, or workers — same platform, six runtimes.
Connect code or fork a block
Push your repo with a gonkablocks.yaml, or fork any public block and edit from the browser.
Gonkablocks does the rest
Image build, scoped key, public URL, autoscaling, logs, metrics, billing — wired by default.
Whatever you're shipping, there's a runtime for it.
Six block types share one platform: same observability, same secrets system, same free inference baked in.
Tools
One-shot jobs. Take inputs, run a container, return outputs. Perfect for translation, summarization, image generation.
Workflows
Visual DAGs that wire blocks together. Outputs flow as inputs. Run any block as a node — no glue code.
Chats
Long-lived sessions with shared state, streamed token-by-token. OpenAI-compatible /v1/chat/completions API.
Live APIs
Always-on HTTP endpoints fronted by autoscaled containers. Per-route auth, scoped keys, audit logs.
Cron
Scheduled runs of any block. Use it for daily summaries, periodic ingest, drift checks. Same observability as one-off runs.
Workers
Persistent background processes. Watch a queue, listen on a webhook, keep a model warm.
Define your stack in one file.
Wire up multiple blocks, model defaults, autoscaling, and cron schedules in a single gonkablocks.yaml. Validated on every push. Version-controlled with your code. Forkable in one click.
Validated
Manifests are linted server-side before any container starts. Bad input schemas fail fast, not at run time.
Version-controlled
Lives in your repo, reviews on PRs, rolls back like normal code. No clicking through dashboards to undo a mistake.
Forkable
Any public block on the platform can be forked into your account by someone reading the manifest. The community remix loop, by default.
Free, multi-model, no keys.
Other platforms make you BYO inference key. We bundle a decentralized GPU network into the runtime so a fresh container is already wired to talk to a model — and you don't pay for any of it while we're getting started.
Free for everyone, right now
Every run, every model call — sponsored by Gonka Labs. We'll let you know if/when this changes; for now there's nothing to wire up, no card to enter.
Open weights, real models
Qwen3 235B today, Kimi K2.6 next. Same OpenAI-compatible API surface across every model.
No API keys to manage
Each run gets a scoped key minted on the fly. No .env handling, no rotation, no leaks to grep for.
One command from terminal to running tool.
gonkablocks deploy packages your block, builds the image, mints a scoped key, and gives you a public URL. No browser steps.
$ curl -fsSL https://blocks.gonka.gg/install.sh | sh
Requires Node.js 20+. Two binaries get installed: gonkablocks and the short alias gbk.
The boring stuff, handled.
The platform ships with the things you'd otherwise have to wire up yourself.
Sandboxed Docker
Every block runs isolated. Optional gVisor runtime for full hard-multitenancy.
OpenAI-compatible
Drop-in /v1/chat/completions and /v1/embeddings. Use any client SDK you already have.
Streaming + logs
Server-sent stdout, structured run events, and per-call inference traces in one viewer.
Secrets vault
Per-user encrypted secrets, auto-matched to block input names, never exposed to client code.
Per-run scoped keys
Each run gets a fresh sk-run-* key bounded by a spend cap. Zero standing credentials in containers.
First-class CLI
`gonkablocks deploy`, `exec`, `connect`. Same primitives as the dashboard, scriptable.
Deploy your first block in five minutes.
gonkablocks deploy — that's the whole onboarding.