DarkGPU Documentation

Distributed inference marketplace for NVIDIA GPUs. OpenAI-compatible API, 40-60% cheaper than OpenRouter, operators keep 95%.

Authentication

Sign up at console.darkgpu.ai ($10 free credits). Create an API key on the API Keys page. Use Authorization: Bearer dgpu-....

First request

curl https://api.darkgpu.ai/v1/chat/completions \
  -H "Authorization: Bearer $DGPU_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{"model":"qwen3.5-7b","messages":[{"role":"user","content":"Hi"}]}'

Python (OpenAI SDK):

from openai import OpenAI
client = OpenAI(base_url="https://api.darkgpu.ai/v1", api_key="dgpu-...")
r = client.chat.completions.create(
    model="qwen3.5-7b",
    messages=[{"role":"user","content":"Hello"}],
)
print(r.choices[0].message.content)

POST /v1/chat/completions

OpenAI-compatible chat completions. Supports streaming via "stream": true.

Response headers include x-darkgpu-provider (provider ID that served the request) and x-darkgpu-request-id.

GET /v1/models

Returns the enabled model catalog with pricing. See pricing reference.

Pricing reference

Model IDInput /1MOutput /1MMin VRAM
qwen3.5-7b$0.015$0.068 GB
llama-3.3-8b$0.02$0.088 GB
mistral-nemo-12b$0.030$0.1212 GB
gemma-4-12b$0.035$0.1412 GB
qwen3.5-14b$0.030$0.1516 GB
qwen3.5-27b$0.060$0.3032 GB
llama-3.3-70b$0.100$0.5048 GB
deepseek-v3$0.270$1.10160 GB

Rate limits

Errors

Errors follow the OpenAI shape: {"error":{"message":"...","type":"..."}}.

HTTPtypeMeaning
400invalid_requestBad input
401authMissing / invalid API key
402billingOut of credits
404not_foundUnknown route or model
429rate_limitToo many requests
503service_unavailableNo provider online for this model

Provider install

Linux with NVIDIA driver 535+, CUDA 12+, Python 3.10+:

curl -fsSL https://darkgpu.ai/install.sh | bash
darkgpu auth
darkgpu serve

The installer detects your GPU, installs vLLM if missing, writes a config to ~/.config/darkgpu/provider.toml, and optionally installs a systemd unit.

Configuration

api_url         = "https://api.darkgpu.ai"
price_floor_pct = 0        # 0 = accept catalog prices
idle_detection  = true     # pause when GPU is in active use

[[schedule]]
days  = ["mon", "tue", "wed", "thu", "fri"]
start = "22:00"
end   = "08:00"

Provider earnings

Three tiers — the calculator on the landing page defaults to conservative. Actual revenue depends on demand for your model and your provider reputation.

Payouts are weekly, every Monday UTC, via Stripe Connect (USD). Minimum payout: $10.

Provider security

The agent is open source Rust. Only outbound WebSocket — no inbound ports, works behind NAT/CGNAT.

Each agent generates an X25519 keypair on first run, stored at ~/.config/darkgpu/x25519.key. The public key is advertised via /v1/providers/:id/attestation.

Trust tiers:

Economics

Revenue split: 95% provider, 5% platform. We don't pretend 0% is sustainable — 5% funds Stripe fees, Cloudflare infra, and the free-tier subsidy.

Consumer free tier: $10 of credits on signup, no card required.

See economics.md for the full breakdown.

FAQ

See the FAQ on the landing page for common questions: earnings reality, hardware wear, crypto, trust, latency.