Pick how you want to use cocompute. You can do both.
You have a GPU sitting idle. Install the host binary, join the pool, and get access to other people's GPUs in return.
You want LLM inference but don't want to buy hardware. Sign up, get an API key, point your apps at our OpenAI-compatible endpoint.
Three steps. About 2 minutes if you have a curl handy.
Create a free account.
From your dashboard, find the pool you want to use and click New API Key. Copy the key. It looks like a long random string. You'll only see it once.
API keys are scoped to a pool. They only have access to the hosts in that pool.
Drop your key in and call the OpenAI-compatible endpoint:
curl https://cocompute.ai/v1/chat/completions \
-H "Authorization: Bearer YOUR_API_KEY" \
-H "Content-Type: application/json" \
-d '{"model":"gemma4:31b","messages":[{"role":"user","content":"hello"}]}'Or list which models the pool has available:
curl https://cocompute.ai/v1/models -H "Authorization: Bearer YOUR_API_KEY"
Works with any client that speaks the OpenAI API spec (the official OpenAI SDK, openwebui, llama.cpp clients). Just change the base URL.
Open an issue on GitHub or read the source. cocompute is AGPL, every line is yours to inspect.