My understanding is that the coding agents people use can be modified to plug in...

social_quotient · 2025-08-01T22:55:10 1754088910

Exactly! You can use tools like https://github.com/musistudio/claude-code-router which let you use other LLMs.

The way I would use this $50 Cerebras offering is as a delegate for some high token count items like documentation, lint fixing, and other operations as a way not only to speed up the workflow but to release some back pressure on Anthropic/claude so you don’t hit your limits as quickly… especially with the new weekly throttle coming. This $50 dollar jump seems very reasonable, now for the 1k completions a day, id really want to see and get a feel for how chatty it is.

I suppose thats how it starts but id the model is competent and fast, the speed alone might force you a bit to delegate more to it. (Maybe sub agent tasks)

pxc · 2025-08-01T22:51:55 1754088715

You can still get it pay-as-you-go on OpenRouter, afaict, and the billing section of the Cerebras Cloud account I just created has a section for Qwen3-Coder-480B as well.

sophia01 · 2025-08-01T23:18:56 1754090336

Yeah just checked apparently it is available as a preview (not on main models/pricing page).

baq · 2025-08-01T22:43:57 1754088237

define 'crazy'.

it's two kilotokens per second. that's fast.

bangaladore · 2025-08-01T23:20:35 1754090435

It's more than 10x faster than the fastest alternative. And roughly 50x the average alternative.

Certainly, somewhere between fast and crazy.

amelius · 2025-08-01T23:37:58 1754091478

It generates code faster than I can inspect it.

In other words, it's needlessly fast.

pxc · 2025-08-02T02:18:54 1754101134

You might be able to use the extra time to have it do things like run some formatters, linters, run the code in a VM before you inspect it, or modify it for compliance with a style guide that you've written, and continually revise it for up to 5 tries until the conditions are met, something like that.

So maybe there's something useful to do with the extra speed. But it does seem more "useful" for vibe coding than for writing usable/good code.

ttoinou · 2025-08-01T22:46:46 1754088406

I’d say super fast