That's temporary. Cerebras speeds up everything, so if Nemotron is good quality, it's just a matter of time until they add it.
[1] https://inference-docs.cerebras.ai/models/overview
reply
Nemotron on the other hand is a hybrid (Transformer + Mamba-2) so it will be more challenging to compile it on Cerebras/Groq chips.
(Me thinks Nvidia is purposefully picking architecture+FP4 that is easy to ship on Nvidia chips, but harder for TPU or Cerebras/Groq to deploy)
That's temporary. Cerebras speeds up everything, so if Nemotron is good quality, it's just a matter of time until they add it.