Svelte Hacker News logo
  • top
  • new
  • show
  • ask
  • jobs
  • about

Compiler optimizations for 5.8ms GPT-OSS-120B inference (not on GPUs)

furiosa.ai

1 points by olibaw 18 hours ago