Welcoming the hfendpoint-images - we are on a mission to build all high performance and scalable inference runners under one roof.

Starting today you can run:

  1. Any Whisper model via vLLM
  2. Any LLM via SGLang

What would you like to see next? Popover to the discussion here