Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Groq inference provider #36353

Open
VladOS95-cyber opened this issue Feb 23, 2025 · 0 comments
Open

Groq inference provider #36353

VladOS95-cyber opened this issue Feb 23, 2025 · 0 comments
Labels
Feature request Request for a new feature

Comments

@VladOS95-cyber
Copy link
Contributor

Feature request

Since several inference provider APIs were added to HF. It would be beneficial to add Groq inference provider as well to expand a list of supported providers.

Motivation

Groq is a cutting-edge AI inference provider that has recently gained significant attention for its exceptional performance and speed. The company's Language Processing Unit (LPU) Inference Engine offers several key benefits and advantages:

  1. Groq's unique approach to AI inference is based on several innovative features like SRAM onlz architecture, co-located compute and memory, kernel-less compiler.
  2. Over 500,000 developers are currently using Groq's API keys.
  3. It achieved up to 18x faster output tokens throughput compared to other cloud-based inference providers in the Anyscale LLMPerf Leaderboard.
  4. Models support: Llama 3 series (8B, 70B, vision models), Mixtral 8x7B, Gemma 2 9B, Whisper Large V3 and so on.

Your contribution

I could try to dive deep into this topic and provide implementation myself.

@VladOS95-cyber VladOS95-cyber added the Feature request Request for a new feature label Feb 23, 2025
@huggingface huggingface deleted a comment from marthos1 Feb 24, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Feature request Request for a new feature
Projects
None yet
Development

No branches or pull requests

1 participant