add deepseek-ai/deepseek-r1 on Replicate #1151

zeke · 2025-01-29T06:29:59Z

This PR adds support for https://replicate.com/deepseek-ai/deepseek-r1

https://x.com/ClementDelangue/status/1884274486905827431

zeke · 2025-01-29T06:30:34Z

packages/inference/src/providers/replicate.ts

@@ -5,6 +5,9 @@ export const REPLICATE_API_BASE_URL = "https://api.replicate.com";
 type ReplicateId = string;

 export const REPLICATE_SUPPORTED_MODEL_IDS: ProviderMapping<ReplicateId> = {
+	conversational: {


I chose conversational here as the task type based on the existing Together AI code.

zeke · 2025-01-29T06:40:24Z

packages/inference/test/HfInference.spec.ts

@@ -930,6 +930,19 @@ describe.concurrent("HfInference", () => {
 				expect(res).toBeInstanceOf(Blob);
 			});

+			it("conversational unversioned", async () => {


Could use some help getting this test right.

Wauplin · 2025-01-29T09:26:04Z

Hi @zeke, it seems like Replicate does not support the chat completion API (or at least it's not documented on https://replicate.com/deepseek-ai/deepseek-r1/api/schema). By "chat completion" I mean the standard API introduced by OpenAI and now supported by many providers: https://platform.openai.com/docs/api-reference/chat/create. The main difference between this API and a classic "text-generation" API is that the user can pass a list of messages instead of a pre-formatted prompt. Users don't need to play with the chat template client side.

So in order to have support for DeepSeek R1 on Replicate via HF, you should either set it as a "text-generation" task (but that wouldn't make it appear in the Widget on https://huggingface.co/deepseek-ai/DeepSeek-R1) or support the Chat Completion API on Replicate.

(Note: "Message API", "Conversational", "Chat completion" all refer to the same API schema)

zeke · 2025-01-29T17:44:24Z

Thanks for the context @Wauplin, that's very helpful. Let me take this back to the team and see what we can do.

julien-c · 2025-01-29T18:53:30Z

if it's public info, what are you running backend-wise to power R1 inference? might have an OpenAI compatible server

add replicate deepseek-ai/deepseek-r1

6198022

zeke requested review from julien-c, hanouticelina, SBrandeis and coyotte508 as code owners January 29, 2025 06:30

zeke commented Jan 29, 2025

View reviewed changes

add a test for replicate conversational model

5e1391f

zeke commented Jan 29, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add deepseek-ai/deepseek-r1 on Replicate #1151

add deepseek-ai/deepseek-r1 on Replicate #1151

zeke commented Jan 29, 2025 •

edited

Loading

zeke Jan 29, 2025

zeke Jan 29, 2025 •

edited

Loading

Wauplin commented Jan 29, 2025

zeke commented Jan 29, 2025

julien-c commented Jan 29, 2025

add deepseek-ai/deepseek-r1 on Replicate #1151

Are you sure you want to change the base?

add deepseek-ai/deepseek-r1 on Replicate #1151

Conversation

zeke commented Jan 29, 2025 • edited Loading

zeke Jan 29, 2025

Choose a reason for hiding this comment

zeke Jan 29, 2025 • edited Loading

Choose a reason for hiding this comment

Wauplin commented Jan 29, 2025

zeke commented Jan 29, 2025

julien-c commented Jan 29, 2025

zeke commented Jan 29, 2025 •

edited

Loading

zeke Jan 29, 2025 •

edited

Loading