Skip to content

[Roadmap] Preliminary Vision for COMB #2

@shijuzhao

Description

@shijuzhao

Roadmap

We are not content with merely publishing a paper; our goal is to drive this project into production. Here is a list of our pending tasks.

Integration

  • SGLang support (@DerekHJH is working on it)
  • LMCache support

Model Support

User Interface

  • Support chat template only and tokenize inside COMB (@shijuzhao is working on it)
  • ...

Feature

  • Batching (@shijuzhao is working on it)
  • Support chunk token and cross-attention mask
  • ...

Hardware Support

  • AMD ROCm
  • ...

Distributed

  • Encoder-decoder disaggregation
  • TP, PP

Optimization

  • Enable CUDA graph through custom operator (@shijuzhao is working on it)
  • Separated PIC process
  • ...

Metadata

Metadata

Labels

roadmapPlan or future work

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions