Skip to content
@OpenDCAI

OpenDCAI

Define the future of Data-centric AI together

OpenDCAI

We are dedicated to advancing research and open-source tools in Data-Centric Artificial Intelligence (DCAI).

Our goal is to develop effective and efficient DCAI systems and algorithms that support and enhance the performance of AI models and applications.

Newly Released Works

🔥 2025/6/29 Our DCAI system DataFlow is released! Link

Pinned Loading

  1. DataFlow DataFlow Public

    Easy Data Preparation with latest LLMs-based Operators and Pipelines.

    Python 1.2k 79

  2. MyScaleDB MyScaleDB Public

    Forked from OriginHubAI/MyScaleDB

    AI Database for unified, scalable SQL + vector data management, search and analytics

    C++ 37

Repositories

Showing 10 of 14 repositories
  • DataFlow-Doc Public

    Documentation for DataFlow, Data-centric AI system for LLM.

    OpenDCAI/DataFlow-Doc’s past year of commit activity
    Python 8 21 4 1 Updated Sep 5, 2025
  • DataFlow-MM Public

    Develop repository for dataflow multi-media version

    OpenDCAI/DataFlow-MM’s past year of commit activity
    Python 4 Apache-2.0 8 1 0 Updated Sep 2, 2025
  • DataFlow Public

    Easy Data Preparation with latest LLMs-based Operators and Pipelines.

    OpenDCAI/DataFlow’s past year of commit activity
    Python 1,189 Apache-2.0 79 9 3 Updated Sep 2, 2025
  • SciReasoner Public
    OpenDCAI/SciReasoner’s past year of commit activity
    Python 1 GPL-3.0 0 0 0 Updated Aug 26, 2025
  • vts-v Public
    OpenDCAI/vts-v’s past year of commit activity
    Python 10 0 0 0 Updated Aug 11, 2025
  • MyScaleDB Public Forked from OriginHubAI/MyScaleDB

    AI Database for unified, scalable SQL + vector data management, search and analytics

    OpenDCAI/MyScaleDB’s past year of commit activity
    C++ 37 Apache-2.0 30 0 0 Updated Jul 19, 2025
  • OpenDCAI/Dataflow-backend’s past year of commit activity
    Python 0 1 0 0 Updated Jul 16, 2025
  • LLaMA-Flex Public Forked from hiyouga/LLaMA-Factory

    <_>

    OpenDCAI/LLaMA-Flex’s past year of commit activity
    Python 0 Apache-2.0 7,490 0 0 Updated Jul 16, 2025
  • OpenDCAI/Awesome_MLLMs_Reasoning’s past year of commit activity
    104 4 0 1 Updated Jul 7, 2025
  • DataFlow-Preview Public

    Easy Data Processing, Generation and Evaluation with latest LLMs based Operators and Pipelines.

    OpenDCAI/DataFlow-Preview’s past year of commit activity
    Python 3 Apache-2.0 12 9 0 Updated Jun 29, 2025

Top languages

Python C++

Most used topics

Loading…