Large Language Model, Code Generation, Reward Model, Table Processing.
-
Renmin University of China
- Beijing
Pinned Loading
-
RUCKBReasoning/CodeRM
RUCKBReasoning/CodeRM PublicThe code of arXiv paper: "Dynamic Scaling of Unit Tests for Code Reward Modeling"
Python 14
-
RUCKBReasoning/SpreadsheetBench
RUCKBReasoning/SpreadsheetBench PublicSpreadsheetBench: Towards Challenging Real World Spreadsheet Manipulation
-
RUCKBReasoning/TableLLM
RUCKBReasoning/TableLLM PublicTableLLM: Enabling Tabular Data Manipulation by LLMs in Real Office Usage Scenarios
-
RUCKBReasoning/FFAEval
RUCKBReasoning/FFAEval PublicFFAEVAL: Evaluating Dialogue System via Free-For-All Ranking
Vue
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.