OpenAI API client library for Rust (unofficial)
-
Updated
Mar 14, 2025 - Rust
OpenAI API client library for Rust (unofficial)
A multi-player tournament benchmark that tests LLMs in social reasoning, strategy, and deception. Players engage in public and private conversations, form alliances, and vote to eliminate each other
This benchmark tests how well LLMs incorporate a set of 10 mandatory story elements (characters, objects, core concepts, attributes, motivations, etc.) in a short creative story
Benchmark that evaluates LLMs using 601 NYT Connections puzzles extended with extra trick words
Multi-Agent Step Race Benchmark: Assessing LLM Collaboration and Deception Under Pressure. A multi-player “step-race” that challenges LLMs to engage in public conversation before secretly picking a move (1, 3, or 5 steps). Whenever two or more players choose the same number, all colliding players fail to advance.
Thematic Generalization Benchmark: measures how effectively various LLMs can infer a narrow or specific "theme" (category/rule) from a small set of examples and anti-examples, then detect which item truly fits that theme among a collection of misleading candidates.
最新ChatGPT Plus合租攻略:国内最靠谱的ChatGPT Plus拼车平台推荐(每月仅27元)!可使用ChatGPT-4o图片生成功能,还支持满血版DeepSeek-R1、马斯克的Grok-3和谷歌Gemini-2 Pro!如果你无法解决科学上网的问题,或觉得每月20美元的会员费用过高,可以考虑ChatGPT Plus共享合租帐号。这种方式不仅能够降低使用成本,还免去了科学上网的复杂操作。
Public Goods Game (PGG) Benchmark: Contribute & Punish is a multi-agent benchmark that tests cooperative and self-interested strategies among Large Language Models (LLMs) in a resource-sharing economic scenario. Our experiment extends the classic PGG with a punishment phase, allowing players to penalize free-riders or retaliate against others.
The GenAI API wrapper for Delphi is designed to integrate OpenAI’s latest models (GPT-4o, O1, O3 and GPT-4.5) seamlessly, offering robust features for chat interactions, text generation, vision processing, audio analysis, JSON configuration, Web search and asynchronous operations with efficient error handling and testing support.
LLM-driven software development helper.
Add a description, image, and links to the gpt-4-5 topic page so that developers can more easily learn about it.
To associate your repository with the gpt-4-5 topic, visit your repo's landing page and select "manage topics."