[NeurIPS 2025] Thinkless: LLM Learns When to Think
-
Updated
Sep 26, 2025 - Python
[NeurIPS 2025] Thinkless: LLM Learns When to Think
FastAPI workbench for text embedding (Gemma-300m with Matryoshka) and summarization (Gemma/Gemini). Features hardware acceleration, caching, and secure endpoints for local LLM integration.
Add a description, image, and links to the hybrid-reasoning topic page so that developers can more easily learn about it.
To associate your repository with the hybrid-reasoning topic, visit your repo's landing page and select "manage topics."