This major update elevates the Gemini AI Toolkit to a new level of versatility and power, introducing multimodal support that allows seamless integration of text, images, audio, video, documents, code and a variety of inputs. The toolkit now offers a more robust and scalable architecture, featuring enhanced error handling, a sophisticated file management system with caching, and improved user interaction across all interfaces.
Key highlights include:
- New Multimodal class for handling diverse input types in a single request
- Support for commands, like
/exit
,/clear
and/upload
- Comprehensive file handling with caching for optimized performance
- Refined error management providing clear, actionable feedback
- Expanded CLI capabilities supporting advanced multimodal interactions
- Improved safety controls and configuration options
Full Changelog: v1.2.1...v1.3