Gumbo (GUM)

General User Models (GUM) learn about you by observing your interactions with your computer. Gumbo uses this architecture to infer new propositions about a user from multimodal observations, retrieve related context, and continuously revise its understanding.

Features

Multimodal Learning: Captures and processes text and visual data (screenshots) to understand user context.
Cross-Platform: Built with Python, supports macOS (primary) and other platforms.
Privacy-First: Designed with user privacy in mind (requires user-provided API keys).
Unified AI Client: Seamlessly switches between Text (Azure/OpenAI) and Vision (OpenRouter) providers.

Installation

Prerequisites

Python 3.8+
Tesseract OCR (required for some OCR features)

Setup

Clone the repository:

git clone https://github.com/ArnavS-22/gumboapp.git
cd gumboapp

Install dependencies:

pip install -r requirements.txt
# OR
pip install .

Configuration: The application requires API keys for AI services. It will prompt you for these on first run, or you can set them as environment variables:
- OPENAI_API_KEY: For text processing
- OPENROUTER_API_KEY: For vision/multimodal processing (optional)
- AZURE_OPENAI_API_KEY & AZURE_OPENAI_ENDPOINT: For Azure OpenAI (optional)

Usage

To start the application:

python start_gum.py

Or if installed as a package:

gum

Contributing

We welcome contributions! Please see CONTRIBUTING.md for details on how to submit pull requests, report issues, and our code of conduct.

License

This project is licensed under the MIT License - see the LICENSE file for details.

Authors

Omar Shaikh
Arnav Sharma

Name		Name	Last commit message	Last commit date
Latest commit History 65 Commits
.github		.github
docs		docs
frontend		frontend
gum		gum
.gitignore		.gitignore
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
FRONTEND_INTEGRATION_GUIDE.md		FRONTEND_INTEGRATION_GUIDE.md
LICENSE		LICENSE
README.md		README.md
azure_text_client.py		azure_text_client.py
controller.py		controller.py
env.template		env.template
mkdocs.yml		mkdocs.yml
openai_text_client.py		openai_text_client.py
openrouter_vision_client.py		openrouter_vision_client.py
pyproject.toml		pyproject.toml
pyrightconfig.json		pyrightconfig.json
rate_limiter.py		rate_limiter.py
requirements.txt		requirements.txt
setup.bat		setup.bat
setup.py		setup.py
setup_wizard.py		setup_wizard.py
skypilot-tmp.yaml		skypilot-tmp.yaml
start_gum.bat		start_gum.bat
start_gum.py		start_gum.py
unified_ai_client.py		unified_ai_client.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Gumbo (GUM)

Features

Installation

Prerequisites

Setup

Usage

Contributing

License

Authors

About

Uh oh!

Releases

Packages

Contributors 3

Uh oh!

Languages

License

ArnavS-22/GUMBO

Folders and files

Latest commit

History

Repository files navigation

Gumbo (GUM)

Features

Installation

Prerequisites

Setup

Usage

Contributing

License

Authors

About

Resources

License

Code of conduct

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Uh oh!

Languages

Packages