ZoomAI

Overview

This project allows users to generate "infinite zoom" art based on their textual and visual inputs. Leveraging advanced models such as ChatGPT for prompt generation and Stable Diffusion for image inpainting and generation, the application takes user inputs to create detailed prompts and produces seamless, infinitely zoomable images. Users can explore and zoom into the created artworks. The primary aim is to offer a unique and immersive experience through captivating "infinite zoom" art.

Image Generation and Manipulation Pipeline

User Input: The pipeline accepts two types of inputs from users:

Text Input: User-provided text describing the desired image.
Image Input: User-provided image to be used as a reference.

Image to Text Model: Uses Salesforce's blip-image-captioning-large model to convert image inputs into text descriptions.
ChatGPT API: Utilizes OpenAI's gpt-3.5-turbo-0125 to process user prompts and generate detailed prompts for the image generation model.
Database: Stores project information and image data, including user prompts, generated prompts, and images.
Stable Diffusion Model: Uses StabilityAI's stable-diffusion-2-inpainting for generating and inpainting images based on the processed prompts.
Producer and Consumer:

Producer: Handles the image generation process.
Consumer: Manipulates and processes the generated images.

Setup

git clone https://github.com/TailUFPB/ZoomAI
cd ZoomAI

pip install -r requirements.txt

Running Instructions

Important Notes

GPU Requirement: This project requires a GPU to run.
Environment Configuration: You need to update the environment configuration in common/environment.js in the frontend to point to where the backend will be running.

Backend

In the project's root directory, run:

pyton3 backend/api.py

Frontend

Navigate to the frontend directory and install frontend dependencies:

cd frontend
npm start

Start the frontend:

npm start

Name		Name	Last commit message	Last commit date
Latest commit History 180 Commits
backend		backend
frontend		frontend
models		models
tests		tests
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
pyvenv.cfg		pyvenv.cfg

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

ZoomAI

Overview

Image Generation and Manipulation Pipeline

Setup

Running Instructions

Important Notes

Backend

Frontend

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 6

Uh oh!

Languages

License

TailUFPB/ZoomAI

Folders and files

Latest commit

History

Repository files navigation

ZoomAI

Overview

Image Generation and Manipulation Pipeline

Setup

Running Instructions

Important Notes

Backend

Frontend

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 6

Uh oh!

Languages

Packages