Looma AI

Setup

The Docker containers for Looma-II named "loomaweb" and "loomadb" must be running when using "loomaai". Clone Looma-II and follow the setup instructions in README for Looma-II repository
Ensure the Looma-II docker-compose is running
Clone this "loomaai" repo to your computer git clone https://github.com/looma/loomaai
Obtain an OpenAI API key and add it to a new file in this directory called .env with the following contents:

export OPENAI_API_KEY=[your-api-key-here]

Run make setup-host (start python env, import openai key, load py requirements)
Run make build (build the streamlit image) - this could take a few minutes
Run make run (start qdrant and streamlit containers)
Navigate to http://localhost:47000/loomaai to access the dashboard
Create the loomaai/data/files/textbooks folder within this folder, if it does not already exist

User Guide

Chapter Splitting

In Streamlit
- In Streamlit, click Textbook in the sidebar and select the textbooks you want.
- Click the "Split Into Chapters" button.
- A file location will be shown on-screen. That location is synced to the host machine, so the chapters will also be in data/files/textbooks within the loomaai folder.

Chapter Summaries

In Streamlit
- Navigate to "Chapter" in the sidebar
- Select the chapter to be summarized from your file explorer.
- Make sure the language selected in the options is the language the chapter is in.
- Click the Summarize button.

Other chapter functions available in StreamLit

summary - creates a file 'ch_id.summary' in the data/files folders for selected chapters
quiz - creates a file 'ch_id.quiz' in the data/files folders for selected chapters
custom prompt -enter a prompt and a file extension [e.g. 'outline'] to create 'ch_id.extension' files based on the prompt
dictionary - scan selected chapters, extract all [english] words and add them to the dictionary if not present

Translating Lessons

In StreamLit
- Lessons are displayed, marked with "AI" and date translated if translated
- Select lessons to be translated
- scans the "data" field of each lesson
- extracts the "html" fields of all "inline" text elements, translates them to Nepali, and inserts a "nepali" field next to the "html" field

Developer Guide

YOU MUST USE PYTHON 3.12, NOT PYTHON 3.13

This is because pytorch does not support python3.13

Run make setup-host to Set up your virtual environment. It does:

 python3.12 -m venv env
 source env/bin/activate
 pip3 install -r requirements.txt

Features

Embed All Activities

This requires docker-compose to be running (see "Setup"). Also you must run the following first BEFORE embedding:

sudo make split
sudo make video-captions

sudo may be required because of a permissions quirk with data being mounted as a docker volume.

make embed-all

This process will generate embeddings for all activities in the MongoDB activities collection and add the vectors to the Qdrant activities collection.
IMPORTANT: This process will DELETE all existing entries from qdrant and rebuild the entire vector database.
The Looma-II semantic search feature requires these embeddings to be generated first and the docker-compose to be running

make embed-missing

This will prevent the program from deleting existing entries. It will check each entry in mongodb and only create embeddings for the activities that are not in qdrant.
Before running this, you have to first run make embed-all without the flag at least once to initialize the qdrant collection (or create the collection some other way, for example through the qdrant dashboard http://localhost:46333/dashboard) .

Populate Chapters with Related Resources

Follow the steps in "Run Containers" and "Embed All Activities" first

make assign-chapters-resources

This process will populate the "related resources" for every chapter in Looma-II
This process is additive and will not overwrite any existing related resources in MongoDB, it will also not add duplicate relations

Translate Lessons [also available in Streamlit interface (above)]

Requires an OpenAI key (see step 4 of Setup)

make translate-lessons

This process will update all lessons in MongoDB with a new field data_np containing translated lesson data. It will overwrite the existing data_np field if present.

Generate Video Captions

Requires ffmpeg to be in PATH

This process will iterate through MongoDB "activities" collection and filter for "ft" == "video". It will download each video file from the remote looma server, transcribe the video, then save a file in data/content/video_captions/en/{fp}{fn} Note that the "../" prefix will be removed from fp, and the fn extension will be changed to vtt. These generated captions must be manually uploaded to looma website.

If a caption file is already on disk, the program will skip that video. To force a re-captioning, delete the caption file from disk.

make video-captions

Translate Video Captions

This process will iterate through all vtt files in the data/content/video_captions/en folder (and its subfolders, recursively), translate the caption track, and save it in the location found by replacing en/ with np/ in the path. These generated captions must be manually uploaded to looma website.

If a translation is already on disk, the program will skip that translation. To force a re-translation, delete the nepali vtt file in np/.

make translate-captions

More Developer Notes

When importing a common library from cli, use a relative import: from ..common.generate import generate_vectors

To run a script in CLI, run it like this from the root directory: python3 -m appai.cli.generate
- The -m flag is important, do not use the filename
When importing a common library from pages: from common.query_faiss import query

To get a terminal session in the container

% make shell
% docker exec -ti looma-streamlit /bin/bash

Now you are in the terminal in the container

If you'd like to see the logs of the running container

% make logs

Name		Name	Last commit message	Last commit date
Latest commit History 358 Commits
appai		appai
dotstreamlit		dotstreamlit
npttf2utf		npttf2utf
.dockerignore		.dockerignore
.gitignore		.gitignore
Dockerfile		Dockerfile
Makefile		Makefile
README.md		README.md
README_LLM.md		README_LLM.md
bootup.sh		bootup.sh
docker-compose.yml		docker-compose.yml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Looma AI

Setup

User Guide

Chapter Splitting

Chapter Summaries

Other chapter functions available in StreamLit

Translating Lessons

Developer Guide

YOU MUST USE PYTHON 3.12, NOT PYTHON 3.13

Features

Embed All Activities

Populate Chapters with Related Resources

Translate Lessons [also available in Streamlit interface (above)]

Generate Video Captions

Translate Video Captions

More Developer Notes

To get a terminal session in the container

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 8

Uh oh!

Languages

looma/loomaai

Folders and files

Latest commit

History

Repository files navigation

Looma AI

Setup

User Guide

Chapter Splitting

Chapter Summaries

Other chapter functions available in StreamLit

Translating Lessons

Developer Guide

YOU MUST USE PYTHON 3.12, NOT PYTHON 3.13

Features

Embed All Activities

Populate Chapters with Related Resources

Translate Lessons [also available in Streamlit interface (above)]

Generate Video Captions

Translate Video Captions

More Developer Notes

To get a terminal session in the container

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 8

Uh oh!

Languages

Packages