chat-extract

Extract data from a screen recording of a chat conversation.

Chat-extract uses the OpenAI API to extract chat messages from a screen recording. It splits the video into frames, then passes those frames directly to a vision-enabled LLM to extract the chat messages. The extracted messages are then saved to a CSV file.

Because it uses a vision-enabled LLM, it can extract messages from any chat application, including WhatsApp, Telegram, and Discord. The downside of this is that it can only extract what is visible on the screen, so certain messaging platforms may not display things like dates or timestamps for every message.

It turns an video like this:

Into a CSV file like this:

sender	message	timestamp
Connor Smith	"Hey, where should we go for lunch today in SoHo?"
	"How about that new place, The Green Stove?"
Connor Smith	I heard it’s good! But I’m in the mood for something different. Any other suggestions?
	"What about Fable & Fork? It’s got great reviews."
Connor Smith	"Ooh, that sounds nice. What kind of food do they have?"
	"Mostly farm-to-table, a lot of seasonal options."
Connor Smith	"That’s perfect. I’m craving something fresh. Is it far from here?"
	"No, just a few blocks away! We could walk there in about 10 minutes."
Connor Smith	"Alright, but I just remembered, I’m kind of in the mood for sushi."
	"In that case, let’s hit up Sushi Sora."
Connor Smith	"I’ve been meaning to try that place!"
	"How about that new place, The Green Stove?"

Installation

Since this project is not published on PyPI, you must install it from GitHub.

Install via pipx from GitHub:

pipx install git+https://github.com/cmhac/chat-extract.git

Note: After installation, make sure the pipx binary directory is in your PATH. Run pipx ensurepath if needed and restart your terminal.

Alternative: Install from local source

If you want to install from a local clone:

# Clone the repository first
git clone https://github.com/cmhac/chat-extract.git
cd chat-extract

# Install via pipx from local source
pipx install .

Setting up OpenAI API Key

Set your OpenAI API key (required for the tool to run):

Temporary for current session:

export OPENAI_API_KEY="your_api_key_here"

Permanent (add to your shell's config file like ~/.bashrc or ~/.zshrc):

echo 'export OPENAI_API_KEY="your_api_key_here"' >> ~/.bashrc
source ~/.bashrc

You can also put it in a .env file in the folder you run the tool from:

OPENAI_API_KEY=your_api_key_here

Usage

Basic syntax:

chat-extract VIDEO_PATH --output-path OUTPUT.csv [--n FRAMESKIP]

Example with the included demo GIF:

chat-extract "docs/screen-recording-example.gif" --output-path "output.csv"

For help with all available options:

chat-extract --help

Limitations

Because this tool uses a vision-enabled LLM, it can only extract what is visible on the screen. This means that certain messaging platforms may not display things like dates or timestamps for every message. However, it does mean that it can extract messages from any chat application, including WhatsApp, Telegram, and Discord.

This tool is not perfect. It may misinterpret messages or fail to extract them altogether. It is recommended to review the extracted messages for accuracy and completeness.

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
.github		.github
.vscode		.vscode
chat_extract		chat_extract
docs		docs
scripts		scripts
tests		tests
.coveragerc		.coveragerc
.gitignore		.gitignore
.pylintrc		.pylintrc
LICENSE		LICENSE
README.md		README.md
justfile		justfile
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

chat-extract

Installation

Install via pipx from GitHub:

Alternative: Install from local source

Setting up OpenAI API Key

Usage

Limitations

About

Uh oh!

Releases

Uh oh!

Contributors 3

Languages

License

cmhac/chat-extract

Folders and files

Latest commit

History

Repository files navigation

chat-extract

Installation

Install via pipx from GitHub:

Alternative: Install from local source

Setting up OpenAI API Key

Usage

Limitations

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Uh oh!

Contributors 3

Languages