Desktop app for analyzing images from autonomous insect monitoring stations using deep learning models
- Requires Python 3.10. Use Anaconda (or miniconda) if you need to maintain multiple versions of Python or are unfamiliar with using Python and scientific packages, it is especially helpful on Windows. PyEnv is also a popular tool for managing multiple versions of python if you are familiar with the command line.
Install (or upgrade) the package with the following command
pip install https://github.com/RolnickLab/ami-data-companion/archive/main.zip
Optionally test the installation with the following command
ami test pipeline
Create an environment just for AMI and the data companion using conda (or virtualenv)
conda create -n ami python=3.10 anaconda
Clone the repository using the command line or the GitHub desktop app.
git clone git@github.com:RolnickLab/ami-data-companion.git
Install as an editable package. This will install the dependencies and install the trapdata
console command
cd ami-data-companion
pip install -e .
Test the whole backend pipeline without the GUI using this command
python trapdata/tests/test_pipeline.py
# or
ami test pipeline
Run all other tests with:
ami test all
-
Make a directory of sample images to test & learn the whole workflow more quickly.
-
Launch the app by opening a terminal and then typing the command
ami gui
. You may need to activate your Python 3.10 environment first (conda activate ami
). -
When the app GUI window opens, it will prompt you to select the root directory with your trapdata. Choose the directory with your sample images.
-
The first time you process an image the app will download all of the ML models needed, which can take some time. The status is only visible in the console!
-
Important: Look at the text in the console/terminal/shell to see the status of the application. The GUI may appear to hang or be stuck when scanning or processing a larger number of images, but it is not. For the time being, most feedback will only appear in the terminal.
-
All progress and intermediate results are saved to a local database, so if you close the program or it crashes, the status will not be lost and you can pick up where it left off.
-
The cropped images, reports, cached models & local database are stored in the "user data" directory which can be changed in the Settings panel. By default, the user data directory is in one of the locations below, You
macOS:
/Library/Application Support/trapdata/
Linux:
~/.config/trapdata
Windows:
%AppData%/trapdata
A short video of the application in use can be seen here: https://www.youtube.com/watch?v=DCPkxM_PvdQ
Configure models and the image_base_path for the deployment images you want to process, then see the example workflow below. Help can be viewed for any of the subcommands with ami export --help
.
There are two ways to configure settings
- Using the graphic interface:
- Run
ami gui
and click Settings. This will write settings to the filetrapdata.ini
- Run
- Using environment variables
- Copy
.env.example
to.env
and edit the values, or - Export the env variables to your shell environment
- Copy
The CLI will read settings from either source, but will prioritize environment variables. The GUI only reads from trapdata.ini
.
ami --help
ami test pipeline
ami show settings
ami import --no-queue
ami show sessions
ami queue sample --sample-size 10
ami queue status
ami run
ami show occurrences
ami queue all
ami run
ami queue status --watch # Run in a 2nd shell or on another server connected to the same DB
ami show occurrences
ami export occurrences --format json --outfile denmark_sample.json --collect-images
By default both the GUI and CLI will automatically create a local sqlite database by default. It is recommended to use a PostgreSQL database to increase performance for large datasets and to process data from multiple server nodes.
You can test using PostgreSQL using Docker:
docker run -d -i --name ami-db -p 5432:5432 -e POSTGRES_HOST_AUTH_METHOD=trust -e POSTGRES_DB=ami postgres:14
docker logs ami-db --tail 100
Change the database connection string in the GUI Settings to postgresql://postgres@localhost:5432/ami
(or set it in the environment settings if only using the CLI)
Stop and remove the database container:
docker stop ami-db && docker remove ami-db
A script is available in the repo source to run the commands above.
./scrips/start_db_container.sh
-
Create a new inference class in
trapdata/ml/models/classification.py
ortrapdata/ml/models/localization.py
. All models inherit fromInferenceBaseClass
, but there are more specific classes for classification and localization and different architectures. Choose the appropriate class to inherit from. It's best to copy an existing inference class that is similar to the new model you are adding. -
Upload your model weights and category map to a cloud storage service and make sure the file is publicly accessible via a URL. The weights will be downloaded the first time the model is run. Alternatively, you can manually add the model weights to the configured
USER_DATA_PATH
directory under the subdirUSER_DATA_PATH/models/
(on macOS this is~/Library/Application Support/trapdata/models
). However the model will not be available to other users unless they also manually add the model weights. The category map json file is simply a dict of species names and their indexes in your model's last layer. See the existing category maps for examples. -
Select your model in the GUI settings or set the
SPECIES_CLASSIFICATION_MODEL
setting. If the model inherits fromSpeciesClassifier
class, it will automatically become one of the valid choices.
Remove the index of images, all detections and classifications by removing the database file. This will not remove the images themselves, only the metadata about them. The database is located in the user data directory.
On macOS:
rm ~/Library/Application\ Support/trapdata/trapdata.db
On Linux:
rm ~/.config/trapdata/trapdata.db
On Windows:
del %AppData%\trapdata\trapdata.db
The model inference pipeline can be run as a web API using FastAPI. This is what the Antenna platform uses to process images.
To run the API, use the following command:
ami api
View the interactive API docs at http://localhost:2000/
A simple web UI is also available to test the inference pipeline. This is a quick way to test models on a remote server via a web browser.
ami gradio
Use ngrok to temporarily expose localhost to the internet:
ngrok http 7861