(What is this project about?)
Trail is an open-source user analytics and event tracking platform for your sites. With Trail, you can get insights to into the userbase of your website - info on the most visited pages, info on the demographics of the users etc.
(Instructions to run locally)
- Clone the repo
- Run the MySQL, ClickHouse and Redis
If you use docker, then do$ docker-compose up clickhouse-server mysql redis
If you're are not using docker-compose, then you'll have to setup the DBs. The scripts are indatabase_scripts/
- Install the dependencies and run the server
$ virtualenv trail-venv --python=python3
$ source trail-venv/bin/activate
$ pip install -r requirements.txt
$ flask run
You can configure ports and authentication in config.py
, or by setting them in the enviroment or .flaskenv
(What did you use to build this?)
The current version is built using Flask and uses MySQL and ClickHouse for database.
Redis is used for caching.
(Some thoughts on the development and working)
Why ClickHouse? Trail was originally written completely with MySQL and then later migrated to ClickHouse.
Before looking into the database chosen, consider the nature of the data to be stored by the application
Of these, the second one is almost entirely immutable data - once entered, it cannot change, but we'll constantly be doing queries on it.
ClickHouse is a DBMS system specifically optimised and designed for exactly these type of data - immutable, but requires frequent analysis. It is an OLAP system as opposed to OLTP system (like MySQL)
So, since ClickHouse is great for analytics (and it was something new to learn), ClickHouse is used for the logs and the others are retained in MySQL
How everything fits together?
The diagram above depicts the flow of the program.
Once a user is setup and has configured their site with Trail, the site starts generating logs and it will be updated in realtime in the dashboard.
The overall flow is relatively simple. Whenever a visitor visits the site setup with Trail, it sends a events through the API, which is processed and validated by the Flask server and logs the entry into ClickHouse.
The site data gets updated in realtime with the help of Clickhouse's fast on-the-fly reporting capabilities
(Why did we work on this project?)
For fun and to learn! (and we had to submit a project for school 😅 )
You can checkout the demo hosted here
Username: test
Password: test
- Re-organise code
- Abstract the database interactions - consider using an ORM/query builder
- Expand the documentation
- Thanks to the Argon project, based on which the frontend is built.
If you have anything to say about Trail, please feel free to reach out. Contributions are always welcome, it would be great to have you contributing to this project. Please feel free to open an issue if you have something in mind.