The main objective of this project was to determine which cryptocurrencies were trading on the market and to use unsupervised machine learning to see if there are any clusters of cryptocurrencies.
Four clusters of cryptocurrencies were used. In the dashboard below, the final page facilitates potential investors who are interesting in choosing the right mix of cryptocurrencies based on Class, Proof Type and Algorithm for their investment portfolio.
Click above image to view video of Dashboard.
For powerBI users, view dashboard
- Perform dimension reduction using Principal Component Analysis (PCA) method and clustering using K-means
- Present my findings
- Python
- Sklearn
- Pandas
- HVplot
- Remove all cryptocurrencies that were not trading
- Remove all cryptocurrencies that had no defined Algorithm
- Remove the IsTrading column
- Remove all cryptocurrencies with at least one null value
- Remove all cryptocurrencies without coins mined
- Create a separate dataframe with all the names of the cryptocurrencies
- Remove the CoinName column in the original crypto_df dataframe
- Create dummy variables for the Algorithm and ProofType columns
- Standardize the dataframe
Reduced to Three components
Decided that K=4 was the best option
3D plot with parameters hover_name="CoinName" and hover_data=["Algorithm"]