- General description
- Instructions prior to loading the analysis notebook
- Used Technology
- Project's cause and purpose
- Teaser Points of Interest
- The Dataset's Description
A project analyzing the rapid growth, development, global spread and other ongoing activities of solar energy farms on recent decades around the globe.
Feel free to check the project's landing page created by my partner: Reference>>
Before reading the analysis, make sure to download the dataset, in order for the code to work you should paste the local path to the file & make sure to keep its original name.
The Jupyter NB would require you (i.e. the reader) to install some non-builtin libraries that were used in the project.
However, notice they will only install them locally in the specific notebook you'll open, and will not affect your general used environment.
Notice the interactive plots would not be available for sight in the preview mode of the Jupyter NB from the github repo.
Python Version: 3.9.12
Packages: pandas (1.4.2), numpy, scipy, matplotlib, seaborn, plotly, missingno, geopy, itertools, translators, loguru, pathos, re, requests
When residential solar energy equipment was first made available to homeowners it was costly. The people who made the switch were usually those who wanted to make a conscious effort to reduce their carbon footprints. But they also had the means to invest in the equipment and services needed to make that switch. In the beginning, like many other new technologies, initial adopters often paid more until the technologies become more mainstream.
Yes, solar energy is increasingly becoming a viable fuel source for everyone. Today, switching to solar energy is far more affordable with the help of programs like the Investment Tax Credit (ITC). This incentive provides a tax credit of 26 percent of the cost to install solar power at your home. While this credit is in effect at 26% until the end of 2022, once it expires, solar energy will still remain a low-priced source of power as prices will most likely continue to drop. Meanwhile, oil, gas, and coal prices are likely to continue to increase, especially as sources are depleted and the costs to obtain these fuels grow.
I've explored the alternate solution to the energy problem, which is the solar market. I've researched it's growth and it's spread globally in the past years, to see how the world is running into the inevitable shifting to using solar energy as the planet's main resource.
Below are some teasers from the analysis:
Used some heavy tools like the missingno
library to explore the missing-values columns' possible connections and handling options across the 15 out of the 27 columns of the data which contained missing values.
A nullity correlation dendrogram; the closer the connected columns' tail, the stronger the appearance correlation:
Created a selective reverse geomapping function which is able to execute reverse-mapping to lat-lon
formatted coordinates according to a given query parameter to find accurate addresses, with a punctuality that moves deeply from mapping the coordinnates continent, country and state, up to the suburb and street name.
Values are available to be received both in the English language (The default), in the local language of a location, or at any other defined language available (internal).
On that project, I used that function to fill 4 different location columns, all missing-values-inclusive, ranging from 7% up to 47% missing values of each column.
Visualized a fully interactive heatmap plot showcasing the capacity of each declared solar plant on Earth using the coordinates of each solar project on the data. The heatmaplot can be used in order to identify countries and areas that invest deeper on solar energy than others, and recognize trends that aren't visible on the non-plotted analyses parts of the report.
The analyzed "Global Solar Power Tracker" data contains 27 columns describing different aspects of solar energy projects that their rows are grouped by the hosting country of each (Alphabetically).
Each of the rows represents a single solar project's (at a certain developement status stage) unique characteristics information, the main ones are:
- Project name.
- Energy capacity information.
- Developement status stage.
- The start year the dev. status stage began, and also retired year, if available.
- Coordinates (Latitude-Longitude) and location columns ranging from continental region to city of the solar project.
- Ownership / Operator names.
The dataset also contains local language names for non-English native countries for a few of the above aspects and others, as well as a Wikipedia page link for each project.