A Python program to analyze data from a 1994 census database from University of California, School of Information and Computer Science.
- pandas
Provides an answer for the following questions based on the 1994 census data
- How many people of each race are represented in this dataset? This should be a Pandas series with race names as the index labels. (race column)
- What is the average age of men?
- What is the percentage of people who have a Bachelor's degree?
- What percentage of people with advanced education (Bachelors, Masters, or Doctorate) make more than 50K?
- What percentage of people without advanced education make more than 50K?
- What is the minimum number of hours a person works per week?
- What percentage of the people who work the minimum number of hours per week have a salary of more than 50K?
- What country has the highest percentage of people that earn >50K and what is that percentage?
- Identify the most popular occupation for those who earn >50K in India.
Before you begin, ensure you have met the following requirements:
- Python 3.10 or higher installed on your system. You can download Python from python.org.
- Poetry 1.6.1 installed on your system. You can install Poetry from python-poetry.org
Follow these steps to install and set up Poetry for this project:
- Install Poetry: Poetry is a Python package manager that simplifies dependency management and virtual environments. You can install Poetry by following their guide here.
- Clone the repository
git clone git@github.com:mrarvind90/fcc-demographic-data-analyzer.git
- Change into the Project Directory
cd fcc-demographic-data-analyzer
- Install Dependencies:
poetry install
- Run the Project:
poetry run python3 main.py
This project is licensed under the MIT License. See the LICENSE file for details.
We follow the black code style for this project. You can format your code using:
black .