This repository contains projects completed as part of technical assessments for job applications. Each project showcases different data analysis skills using R and R Markdown, including data manipulation, statistical analysis, and data visualisation.
Projects Included:
- Compute Average Monthly Wage by Firm Size
- Data Quality and Staff Dashboard Visualisation
- US States Grouping and Assault Rate Prediction
Folder: Calc-Avg-Mthly-Wage
Overview:
This project demonstrates how to compute the average monthly wage paid by companies of different sizes using R. The analysis uses simulated data based on hypothetical company and employee wage tables. The code simulates data loading, defines firm sizes, merges datasets, and computes the average monthly wage for each firm size category.
Objectives:
- Simulate the computation of average monthly wages based on firm size (small, medium, large).
- Simulate merging two datasets (companies and employees) for analysis.
- Provide a clean and reproducible workflow using R Markdown.
Instructions:
- Navigate to the Calc-Avg-Mthly-Wage folder.
- Follow the instructions in the project's README.
Folder: Data-Quality_Staff-Dashboard
Overview:
This project focuses on organizing and cleaning access data from Company ABC and developing an interactive dashboard to monitor staff attendance patterns. The dashboard provides insights into staff working from home, office arrival time slots, and visitor access trends, supporting informed decision-making.
Objectives:
- Clean and process raw data from access logs to ensure data quality.
- Develop a scalable data ingestion solution for future datasets.
- Create a dashboard that provides insights into staff attendance and visitor trends.
Instructions:
- Navigate to the Data-Quality_Staff-Dashboard folder.
- Follow the instructions in the project's README.
Folder: States-Grouping_Assault-Prediction
Overview:
This project analyses socio-economic and crime data to group US states based on their characteristics and identify factors that significantly predict assault rates. The analysis includes clustering and regression techniques to understand socio-economic patterns and crime trends across the US.
Objectives:
- Group US states based on socio-economic and crime characteristics using clustering analysis.
- Identify significant predictors of assault rates through regression models.
- Provide data-driven insights into socio-economic disparities and crime rates.
Instructions:
- Navigate to the States-Grouping_Assault-Prediction folder.
- Follow the instructions in the project's README.
Prerequisites:
- R (version 4.0 or above)
- RStudio
General Dependencies:
Each project utilizes various R packages for data analysis and visualization.
Install packages using:
install.packages(c("tidyverse", "lubridate", "flexdashboard", "plotly", "cluster", "factoextra", "broom", "car", "ggcorrplot", "usmap"))
Note: Please refer to each project's README for specific dependencies and installation instructions.
Instructions:
- Clone the Repository:
git clone https://github.com/Gyres/Job-Assessments.git cd Job-Assessments
- Select a Project:
- Navigate to the desired project folder.
- Follow the instructions in that project's README to run the analysis.
- Running the Analysis:
- Open the
.Rmd
file in RStudio. - Install any missing packages if prompted.
- Run the code chunks sequentially or knit the document to generate the report.
- Open the
Job-Assessments/
│
├── Calc-Avg-Mthly-Wage/
│ ├── Calc-Avg-Mthly-Wage.Rmd
│ └── README.md
│
├── Data-Quality_Staff-Dashboard/
│ ├── Data-Quality_Staff-Dashboard.Rmd
│ ├── Data-Quality_Staff-Dashboard.md
│ └── README.md
│
├── States-Grouping_Assault-Prediction/
│ ├── States-Grouping_Assault-Prediction.Rmd
│ ├── States-Grouping_Assault-Prediction.md
│ └── README.md
│
└── README.md # This file
This repository and its projects are intended for submission as part of technical assessments. The content and code are not intended for public distribution, reproduction, or commercial use without explicit permission from the author.
Author: Ou Yang Yu