Using Satellite Images to Predict Poverty

Poverty is a multifaceted problem manifested by conditions such as malnutrition, homelessness, lack of access to clean water, low educational achievement, and the like (UN 2005). It continues to be one the world’s most pressing issues and it has only been exacerbated by the economic effects of the COVID-19 pandemic. It follows that poverty statistics are among the most important and most widely used data in the economic and policy research sphere. Ironically, the most vulnerable economies in need of extensive and up-to-date poverty data are also those who lack the capacity to compile them. Our project attempts to contribute to the existing body of research by leverging technological advances in machine learning to make use of existing satellite imagery.

Project Summary

We used day time satellite imagery for Ethiopia, Mali, Nigeria, and Malawi from 2015, combined with economic indicators from survey data, to train machine learning models. We planned to use Linear Regression, Random Forest, and Convolutional Neural Networks (CNN) to improve accuracy in poverty prediction, but also ended up including Support Vector Machines and Transfer Learning models. Our work is inspired by past projects which have pushed the evolution of this work from using only nighttime satellite imagery to a combination of the two, along with application of CNN to countries from around the world. By training multiple models on a cluster of countries in Africa (which have not been included before), we hope that our work can continue to advance this mission and encourage countries without consistent poverty data to invest in the development of this data.

Data Sources

We are using satellite images from private satellite company, Planet, which shared images from Africa for public use.
Economic indicators from the Demographic and Health Surveys (DHS) Program

Hardware and Software Requirements

This code was tested on a system with the following specifications:

GPU: 4 x NVIDIA A100 40GB HBM2
CPU: AMD EPYC 7742 (64 cores, Rome, 2.25 GHz)
RAM: 512 GB (8 x 64GB) ECC DDR4 3200 Mhz
SSD: 2,5” 3,8 TB U.2 NVMe TLC
Network: 1 x 10 GbE SFP+, 2x 1 GbE RJ45, 1 x IPMI Lan
OS: Ubuntu 20.04 LTS + Tensorflow, PyTorch and MxNet with Docker Container for Versions Management

Results

Linear Regression (3.76% score) & Random Forest (61.8%) set a rough baseline.

Despite having server issues which only allowed us to run the CNN 50 epochs, this model rendered a 57.1% accuracy rate.

SVM (60.8%) and Transfer Learning, applied to CNN (56.4%), were both included as a means to circumvent server issues.

Next Steps & Future Work

Improving current work: We saw improvement with hyper parameter tuning, further compression of files, and longer training times; and believe that with more time, the CNN's output can be greatly improved.

Future projects & applications: Taking lessons from this project, we'd like to apply this model to more up-to-date satellite images. Our ultimate goal is to use these models to support prediction efforts for countries with NO poverty data (i.e. Afghanistan, Somalia), so including countries nearby which can lead to more accurate prediction is another goal.

Name		Name	Last commit message	Last commit date
Latest commit History 61 Commits
Project Proposal (Satellites against Poverty)		Project Proposal (Satellites against Poverty)
images		images
maps		maps
outcome_data		outcome_data
.DS_Store		.DS_Store
.gitignore		.gitignore
FullCodeMalawi.ipynb		FullCodeMalawi.ipynb
Johannes Halkenhaeusser_115538_0.pdf		Johannes Halkenhaeusser_115538_0.pdf
LaTeXTemplate_for_ML_Project_Proposal__Satellites_against_Poverty_.pdf		LaTeXTemplate_for_ML_Project_Proposal__Satellites_against_Poverty_.pdf
MasterCode.ipynb		MasterCode.ipynb
README.md		README.md
Satellite_Prediction_of_Poverty_Final_Project_Report.pdf		Satellite_Prediction_of_Poverty_Final_Project_Report.pdf
get_shapefile.ipynb		get_shapefile.ipynb
predictions.csv		predictions.csv
test_set_pca.csv		test_set_pca.csv
train_set_pca.csv		train_set_pca.csv
transferlearning_curves.png		transferlearning_curves.png
transferlearning_r2.png		transferlearning_r2.png
wealth_index.py		wealth_index.py
wealth_index_all.py		wealth_index_all.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Using Satellite Images to Predict Poverty

Project Summary

Data Sources

Hardware and Software Requirements

Results

Next Steps & Future Work

About

Releases

Packages

Contributors 3

Languages

janinepdevera/Poverty-Estimation-with-Satellite-Images

Folders and files

Latest commit

History

Repository files navigation

Using Satellite Images to Predict Poverty

Project Summary

Data Sources

Hardware and Software Requirements

Results

Next Steps & Future Work

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages