In 2023, AidData, an international development research lab housed at William & Mary’s Global Research Institute, released its massive dataset on projects supported by loans and grants from China.
The AidDatasGlobalChineseDevelopmentFinanceDataset_v3.0.xlsx
dataset has 126 columns and 20,985 rows. It is used for self-initiated pandas
data cleaning practice. Findings may also be included in a scoping report currently being processed.
The Jupyter notebook aid_data.ipynb
shows my first workflow using the AidData dataset. data.csv
serves as a checkpoint file after filtering the data to only Southeast Asian countries and data_petrochemical.csv
drops unnecessary columns and only keeps rows with the keyword "petrochemical".
Custer, S., Dreher, A., Elston, T.B., Escobar, B., Fedorochko, R., Fuchs, A., Ghose, S., Lin, J., Malik, A., Parks, B.C., Solomon, K., Strange, A., Tierney, M.J., Vlasto, L., Walsh, K., Wang, F., Zaleski, L., and Zhang, S. 2023. Tracking Chinese Development Finance: An Application of AidData’s TUFF 3.0 Methodology. Williamsburg, VA: AidData at William & Mary.