Computational Social Scientist | Economics + Data
I use R, Python, and SQL to turn messy social data into research that means something — from NLP pipelines on IMF policy documents to climate vulnerability scores for Spanish schools.
- MSc Computational Social Science @ Universidad Carlos III de Madrid (2026)
- Research Assistant @ Sciences Po – InclusiveParl Project (SPARQL + web scraping for European parliamentary representation data)
- MSc Thesis: BERT-based classification of IMF conditionality discourse (1980–2020) — working paper in progress
- BSc Economics and Business Economics @ Utrecht University (2022)
- Family connection: AccorneroVini, a winery in Monferrato, Piedmont — where I first got interested in market data
- Classifying 40 years of IMF conditionality using BERT — MSc thesis, SSRN working paper in progress
- Building SPARQL + scraping pipelines for cross-European parliamentary representation data at Sciences Po
- Preparing a working paper on degrowth readiness typologies across 254 EU regions
European Degrowth Typology
PCA + hierarchical and K-means clustering on 254 EU NUTS-2 regions to identify structural typologies of degrowth readiness. Validated externally against CO₂ emissions and life expectancy. R Eurostat ggplot2
Spanish Schools Heat Vulnerability Pipeline
Production-ready R pipeline integrating 4 government APIs (CartoCiudad, Nominatim, Catastro, AEMET) to assess climate heat vulnerability across 999 schools — 261,406 observations. 90% geocoding accuracy. R API Policy Tool
Customer Churn ML — Telco
Supervised ML pipeline on IBM Telco dataset (7,043 records): churn classification with Random Forest, LASSO, and SVM, and total charges regression (R²=0.91). Focus on class imbalance and interpretable business insight. R caret glmnet
IKEA Furniture ML Models
Price prediction (R²=0.74) and discount classification on 3,694 products using LASSO and Random Forest in R. R Machine Learning
Roma Acceptance in the EU — Multilevel Analysis
Multilevel model (lme4) on Eurobarometer 493 data — 27,438 respondents across 28 EU countries. Multiple imputation via mice. Individual and country-level drivers of Roma workplace acceptance. R lme4 Survey Methods
Changing Fortunes — FT Visualization Replication
Full replication and redesign of the Financial Times "Changing Fortunes" slope chart on US median household income inequality (1999–2014). Pew Research data. R ggplot2 Data Visualization
Methods: Regression · Causal Inference · Machine Learning · NLP / Text Classification · Multilevel Modeling · PCA & Clustering · Survey Analysis · Web Scraping · Data Visualization
Libraries: tidyverse · ggplot2 · lme4 · mice · caret · glmnet · randomForest · xml2 · sf · factoextra
Data Sources: Eurostat · Eurobarometer · IMF MONA · World Bank · Catastro · AEMET · CartoCiudad
- 📊 IMF Conditionality Working Paper — BERT classification of ~12,000 IMF conditions across 40 years. Targeting SSRN submission July 2026.
- 🔬 Sciences Po – InclusiveParl — Web scraping + SPARQL pipelines for European parliamentary representation databases.
Email: tommaso.accornero@outlook.com
LinkedIn: linkedin.com/in/tommaso-accornero-4084931a9
Location: Madrid, Spain
Looking for: Summer 2026 data analytics internships in Europe