Talks and Publications
Talks
- Timeseries forecasting with skrub Data Ops: EuroScipy 2025, with Guillaume Lemaitre
- Skrub – Machine learning with dataframes: PyData Paris 2025
- Skrub tutorials material: To be presented
Publications
Riccardo Cappuzzo, Gael Varoquaux, Aimee Coelho, Paolo Papotti: Retrieve, Merge, Predict: Augmenting Tables with Data Lakes TMLR2025, PDF
Riccardo Cappuzzo, Paolo Papotti, and Saravanan Thirumuruganathan: Relational Data Imputation with Graph Neural Networks EDBT2024, PDF
Riccardo Cappuzzo, Paolo Papotti, and Saravanan Thirumuruganathan: Creating Embeddings of Heterogeneous Relational Datasets for Data Integration Tasks. SIGMOD 2020, PDF
Riccardo Cappuzzo, Paolo Papotti, Saravanan Thirumuruganathan: EmbDI: Generating Embeddings for Relational Data Integration (Discussion Paper). SEBD 2021 PDF
Riccardo Cappuzzo, under the supervision of Paolo Papotti, Elena Baralis: Clustering of Categorical Data for Anonymization and Anomaly Detection. Master Thesis at Politecnico di Torino PDF
Riccardo Cappuzzo, under the supervision of Paolo Papotti: Deep Learning Models for Tabular Data Curation. PhD Thesis at EURECOM/Sorbonne University PDF