Posts

  • Dec 16, 2025 ‐ Arnab’s 2025 paper, MEMPHIS: Holistic Lineage-based Reuse and Memory Management for Multi-backend ML Systems, has received the prestigious SIGMOD Research Highlight Award and will appear in the March 2026 special issue of SIGMOD Record. The paper has previously won the EDBT 2025 Best Research Paper Award..
  • Nov 3, 2025 ‐ Barrie has been invited to submit an extended version of his paper on Scalable Data Debugging for Neighborhood-based Recommendation with Data Shapley Values to the Special Issue on Highlights of RecSys ’25 of the ACM Transactions on Recommender Systems..
  • Sep 26, 2025 ‐ Barrie and Pierre are at RecSys in Prague this week! Barrie presents our work on on Scalable Data Debugging for Neighborhood-based Recommendation with Data Shapley Values, which was selected as a spotlight oral. Pierre gives a talk on his initial ideas Towards a Real-World Aligned Benchmark for Unlearning in Recommender Systems at the Responsible recommendation workshop..
  • Jul 15, 2025 ‐ Meet Olga from our lab at ICML in Vancouver this week! She will present a research paper on scSSL-Bench: Benchmarking Self-Supervised Learning for Single-Cell Data, which was selected as a spotlight poster, as well as a second paper on Towards Cross-Modal Error Detection with Tables and Images at the DataWorld workshop..
  • Jul 14, 2025 ‐ Our external PhD student Zeyu from the University of Amsterdam is starting his internship with the Amazon Q team of AWS Berlin this week!.
  • Jul 7, 2025 ‐ Our external PhD Student Barrie Kersbergen (co-supervised with Maarten de Rijke) has successfully defended his PhD at the University of Amsterdam! Barrie’s research on recommender systems has been deployed to millions of users at the European e-commerce platform bol.com..
  • Jun 10, 2025 ‐ Meet our lab at the SIGMOD conference in Berlin next week! We are part of the organizing committee of the conference and co-organise the DEEM workshop as well. Furthermore, we will present a workshop paper on Towards Automated Task-Aware Data Validation and run a tutorial on Navigating Data Errors in Machine Learning Pipelines on Friday..
  • May 2, 2025 ‐ Olga and Sebastian took part in the seminar on the Challenges and Opportunities of Table Representation Learning in Dagstuhl, which aims to connect the communities of data management, machine learning, and natural language processing to discuss the future of learning on tabular data..
  • Mar 25, 2025 ‐ Zeyu gave an invited talk about the efficient utilization of language models for table data preparation at the industry event on Next-Generation Data Management Systems at EDBT 2025 in Barcelona, and subsequently presented our paper on A Deep Dive Into Cross-Dataset Entity Matching with Large and Small Language Models..
  • Jan 11, 2025 ‐ Stefan will be co-organising the workshop on Data Management for End-to-End Machine Learning (DEEM) at SIGMOD 2025 in Berlin..