TU Delft · Web Information Systems

Data management for the age of AI.

Infinidata Lab at TU Delft builds the next generation of data systems — uniting data lakes with machine learning, privacy-preserving federated learning, and quantum data management.

Led by Dr. Rihan Hai, Assistant Professor 30+ papers at SIGMOD · VLDB · ICDE NWO VENI laureate
What we work on

Three research frontiers

We rethink how data is integrated, shared, and processed — from the model lake to the quantum processor.

AI in Data & Model Lakes

Bringing data integration and machine learning together, so heterogeneous data and rich model zoos meet in one lake.

Federated & Private Learning

Training and synthetic-data generation across organisational silos — without ever sharing the raw data.

Quantum Data Management

Reimagining query optimisation, entity matching, and anomaly detection for the NISQ-era quantum processor.

Selected work

Featured projects

All projects

Model Lake

Amalur

Amalur explores the convergence of data integration and machine learning — automating how scattered training data across silos is integrated for downstream models. It is the foundation of the group’s Model Lake vision, where heterogeneous data and rich model zoos meet in one place.

IEEE TKDE 2024 Data integrationMachine learning

Synthetic data

SiloFuse

SiloFuse generates cross-silo synthetic tabular data using latent diffusion models, so organisations can share realistic data without ever exposing raw, feature-partitioned records.

ICDE 2024 DiffusionPrivacy

Time series

WaveStitch

WaveStitch performs flexible and fast conditional time-series generation with diffusion models, stitching together realistic signals under user-specified constraints.

SIGMOD 2025 DiffusionGenerative

LLM serving

TranSQL / Database-as-Runtime

TranSQL serves large language models with relational queries — compiling model inference to SQL so that LLMs can run inside a database engine, even on low-resource hardware.

SIGMOD 2025 Best demo runner-up LLM servingSQL
Latest publications

Fresh from the lab

All publications
Join us

Curious about data systems, ML, or quantum?

We're always looking for sharp PhD and MSc students. Thesis topics and open positions are posted year-round.

See open positions