Publications

Abzal Aidakhmetov, Donato Crisostomi, Tommaso Mencattini, Adrian Robert Minut, Iacopo Masi, Emanuele Rodolà

Steering Vectors are an Adversarial Attack Surface

PHALAR: Phasors for Learned Musical Audio Representations

ICML 2026

Stem retrieval, the task of matching missing stems to a given audio submix, is a key challenge currently limited by models that discard …

Davide Marincione, Michele Mancusi, Giorgio Strano, Luca Cerovaz, Donato Crisostomi, Roberto Ribuoli, Emanuele Rodolà

PHALAR: Phasors for Learned Musical Audio Representations

Model Merging: Foundations and Algorithms

PhD Thesis

Modern deep learning usually treats models as separate artifacts: trained independently, specialized for particular purposes, and …

Donato Crisostomi

Model Merging: Foundations and Algorithms

Post-AGI Science and Society Workshop

ICLR 2026 Workshop

The Post-AGI Science and Society Workshop at ICLR 2026 asks what comes next if AGI becomes ubiquitous, reliable, and affordable, and …

Donato Crisostomi, Andrea Santilli, Pratyusha Sharma, Valentina Pyatkin, Zorah Lähner, Emanuele Rodolà

Multi-objective Evolutionary Merging Enables Efficient Reasoning Models

Preprint

Reasoning models have demonstrated remarkable capabilities in solving complex problems by leveraging long chains of thought. However, …

Mario Iacobelli, Adrian Robert Minut, Tommaso Mencattini, Donato Crisostomi, Andrea Santilli, Iacopo Masi, Emanuele Rodolà

Multi-objective Evolutionary Merging Enables Efficient Reasoning Models

Multi-Way Representation Alignment

ICML 2026

The Platonic Representation Hypothesis suggests that independently trained neural networks converge to increasingly similar latent …

Akshit Achara, Tatiana Gaintseva, Mateo Mahaut, Pritish Chakraborty, Viktor Stenby Johansson, Melih Barsbey, Emanuele Rodolà, Donato Crisostomi

Membership and Dataset Inference Attacks on Large Audio Generative Models

AI4Music @ NeurIPS 2025

Generative audio models, based on diffusion and autoregressive architectures, have advanced rapidly in both quality and expressiveness. …

Jakub Proboszcz, Paweł Kochański, Karol Korszun, Katarzyna Stankiewicz, Donato Crisostomi, Giorgio Strano, Emanuele Rodolà, Kamil Deja, Jan Dubiński

Membership and Dataset Inference Attacks on Large Audio Generative Models

Model Merging Improves Zero-Shot Generalization in Bioacoustic Foundation Models

AI for Animal Communication @ NeurIPS 2025

Foundation models capable of generalizing across species and tasks represent a promising new frontier in bioacoustics, with NatureLM …

Davide Marincione, Donato Crisostomi, Roberto Dessì, Emanuele Rodolà, Emanuele Rossi

Model Merging Improves Zero-Shot Generalization in Bioacoustic Foundation Models

Language Models are Injective and Hence Invertible

ICLR 2026

Transformer components such as non-linear activations and normalization are inherently non-injective, suggesting that different inputs …

Giorgios Nikolaou, Tommaso Mencattini, Donato Crisostomi, Santilli, Yannis Panagakis, Emanuele Rodolà

Language Models are Injective and Hence Invertible

Two-Scale Latent Dynamics for Recurrent-Depth Transformers

UniReps @ NeurIPS 2025

Recurrent-depth transformers scale test-time compute by iterating latent computations before emitting tokens. We study the geometry of …

Francesco Pappone, Donato Crisostomi, Emanuele Rodolà

Two-Scale Latent Dynamics for Recurrent-Depth Transformers

On Task Vectors and Gradients

UniReps @ NeurIPS 2025

Task arithmetic has emerged as a simple yet powerful technique for model merging, enabling the combination of multiple finetuned models …

Luca Zhou, Daniele Solombrino, Donato Crisostomi, Maria Sofia Bucarelli, Giuseppe Alessio D'Inverno, Fabrizio Silvestri, Emanuele Rodolà

Implicit Inversion turns CLIP into a Decoder

ICLR 2026

CLIP is a discriminative model trained to align images and text in a shared embedding space. Due to its multimodal structure, it serves …

Antonio D'Orazio, Maria Rosaria Briglia, crisostomi, Dario Loi, rodola, Iacopo Masi

Implicit Inversion turns CLIP into a Decoder

Mergenetic: a Simple Evolutionary Model Merging Library

ACL 2025 System Demo

Model merging allows combining the capabilities of existing models into a new one - post hoc, without additional training. This has …

Adrian R. Minut, Tommaso Mencattini, Marco Santilli, Donato Crisostomi, Emanuele Rodolà

Mergenetic: a Simple Evolutionary Model Merging Library

Update Your Transformer to the Latest Release: Re-Basin of Task Vectors

ICML 2025

Foundation models serve as the backbone for numerous specialized models developed through fine-tuning. However, when the underlying …

Filippo Rinaldi, Giacomo Capitani, Lorenzo Bonicelli, Angelo Porrello, Donato Crisostomi, Federico Bolelli, Emanuele Rodolà, Elisa Ficarra, Simone Calderara

Update Your Transformer to the Latest Release: Re-Basin of Task Vectors

STAGE: Stemmed Accompaniment Generation through Prefix-Based Conditioning

ISMIR 2025 (top in Music ML)

Recent advances in generative models have made it possible to create high-quality, coherent music, with some systems delivering …

Giorgio Strano, Chiara Ballanti, Donato Crisostomi, Michele Mancusi, Luca Cosmo, Emanuele Rodolà

Efficient Generation of Multimodal Fluid Simulation Data

STAG

In this work, we introduce an efficient generation procedure to produce synthetic multi-modal datasets of fluid simulations. The …

Daniele Baieri, Donato Crisostomi, Donato Crisostomi, Stefano Esposito, Filippo Maggioli, Emanuele Rodolà

Efficient Generation of Multimodal Fluid Simulation Data

MASS: MoErging through Adaptive Subspace Selection

ICLR 2026

Model merging has recently emerged as a lightweight alternative to ensembling, combining multiple fine-tuned models into a single set …

Donato Crisostomi, Alessandro Zirilli, Antonio Andrea Gargiulo, Maria Sofia Bucarelli, Simone Scardapane, Fabrizio Silvestri, Iacopo Masi, Emanuele Rodolà

MASS: MoErging through Adaptive Subspace Selection

LoopGen: Training-Free Loopable Music Generation

ISMIR 2025 (top in Music ML)

Loops–short audio segments designed for seamless repetition–are central to many music genres, particularly those rooted in …

Davide Marincione, Giorgio Strano, Donato Crisostomi, Roberto Ribuoli, Emanuele Rodolà

LoopGen: Training-Free Loopable Music Generation

Activation Patching for Interpretable Steering in Music Generation

Preprint

Understanding how large audio models represent music, and using that understanding to steer generation, is both challenging and …

Simone Facchiano, Giorgio Strano, Donato Crisostomi, Irene Tallini, Tommaso Mencattini, Fabio Galasso, Emanuele Rodolà

Activation Patching for Interpretable Steering in Music Generation

Task Singular Vectors: Reducing Task Interference in Model Merging

CVPR 2025

Task Arithmetic has emerged as a simple yet effective method to merge models without additional training. However, by treating entire …

Antonio Andrea Gargiulo, Donato Crisostomi, Maria Sofia Bucarelli, Simone Scardapane, Emanuele Rodolà

Task Singular Vectors: Reducing Task Interference in Model Merging

MERGE³: Efficient Evolutionary Merging on Consumer-grade GPUs

ICML 2025

Evolutionary model merging enables the creation of high-performing multi-task models but remains computationally prohibitive for …

Tommaso Mencattini, Adrian Robert Minut, Donato Crisostomi, Andrea Santilli, Emanuele Rodolà

MERGE³: Efficient Evolutionary Merging on Consumer-grade GPUs

Humanity's Last Exam

Preprint

Benchmarks are important tools for tracking the rapid advancements in large language model (LLM) capabilities. However, benchmarks are …

More than 600 authors including, Donato Crisostomi, Emanuele Rodolà

ATM: Improving Model Merging by Alternating Tuning and Merging

ArXiv

Model merging has recently emerged as a cost-efficient paradigm for multi-task learning. Among current approaches, task arithmetic …

Luca Zhou, Daniele Solombrino, Donato Crisostomi, Maria Sofia Bucarelli, Fabrizio Silvestri, Emanuele Rodolà

ATM: Improving Model Merging by Alternating Tuning and Merging

C²M³: Cycle-Consistent Multi-Model Merging

NeurIPS 2024

In this paper, we present a novel data-free method for merging neural networks in weight space. Differently from most existing works, …

Donato Crisostomi, Marco Fumero, Daniele Baieri, Florian Bernard, Emanuele Rodolà

C²M³: Cycle-Consistent Multi-Model Merging

Preface of UniReps: the Second Edition of the Workshop on Unifying Representations in Neural Models

PMLR

Discover why, when and how distinct learning processes yield similar representations, and the degree to which these can be unified.

Clementine Domine, Marco Fumero, Zorah Lähner, Donato Crisostomi, Luca Moschella, Kimberly Stachenfeld

Preface of UniReps: the Second Edition of the Workshop on Unifying Representations in Neural Models

From Charts to Atlas: Merging Latent Spaces into One

NeurReps workshop @ NeurIPS 2023

Models trained on semantically related datasets and tasks exhibit comparable inter-sample relations within their latent spaces. We …

Donato Crisostomi, Irene Cannistraci, Luca Moschella, Pietro Barbiero, Marco Ciccone, Pietro Lio, Emanuele Rodolà

From Charts to Atlas: Merging Latent Spaces into One

Mitigating the Burden of Redundant Datasets via Batch-Wise Unique Samples and Frequency-Aware Losses

ACL 2023

Datasets used to train deep learning models in industrial settings often exhibit skewed distributions with some samples repeated a …

Donato Crisostomi, Andrea Caciolai, Alessandro Pedrani, Kay Rottmann, Alessandro Manzotti, Enrico Palumbo, Davide Bernardi

Mitigating the Burden of Redundant Datasets via Batch-Wise Unique Samples and Frequency-Aware Losses

AVEN-GR: Attribute Value Extraction and Normalization using product GRaphs

ACL 2023

Getting a good understanding of the user intent is vital for e-commerce applications to surface the right product to a given customer …

Donato Crisostomi, Thomas Ricatte

AVEN-GR: Attribute Value Extraction and Normalization using product GRaphs

Play música alegre: A Large-Scale Empirical Analysis of Cross-Lingual Phenomena in Voice Assistant Interactions

MMNLU workshop, EMNLP 2022

Cross-lingual phenomena are quite common in informal contexts like social media, where users are likely to mix their native language …

Donato Crisostomi, Alessandro Manzotti, Enrico Palumbo, Davide Bernardi, Sarah Campbell, Shubham Garg

Play música alegre: A Large-Scale Empirical Analysis of Cross-Lingual Phenomena in Voice Assistant Interactions

Metric Based Few-Shot Graph Classification

LoG 2022

Few-shot graph classification is a novel yet promising emerging research field that still lacks the soundness of well-established …

Donato Crisostomi, Simone Antonelli, Valentino Maiorca, Luca Moschella, Riccardo Marin, Emanuele Rodolà

Metric Based Few-Shot Graph Classification

Few-Shot Object Detection: A Survey

ACM Surveys

Deep learning approaches have recently raised the bar in many fields, from Natural Language Processing to Computer Vision, by …

Simone Antonelli, Danilo Avola, Luigi Cinque, Donato Crisostomi, Gian Luca Foresti, Fabio Galasso, Marco Raoul Marini, Alessio Mecca, Daniele Pannone