Publications

Filippo Rinaldi, Giacomo Capitani, Lorenzo Bonicelli, Angelo Porrello, Donato Crisostomi, Federico Bolelli, Emanuele Rodolà, Elisa Ficarra, Simone Calderara

Update Your Transformer to the Latest Release: Re-Basin of Task Vectors

STAGE: Stemmed Accompaniment Generation through Prefix-Based Conditioning

Preprint

Recent advances in generative models have made it possible to create high-quality, coherent music, with some systems delivering …

Giorgio Strano, Chiara Ballanti, Donato Crisostomi, Michele Mancusi, Luca Cosmo, Emanuele Rodolà

STAGE: Stemmed Accompaniment Generation through Prefix-Based Conditioning

MASS: MoErging through Adaptive Subspace Selection

Preprint

Model merging has recently emerged as a lightweight alternative to ensembling, combining multiple fine-tuned models into a single set …

Donato Crisostomi, Alessandro Zirilli, Antonio Andrea Gargiulo, Maria Sofia Bucarelli, Simone Scardapane, Fabrizio Silvestri, Iacopo Masi, Emanuele Rodolà

MASS: MoErging through Adaptive Subspace Selection

LoopGen: Training-Free Loopable Music Generation

Preprint

Loops–short audio segments designed for seamless repetition–are central to many music genres, particularly those rooted in …

Davide Marincione, Giorgio Strano, Donato Crisostomi, Roberto Ribuoli, Emanuele Rodolà

Activation Patching for Interpretable Steering in Music Generation

Preprint

Understanding how large audio models represent music, and using that understanding to steer generation, is both challenging and …

Simone Facchiano, Giorgio Strano, Donato Crisostomi, Irene Tallini, Tommaso Mencattini, Fabio Galasso, Emanuele Rodolà

Activation Patching for Interpretable Steering in Music Generation

Task Singular Vectors: Reducing Task Interference in Model Merging

CVPR 2025

Task Arithmetic has emerged as a simple yet effective method to merge models without additional training. However, by treating entire …

Antonio Andrea Gargiulo, Donato Crisostomi, Maria Sofia Bucarelli, Simone Scardapane, Emanuele Rodolà

MERGE3: Efficient Evolutionary Merging on Consumer-grade GPUs

ICML 2025

Evolutionary model merging enables the creation of high-performing multi-task models but remains computationally prohibitive for …

Tommaso Mencattini, Adrian Robert Minut, Donato Crisostomi, Andrea Santilli, Emanuele Rodolà

MERGE3: Efficient Evolutionary Merging on Consumer-grade GPUs

Humanity's Last Exam

Preprint

Benchmarks are important tools for tracking the rapid advancements in large language model (LLM) capabilities. However, benchmarks are …

More than 600 authors including, Donato Crisostomi, Emanuele Rodolà

ATM: Improving Model Merging by Alternating Tuning and Merging

ArXiv

Model merging has recently emerged as a cost-efficient paradigm for multi-task learning. Among current approaches, task arithmetic …

Luca Zhou, Daniele Solombrino, Donato Crisostomi, Maria Sofia Bucarelli, Fabrizio Silvestri, Emanuele Rodolà

C2M3: Cycle-Consistent Multi-Model Merging

NeurIPS 2024

In this paper, we present a novel data-free method for merging neural networks in weight space. Differently from most existing works, …

Donato Crisostomi, Marco Fumero, Daniele Baieri, Florian Bernard, Emanuele Rodolà

C2M3: Cycle-Consistent Multi-Model Merging

From Charts to Atlas: Merging Latent Spaces into One

NeurReps workshop @ NeurIPS 2023

Models trained on semantically related datasets and tasks exhibit comparable inter-sample relations within their latent spaces. We …

Donato Crisostomi, Irene Cannistraci, Luca Moschella, Pietro Barbiero, Marco Ciccone, Pietro Lio, Emanuele Rodolà

Mitigating the Burden of Redundant Datasets via Batch-Wise Unique Samples and Frequency-Aware Losses

ACL 2023

Datasets used to train deep learning models in industrial settings often exhibit skewed distributions with some samples repeated a …

Donato Crisostomi, Andrea Caciolai, Alessandro Pedrani, Kay Rottmann, Alessandro Manzotti, Enrico Palumbo, Davide Bernardi

Mitigating the Burden of Redundant Datasets via Batch-Wise Unique Samples and Frequency-Aware Losses

AVEN-GR: Attribute Value Extraction and Normalization using product GRaphs

ACL 2023

Getting a good understanding of the user intent is vital for e-commerce applications to surface the right product to a given customer …

Donato Crisostomi, Thomas Ricatte

Play música alegre: A Large-Scale Empirical Analysis of Cross-Lingual Phenomena in Voice Assistant Interactions

MMNLU workshop, EMNLP 2022

Cross-lingual phenomena are quite common in informal contexts like social media, where users are likely to mix their native language …

Donato Crisostomi, Alessandro Manzotti, Enrico Palumbo, Davide Bernardi, Sarah Campbell, Shubham Garg

Play música alegre: A Large-Scale Empirical Analysis of Cross-Lingual Phenomena in Voice Assistant Interactions

Metric Based Few-Shot Graph Classification

LoG 2022

Few-shot graph classification is a novel yet promising emerging research field that still lacks the soundness of well-established …

Donato Crisostomi, Simone Antonelli, Valentino Maiorca, Luca Moschella, Riccardo Marin, Emanuele Rodolà

Metric Based Few-Shot Graph Classification

Few-Shot Object Detection: A Survey

ACM Surveys

Deep learning approaches have recently raised the bar in many fields, from Natural Language Processing to Computer Vision, by …

Simone Antonelli, Danilo Avola, Luigi Cinque, Donato Crisostomi, Gian Luca Foresti, Fabio Galasso, Marco Raoul Marini, Alessio Mecca, Daniele Pannone