Donato Crisostomi
Donato Crisostomi
Home
News
Experience
Publications
Type
Conference paper
Journal article
Preprint
Date
2025
2024
2023
2022
Task Singular Vectors: Reducing Task Interference in Model Merging
CVPR 2025
Task Arithmetic has emerged as a simple yet effective method to merge models without additional training. However, by treating entire …
Antonio Andrea Gargiulo
,
Donato Crisostomi
,
Maria Sofia Bucarelli
,
Simone Scardapane
,
Emanuele Rodolà
Cite
arXiv
GitHub
MERGE3: Efficient Evolutionary Merging on Consumer-grade GPUs
ArXiv
Evolutionary model merging enables the creation of high-performing multi-task models but remains computationally prohibitive for …
Tommaso Mencattini
,
Adrian Robert Minut
,
Donato Crisostomi
,
Andrea Santilli
,
Emanuele Rodolà
Cite
arXiv
GitHub
Humanity's Last Exam
Preprint
Benchmarks are important tools for tracking the rapid advancements in large language model (LLM) capabilities. However, benchmarks are …
More than 600 authors including
,
Donato Crisostomi
,
Emanuele Rodolà
Cite
URL
GitHub
arXiv
ATM: Improving Model Merging by Alternating Tuning and Merging
ArXiv
Model merging has recently emerged as a cost-efficient paradigm for multi-task learning. Among current approaches, task arithmetic …
Luca Zhou
,
Daniele Solombrino
,
Donato Crisostomi
,
Maria Sofia Bucarelli
,
Fabrizio Silvestri
,
Emanuele Rodolà
Cite
arXiv
GitHub
C2M3: Cycle-Consistent Multi-Model Merging
NeurIPS 2024
In this paper, we present a novel data-free method for merging neural networks in weight space. Differently from most existing works, …
Donato Crisostomi
,
Marco Fumero
,
Daniele Baieri
,
Florian Bernard
,
Emanuele Rodolà
Cite
arXiv
GitHub
From Charts to Atlas: Merging Latent Spaces into One
NeurReps workshop @ NeurIPS 2023
Models trained on semantically related datasets and tasks exhibit comparable inter-sample relations within their latent spaces. We …
Donato Crisostomi
,
Irene Cannistraci
,
Luca Moschella
,
Pietro Barbiero
,
Marco Ciccone
,
Pietro Lio
,
Emanuele Rodolà
Cite
URL
PDF
Mitigating the Burden of Redundant Datasets via Batch-Wise Unique Samples and Frequency-Aware Losses
ACL 2023
Datasets used to train deep learning models in industrial settings often exhibit skewed distributions with some samples repeated a …
Donato Crisostomi
,
Andrea Caciolai
,
Alessandro Pedrani
,
Kay Rottmann
,
Alessandro Manzotti
,
Enrico Palumbo
,
Davide Bernardi
Cite
URL
AVEN-GR: Attribute Value Extraction and Normalization using product GRaphs
ACL 2023
Getting a good understanding of the user intent is vital for e-commerce applications to surface the right product to a given customer …
Donato Crisostomi
,
Thomas Ricatte
Cite
URL
Play música alegre: A Large-Scale Empirical Analysis of Cross-Lingual Phenomena in Voice Assistant Interactions
MMNLU workshop, EMNLP 2022
Cross-lingual phenomena are quite common in informal contexts like social media, where users are likely to mix their native language …
Donato Crisostomi
,
Alessandro Manzotti
,
Enrico Palumbo
,
Davide Bernardi
,
Sarah Campbell
,
Shubham Garg
Cite
URL
Metric Based Few-Shot Graph Classification
LoG 2022
Few-shot graph classification is a novel yet promising emerging research field that still lacks the soundness of well-established …
Donato Crisostomi
,
Simone Antonelli
,
Valentino Maiorca
,
Luca Moschella
,
Riccardo Marin
,
Emanuele Rodolà
Cite
URL
PDF
GitHub
Few-Shot Object Detection: A Survey
ACM Surveys
Deep learning approaches have recently raised the bar in many fields, from Natural Language Processing to Computer Vision, by …
Simone Antonelli
,
Danilo Avola
,
Luigi Cinque
,
Donato Crisostomi
,
Gian Luca Foresti
,
Fabio Galasso
,
Marco Raoul Marini
,
Alessio Mecca
,
Daniele Pannone
Cite
DOI
URL
Cite
×