This website uses cookies to ensure you get the best experience on our website.
Learn more
Got it!
Donato Crisostomi
Donato Crisostomi
Home
News
Experience
1
Update Your Transformer to the Latest Release: Re-Basin of Task Vectors
Foundation models serve as the backbone for numerous specialized models developed through fine-tuning. However, when the underlying …
Filippo Rinaldi
,
Giacomo Capitani
,
Lorenzo Bonicelli
,
Angelo Porrello
,
Donato Crisostomi
,
Federico Bolelli
,
Emanuele Rodolà
,
Elisa Ficarra
,
Simone Calderara
Cite
MERGE3: Efficient Evolutionary Merging on Consumer-grade GPUs
Evolutionary model merging enables the creation of high-performing multi-task models but remains computationally prohibitive for …
Tommaso Mencattini
,
Adrian Robert Minut
,
Donato Crisostomi
,
Andrea Santilli
,
Emanuele Rodolà
Cite
arXiv
GitHub
Task Singular Vectors: Reducing Task Interference in Model Merging
Task Arithmetic has emerged as a simple yet effective method to merge models without additional training. However, by treating entire …
Antonio Andrea Gargiulo
,
Donato Crisostomi
,
Maria Sofia Bucarelli
,
Simone Scardapane
,
Emanuele Rodolà
Cite
arXiv
GitHub
C2M3: Cycle-Consistent Multi-Model Merging
In this paper, we present a novel data-free method for merging neural networks in weight space. Differently from most existing works, …
Donato Crisostomi
,
Marco Fumero
,
Daniele Baieri
,
Florian Bernard
,
Emanuele Rodolà
Cite
arXiv
GitHub
AVEN-GR: Attribute Value Extraction and Normalization using product GRaphs
Getting a good understanding of the user intent is vital for e-commerce applications to surface the right product to a given customer …
Donato Crisostomi
,
Thomas Ricatte
Cite
URL
Mitigating the Burden of Redundant Datasets via Batch-Wise Unique Samples and Frequency-Aware Losses
Datasets used to train deep learning models in industrial settings often exhibit skewed distributions with some samples repeated a …
Donato Crisostomi
,
Andrea Caciolai
,
Alessandro Pedrani
,
Kay Rottmann
,
Alessandro Manzotti
,
Enrico Palumbo
,
Davide Bernardi
Cite
URL
Metric Based Few-Shot Graph Classification
Few-shot graph classification is a novel yet promising emerging research field that still lacks the soundness of well-established …
Donato Crisostomi
,
Simone Antonelli
,
Valentino Maiorca
,
Luca Moschella
,
Riccardo Marin
,
Emanuele Rodolà
Cite
URL
PDF
GitHub
Play música alegre: A Large-Scale Empirical Analysis of Cross-Lingual Phenomena in Voice Assistant Interactions
Cross-lingual phenomena are quite common in informal contexts like social media, where users are likely to mix their native language …
Donato Crisostomi
,
Alessandro Manzotti
,
Enrico Palumbo
,
Davide Bernardi
,
Sarah Campbell
,
Shubham Garg
Cite
URL
Cite
×