1. Decouple reception from optimization
Clients download rank r_tot under bandwidth limits, but only
train rank r_train under memory and time limits.
ICML 2026
Towards Drift-Resilient Federated Fine-tuning with Heterogeneous Resources
School of Computing Science, University of Glasgow
Technical TL;DR
FedHera decouples the downloaded inference rank from the locally trainable rank, so clients can receive a richer global LoRA basis while only optimizing the prefix that fits their memory and compute budget.
A spectrum-preserving water-filling allocator spends bandwidth on high-energy singular directions, while prefix-gated training uses the frozen tail as a forward-pass anchor to reduce truncation-induced drift.
FedHera mechanics
Clients download rank r_tot under bandwidth limits, but only
train rank r_train under memory and time limits.
The server competes singular directions across layers and spends each rank column where energy per cost is highest.
Gradient masks update only the active prefix; Adaptive Tail Warm-up gates the frozen tail as the global basis becomes reliable.
Results
Lower drift indicates that local client updates stay closer to the high-rank reference direction and remain more aligned across heterogeneous clients.
| Task | ROUGE-L gain | Loss |
|---|---|---|
| Alpaca | +0.101 | 1.140 |
| E2E NLG | +0.195 | 0.492 |
| Method | Abs. | Rel. |
|---|---|---|
| FedHera | 2.060 | 6.641 |
| FedHL | 2.072 | 6.684 |
| FlexLoRA | 2.991 | 9.647 |
Code map
BibTeX
@inproceedings{xiao2026fedhera,
title={FedHera: Towards Drift-Resilient Federated Fine-tuning with Heterogeneous Resources},
author={Xiao, Ke and Wang, Qiyuan and Anagnostopoulos, Christos and Tan, Zhuoran and Li, Wenhao},
booktitle={International Conference on Machine Learning (ICML)},
year={2026}
}