Federated Nested Learning Framework for Test-Time Adaptation

ai-technology · 2026-05-20

A novel machine learning framework named Federated Nested Learning (FedNL) has been introduced to tackle the challenges posed by non-IID client data in federated learning. This framework reinterprets federated learning as a three-tier nested optimization model, incorporating Titans-based linear attention to facilitate lightweight, zero-shot test-time adaptation through a delta rule as an online gradient step. Tests conducted on non-IID MMLU and long-context benchmarks demonstrate strong performance in short-context reasoning, enhanced long-context retrieval, and consistent streaming cross-entropy, along with stable inference memory. The research paper can be accessed on arXiv.

Key facts

FedNL reformulates FL as a three-level nested optimization system.
FedNL embeds Titans-based linear attention into FL.
Clients perform lightweight, zero-shot test-time adaptation.
A delta rule is treated as an online gradient step.
Experiments on non-IID MMLU and long-context benchmarks.
Competitive performance in short-context reasoning.
Enhanced long-context retrieval and streaming cross-entropy.
Maintains constant inference memory.

Federated Nested Learning Framework for Test-Time Adaptation

Key facts

Entities

Institutions

Sources