Federated Learning under Distributed Concept Drift (FedDrift)

This repository is the source code for our paper: Federated Learning under Distributed Concept Drift (AISTATS'23).

Federated Learning (FL) under distributed concept drift is a largely unexplored area. Although concept drift is itself a well-studied phenomenon, it poses particular challenges for FL, because drifts arise staggered in time and space (across clients). We first demonstrate that prior solutions to drift adaptation that use a single global model are ill-suited to staggered drifts, necessitating multiple-model solutions. We identify the problem of drift adaptation as a time-varying clustering problem, and we propose two new clustering algorithms for reacting to drifts based on local drift detection and hierarchical clustering. Empirical evaluation shows that our solutions achieve significantly higher accuracy than existing baselines, and are comparable to an idealized algorithm with oracle knowledge of the ground-truth clustering of clients to concepts at each time step.

This repository is built on top of a federated learning research platform, FedML.

Setup Environment

Our installation script is based on Miniconda. Please modify the script according to your package manager.

$ ./CI-install.sh

Our expeirments are tested with Python 3.7.4 and PyTorch 10.2. You can also install the GPU-enabled PyTorch by modifying the script above.

The above script will create a conda environment named feddrift and install all the required packages. Please activate the environment before running any experiments.

$ conda activate feddrift

Running Experiments

We use Weights & Biases to log experiments. Please create an account and log in before running an experiment.

$ wandb login

To run an experiment, please use the following command:

$ cd fedml_experiments/distributed/fedavg_cont_ens
$ ./run_fedavg_distributed_pytorch.sh ${CLIENT_NUM} ${WORKER_NUM} ${SERVER_NUM} ${GPU_PER_SERVER} ${MODEL} ${DATA_DIST} ${ROUND} ${EPOCH} ${BATCH_SIZE} ${LR} ${DATASET} ${DATA_DIR} ${SAMPLE_NUM} ${NOISE_PROB} ${ONLY_TEST_ONE_CLIENT} ${TOTAL_ITER} ${CONCEPT_NUM} ${RESET_MODEL} ${DRIFT_TOGETHER} ${DRIFT_ALGO} ${DRIFT_ALGO_ARG} ${TIME_STRETH} ${DUMMY_ARG} ${CHANGE_POINT}

For example, running FedDrift for SEA-4 dataset and change point A:

$ ./run_fedavg_distributed_pytorch.sh 10 10 1 4 fnn homo 200 5 500 0.01 sea "./../../../data/" 100 0 0 10 4 0 0 softcluster H_A_C_1_10_0 1 0 A

The major algorithms we implemented are listed here (in the form of ${DRIFT_ALGO} + optionally ${DRIFT_ALGO_ARG}):

win-1: Window method (one time step)
win-2: Window method (two time step)
all: Oblivous method (using all training data)
lin: Weighted (linear decay)
exp: Weighted (exponential decay)
dsurf: DriftSurf
aue: Accuracy Updated Ensemble
kue: Kappa Updated Ensemble
softcluster + cfl_0.1_win-1: Clustered federated learning
softclusterwin-1 + hard-r: IFCA
ada + win-1_iter: Adaptive-FedAvg
softcluster + H_A_F_1_06_0: FedDrift (ours)
softcluster + mmacc_06: FedDrift-Eagar (ours)

Reference Papers

If you use our code in your work, we would appreciate a reference to the following papers

Ellango Jothimurugesan, Kevin Hsieh, Jianyu Wang, Gauri Joshi, Phillip B. Gibbons. Federated Learning under Distributed Concept Drift. Proceedings of The 26th International Conference on Artificial Intelligence and Statistics (AISTATS), 2023.

Contributing

This project welcomes contributions and suggestions. Most contributions require you to agree to a Contributor License Agreement (CLA) declaring that you have the right to, and actually do, grant us the rights to use your contribution. For details, visit https://cla.microsoft.com.

When you submit a pull request, a CLA-bot will automatically determine whether you need to provide a CLA and decorate the PR appropriately (e.g., label, comment). Simply follow the instructions provided by the bot. You will only need to do this once across all repositories using our CLA.

This project has adopted the Microsoft Open Source Code of Conduct. For more information see the Code of Conduct FAQ or contact [email protected] with any additional questions or comments.

Name		Name	Last commit message	Last commit date
Latest commit History 440 Commits
.github		.github
applications		applications
benchmark		benchmark
data		data
docs		docs
fedml_api		fedml_api
fedml_core		fedml_core
fedml_experiments		fedml_experiments
fedml_mobile		fedml_mobile
tests/fedml_api/standalone/fedavg		tests/fedml_api/standalone/fedavg
.amltignore		.amltignore
.gitignore		.gitignore
.travis.yml		.travis.yml
CI-install.sh		CI-install.sh
DATA-install.sh		DATA-install.sh
INSTALL.md		INSTALL.md
LICENSE		LICENSE
LICENSE-FedML		LICENSE-FedML
README.md		README.md
SECURITY.md		SECURITY.md
__init__.py		__init__.py
azure-pipelines.yml		azure-pipelines.yml
contributor.md		contributor.md
publications.md		publications.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Licenses found

Repository files navigation

Federated Learning under Distributed Concept Drift (FedDrift)

Setup Environment

Running Experiments

Reference Papers

Contributing

About

Licenses found

Releases

Sponsor this project

Packages

Contributors 2

Languages

License

Licenses found

microsoft/FedDrift

Folders and files

Latest commit

History

Repository files navigation

Federated Learning under Distributed Concept Drift (FedDrift)

Setup Environment

Running Experiments

Reference Papers

Contributing

About

Resources

License

Licenses found

Code of conduct

Security policy

Stars

Watchers

Forks

Releases

Sponsor this project

Packages 0

Contributors 2

Languages

Packages