PR3: Add deep_mmd_loss files #170

sanaAyrml · 2024-06-07T00:44:13Z

PR Type

[Feature]

Short Description

This is a tentative implementation for deep mmd loss.

Tests Added

No tests added yet.

for more information, see https://pre-commit.ci

emersodb · 2024-06-19T18:25:03Z

fl4health/clients/deep_mmd_clients/ditto_deep_mmd_client.py

+ each batch. Defaults to LossMeterType.AVERAGE.
+ checkpointer (Optional[TorchCheckpointer], optional): Checkpointer to be used for client-side
+ checkpointing. Defaults to None.
+ metrics_reporter (Optional[MetricsReporter], optional): A metrics reporter instance to record the metrics


I think this is probably an issue with the DittoClient docstring as well, but the metrics_reporter is technically not an arg in this implementation.

emersodb · 2024-06-19T18:26:11Z

fl4health/clients/deep_mmd_clients/ditto_deep_mmd_client.py

+ size_feature_extraction_layers: Dict[str, int] = {},
+ ) -> None:
+ """
+ This client implements the DEEP-MMD loss function in the Ditto framework.


Super minor, but is DEEP an acronym here? If not, I'd say we can use the capitalization scheme of Deep-MMD throughout?

emersodb · 2024-06-19T18:28:21Z

fl4health/clients/deep_mmd_clients/ditto_deep_mmd_client.py

+ deep_mmd_loss_weight (float, optional): weight applied to the DEEP-MMD loss. Defaults to 10.0.
+ flatten_feature_extraction_layers (Dict[str, bool], optional): Dictionary of layers to extract features
+ from them what is the flattened feature size. Defaults to {}. If it is -1 then the layer is not
+ flattened.


It looks like size_feature_extraction_layers is missing from the args documentation here or maybe it's sort of squashed together in the docs for flatten_feature_extraction_layers?

emersodb · 2024-06-19T18:30:12Z

fl4health/clients/deep_mmd_clients/ditto_deep_mmd_client.py

+ lam: float = 1.0,
+ deep_mmd_loss_weight: float = 10.0,
+ flatten_feature_extraction_layers: Dict[str, bool] = {},
+ size_feature_extraction_layers: Dict[str, int] = {},


We noted that we don't want mutable defaults in the previous PR. Just wanted to put this here as a reminder to change these around too 🙂

emersodb · 2024-06-19T20:46:03Z

fl4health/clients/deep_mmd_clients/ditto_deep_mmd_client.py

+ features = self.local_feature_extractor.get_extracted_features()
+ if self.deep_mmd_loss_weight != 0:
+ # Compute the features of the init_global_model
+ _ = self.init_global_model(input)


I don't think you need to catch this with _ you're not going to store it anyway.

emersodb · 2024-06-19T20:56:36Z

fl4health/clients/deep_mmd_clients/ditto_deep_mmd_client.py

+ EvaluationLosses: an instance of EvaluationLosses containing checkpoint loss and additional losses
+ indexed by name.
+ """
+ for layer in self.flatten_feature_extraction_layers.keys():


If you're going to be indexing into self.deep_mmd_losses anyway, could we simply do

for layer_loss_module in self.deep_mmd_losses.values(): layer_loss_module.training = False

For Ditto, we do this process in validate and train_by_steps/train_by_epochs for the global model, maybe we can just do this there?

I think it's still worth overriding compute_evaluation_loss and compute_training_loss and asserting that all layer_loss_module.training == False or vice versa though to be safe 🙂

I also might be missing this, but I don't see where we set layer_loss_module.training to True in the client. Based on the loss code, this would mean that we won't run training of the deep kernels after the first server round, which I think we want to keep doing?

emersodb · 2024-06-19T21:14:57Z

fl4health/clients/deep_mmd_clients/ditto_deep_mmd_client.py

+ if self.deep_mmd_loss_weight != 0:
+ # Compute DEEP-MMD loss
+ total_deep_mmd_loss = torch.tensor(0.0, device=self.device)
+ for layer in self.flatten_feature_extraction_layers.keys():


As above, since we're not accessing components of flatten_feature_extraction_layers, it might be more straightforward to do:

for layer, layer_loss_module in self.deep_mmd_losses.items(): layer_deep_mmd_loss = layer_loss_module( features[layer], features[" ".join(["init_global", layer])] ) additional_losses["_".join(["deep_mmd_loss", layer])] = layer_deep_mmd_loss total_deep_mmd_loss += layer_deep_mmd_loss

emersodb · 2024-06-19T21:40:26Z