We are excited to announce the release of OpenFL 1.6! This release brings the following changes.
New Features and APIs:
-
Federated LLM fine-tuning:
- Horovod: Use horovod to efficiently train LLMs across multiple private clusters
- Neuralchat-7b fine-tuning: Learn how to fine-tune neuralchat-7b using the Intel® Extension for Transformers and the workflow interface.
-
Workflow API enhancements: Introducing an experimental Workspace Export feature that can be used to transform a Workflow API-based FL experiment into the TaskRunner API format for running in a distributed deployment. There is also groundwork laid for a future FederatedRuntime implementation for Workflow API, in addition to the currently supported LocalRuntime.
-
Federated Evaluation: Federated evaluation allows for the assessment of ML models in a federated learning system by validating the model's performance locally on decentralized collaborator nodes, and then aggregating these metrics to gauge overall effectiveness, without compromising data privacy and security. FE is now officially supported by OpenFL, including example tutorials on how to use this new feature (via TaskRunner API).
-
Expanded AI Accelerator Support: Intel® Data Center GPU Max Series support via the Intel® Extension for PyTorch, including examples for training on datasets such as MNIST (via Workflow API) and TinyImageNet (via Interactive API)
-
Improved straggler collaborator handling: Improvements and bug fixes to aggregator’s fault-tolerance when collaborators stop responding or drop out of a federation. Introducing a cut-off timer-based policy and enabling other policies to be plugged-in. This capability is particularly relevant for large or geo-distributed federations.
-
fx CLI Improvements: We have separated the CLI commands are separated for creating a collaborator from the certificate request generation, for offering improved control of the participant setup workflow.
Highlighting Community Research
- Mitigating backdoor attacks in FL - Workflow API example contributed by @perieger (TU Darmstadt) demonstrates how Crowdguard can be used to leverage clients' feedback on individual models, analyze the behavior of neurons in hidden layers, and eliminate poisoned models through an iterative pruning scheme.
Enhanced Documentation:
- Quickstart Tutorial: An updated TaskRunner API quickstart tutorial makes it easier for newcomers to get up and running with built-in workspaces or custom code.
- FL Plan Description Documentation: There is now a detailed FL plan description in the to our documentation to help users better understand and configure the federated learning process.
New Contributors
Thank you and welcome to our 30 first time contributors In OpenFL 1.6!
@ptizzza, @grib0ed0v, @joedevon, @akantak, @wangleflex, @fangxiaoran, @danhe1, @bjklemme-intel, @fstrr, @KeertiX, @porteratzo, @amitport, @ashahba, @andreazanetti, @GabrieleRoncolato, @orionsBeltWest, @sbakas, @perieger, @manuelhsantana, @Hmikihiro, @VukW, @aryanxk02, @pboushy, @ParthMandaliya, @yanmxa, @tonywjs, @theakshaypant, @ishaileshpant, @rajithkrishnegowda, @teoparvanov
Full Changelog: v1.5...v1.6