HEREDITARY is receiving EOSC EU Node funding to advance Federated Learning experiments

The HEREDITARY project has been granted 40,000 credits from the European Open Science Cloud (EOSC EU Node) to support, among other things, its federated learning activities within a secure European research infrastructure.

About the EOSC EU Node

The European Open Science Cloud (EOSC EU Node) is the operational platform of the EOSC Federation, designed to facilitate open, collaborative and data-driven research in Europe. It supports multidisciplinary scientific work by providing access to digital research services such as computing and storage resources, containerized environments, and collaborative tools through institutional credentials. The platform promotes the sharing and reuse of research outputs in a secure, GDPR-compliant cloud ecosystem based on FAIR data principles and a credit-based access model.

The awarded credits will be used to deploy and maintain the central server required for federated learning experiments on EOSC virtual machines. Federated learning allows multiple institutions to collaboratively train machine learning models without sharing sensitive data. By using EOSC infrastructure, HEREDITARY can efficiently manage firewall configurations and incoming connections, overcoming common technical barriers associated with institutional IT restrictions.

The EOSC EU Node credits will support several weeks of experimentation, with server configurations adapted to different model sizes and algorithmic requirements. In parallel, HEREDITARY is exploring the development of an API-based solution that would allow researchers to deploy experiment-specific software containers through the EOSC Cloud Container platform. This approach aims to streamline workflows, facilitate testing and deployment, and potentially deliver open-source tools that could benefit the broader research community.

#DeCoding Federated Learning: from Infrastructure design to secure collaboration across Europe

How can hospitals collaborate on sensitive medical data without ever sharing the data itself? This is the core question behind Federated Learning (FL), and one of the key technological pillars of the HEREDITARY project.

Over the past two years, HEREDITARY has progressively designed, deployed and tested a federated learning infrastructure capable of connecting medical centres across Europe while ensuring that raw patient data never leaves its original location. What began as a technical design challenge has now evolved into a secure network supporting distributed machine learning experiments across heterogeneous datasets.

Building the Foundations: Computing Infrastructures

Federated Learning only works if each participating centre has the technical capacity to train models locally and communicate securely with the rest of the network. The first step was ensuring this. Under Deliverable D2.14 in Month 9 and lead by SURF, partners established secure computing infrastructures capable of handling sensitive clinical and genomic data, equipping centres with appropriate storage, processing power and secure communication channels. Thanks to this, data owners can process data locally, train models without centralising records and exchange model updates securely within the federation.

With local infrastructures in place, the next step was to design and validate the full federated learning architecture. Deliverable D2.11 in Month 18 presents a federated infrastructure that is secure, flexible and deployable across heterogeneous environments, including high-performance computing systems and cloud platforms. Encrypted communication via gRPC/TLS was implemented to protect model exchanges, while Secure Aggregation mechanisms (SecAgg/SecAgg+) were integrated to prevent the central server from accessing individual model updates.

The system was engineered to support both horizontal federated learning (same data types across centres) and vertical federated learning (different data modalities distributed across centres). Dedicated project workshops demonstrated that both approaches could run successfully across geographically distributed nodes, even when accounting for network latency between countries. By Month 18, HEREDITARY had a federated network capable of running both horizontal and vertical learning experiments on ALS data, without moving any raw records.

Securing the Communication: Communication Protocols

Security does not stop at this point. Deliverable D2.15 in Month 22 dives deeper into how model updates are protected during training. SURF analysed and validated advanced communication protocols within the federated learning framework. Three key mechanisms were the driving force behind this:

  • Secure Aggregation ensures that the server can combine model updates without seeing any individual contribution. Clients (Medical Centres) mask their updates using cryptographic techniques so that when all updates are aggregated, the masks cancel out, but no single update can be inspected independently. Tests showed no significant decrease in model performance, with only a modest increase in runtime due to additional communication steps.
  • Differential Privacy was also evaluated, introducing controlled noise to model updates to further reduce the risk of information leakage, again with minimal performance degradation.
  • Trusted Execution Environments were explored as an additional layer of security, though their hardware requirements make them less practical in heterogeneous clinical environments.

Beyond Simulation: paving the way for actual implementation

One key lesson emerging from this work is that federated learning is relatively straightforward in simulation, but deploying it across real institutions introduces new challenges: hardware variability, network latency across countries, IT coordination and regulatory compliance. Through interactive workshops and live experiments, HEREDITARY has moved beyond theoretical experimentation to operational deployment.

Today, the project operates a federated network linking multimodal clinical data without centralising any raw records. Advanced AI models can be trained across distributed datasets and privacy-enhancing technologies can be implemented with limited performance trade-offs. The infrastructure is reliable, secure and resilient. This “data stays at source” approach aligns closely with the principles of the European Health Data Space, demonstrating that privacy-preserving, cross-border health data collaboration is technically feasible.

The next step will arrive in June 2026, when the project moves from validated design to consolidated implementation. Deliverable D2.12 will formalise the full implementation of the federated infrastructure, while Federated Learning will demonstrate its clinical relevance through Deliverable D2.17, presenting intermediate results from the neurodegenerative use cases. Together, these upcoming milestone will mark a transition from infrastructure validation to scientific and clinical impact.

HEREDITARY reaches midterm with strong scientific progress and successful review

The European Horizon Europe project HEREDITARY has successfully reached Month 24 of its execution, marking the halfway point of its four-year duration. This milestone confirms the project’s strong progress and consolidates the solid foundations laid during its first two years of activity, with major deliverables completed and progress achieved.

The end of 2025 closed with particularly positive news for the consortium. HEREDITARY successfully passed its first periodic review at Month 18, with all deliverables approved. Both the external reviewers and the Project Officer praised the high quality of the work, the coherence of the technical developments, and the overall advancement of the project in line with its ambitious objectives.

In December, the consortium reached another remarkable achievement: 14 deliverables were submitted in a single day, representing the highest delivery peak foreseen throughout the entire project. These deliverables span all core scientific and technical work packages, covering clinical use cases, federated and privacy-preserving data infrastructures, semantic integration, advanced analytics, visualisation tools, citizen engagement, project management, and exploitation and intellectual property planning. Altogether, they account for more than 400 pages of technical and scientific results, reflecting an extraordinary collective effort by all partners. At the end of the article, you can review the complete list of all the reports submitted. Check them all out in the Deliverables section of our website.

Among the key achievements at this midpoint, there are also two important milestones: the first operational version of the federated workflow execution engine, enabling secure and distributed analysis across institutions, on top of the federated data management infrastructure, and the progress in data FAIRification, strengthening the discoverability and alignment of HEREDITARY data resources with European initiatives and standards. Both can be consulted in Deliverables 3.2 and Deliverable 3.6, respectively.

Reaching Month 24 represents not only a quantitative success in terms of deliverables and milestones, but also a qualitative one. The results produced so far demonstrate that HEREDITARY is effectively advancing towards its vision of building a federated, interoperable and privacy-preserving ecosystem for the integration and analysis of multimodal health data, with a particular focus on neurodegenerative and gut–brain related disorders.

Looking ahead, the consortium enters the second half of the project with a clear roadmap. The coming period will focus on maturing core scientific contributions, integrating results across work packages, and consolidating HEREDITARY into a coherent and impactful ecosystem.

14 Deliverables Submitted at M24 (December 2025)

DeliverableTitleBrief descriptionDissemination level
D1.5Risk Management Plan, 2nd reportUpdated analysis of project risks identified after the second year of implementation, including mitigation and contingency measures.EU Classified
D2.4Linkage and feature extraction from gut–brain, intermediate evaluationIntegrated brain–gut linkage and behavioural phenotyping to extract features for federated learning, including an intermediate evaluation at M24.Public (PU)
D2.22UCD clinical studies documentationRegulatory, ethical and data access documentation required for the UCD-led clinical studies, including approvals and MTAs where applicable.Public (PU)
D3.2Federated workflow execution methods: first releaseFirst release of the federated query execution engine, including intermediate implementations, optimisations, documentation and testing.Public (PU)
D3.6FAIRification of participating data resourcesReport on improvements in FAIRness of HEREDITARY data sources, with emphasis on discoverability and alignment with EU initiatives.Public (PU)
D3.11Pilot of the genomics data science ontology interconversionPilot demonstrator of a clinical ontology conversion tool enabling interoperability with genomic and other biomedical data.Public (PU)
D4.1KDE datasets and methods: first releaseOpen dataset including newly predicted links from the HEREDITARY knowledge graph using several knowledge graph embedding methods.Public (PU)
D4.3Learning models and spatio-temporal harmonizationDesign and first implementation of multimodal learning algorithms, self-supervised methods, and initial harmonisation libraries.Public (PU)
D5.2Demonstrator of visualization components for sequences, networks, text, and high dimensional dataSoftware libraries implementing visualisation components for heterogeneous data types, including sequences, networks and text.Public (PU)
D5.4Prototype of the visualization components for spatial, image, and simulation dataPrototype visualisation libraries addressing spatial data, biomedical images and simulation-based datasets.Public (PU)
D5.7Requirement analysis and user studies: Initial resultsInitial requirements analysis and early evaluation results derived from user studies of WP5 visual analytics tools.Public (PU)
D5.10First evaluation challenge: report on the data, results, and integration with EOSCReport on the first evaluation challenge, including datasets, results, open lab proceedings and integration within EOSC.Public (PU)
D6.7World café outcome: Priorities and gapsSynthesis of stakeholder perspectives collected during the World Café, identifying priorities and gaps relevant to HEREDITARY.Public (PU)
D8.5Mid Term IPR planMid-term Intellectual Property Rights plan outlining preliminary protection and exploitation strategies for project results.Sensitive (SEN)

Check them all in the Deliverables section of the website.