Skip to main content
Scientific Reports (Nature Publishing Group)

Dataset-Centric Evaluation of Federated Intrusion Detection Models in IoT Networks

Abstract

"A federated learning evaluation across several contemporary IoT and IIoT intrusion detection datasets, benchmarking algorithms such as FedAvg, FedProx, and FedNova with LSTM and Transformer models in in-domain, cross-dataset, and multi-dataset federation scenarios."

Description

Overview

Dataset-Centric Evaluation of Federated Intrusion Detection Models in IoT Networks is a Scientific Reports study that systematically evaluates federated learning approaches for IoT intrusion detection across heterogeneous datasets. It highlights how dataset characteristics influence federated model performance and generalization.

Research Motivation

  • Most prior evaluations use a single dataset, risking overfitting to specific traffic patterns.
  • Federated learning allows collaborative IDS training without sharing raw traffic, preserving privacy.
  • The study investigates how well federated IDS models generalize across diverse IoT datasets.

Datasets and Attack Profiles

  • Multiple recent IoT and IIoT intrusion datasets are leveraged, each with distinct traffic profiles, attacks, and feature representations.
  • Datasets are chosen to capture attack diversity and realistic IoT conditions.
  • Label and feature harmonization is performed to enable multi-dataset federation.

Federated Learning Methodology

  • Algorithms evaluated include FedAvg, FedProx, and FedNova.
  • Model architectures include LSTM and Transformer networks.
  • Clients are constructed to reflect non-IID scenarios typical of IoT deployments.

Evaluation Scenarios and Findings

  • In-domain, out-of-domain, and multi-dataset federation scenarios are explored.
  • Results show strong in-domain performance and highlight challenges for cross-dataset generalization.
  • Attack diversity and dataset heterogeneity are shown to be critical factors.

Use Cases

  • Privacy-preserving collaborative IDS across organizations or deployments.
  • Assessing IDS robustness when deployed in new IoT environments.
  • Guiding the design of future IoT security datasets tailored for federated learning.

Access and License

The methodology and results are published in Scientific Reports under open access. The evaluated datasets are publicly available from their respective sources and must be accessed under their own licenses.

View Data Structure

To explore column names, data types, and sample rows, visit the official dataset page on Scientific Reports (Nature Publishing Group).

Preview on Scientific Reports (Nature Publishing Group)

Cite This Dataset

Bilal, Muhammad Ahmad, Islam, Ihtesham Ul, & others (2026). Dataset-Centric Evaluation of Federated Intrusion Detection Models in IoT Networks. Scientific Reports. [Dataset]. Scientific Reports (Nature Publishing Group). https://doi.org/10.1038/s41598-025-32567-w

Source: Scientific Reports (Nature Publishing Group) (2026) · DOI: 10.1038/s41598-025-32567-w

Indexed by IoTDataset.com on Feb 03, 2026

Ready to Start Your Research?

Download this dataset directly from the official repository and start building your next breakthrough project.

Download Dataset

Related Topics & Keywords

Share This Research

More in IoT Network Security / Federated Learning

View All
Cybersecurity Zenodo

Gotham Dataset 2025 - Large-Scale IoT Network Intrusion Detection

Reproducible large-scale IoT network dataset from 78 emulated devices using MQTT, CoAP, and RTSP protocols. Includes benign and malicious traffic with DoS, brute force, scanning, and C&C attacks in PCAP and CSV formats.

Mar 20, 2026
Cybersecurity Kaggle

Gotham Dataset 2025: Large-Scale Federated IoT IDS Benchmark

The Gotham Dataset is a large-scale, reproducible benchmark for evaluating decentralized Intrusion Detection Systems (IDS) and Federated Learning in virtualized smart cities. It captures interface-level network traffic from 78 heterogeneous IoT devices, including complex attack vectors like Mirai botnets, Merlin C2 traffic, and CoAP amplification, preserving the non-IID nature of edge data for realistic AI security training.

Feb 05, 2026
Cybersecurity Research Paper

Federated IoT Intrusion Detection Dataset - Privacy-Preserving Security

Dataset for evaluating federated learning approaches to IoT intrusion detection published in Nature Scientific Reports January 2026. Features distributed network traffic from multiple IoT deployments with privacy constraints and decentralized learning evaluation metrics.

Jan 24, 2026
Industrial IoT University

Edge-IIoTset - Comprehensive Cyber Security Dataset for IoT and IIoT

Realistic cybersecurity dataset with 14 attack types from 10+ IoT/IIoT device types including sensors, actuators, and industrial controllers. Supports centralized and federated learning with 61 optimized features.

Jan 22, 2026