Skip to main content
Stratosphere Lab

IoT-23: A Labeled Dataset with Malicious and Benign IoT Network Traffic

Network Security IoT
237 views
2 min read
License

Abstract

"Comprehensive dataset from Stratosphere Laboratory containing network traffic from 23 IoT malware captures including Mirai and Torii botnets, with over 325 million labeled connections for cybersecurity research and ML-based threat detection."

Description

Dataset Overview

The IoT-23 dataset is a comprehensive collection of network traffic from Internet of Things devices infected with malware. Created by the Stratosphere Laboratory, it contains 23 different malware captures plus 3 benign scenarios, representing one of the most extensive labeled IoT botnet datasets available for research purposes.

Key Features

  • 23 malware infection scenarios with real IoT botnets
  • 3 benign traffic scenarios for baseline comparison
  • Over 325 million labeled network connections
  • PCAP files with complete packet captures
  • Bidirectional NetFlow data (argus format)
  • Labeled connections (Malicious, Benign, or Unknown)
  • Multiple malware families: Mirai, Torii, Hide and Seek, Hakai
  • Real IoT devices used: Philips HUE, Amazon Echo, Somfy doorlock

Data Structure

Each scenario in the dataset includes multiple data formats:

  • PCAP Files: Complete packet captures for deep analysis
  • Conn.log Files: Connection summaries in Zeek/Bro format
  • Labeled Flows: CSV files with labeled connections
  • Metadata: Information about infection type and device
  • Network Features: Duration, protocol, packets, bytes, ports
  • Behavioral Labels: Malicious, Benign, Background, or Unknown

Data Collection Method

The dataset was created by intentionally infecting real IoT devices with malware in a controlled laboratory environment. Traffic was captured during the infection process and normal operation, providing authentic examples of IoT botnet behavior. All captures were performed with proper isolation to prevent actual attacks.

Malware Families Included

  • Mirai: Famous IoT botnet targeting cameras and routers
  • Torii: Advanced persistent IoT botnet
  • Hide and Seek: P2P-based IoT botnet
  • Hakai: Variant of Mirai targeting specific devices
  • Others: Various IoT-specific malware strains

Research Applications

  • IoT botnet detection and classification
  • Malware behavior analysis in IoT environments
  • Development of network-based intrusion detection systems
  • Machine learning model training for threat detection
  • IoT security protocol evaluation
  • Behavioral analysis of infected IoT devices
  • Comparative studies of different malware families

Machine Learning Use Cases

  • Binary classification (Malicious vs. Benign traffic)
  • Multi-class malware family classification
  • Anomaly detection in IoT network behavior
  • Time-series analysis of infection patterns
  • Deep learning for packet-level analysis
  • Feature engineering from network flows
  • Botnet command and control detection
  • Zero-day malware detection using behavioral models

View Data Structure

To explore column names, data types, and sample rows, visit the official dataset page on Stratosphere Lab.

Preview on Stratosphere Lab

Cite This Dataset

Stratosphere Lab (2026). IoT-23: A Labeled Dataset with Malicious and Benign IoT Network Traffic. [Dataset]. Stratosphere Lab. https://www.stratosphereips.org/datasets-iot23

Source: Stratosphere Lab (2026)

Indexed by IoTDataset.com on Jan 16, 2026

Ready to Start Your Research?

Download this dataset directly from the official repository and start building your next breakthrough project.

Download Dataset

Related Topics & Keywords

Share This Research

More in Network Security IoT

View All
Accelerometer / Gyroscope / Posture Analysis Mendeley Data

MPU6050 Gyroscope Data for Posture Tracking and Analysis [Mendeley Data, March 2025]

Real-time posture monitoring dataset from the MPU6050 6-axis sensor (3D gyroscope + 3D accelerometer) attached to the upper body, streaming wirelessly via Phyphox. Angular orientation values: Angle_X, Angle_Y, Angle_Z. Mendeley Data, March 2025 — DOI: 10.17632/ftn4rjnd6x.1. Used for LSTM-based posture classification and ergonomics.

May 10, 2026
Network Security IoTSyn Generated

IIoT Network Traffic - 21% Attacks [4K rows] #6a63

Synthetic Network Security dataset with 4,000 data points. 16 columns. Config: Attack rate: 21%. CC0 licensed.

May 04, 2026
Household Electric Power Consumption University

IDEAL Household Energy Dataset: 255 UK Homes, Electricity, Gas & Sensors [Nature Scientific Data]

Comprehensive energy dataset from 255 UK homes covering electricity, gas, room temperature, humidity, and appliance-level data for a 39-home sub-cohort. 23 months of data. CSV format. Published on Edinburgh DataShare (DOI: 10.7488/ds/2836). Described in Nature Scientific Data 2021.

May 04, 2026
Household Electric Power Consumption Zenodo

Energy Consumption Dataset for Smart Homes — 5 Appliances, Individual Smart Meters [Zenodo 2025]

Real IoT smart home energy dataset from a testbed of 5 household appliances each connected to an individual smart meter. 507 KB CSV. Published January 2025 on Zenodo. Used for appliance-level energy monitoring, household energy disaggregation, and smart meter analytics.

May 04, 2026