AI4I 2020 Predictive Maintenance — Milling Machine Sensor Failures [10,000 Records]
Abstract
"Synthetic IIoT dataset reflecting real milling machine predictive maintenance scenarios. 10,000 records with 14 features including air temperature, process temperature, rotational speed, torque, and 5 labeled failure types. CSV format. Ideal for multi-label fault classification."
Description
Overview
The AI4I 2020 Predictive Maintenance Dataset is a synthetic dataset carefully designed to reflect real industrial predictive maintenance data encountered in milling machine operations. It was created because actual industrial maintenance datasets are rarely publicly available due to proprietary constraints, yet there is strong demand for labeled IIoT failure data in machine learning research.
The dataset models five distinct failure modes — Tool Wear Failure (TWF), Heat Dissipation Failure (HDF), Power Failure (PWF), Overstrain Failure (OSF), and Random Failure (RNF) — alongside a combined machine failure label. Each record includes air and process temperature, rotational speed, torque, and tool wear duration, making it a compact but highly structured multi-label classification benchmark.
With 10,000 records, no missing values, and clear feature-label relationships, AI4I 2020 is widely used to benchmark and compare ML classifiers including Random Forest, XGBoost, SVM, and neural networks for IIoT fault detection and root-cause analysis.
Column Schema
| Column | Description |
|---|---|
| UDI | Unique identifier ranging from 1 to 10,000. |
| Product_ID | Product quality variant: L (low), M (medium), H (high). |
| Air_temperature_K | Air temperature in Kelvin (generated via random walk). |
| Process_temperature_K | Process temperature in Kelvin. |
| Rotational_speed_rpm | Rotational speed in RPM (calculated from power). |
| Torque_Nm | Torque in Newton-metres. |
| Tool_wear_min | Tool wear duration in minutes. |
| Machine_failure | Binary combined machine failure label (0/1). |
| TWF / HDF / PWF / OSF / RNF | Individual binary failure mode labels (5 types). |
Key Statistics
- Total Records: 10,000
- Features: 14 columns
- Failure Types: 5 individual modes + 1 combined label
- Missing Values: None
- File Format: CSV
- Donated: August 2020
Use Cases
- Multi-label failure mode classification for CNC milling machines
- Benchmarking ML/DL classifiers on IIoT predictive maintenance tasks
- Feature importance analysis for industrial sensor-driven fault detection
- Root cause analysis of manufacturing equipment failures
Source & Attribution
The dataset was created by Stephan Matzka at the Berlin School of Economics and Law and donated to the UCI Machine Learning Repository. It is available via UCI and widely referenced in Industry 4.0 and IIoT machine learning papers.
View Data Structure
To explore column names, data types, and sample rows, visit the official dataset page on UCI.
Preview on UCICite This Dataset
Matzka, Stephan (2020). AI4I 2020 Predictive Maintenance — Milling Machine Sensor Failures [10,000 Records]. [Dataset]. UCI Machine Learning Repository. https://archive.ics.uci.edu/ml/datasets/ai4i+2020+predictive+maintenance+dataset
Source: UCI Machine Learning Repository (2020)
Indexed by IoTDataset.com on Apr 10, 2026
Ready to Start Your Research?
Download this dataset directly from the official repository and start building your next breakthrough project.