A national guideline for building a data standards system was jointly released on October 8, by six government bodies, including the National Development and Reform Commission, the National Data Administration, the Office of the Central Cyberspace Affairs Commission, and the Ministry of Industry and Information Technology.
The aim is to create a data-driven digital economy, positioning data as a critical resource and innovation driver for future growth.
The guideline sets a clear goal: by the end of 2026, China will largely have established a national data standards system.
This system will encompass more than 30 fundamental and universal national standards in various key areas of the data field, such as data circulation infrastructure, data management, data services, training datasets, public data authorization and operations, data rights confirmation, data resource pricing, and corporate data transaction models.
According to the construction framework outlined in the guideline, the data standards system is divided into seven key sections: foundational universal standards, data infrastructure, data resources, data technology, data circulation, integrated applications, and security assurance.
The guideline also provides a detailed breakdown of the content within the national data standards system. Specifically:
●Data infrastructure standards cover storage and computing facilities, network infrastructure, and circulation and utilization facilities.
●Data resource standards focus on foundational resources, development and utilization, data subjects, data governance, and training datasets.
●Data technology standards encompass data aggregation, processing, circulation, application, operation, and destruction technologies.
●Data circulation standards include data products, data rights confirmation, data resource pricing, and data circulation and transactions.
For example, the standards for network infrastructure, particularly for 5G data transmission, regulate the access, transmission, and management of 5G network data, including 5G network data management, access requirements, transmission quality control, protocols, functionality testing, and performance testing.
Regarding training datasets, the standards focus on the collection and processing of datasets used for large-scale model training, covering aspects such as dataset format requirements, classification and grading, collection performance, analysis and monitoring, and quality requirements.
To promote compliant data circulation and trading, the guideline encourages the exploration of diversified methods for data circulation and trading, and supports mutual recognition and interconnection among data trading institutions and data circulation and trading platforms.
In respect of core technologies, the guideline proposes promoting the coordinated development of cloud-edge-terminal computing technologies to form computing service capabilities that accommodate large-scale data aggregation, real-time analysis, and intelligent applications.
It also emphasizes strengthening R&D in trustworthy storage technologies to support large-scale, real-time cross-domain data storage and flow, thereby increasing the proportion of intelligent storage utilization.