Skip to main content

Data Management

Feature Overview

Data management is the core module of the IO data platform, providing comprehensive data lifecycle management functionality. Users can centrally manage all data files here, perform retrieval, filtering, preview, annotation and batch operations, serving as the starting point of the data annotation workflow.


Main Features

Data Browsing and Retrieval

Project Filtering

Data management supports multiple project views: you can view data from all projects, select specific projects to view their data, access personal private data, or browse team shared data. This flexible filtering approach allows users of different roles to quickly find required data.

Advanced Search Function

The system provides powerful search capabilities, supporting fuzzy matching and exact search of data names, filtering by data source robots, filtering data by annotation tags, filtering by upload time, and filtering by file format (MCAP, BAG, video, audio, images). These search conditions can be combined to help you precisely locate target data.

Status Filtering

Through dimensions such as assignment status (assigned/unassigned tasks), annotation status (annotated/not annotated), quality status (high quality/low quality/pending review), you can quickly filter data meeting specific conditions, improving work efficiency.

Data Preview and Playback

Data Preview

The system provides thumbnail display functionality, allowing you to quickly browse data content. Simultaneously displays basic information such as file size, duration, upload time, and metadata such as robot information, collection parameters, helping you comprehensively understand data characteristics.

Online Playback

Supports online playback of multiple formats: video files, audio files, and MCAP format robot data visualization playback. The player provides control functions such as pause, fast forward, slow motion, loop playback, allowing you to flexibly view and analyze data content.

Batch Operation Functions

Data Management Operations

Supports batch operations such as renaming, viewing statistics, managing tags, deleting data, importing external data, associating robot devices. These batch functions greatly improve data management efficiency, especially when handling large amounts of data.

After selecting data, you can create annotation tasks with one click, or append data to existing annotation tasks. You can also view annotation results and progress of data, as well as quality statistics, providing comprehensive support for annotation work.

Data Download and Export

File Download

Supports downloading original data files, converted MCAP files, and batch download in ZIP compressed package format. Whether you need individual files or batch data, you can conveniently obtain them.

Data Export

Provides functions such as annotation result export, statistical report export, metadata export. Exported data can be directly used for model training, data analysis or other purposes, meeting needs for different scenarios.

Data Quality Monitoring

Quality Indicators

The system continuously monitors key indicators such as annotation completion rate, quality pass rate, annotation efficiency, abnormal data, helping you comprehensively understand data quality status.

Quality Analysis

Through functions such as quality trend analysis, annotator performance comparison, problem analysis, you can deeply understand data quality change patterns, identify improvement opportunities, and improve overall annotation quality.

Applicable Roles

Administrator

As a platform administrator, you can view and manage data from all projects, monitor overall data quality status, assign data to different projects, and perform system maintenance, clean up invalid data, optimize storage space.

Project Manager

Project managers can manage data for responsible projects, select data to create annotation tasks, monitor data annotation progress, and ensure data annotation quality. Through data management module, project managers can comprehensively control project data status.