Data Export
Feature Overview
Data export is an important data delivery module of the IO data platform, providing functionality to export annotated data in multiple standard formats, including JSON, CSV, HDF5, LeRobot, MCap, etc. Through flexible filtering conditions, batch export functionality and export history management, it ensures annotated data can be delivered to downstream systems in the most suitable format, supporting model training, data analysis and various application scenarios.
Main Features
Multi-format Export Support
Standard Data Formats
Supports exporting to multiple standard data formats, including JSON (structured data), CSV (tabular data), HDF5 (scientific computing data), LeRobot (robot learning data), MCap (multimodal data), etc. These formats cover the vast majority of downstream application needs.
Custom Formats
Supports customizing export formats based on specific needs, including field selection, data conversion, format configuration, etc. Through custom formats, meet data export needs for special scenarios.
Format Conversion
Provides intelligent format conversion functionality, allowing data conversion from one format to another, ensuring data compatibility between different systems. Conversion process supports data validation and quality checking.
Flexible Filtering Function
Multi-dimensional Filtering
Supports data filtering by multiple dimensions such as project, time, annotator, quality level, etc. Through flexible filtering conditions, precisely select data that needs to be exported.
Advanced Filtering
Provides advanced filtering functionality, supporting complex filter condition combinations, including logical operations, range filtering, fuzzy matching, etc. Advanced filtering allows precise control of exported data range.
Preview Function
Provides data preview functionality before export, allowing viewing of filter results and confirming exported data meets expectations. Preview functionality avoids unnecessary export operations.
Batch Export Management
Batch Processing
Supports batch export of multiple datasets, can simultaneously process multiple export tasks, greatly improving export efficiency. Batch processing is particularly suitable for large-scale data export scenarios.
Task Queue
Provides export task queue management, supporting queuing and execution of multiple export tasks. Through task queue, orderly process large numbers of export requests.
Progress Monitoring
Real-time monitoring of export progress, including completed quantity, processing speed, estimated completion time, etc. Through progress monitoring, timely understand export status.
Export History Management
History Records
Records history of all export operations, including export time, export format, data volume, operator, etc. Through history records, track data usage.
Version Management
Supports version management of exported data, can save different versions of export results, facilitating data backtracking and comparison. Version management ensures data traceability.
Permission Control
Provides fine-grained permission control, can set different user export permissions for different data. Through permission control, ensure data security and prevent unauthorized export.
Applicable Roles
Administrator
As a platform administrator, you can deliver training data or downstream analysis required data externally, manage export tasks, monitor export progress, and control data export permissions. These functions ensure the platform's data delivery service is secure and efficient.
Project Manager
Project managers can export data related to projects, prepare data for project delivery, monitor data usage, and coordinate data export work. Through data export management, project managers can effectively control project data delivery.