Skip to main content

Data Import

Data import is used to batch import data from external systems to the platform. Mainly supports two scenarios: importing MCap files from local IO-AI Agent devices, and importing LeRobot format datasets.

Typical use cases:

  • Offline Collection Data Import: Batch import data from local collection devices
  • External Dataset Import: Import LeRobot format datasets
  • Data Migration: Migrate data from other systems to platform

Data Import Interface

Data Ingestion Overview

Data can enter the platform via IO Agent, LeRobot packages, or local upload; after ingestion or transcoding it appears in Data Management for annotation and export.

Quick Start: Import Data from IO-AI Agent

What is IO-AI Agent?

IO-AI Agent is software running on local devices, used to manage collected MCap files. Through data import functionality, these files can be batch imported to the platform.

Import Steps

Step 1: Configure Agent Address

  1. On import page, enter IO-AI Agent service address
  2. System will automatically detect Agent service status
  3. After connection succeeds, can browse files on Agent device

Step 2: Select Files to Import

  1. Browse MCap files on Agent device
  2. Display file size, creation time and other metadata
  3. Support search and filter files by name
  4. Check files that need to be imported

Step 3: Select Storage Method

Each file can choose two storage methods:

  • Cloud: Download file and upload to cloud storage (recommended)

    • File will be downloaded from Agent device
    • Then uploaded to configured cloud storage
    • Suitable for data that needs long-term storage
  • Local: Only create dataset record, file remains on Agent device

    • Don't download file, only create metadata
    • File access depends on Agent device being online
    • Suitable for temporary data or saving storage space

Step 4: Start Import

  1. Click "Cloud" or "Local" button to start import
  2. System will display import progress
  3. After import completes, files will appear on Data Management page

Batch Import

Batch Operations:

  • Can check multiple files for batch import
  • Support batch select all files
  • Batch import will process one by one in order

Import Queue:

  • Import tasks will join queue, execute in order
  • Can view import status of each file
  • Support canceling ongoing import tasks

LeRobot Format Import (New in 3.4.0)

What is LeRobot Format?

LeRobot is a popular robot learning framework. If you have LeRobot format datasets, you can directly import them to the platform.

Supported Formats:

  • LeRobot standard folder structure
  • Includes images, videos and annotation data
  • Support compressed package (tar.gz) format import

Format Requirements:

  • Comply with LeRobot standard folder structure
  • Include necessary metadata file (meta/info.json)
  • Annotation data format is correct

Import Steps

  1. Select Data Source: Select LeRobot format folder or compressed package
  2. Format Validation: System automatically identifies format and validates data integrity
  3. Data Parsing: Extract metadata and annotation information
  4. Create Dataset: Automatically create dataset and associate annotations
  5. Complete Import: After import completes, can view in data page
tip

LeRobot Import Notes:

  • Before import, ensure folder structure complies with LeRobot standards
  • Support batch import of multiple folders
  • Import process automatically validates data integrity

Import Management

How to View Import Progress?

Task Status:

  • Pending: Task created, waiting for execution
  • Processing: Downloading or uploading files
  • Completed: File successfully imported, dataset created
  • Failed: Error occurred during processing, can view error information

Progress Information:

  • Real-time display of each file's processing status
  • Display upload progress percentage
  • Display number of processed files and total number
  • Display estimated remaining time

Import Task Queue

Queue Functions:

  • Display list of all import tasks
  • Support filter by status (pending, processing, completed, failed)
  • Support search specific tasks
  • Display task creation time and processing progress

Task Operations:

  • View Details: View task details and included files
  • Cancel Task: Cancel ongoing tasks
  • Retry Task: Retry failed tasks
  • Delete Task: Delete completed tasks

Error Handling

Common Error Types:

  • Network Error: Download or upload failed, support retry
  • Format Error: File format incorrect, need to check file
  • Storage Error: Cloud storage configuration issue, need to check configuration
  • Data Error: Data corrupted or format incompatible

Error Recovery:

  • Auto Retry: Temporary errors can auto recover
  • Manual Retry: Failed tasks can be re-executed
  • Error Logs: Record detailed error information for troubleshooting

Common Questions

What to Do When Agent Connection Fails?

Possible Causes:

  1. Address Error: Check if Agent service address is correct
  2. Network Unreachable: Confirm if browser can access Agent address
  3. Service Not Started: Confirm if IO-AI Agent software is running
  4. Firewall Blocking: Check firewall settings

Solution:

  1. Directly access Agent address in browser, confirm if accessible
  2. Check if Agent software is running normally
  3. Confirm network connection is normal
  4. If problem persists, contact technical support

What to Do When Import Speed is Slow?

Possible Causes:

  1. Network Bandwidth: Insufficient network bandwidth affects download and upload speed
  2. File Size: Large files need more time
  3. System Load: May be slower when system is busy

Optimization Recommendations:

  1. Check network connection, ensure sufficient bandwidth
  2. Large files recommend batch import
  3. Avoid system peak hours for import
  4. Using "Local" storage method can skip upload step

What to Do When Import Fails?

Handling Steps:

  1. View error information to understand failure reason
  2. Take corresponding measures based on error type:
    • Network error: Check network connection then retry
    • Format error: Check if file format is correct
    • Storage error: Check cloud storage configuration
  3. Click "Retry" button to re-import
  4. If problem persists, contact technical support

How to Know if File Has Been Imported?

Check Method:

  1. On import page, imported files will show "Completed" status
  2. Search file name on Data Management page, confirm dataset has been created
  3. View import history records, confirm import success

Duplicate Import:

If file already exists, system will detect and prompt. Can choose:

  • Skip: Don't duplicate import
  • Overwrite: Re-import and overwrite existing data

Applicable Roles

Administrator

You can:

  • Centrally import offline collection data
  • Standardize data entry process
  • Monitor import progress
  • Handle issues during import process
  • Configure Agent connection and cloud storage

Project Manager

You can:

  • Import project-related data
  • Manage import tasks
  • Monitor import progress
  • Ensure data quality

Collector

You can:

  • Import data from collection devices
  • Batch import data from collection tasks
  • View import status and progress
  • Handle import errors

After completing data import, you may also need: