Skip to main content

Data Management

When hundreds or thousands of datasets have accumulated in the platform, how do you quickly find the one you need?

Common scenarios:

  • Project managers need to find all unannotated data in a project to assign to annotators
  • Annotators want to view their own annotation progress
  • Administrators need to count data collected by a specific robot during a specific time period
  • Training engineers need to export completed annotation data for model training

The Data Management page is designed to solve these problems. It provides powerful search, filtering, and batch operation capabilities, allowing you to quickly locate targets in massive data.

Data Management Interface

Quick Start: 3 Steps to Find Target Data

Step 1: Select Project Scope

The project selector at the top of the page lets you decide which data to view:

  • All Projects: View all data you have permission to access (administrators and project managers)
  • Specific Project: Only view data from a specific project, narrowing search scope
  • Personal Space: View your personally uploaded private data

💡 Tip: If data volume is large, selecting a specific project first can significantly improve loading speed.

Step 2: Use Search and Filter

Enter keywords in the search bar, and the system will search for matches in data names. Supports fuzzy matching, for example, entering "grasp" can find all datasets containing "grasp".

Quick Filter Buttons:

  • Assigned/Unassigned: Quickly distinguish which data has been assigned to annotation tasks
  • Annotated/Unannotated: View annotation completion status

Click buttons to toggle filter status, click again to cancel filter.

Advanced Search:

Click the expand button on the right side of the search bar to set more precise search conditions:

  • Data Name: Supports fuzzy matching and exact search
  • Source Robot: Filter by robot that collected the data
  • Tag Filter: Filter by data tags (such as "high quality", "test data", etc.)
  • Upload Time: Select time range, such as "last week", "last month"
  • File Format: MCAP, BAG, video, audio, images, etc.
  • Duration Range: Filter data with specific duration (in seconds)

These conditions can be combined. For example: Find "Robot A" collected "unannotated" "MCAP format" data from "last month".

Advanced Search Interface

Step 3: Preview and Select Data

In the data list, each dataset displays:

  • Thumbnail: Quick preview of data content
  • Basic Information: Name, size, duration, upload time
  • Status Indicators: Whether assigned to task, whether annotated
  • Tags: Data classification tags

Click dataset to view details, check checkbox to select multiple datasets for batch operations.

Data List

Advanced Usage

How to Batch Manage Data?

Scenario: You need to rename 50 datasets, or add tags to them uniformly.

Operation Steps:

  1. Use search and filter to find target datasets
  2. Check datasets that need operation (supports select all)
  3. Click corresponding button in bottom operation bar

Supported Batch Operations:

OperationUse CaseNotes
Batch RenameUnify naming standardsWill pop up dialog, can modify names one by one
View StatisticsUnderstand data overviewDisplay total size, total duration, annotation count, etc.
Manage TagsBatch classify dataCan add or remove tags
Create Annotation TaskQuickly assign workOne-click task creation after selecting data
Append to TaskSupplement existing tasksAdd new data to existing annotation tasks
Delete DataClean up unnecessary dataSoft delete, can be recovered from trash
Associate RobotMark data sourceAdministrator permission, for data traceability
Update MetadataFix metadata errorsRe-extract file information

Batch Operation Interface

⚠️ Note: Batch delete operations will go to trash and can be recovered. But deleting annotations is irreversible, please use with caution.

How to View and Play Data?

Online Preview:

Click dataset name or thumbnail to view detailed information:

  • Basic Information: File size, duration, format, upload time
  • Robot Information: Collection robot, collection parameters
  • Annotation Statistics: Annotation count, annotation progress
  • Task Association: Associated annotation tasks

Online Playback:

Supports online playback of multiple formats:

  • Video Files: MP4, AVI, MOV, etc., support pause, fast forward, slow motion
  • Audio Files: MP3, WAV, etc., support waveform display
  • MCAP Data: Robot data visualization playback, support 3D scene rendering

The player provides complete control functions, allowing you to view data content without downloading.

Data Playback Interface

How to Handle Accidentally Deleted Data?

The platform uses soft delete mechanism. Deleted data does not disappear immediately but goes to trash.

Recover Data:

  1. Go to "Trash" page
  2. Find accidentally deleted dataset
  3. Click "Recover" button

Recovered data will retain all historical information:

  • ✅ Original annotation data
  • ✅ Associated annotation tasks
  • ✅ Data tags and metadata
  • ✅ Access and operation logs

Re-upload Same Name Data:

If you re-upload a previously deleted dataset, the system will detect same name data:

  • Recover Existing Dataset: Retain all historical information (recommended)
  • Create New Dataset: Ignore historical data, create new record

💡 Suggestion: If data was accidentally deleted, choosing "Recover Existing Dataset" can retain all annotations and task associations.

How to Sync and Fix Metadata?

When Do You Need to Sync Metadata?

  • File information displayed incorrectly (such as duration, size)
  • Metadata extraction failed after upload
  • File was modified by external tools

Operation Steps:

  1. Select datasets that need sync (supports batch)
  2. Click "Update" button in bottom operation bar
  3. Select "Update Metadata" in dialog
  4. Confirm operation, system starts processing

Processing Status:

Dialog will display processing status of each dataset in real time:

  • Pending: Waiting for queue processing
  • Processing: Reading file and extracting metadata
  • Completed: Metadata update successful
  • Failed: File corrupted or format error

Error Handling:

If file is corrupted, system will mark as error status. Permanent errors (such as file corruption) will not auto-retry, need to fix file first.

Metadata Sync Interface

How to Create Annotation Tasks?

Create from Data Management Page:

  1. Use search and filter to find data that needs annotation
  2. Check target datasets (can select across pages)
  3. Click "Annotate" button in bottom operation bar
  4. Fill in task information:
    • Task name and description
    • Specify annotators and reviewers
    • Select project
    • Set completion time
  5. Confirm creation

Create Annotation Task

Append to Existing Task:

If annotation task already exists, can append new data:

  1. Select datasets to append
  2. Click "Append to Task"
  3. Select target task
  4. Confirm operation

💡 Tip: Before creating task, recommend using filter function to confirm data status first, avoid re-assigning already annotated data.

How to Export Data for Training?

The Data Management page mainly provides data browsing and management functions. To export data for model training, please use the Data Export function.

On the Data Management page, you can:

  • View Annotation Progress: Understand which data has been annotated
  • Filter Annotated Data: Use "Annotated" filter button
  • Batch Select: Select multiple annotated datasets

Then go to Data Export page and select these datasets for export.

Data Quality Monitoring

View Statistics

After selecting datasets, click "Statistics" button to view:

  • Data Overview: Total size, total duration, dataset count
  • Annotation Statistics: Annotated count, annotation completion rate
  • Quality Metrics: Annotation pass rate, review status distribution

This information helps you understand overall data status.

Quality Filtering

Use "Annotated" filter button to quickly distinguish:

  • Annotated Data: Annotation completed, can be used for training
  • Unannotated Data: Need to assign annotation tasks

Combined with project filter, can view annotation completion status for specific projects.

Common Questions

Possible reasons:

  1. Project Filter: Check if correct project scope is selected
  2. Permission Restrictions: Confirm you have permission to access that data
  3. Search Keywords: Try using broader keywords, or use advanced search
  4. Data Deleted: Check trash, data may have been accidentally deleted

What to Do When Batch Operations Fail?

If some datasets fail in batch operations:

  1. View error information in operation dialog
  2. For metadata sync failures, check if file is corrupted
  3. For annotation deletion failures, confirm dataset indeed has annotation data
  4. Can retry operation separately for failed datasets

How to Improve Search Efficiency?

Suggestions:

  1. Select Project First Then Search: Narrow search scope
  2. Use Tags: Add tags to data for easy filtering later
  3. Combine Filter Conditions: Use multiple filter conditions for precise matching
  4. Save Common Searches: Record common search conditions for direct use next time

What to Do When Data List Loads Slowly?

Optimization Suggestions:

  1. Select Specific Project: Don't view "All Projects"
  2. Use Filter Conditions: Reduce returned data volume
  3. Adjust Items Per Page: Reduce number of datasets loaded at once
  4. Check Network Connection: Ensure network is stable

If problem persists, contact administrator to check system performance.

Applicable Roles

Administrator

You can view and manage data from all projects, including:

  • Cross-project data statistics and monitoring
  • Data assignment and project association
  • System maintenance and data cleanup
  • Permission management and access control

Project Manager

You can manage data for responsible projects, including:

  • View project data overview
  • Create and assign annotation tasks
  • Monitor annotation progress and quality
  • Export data for training

Annotator and Reviewer

You can view and search data (with permission restrictions):

  • View task data assigned to yourself
  • Search datasets that need processing
  • View data details and annotation results

After completing data management, you may also need: