How RoboX Works
From Collection to Model Training
RoboX operates a three-stage process that moves data from raw smartphone recordings to structured, annotated datasets ready for robotics training.
Stage 1: Collection
How Contributors Collect Data
Contributors download the RoboX app, browse active campaigns, and start recording. Each campaign defines the task, sensors, and clip format.
Campaign-Specific Briefs
EgoGrasp asks for close-range hand-object interactions. Capture how you pick up, hold, manipulate, and place everyday items. The focus is on hand visibility, object contact, and natural grasping motion.
EgoNav asks for walking through indoor spaces like offices, malls, apartments, or airports. Capture your perspective as you move, navigate obstacles, and interact with the environment. The focus is on spatial understanding and obstacle awareness.
Contributors pick the campaigns that fit their routine. There's no minimum duration requirement, even brief, high-quality clips are valuable.
Lightweight Capture
Recording uses the phone's native camera, IMU, and where supported, LiDAR or audio. Short, structured clips are captured during normal activity. The app handles scheduling and sensor coordination so you don't have to think about it.
Stage 2: Processing and Anonymization
On-Device Anonymization
Faces, readable text, license plates, and device identifiers are detected and removed locally before anything uploads. Only clean, anonymized sensor data leaves the phone.
This happens on your device, before your data ever touches a server. You retain full control over what's anonymized and what's sent.
Quality Validation
Every clip goes through automated checks:
Lighting: Is the clip well-lit and usable?
Stability: Is the camera steady enough to extract meaningful signals?
Completeness: Does the clip cover the full task or motion?
Relevance: Does the content match the campaign brief?
Only clips that pass are accepted. Contributors get clear feedback on rejections through the app, so you know what to improve next time.
Stage 3: Annotation and Distribution
Layered Annotation
Validated clips are annotated with multiple data layers:
Temporal segmentation: Key phases of the task (approach, contact, manipulation, release)
Object bounding: What objects are present and their spatial bounds
Hand pose: Joint positions and hand configuration throughout the clip
Gaze direction: Estimated eye fixation and attention patterns
Spatial layout: Scene structure, surfaces, and spatial relationships
Interaction labels: Grasp type, object category, action outcome, and task semantics
New annotation layers can be applied to existing recordings as schemas evolve. Your contribution remains useful as new research questions emerge.
Dataset Access
Annotated datasets are organized by campaign and accessible via API or direct download. Teams can:
Browse existing datasets: Search by task, environment, sensor type, or region
Request access to specific campaigns: Get notified when new data is added
Commission custom collections: Work with RoboX to design and execute targeted data missions with defined geography, environment, task, and sensor parameters
For Data Companies and Integrators
RoboX datasets are also available for licensing and redistribution through data marketplace partners. Companies that aggregate, resell, or integrate training data into their own pipelines can access RoboX egocentric datasets in bulk, with:
Flexible licensing terms: Commercial, academic, and research licenses available
Structured metadata: Ready for downstream integration and pipeline automation
Version control: Track dataset updates, changes, and new annotation layers
Compliance support: All datasets include privacy and provenance documentation
Contributor Experience: Step by Step
01 Select a Campaign
Open the app and browse active data missions. Each campaign has a clear brief covering what to record, how long, and which sensors are needed.
02 Record
Go about your routine while the phone captures short, structured clips during the campaign window. No special equipment required.
03 Auto-Anonymize
Before anything uploads, the app strips sensitive identifiers directly on the device, including faces, text on screens, license plates, and voice data.
04 Data Reaches R&D
Validated, anonymized, annotated clips are organized into campaign-specific datasets and made accessible to robotics teams through API or direct download.
What Happens to Your Data
Your contribution to RoboX:
Is anonymized on your device
Before upload, all identifying information is removed
Is validated automatically Quality checks ensure it meets the campaign standard
Is annotated with structured labels Making it useful for robotics training
Is organized into public or commissioned datasets Ready for research and development
Stays under your control You can request deletion anytime, and you can see where it's used
Is tracked on-chain Data provenance is cryptographically verified
Last updated