MobileEgo Anywhere: Open Infrastructure for long horizon egocentric data on commodity hardware

Senthil Palanisamy; Satpal Singh Rathore; Abhishek Anand; Pratyush Kumar Patnaik; Shubhanshu Khatana; Ekaksh Janweja

MobileEgo Anywhere: Open Infrastructure for long horizon egocentric data on commodity hardware

Senthil Palanisamy, Satpal Singh Rathore, Abhishek Anand, Pratyush Kumar Patnaik, Shubhanshu Khatana, Ekaksh Janweja

Published: 31 May 2026, Last Modified: 01 Jun 2026Beyond Teleop workshop, ICRA 2026 PosterEveryoneRevisionsBibTeXCC BY 4.0

Keywords: VLA training dataset, egocentric dataset, robotics, commoditized VLA data collection, long-horizon tracking.

TL;DR: MobileEgo Anywhere democratizes robotics data collection by using standard iPhones to capture high-fidelity, long-horizon egocentric data, that can be used for VLA training

Abstract: Vision-language-action (VLA) models have driven demand for large-scale egocentric datasets, yet the hardware and infrastructure to collect long-horizon data remain inaccessible. Datasets today typically have episodes only a few minutes long, which fails to capture the long-horizon temporal dependencies that complex robotic task execution requires. We present \textbf{MobileEgo Anywhere}, a framework for collecting hour-plus egocentric trajectories on commodity mobile hardware that uses modern smartphone sensors for long-term pose tracking without the hardware barriers of traditional robotics data collection. We release three components: (1) STERA, an open-source video-processing pipeline that converts raw mobile captures into standardized, training-ready formats for VLA and foundation-model research; (2) a free mobile app that lets any user record egocentric activity; and (3) a 200-hour dataset of diverse, long-form egocentric data with persistent state tracking across 584 sessions. We further show this data is a usable training signal: mid-training a VLA on it lowers held-out action-prediction error.

Email Sharing: We authorize the sharing of all author emails with Program Chairs.

Data Release: We authorize the release of our submission and author names to the public in the event of acceptance.

Submission Number: 28

Loading