Shared Representation for 3D Pose Estimation, Action Classification, and Progress Prediction from Tactile Signals
Isaac Han , Seoyoung Lee , Sangyeon Park , Ecehan Akan , Yiyue Luo , Jeffrey DelPreto , Kyung-Joong Kim
SCOTTI (Shared COnvolutional Transformer for Tactile Inference) simultaneously addresses three tasks from foot tactile signals: 3D human pose estimation, action class categorization, and action completion progress estimation. This is the first work to explore action progress prediction using foot tactile signals. Multi-task learning enables mutual benefit across tasks, achieving superior performance compared to task-specific models. A new dataset with 15 participants, 8 actions, and 200,000+ synchronized tactile+visual frames is introduced.