TAP-Net: Tracking Any Point in a Video
RGB-Stacking Point Tracking
TAPNet generalizes to robotics videos from the RGB-Stacking with limited texture. For each example, we show each tracked point in a different color. For simplicity, all query points are given on the first frame, although our network is capable of tracking queries from any frame. The points are typically tracked well despite the lack of texture and heavy occlusions. However, the rotational symmetries can cause problems, meaning that the network doesn't necessarily track points on the surface very well.