In many computer vision contexts, this specific naming convention (b + four digits) is used for processed sequences in tracking and segmentation benchmarks. To provide the exact paper you are looking for, could you share a bit more context? Specifically:
(e.g., Detectron2, MMCV, or a specific university project). b1306.mp4
(e.g., a specific GitHub repo like Video-Recognition or a dataset folder). In many computer vision contexts, this specific naming