We propose a robust algorithm to generate video segment
proposals. The proposals generated by our method
can start from any frame in the video and are robust to
complete occlusions. Our method does not assume specific
motion models and even has a limited capability to
generalize across videos. We build on our previous least
squares tracking framework, where image segment proposals
are generated and tracked using learned appearance
models. The innovation in our new method lies in the use
of two efficient moves, the merge move and free addition,
to efficiently start segments from any frame and track them
through complete occlusions, without much additional computation.
Segment size interpolation is used for effectively
detecting occlusions. We propose a new metric for evaluating
video segment proposals on the challenging VSB-100
benchmark and present state-of-the-art results. Preliminary
results are also shown for the potential use of our framework
to track segments across different videos.