This paper proposes CRANE - Concept Ranking According to Negative Exemplars - for semantic two-class segmentation of weakly labeled videos. The task can be summarized as follows: Given an oversegmentation of a video, tagged weakly with a concept such as "cat" or "dog", decide which segments actually belong to the concept. As known from semantic segmentation of weakly-labeled images, common difficulties are high in-class variation of concepts like "cat" and "dog" as well as the unknown location of the concept. In videos, an additional difficulty is the unknown temporal location of the concept.
An overview of CRANE is given on the paper's webpage.
What is your opinion on the summarized work? Or do you know related work that is of interest? Let me know your thoughts in the comments below: