Sketch-based Video Browser (or Video Hunter)

A video retrieval tool for known-item search tasks.

Sketch-based Video Browser (or Video Hunter) is a new tool focusing on known-item search tasks, where users have seen some video scene, know it is contained in a collection, but do not know where it is located. Therefore, users have to search/browse the collection with advanced techniques enabling query idea initialization, result visualization and browsing. Given such support, known-item search is not restricted just to ideal query formulation. The Video Hunter tool has participated in the Video Browser Showdown, winning the competition in 2014 and 2015.

The Video Hunter tool focuses on intuitive interactive interface, enabling users to materialize a query idea, supporting various forms of visualizations and browsing.

Query formulation

In order to materialize a known-item, users can upload an example image, provide keywords and/or draw a multi-sketch with color/edge features. The tool emplys state-of-the-art deep convolutional neural networks to process queries based on example query objects (using DeCAF features) and keywords (annotations based on network classifications). The multi-sketch retrieval relies on position-color feature signatures and edge histograms.


The visualization component for results supports two modes. The first mode displays the list of keyframes ordered with respect to the relevance to a query. The keyframes are accompanied with the temporal context, i.e., keyframes preceding/following the matched keyframe in the particular video. Basically, the results are displayed in the mode with one matched keyframe per one line with the temporal context. However, more results per line are enabled. The second mode displays hundreds of keyframes as a color-sorted map, without a temporal context. This second option is suitable for first phases of the retrieval where users first try to find a candidate query object. Beside this two main modes, the tool supports also interactive navigation summaries and comact visualizations of groups of similar keyframes.


Vertical scrolling of the pages of ranked results is a standard way to improve recall. Once users find a promising scene or video, inspection of the temporal neighborhood is required. The Video Hunter tool enables horizontal browsing for each result line (extension of the temporal context) and also browsing using the interactive navigation summary.



