Two-Stream Action Recognition-Oriented Video Super-Resolution
Published in International Conference on Computer Vision (ICCV), 2019
In this paper, we study the video super-resolution (SR) problem for facilitating video analytics tasks, e.g. action recognition, instead of for visual quality. Tailored for two-stream action recognition networks, we propose two video SR methods for the spatial and temporal streams respectively.

On the one hand, we observe that regions with action are more important to recognition, and we propose an optical-flow guided weighted mean-squared-error loss for spatial-oriented SR network to emphasize the reconstruction of moving objects. On the other hand, we observe that existing video SR methods incur temporal discontinuity between frames, which also worsens the recognition accuracy, and we propose a siamese network for our temporal-oriented SR training that emphasizes the temporal continuity between consecutive frames.

Citation:
@inproceedings{zhang2019two,
title={Two-stream action recognition-oriented video super-resolution},
author={Zhang, Haochen and Liu, Dong and Xiong, Zhiwei},
booktitle={IEEE/CVF International Conference on Computer Vision (ICCV)},
pages={8799--8808},
year={2019}
}