Spatiotemporal Super-Resolution with Cross-Task Consistency
and its Semi-supervised Extension


Abstract

Spatiotemporal super-resolution (SR) aims to upscale both the spatial and temporal dimensions of input videos, and produces videos with higher frame resolutions and rates. It involves two essential sub-tasks: spatial SR and temporal SR. We design a two-stream network for spatiotemporal SR in this work. One stream contains a temporal SR module followed by a spatial SR module, while the other stream has the same two modules in the reverse order. Based on the interchangeability of performing the two sub-tasks, the two network streams are supposed to produce consistent spatiotemporal SR results. Thus, we present a cross-stream consistency to enforce the similarity between the outputs of the two streams. In this way, the training of the two streams is correlated, which allows the two SR modules to share their supervisory signals and improve each other. In addition, the proposed cross-stream consistency does not consume labeled training data and can guide network training in an unsupervised manner. We leverage this property to carry out semi-supervised spatiotemporal SR. It turns out that our method makes the most of training data, and can derive an effective model with few high-resolution and high-frame-rate videos, achieving the state-of-the-art performance.

Papers

Citation

Han-Yi Lin, Pi-Cheng Hsiu, Tei-Wei Kuo, and Yen-Yu Lin, "Spatiotemporal Super-Resolution with Cross-Task Consistency and its Semi-supervised Extension", in International Joint Conference on Artificial Intelligence (IJCAI), 2020.


BibTex
@inproceedings{STSRwithCrossTask,
  author    = {Lin, Han-Yi and Hsiu, Pi-Cheng and Kuo, Tei-Wei and Lin, Yen-Yu},
  title     = {Spatiotemporal Super-Resolution with Cross-Task Consistency and Its Semi-supervised Extension},
  booktitle = {International Joint Conference on Artificial Intelligence (IJCAI)},
  year      = {2020}
}