Paper
6 September 2019 Deep learning and video quality analysis
P. Topiwala, M. Krishnan, W. Dai
Author Affiliations +
Abstract
For more than 30 years, the video coding industry has been using mean-squared error-based PSNR as a measure of video quality, despite evidence of its inadequacy. Moreover, in the encoder, SAD is used instead of MSE to save multiplications. We quantify how these measures are inadequately correlated to subjective scores and obtain new measures that correlate much better. We focus on the problem of full-reference assessment of video degraded only by coding and scaling errors, such as experienced by streaming services, and put aside issues of transmission, such as timing jitters, rebufferings, etc. We begin with the Video Multi- Assessment Fusion (VMAF) algorithm introduced by Netflix. Results with up to 97% correlation accuracy to subjective scores are reported on two Netflix datasets, using a neural network model.
© (2019) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
P. Topiwala, M. Krishnan, and W. Dai "Deep learning and video quality analysis", Proc. SPIE 11137, Applications of Digital Image Processing XLII, 111370T (6 September 2019); https://doi.org/10.1117/12.2530557
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Video

Video compression

RELATED CONTENT

Striping in multidisk video servers
Proceedings of SPIE (January 15 1996)
Multimedia wireless networking
Proceedings of SPIE (March 25 1996)
MPEG/JPEG encoder architecture using hybrid technologies
Proceedings of SPIE (January 03 1996)

Back to Top