"Crossing" Comparison Details

Video: "Crossing"
Resolution: 1920 pixels (horizontal) by 1080 pixels (vertical)
Frame Rate: 24
Camera Bitrate: 1194.39 Mbit/s
Video Duration: 152 seconds
Camera Filesize: 21.13 GB

Codec: EMPEG-H.264
Compressed Bitrate:
2.0 Mbit/s
Compressed Filesize: 39.5 MB

"ARRI" - Comparison Details

Video: "ARRI"
Resolution: 1920 pixels (horizontal) by 852 pixels (vertical)
Frame Rate: 25
Camera Bitrate: 981.50 Mbit/s
Video Duration: 185 seconds
Camera Filesize: 21.16 GB

Codec: EMPEG-H.264
Compressed Bitrate:
2.0 Mbit/s
Compressed Filesize: 49.7 MB

More Videos

4K - Download & Compare

Video: "Taipei"
Resolution: 3840 pixels (horizontal) by 2160 pixels (vertical)
Frame Rate: 29.97 fps
Camera Bitrate: 5966.0 Mbit/s
Video Duration: 107 seconds
Camera Filesize: 74.46 GB

Codec: EMPEG-H.264
Compressed Bitrate: 
3.0/6.0 Mbit/s
Compressed Filesize: 42.2/84.4 MB

4K - Download & Compare

Video: "Honey Bees"
Resolution: 4096 pixels (horizontal) by 2304 pixels (vertical)
Frame Rate: 24 fps
Camera Bitrate: 5,435.82 Mbit/s
Video Duration: 227 seconds
Camera Filesize: 143.94 GB

Codec: EMPEG-H.264
Compressed Bitrate:
3.0/6.0 Mbit/s
Compressed Filesize: 89.5/179.0 MB

HD - Download & Compare

Video: "Avatar Trailer"
Resolution: 1920 pixels (horizontal) by 800 pixels (vertical)
Frame Rate: 23.976
Camera Bitrate: 883.85 Mbit/s
Video Duration: 211 seconds
Camera Filesize: 21.72 GB

Codec: EMPEG-H.264
Compressed Bitrate:
1.0/2.0 Mbit/s
Compressed Filesize: 29.9/59.8 MB

HD - Download & Compare

Video: "Wonder Woman Trailer"
Resolution: 1920 pixels (horizontal) by 804 pixels (vertical)
Frame Rate: 24
Camera Bitrate: 889.16 Mbit/s
Video Duration: 166 seconds
Camera Filesize: 17.19 GB

Codec: EMPEG-H.264
Compressed Bitrate:
1.0/2.0 Mbit/s
Compressed Filesize: 23.5/47.0 MB

4K - Download & Compare

Video: "Tears Of Steel"
Resolution: 3840 pixels (horizontal) by 1714 pixels (vertical)
Frame Rate: 24 fps
Camera Bitrate: 3791.09 Mbit/s
Video Duration: 12 minutes : 14 seconds
Camera Filesize: 323.94 GB

Codec: EMPEG-H.264
Compressed Bitrate:
3.0/6.0 Mbit/s
Compressed Filesize: 287.7/575.4 MB

4K - Download & Compare

Video: "Big Buck Bunny"
Resolution: 3840 pixels (horizontal) by 2160 pixels (vertical)
Frame Rate: 30 fps
Camera Bitrate: 5,971.97 Mbit/s
Video Duration: 10 minutes : 35 seconds
Camera Filesize: 441.15 GB

Codec: EMPEG-H.264
Compressed Bitrate:
3.0/6.0 Mbit/s
Compressed Filesize: 248.4/496.8 MB

  1. Camera bitrate and camera filesize are calculated for 24-bit pixel depth.
  2. 1 GB = 1,024 * 1,024 * 1,024 Bytes; 1 MB = 1,024 * 1,024 Bytes; 1 GB = 1,024 MB.
  3. Better compression is possible with EMPEG-H.265 but is not supported on all browsers.
  4. Some video content may not function on some platforms if they lack support for common codecs. Try using a different browser or video player.
  5. *The video content presented in these demonstrations is being used solely for illustrative, non-commercial, and educational purposes. We assert no ownership over the copyrighted material and acknowledge that the rights, trademarks, and copyrights are the property of their respective owners.

VMAF

Video Multimethod Assessment Fusion (VMAF) is an objective full-reference video quality metric developed by Netflix in cooperation with the University of Southern California and the Laboratory for Image and Video Engineering (LIVE) at The University of Texas at Austin. It predicts subjective video quality based on a reference and distorted video sequence. The metric can be used to evaluate the quality of different video codecs, encoders, encoding settings, or transmission variants.

PSNR

Peak signal-to-noise ratio, often abbreviated PSNR, is an engineering term for the ratio between the maximum possible power of a signal and the power of corrupting noise that affects the fidelity of its representation. Because many signals have a very wide dynamic range, PSNR is usually expressed in terms of the logarithmic decibel scale.

SSIM

The structural similarity index measure (SSIM) is a method for predicting the perceived quality of digital television and cinematic pictures, as well as other kinds of digital images and videos. SSIM is used for measuring the similarity between two images. The SSIM index is a full reference metric; in other words, the measurement or prediction of image quality is based on an initial uncompressed or distortion-free image as reference.

Video Compression metrix vs Trireme EMPEG

Many if not all of the videos that you watch on Netflix and YouTube have been configured using video quality metrics. Over the last 4 years or so, Netflix’s encoding ladders have been driven by the company’s Video Multimethod Assessment Fusion (VMAF) metric and, before that, Peak Signal to Noise Ratio (PSNR). As well, YouTube uses a neural network based upon constant rate factor encoding, which itself is driven by an internal video quality metric.

Simply stated, video quality metrics attempt to predict how a subjective viewer would rate a particular video, and metrics are comparatively rated based upon the accuracy of these predictions. There are many purists who insist that subjective comparisons are the only valid technique for gauging quality, and indeed, properly administered subjective tests are the gold standard.

However, when you consider that 500 hours of video are uploaded to YouTube each minute, you can appreciate that the service has a strong need to encode its streams as efficiently as possible and a total inability to deploy humans to make it happen. Even Netflix, with a comparatively paltry 2,800 hours of new content in 2019, can’t use human eyes to create the customized encoding ladders for each video. For both companies, and many others, objective quality metrics are essential.

ERROR-BASED METRICS

The first class of metrics are error-based. They compare the compressed image to the original and create a score that mathematically represents the differences between the two images, also called noise or error. The PSNR is a good example. Metrics based upon this approach are simple and easy to compute, but scores often don’t correlate well with subjective ratings because human eyes perceive errors differently.

PERCEPTUAL-BASED MODELS

At a high level, perceptual-based models like the SSIM attempt to incorporate how humans perceive errors, or “human visual system models,” to more accurately predict how humans will actually rate videos. For example, while PSNR estimates absolute errors, Structural Similarity Index (SSIM) is a perception-based model that considers image degradation as perceived change in structural information, while also incorporating important perceptual phenomena, including both luminance masking and contrast masking terms.” In other words, perceptual-based metrics measure the errors and attempt to mathematically model how humans perceive them.

Netflix’s VMAF is another metric that was built to be trained (AI), using what’s called a support vector machine. Since the primary use for VMAF is to help Netflix produce encoding ladders for its per-title encoding, the Netflix training dataset includes clips ranging in resolution from 384x288 to 1080p at data rates ranging from 375Kbps to 20Mbps. Again, by correlating the mathematical result with subjective MOS scores, VMAF became much better at making the 540p vs. 720p decision mentioned above.

As the name suggests, VMAF is a fusion of three metrics, two that measure image quality and one that measures temporal quality, making it a true “video” metric.

Trireme has recently conducted extensive comparative studies to evaluate and analyze the performance of our compression against industry accepted benchmarks. The following studies include multiple videos at various bitrates measured by SSIM, VMAF & PSNR all compared to FFMPEG.

The evaluations were performed on 2 complex video sequences in H.264/AVC compression with 10s and 2s IDR period. The compression with 10s IDR period has been performed from uncompressed video as an input.

The compression with 2s IDR period has been performed from 30 Mbit/s ffmpeg x264 compressed video in the default placebo mode with 2s IDR period as an input, which (logically and obviously) benefits subsequent ffmpeg x264 compression in the default placebo mode, especially near 30 Mbit/s bitrate.

The objective quality difference between EMPEG compression in the default mode and FFMPEG compression in the default placebo mode is bigger than between two video compression standards, such as H.265/HEVC and H.264/AVC according to the attached comparison example in the jpg file. We have provided below a summary of the results and attached the corresponding full studies along with links to the videos.