Sustainability of Digital Formats
 Planning for Library of Congress Collections

Introduction | Sustainability Factors | Content Categories | Format Descriptions | Contact
Format Description Categories >> Browse Alphabetical List

V210 Video Picture Encoding

>> Back
Table of Contents
Format Description Properties Explanation of format description terms

Identification and description Explanation of format description terms

Full name V210 Video Picture Encoding
Description

A digital, color-difference component video picture format identified by the FOURCC code V210. Associated with Apple's Quicktime technology, V210 employs 4:2:2 chroma subsampling with each sample represented by 10 bits of data. This format's structure consists of twelve 10-bit unsigned components packed into four 32-bit little-endian words, with 8 zero bits as padding. The Apple developer site provides additional information and a helpful diagram.

V210 encoding (like all uncompressed formats identified by FOURCC codes) allows for variation in features like picture size, aspect ratio (square or non-square pixels), component levels (i.e., levels for Y, Cb, and Cr in either video range or wide range), and a given instance may contain video from interlaced or progressive sources. (See comment about interlacing and field order in the Notes below.) In order for an application to play the video back correctly or, say, to hand it off for successful inclusion in a video production side by side with other footage, all of these facts ought to be declared in metadata embedded in the file wrapper or associated with the file. The need for such metadata (and a way to compensate if it is missing) is suggested by the "Appendix: Backwards Compatibility" section of an Apple developer page that provides "best assumptions" to support playback for certain video types (not including V210) when the metadata is absent.

Specialists in the video community debate the pros and cons of 10-bit sampling and the V210 structure. Sampling at 10-bits can reduce picture artifacting as compared to 8-bit although, as stated in the Notes below, some commentators argue that this benefit is limited to certain types or classes of original (source) video. Most specialists agree that 10-bit sampling has value when digitizing analog sources. It is also the case that the padding bits required by the V210 structure can be seen as inefficient. One expert commentator called V210 "a very poorly compressing codec." For efficient handling when transcoding from V210, for example, the ffmpeg tool decodes the packed data to a planar form (AV_PIX_FMT_yuv422p10le), discarding the padding bits with no loss. (When outputting as V210, ffmpeg inserts the padding as needed.)

Production phase Employed in post-production or editing (middle phase) and dissemination (final phase).
Relationship to other formats
    Used by AVI_V210, AVI OpenDML File Format with V210 Video Encoding
    Used by QT_V210, Quicktime File Format with V210 Video Encoding
    Used by MXF_GC_UNC_V210, MXF Generic Container with Uncompressed Video Essence V210
    Used by Matroska_V210, Matroska File Format with V210 Video Encoding
    Used by Other video file wrappers not described at this Web site at this time.

Local use Explanation of format description terms

LC experience or existing holdings No extensive experience.
LC preference None. For preservation reformatting, the Library of Congress' Packard Campus for Audio-Visual Conservation has chosen MXF_OP1a_JP2_LL (lossless JPEG 2000 wrapped in MXF operational pattern 1a).

Sustainability factors Explanation of format description terms

Disclosure V210 structure is described at an open Web site managed by Apple, referenced by a number of other organizational sites, including FOURCC.org.
    Documentation Described in a segment of the Apple developer page titled Uncompressed Y´CbCr Video in QuickTime Files. See also SMPTE ST 377:2011, annex G, including amendment 2 (to be published in 2013).
Adoption Widely adopted.
    Licensing and patents None.
Transparency Relatively transparent; requires tools to write and read.
Self-documentation Not applicable; provided by the file wrapper.
External dependencies None.
Technical protection considerations None.

Quality and functionality factors Explanation of format description terms

Moving Image
Normal rendering Supported
Clarity (high image resolution) Excellent; this 10-bit sampling surpasses the 8-bit sampling found in UYUV, YUY2, and other encodings. See also Notes below.
Functionality beyond normal rendering Not applicable.

File type signifiers Explanation of format description terms

Tag Value Note
Microsoft FOURCC V210
From the FOURCC YUV page. This identifier will be found in files like AVI_V210. The Apple developer page that describes this format also provides this Apple codec name: Component Y'CbCr 10-bit 4:2:2.
Apple Video Sample Description kCVPixelFormatType_422YpCbCr10
From an Apple developer page, using the term Apple Core Video pixel format description. The compiler of this resource is uncertain as to whether this identifier is found in file headers or elsewhere in the file; Comments welcome. It may appear in applications that handle the picture data.
Other Registry: 06.0E.2B.34.04.01.01.0A
Item: 04.01.02.01.01.02.02.01
SMPTE Universal Label, as found in ST 377-1:2009, also in SMPTE Labels Registry as specified in RP224v12-2012. This identifier will be found in files like MXF_GC_UNC_V210.

Notes Explanation of format description terms

General

Interlacing and field order. This writer's understanding is that, when the source footage is interlaced, V210 encoding does not group together all the odd and even lines as they would be grouped in the two fields that make up a video frame on, say, a videotape. All V210 pixel data is presented in a left-right, top-down order that could be called progressive. To put it another way, for interlaced picture data, the tape presents alternate lines in the two fields that make up a frame: line 1, line 3, line 5, etc., and then line 2, line 4, line 6, etc. In contrast, V210 encodes the data as line 1, line 2, line 3, line 4, line 5, line 6, etc.

In fact, on a tape, which comes first (1, 3, 5 or 2, 4, 6) represents what is called field order or field cadence. There are two options for field order: upper (field 2 is dominant, so the second field is drawn first) and lower (field 1 is dominant, so the first field is drawn first). Generally, upper is used by 640 x 480 systems, while lower is most common in professional 720 x 486 and DV 720 x 480 systems.

As noted in the Description section above, even though V210's serialization of the data is progressive, and many players will work well without help, it may be the case that a user will wish to "restore" footage archived as V210 in a manner that will, say, intercut with other footage. Such restoration may require the decoding system to be aware that this footage was originally interlaced and to know the original field order. That's a job for metadata.

8- and 10-bit sampling. In principle, 10-bit encoding is superior in clarity to 8-bit, due to the reduction in tonal contouring and other artifacts. Some specialists argue, however, that there is no benefit for certain classes of material. One university expert wrote, "We digitize Betacam SX tape to 8-bit UYVY but Digibeta to 10-bit V210 because these selections align with the nature of the data that is actually sent out over SDI from these tapes. . . . SDI is 10-bit data, but when I piped the SDI video data from an SX tape to a binary display I could see the 9th and 10th bits were always zero. Thus by taking only the first 8 bits I could get all meaningful data . . . . I have about 3,000 SX hours to preserve and choosing 8-bit instead of 10 saves me about 90 TB of storage" (private communication).

History  

Format specifications Explanation of format description terms


Useful references

URLs


Last Updated: Wednesday, 09-Mar-2016 14:54:09 EST