lurkertech.com → Lurker's Guide → Programmer's Guide to Video Systems

Programmer's Guide to Video Systems

By Chris Pirazzi. Many thanks to those who have provided valuable info, including Charles Poynton and Bob Williams. Thanks to Andy Walls for some insights on standard def square luma sampling frequencies. Thanks to Calvin Walton for ATSC line number refs.

Support This Site	This hobby site is supported by readers like you. To guarantee future updates, please support the site in one of these ways:

	donate now Use your credit card or PayPal to donate in support of the site.

	get anything from amazon.com Use this link to Amazon—you pay the same, I get 4%.

	get my thai dictionary app Learn Thai with my Talking Thai-English-Thai Dictionary app: iOS, Android, Windows.

	get my thai phrasebook app Experience Thailand richly with my Talking Thai-English-Thai Phrasebook app.

	get my chinese phrasebook app Visit China easily with my Talking Chinese-English-Chinese Phrasebook app.

	get thailand fever I co-authored this bilingual cultural guide to Thai-Western romantic relationships.

Support This Site

This hobby site is supported by readers like you. To guarantee future updates, please support the site in one of these ways:

donate now

Use your credit card or PayPal to donate in support of the site.

get anything from amazon.com
Use this link to Amazon—you pay the same, I get 4%.

get my thai dictionary app
Learn Thai with my Talking Thai-English-Thai Dictionary app: iOS, Android, Windows.

get my thai phrasebook app
Experience Thailand richly with my Talking Thai-English-Thai Phrasebook app.

get my chinese phrasebook app
Visit China easily with my Talking Chinese-English-Chinese Phrasebook app.

get thailand fever
I co-authored this bilingual cultural guide to Thai-Western romantic relationships.

Submit This Site

Like what you see?
Help spread the word on social media:

Submit This Site

Like what you see?
Help spread the word on social media:

Introduction
The Reality of Video Systems
The 480i/60M Video System
- Vital Statistics
- Sampling
  - Square-Pixel Sampling
  - Non-Square-Pixel Sampling
The 576i/50 Video System
- Vital Statistics
- Sampling
  - Square-Pixel Sampling
  - Non-Square-Pixel Sampling
The 1080i/60M and 1080i/50 Video Systems
- Vital Statistics
- Sampling
The 720p/60M and 720p/50 Video Systems
- Vital Statistics
- Sampling

Introduction

When we software types hear that there are different "video systems" such as 480i, 576i, 1080i, or 720p, we immediately think of two things:

they have different "resolutions" (like different desktop display modes)
they have different "frame rates" (like different desktop refresh rates)

For example, if you look at the various wiki pages about video systems, you will see statements like "480i is the shorthand name for a video mode," and you'll see pictures like this:

as if video can be considered just a special setting of a VGA graphics adapter.

Well, it turns out that video is much more complex than just that, and it is absolutely crucial to understand certain additional videosyncrasies in order to write video capture, editing, processing, or playback software that does not cause bugs and compatibility problems for end-users.

This document will give you the basics that you need in order to avoid getting in trouble with video!

Disclaimer: there's one important videosyncrasy that is not yet covered here, and that's the different color systems (both Y'CbCr and R'G'B') that you will need to deal with in video software. It's important to get that right as well for your video software to work. Eventually, I will add material on this, but for now, check out Charles Poynton's excellent book on the subject. The situation is also summarized the QuickTime uncompressed standard that I wrote for Apple.

The Reality of Video Systems

We programmers like to think of video as a series of frames. Each frame, we imagine in our pleasant dreams, consists of one whole picture, say 640x480 pixels, all snapped at a single instant of time, and we like to believe there are, say, 30 of these pictures per second. To play back the video, we just display one picture after the other, at 30 images per second. We blissfully assume that the image size of the picture (640x480) is totally standardized and represents the "complete" picture. And, we assume that the pixels are square: that 100 vertical pixels is the same distance on the display monitor as 100 horizontal pixels.

Unfortunately, we're in for a rude awakening, because in many important cases that we must handle in video software today, some or all of these assumptions are wrong.

We programmers also like to think of video exclusively as data in our computer memory or hard disks. We often try to ignore the fact that video is also transmitted as an electrical (analog or digital) signal over wires, and stored on (gasp) videotape.

It turns out that if we take a moment to understand the bigger picture of video (no pun intended)—how it is transmitted electrically, how it is displayed by TVs and monitors, and how video geeks think of it—then suddenly it becomes tremendously easier to understand where these videosynracies come from, and to predict and handle them correctly in our video software.

So here we go...

What is a Video System?

A video system determines much more than just a resolution and frame rate. It includes:

a scanning system, which could be either:
- frame-based (progressive-scan), or
- field-based (interlaced/interleaved)
a frame rate or field rate, which may include magic M (1/1.001)
a picture aspect ratio (e.g. 16:9 or 4:3)
a total number of lines per frame (e.g. 525 for NTSC, not 480), including both:
- picture lines (that carry actual image information), and
- non-picture lines
a standard numbering scheme for those lines
for field-based systems
- the way to weave the lines of each field into picture lines
- which field is called F1 and which field is called F2
a set of analog electrical standards (e.g. NTSC composite, NTSC s-video), defining
- an electrical waveform and analog encoding of video
- a picture center, specified
  - vertically: in terms of the standard line numbers
  - horizontally: as a specific point on that waveform
a set of digital electrical standards (e.g. serial digital aka SDI), defining
- an electrical waveform and digital encoding of video
- a picture center, specified
  - vertically: in terms of the standard line numbers
  - horizontally: in terms of specific pixels on each line
a production aperture: the rectangle of picture data
a clean aperture: the rectangle with the picture aspect ratio
a set of computer representations for that video, each of which includes:
- an in-memory representation (e.g. MPEG, DV, JPEG, uncompressed Y'CbCr, etc.)
- a grid of sampling points, specified
  - vertically: in terms of the standard line numbers
  - horizontally: in terms of one of the electrical standards
  That grid of sampling points thus determines:
  - a horizontal resolution
  - a vertical resolution
  - a pixel aspect ratio
  for images in memory.

In the following sections, we'll introduce the many new concepts found in the list above.

What Are the Video Systems?

There are many video systems, but we will cover the most common ones:

SD/HD	System	Rate	Used In
Standard Definition TV (SDTV)	480i/60M	60M fields/second	US/Japan (example standard: NTSC)
Standard Definition TV (SDTV)	576i/50	50 fields/second	rest of world (example standard: PAL)
High Definition TV (HDTV)	1080i/60M	60M fields/second	US/Japan
	1080i/50	50 fields/second	rest of world
	720p/60M	60M frames/second	US/Japan
	720p/50	50 frames/second	rest of world

As you can see in the table, to unambiguously name one system, you should include the rate.

In this document, we will generally omit the rate from 480i and 576i because there is only one possibility.

In this document, we will drop the rate from 1080i and 720p in situations where our text applies to both the 50 and 60M systems. So 1080i and 720p are really groups of systems.

Frame-Based and Field-Based Video

As software types, we like to think of video as a series of complete frames, say 640x480 pixels, all snapped at a single instant of time. This is the frame-based (aka progressive-scan) model.

Unfortunately, the majority of systems (480i, 576i, and 1080i) are field-based (aka interlaced or interleaved).

Roughly speaking, rather than representing video as a series of, say, 30 640x480 images per second, field-based systems represent video as a series of 60 640x240 images per second, and each image contains information for only half of the lines of the overall picture. In particular, within each pair of fields, one field has lines 0, 2, 4, ... and the other field has lines 1, 3, 5, .... But the catch is that all the fields are temporarlly distinct, so you simply do not have all the data you need to reconstruct a complete picture at any given moment of time!

This is such an important issue, with such deep implications for software, that we have a whole separate page dedicated to it:

All About Video Fields

You should pop over and read that page, either now or after you're done with this one.

Frame Rate or Field Rate and the Magic M (30/1.001, not 29.97)

Frame-based systems transmit a series of frames, and so they have a frame rate.

Field-based systems transmit a series of fields, and so they have a field rate, and a frame rate which is half of the field rate.

When you see the rate for the US/Japan standards listed in casual discussions and even some standards, you will often see it written as 60 or 59.94 for brevity. But it's not really 60 and it's not really 59.94—it's actually 60/1.001, for astounding historical reasons that go back to the 1950s standardization of color NTSC TV and still haunt us today. Similarly, if you see 30 or 29.97, it's really 30/1.001. And if you see 23.98, it's really 24/1.001.

While this distinction may not matter for casual reference to video standards, it is very important for software, because you often need to synchronize video with audio and 60 is quite different from 60/1.001. If you use the wrong number, your audio and video will quickly slide out of sync and your customer will report bugs.

For this reason, this document is precise. We consistently add the magic factor M, which is equal to 1/1.001, whenever it is called for. If you see 60 in this document, it's really 60. If you see 60M, then it's 60/1.001.

Nowadays, there's yet another reason not to be sloppy. The SMPTE specs which define 1080i and 720p actually define both 60 and 60M versions of the standard (and both 30 and 30M, and both 24 and 24M)! It's true that in most cases, if you see 60, it's really 60M, but it's quite possible that you may run into true 60, particularly as a compatibility bridge in 50 field/frame per second environments.

Fortunately, you don't have to worry about magic M for the European standards based on 25 or 50 frames/fields per second. They never had the sordid NTSC history that brought this all about.

You may also sometimes see the magic numbers 59.94 or 29.97. These numbers do not represent the rate of video (59.94 is not equal to 60/1.001, nor is it close enough for your software). Those numbers come from a separate hack relating to drop-frame SMPTE timecode and unless you are dealing with SMPTE timecode in your software, you should never use these numbers in your software. I cannot overstate how many buggy pieces of software, and how many undeniably mislabeled movie files, have been created over this one little misunderstanding.

The SMPTE and ITU industry standards themselves set a horrible example that will no doubt confuse the heck out of the industry. For example, SMPTE 274M-2005, which defines 1080i, uses the nomenclature "1920 x 1080/59.94/I" (Table 1) for 1080i/60M. Similar sloppiness exists in many of the SMPTE and ITU specs. The real 1/1.001 rate is in the spec of course, but I'm certain that the shorthand will cause problems. Ouch!

Picture Aspect Ratio

Each video system defines a picture aspect ratio, which is simply the horizontal-to-vertical aspect ratio of the TV or other display device on which the user watches the video.

Picture aspect ratio is not the same as pixel aspect ratio, which we will discuss later.

The standard-definition systems we cover in this document (480i and 576i) are 4:3.

The high-definition systems we cover in this document (1080i and 720p) are 16:9.

As you probably know, before HDTV arrived, people came up with several ways to do 16:9 standard-def:

Those of you who have geeked around with "anamorphic" DVDs and movie files know that there is a hack whereby a movie in 16:9 format can be squished horizontally onto a standard definition (480i or 576i) 4:3 recording device. The device playing back the video (a TV or playback application) then needs to be switched into a special mode where it un-squishes the video back to 16:9.
This hack, while common and useful, is not considered to be 480i or 576i for the purposes of this document. When we say 480i or 576i, we mean 480i or 576i at 4:3 aspect ratio. Furthermore, in this document we will not point out all the software ramifications of the anamorphic hack. For example, if your application inputs anamorphic data into memory, that data is likely to have a different pixel aspect ratio than the ones we specify here.
There is yet another digital electrical standard, SMPTE 267M-1995, which defines a truly 16:9, standard-definition video format (it's not anamorphic: there are actually more pixels per line than 480i).

If you need to be specific in your own writings, I would recommend using this terminology for the three different systems:

480i 4:3
480i 16:9 anamorphic
480i 16:9 SMPTE 267M-1995

and similarly for 576i.

As we mentioned earlier, we will not be covering the strange old 1035i Japanese HDTV systems with a 5:3 picture aspect ratio.

Total Lines Per Frame: There's More Than You Thought

A video system doesn't just define dimensions for the picture (e.g. 640x480).

Instead, it introduces a broader concept of video lines, where some lines are picture lines and some are non-picture lines. For example, a video frame of an interlaced video signal looks like this:

These non-picture lines are used for many purposes:

In the analog electrical standards, they contain sync pulses that tell the CRT-based TV when to scan its electron beam back up to the top of the screen ("vertical retrace"). For this reason, the region of the non-picture lines is often called the "vertical blanking interval," since the TV would have to blank out its electron gun during retrace. Retro, huh?
In all electrical standards, they can contain ancillary data such as closed captioning, or VITC timecode.
In the digital electrical standards, they can even be used to carry audio data!

You may have heard video geeks refer to 480i video as "525-line" and 576i video as "625-line." These names reflect the actual number of video lines in the video frame, including all those extra non-picture lines:

System	Total Number of Lines
480i/60M	525
576i/50	625
1080i (all variants)	1125
720p (all variants)	750

Video Line Numbering

The video system also defines a standard numbering scheme for those lines. For example, every 480i frame has lines numbered from 1 to 525, regardless of whether it's expressed as an analog (NTSC) video signal or a digital (SDI) video signal. The specification for each electrical standard will then provide the mapping between those numbers and the waveform or bits of that particular electrical standard.

Field Names: F1 and F2

If the video system is field-based, it also defines standard names for each field, F1 and F2, in terms of which video lines they occupy.

F1 and F2 are purely properties of the signal. They have nothing to do with software, or field dominance, or any other external factors. Specifically,

If you are looking at part of the waveform of a field-based analog video signal on an oscilliscope, you can tell whether it is from an F1 field or an F2 field by the shape of its sync pulses.
If you are looking at an excerpt of a field-based digital video signal on a logic analyzer, you can tell if it's from an F1 or F2 field based on its sync words.

How Big is the Picture?

An important question for us software folks is obviously: which lines are picture lines and which are not? That will determine how many lines our video images in memory will have.

A good question, but we'll have to hold off on the answer until we've explained a few more vidiosyncrasies below.

How Fields Weave into a Frame

The video system also determines how the lines of each field weave into a frame, and it does that using the standard video line numbering scheme, and the standard F1 and F2 names, also defined by the video system.

For example, in the 480i system, here is how F1 and F2 line up:

Notice we've strategically avoided saying where the top and bottom of the picture are :) We'll be able to give you the answer to this later, after explaining some more vidiosyncrasies.

Analog Electrical Standards

Each video system encompasses one or more analog electrical standards that specify a way to transmit video using that system over an electrical wire. For example, some common standards are:

System	Analog Electrical Standards
480i/60M	NTSC (component, composite, s-video)
576i/50	PAL (component, composite, s-video) SECAM (component, composite, s-video)
1080i (all variants)	component analog HD
720p (all variants)	component analog HD

Analog video has a couple of unique properties which, it turns out, have caused some pretty serious ramifications that bubble all the way up to the software level.

To understand the situation, let's take a quick look at an analog waveform. Analog video, as the name suggests, encodes the picture data along each line as a smooth, continuous electrical signal, like this:

In this figure (stolen from Hamlet Video International and heavily doctored), you can see a sample colorbar video image on the top. On the bottom, you see the electrical waveform for one line of this video signal (say, video line L, which we have marked). The vertical axis is voltage (actually IRE units which are proportional to voltage), and the horizontal axis is time (roughly 63.5 microseconds across).

Notice how the video signal consists of some funky pulses at the left (known as horizontal sync and colorburst), followed by the actual video signal for that line. You can see how the differently colored bars produce different analog signals.

It's not important exactly how the luma and chroma get encoded as voltage; the main point to take home is that the voltage varies smoothly and continuously. In other words:

There are no pixels! When a computer's video input hardware digitizes the analog signal to make it available to software, it must sample that signal at some pixel sampling frequency. For the same analog signal, the computer could end up with 640 samples across the image, or 720 samples, or anything else.
There's no obvious "left and right edge." Unlike an array of pixels in a computer's memory, which has an unambiguous start and end, a video input device could choose many possible places to start and stop sampling the lines of the incoming video signal.

So here's a situation where you could do it different ways. And, as a seasoned engineer, you know what that means: people did do it different ways, and this will cause problems for you.

We'll give you more details on this further on in the document.

Digital Electrical Standards

In addition to the analog electrical standards you're familiar with, the video industry also defines digital electrical standards for each video system.

These standards are not like DV/1394/Firewire or MPEG-2. Don't let the word "digital" fool you into thinking it's a computer thing.

These standards are high-bandwidth, clocked (not packet-based) dedicated interconnects that are used in TV production studio environments to transmit 8- or 10-bit uncompressed video between different devices in the studio. The connection is typically a coax cable with BNC jacks, and the standard is called serial digital (SDI). There is an HD flavor, creatively named HD serial digital (HD-SDI). Sometimes they use SDI or HD-SDI connections in pairs (e.g. to transmit R'G'B' data) and this is called "dual link."

A typical modern studio will have an ungodly expensive video switch that routes SDI or HD-SDI signals all around the facility, and every device (such as studio cameras, D1 or Digital Betacam VTRs, old-school dedicated effects boxes, and studio monitors) will have SDI or HD-SDI inputs and outputs.

The various industry specs that define the standard definition (480i, 576i) electrical standards all point back to ITU-R BT.601-4 (commonly known as "Rec. 601") for some basic parameters. For that reason, you will often hear these electrical standards referred to as "601." One of the first VTR tape formats to have VTRs supporting this standard was the uncompressed D1 tape format, so you will also hear this electrical standard referred to as "D1."

These digital electrical standards also share the property with the analog electrical standards that there is more stuff encoded on each line than just the picture. In fact, there's quite a lot of so-called horizontal ancillary space available to store data; it occurs in the same area where the analog signal is having its funky sync and colorburst pulses, and like the vertial ancillary space, it can contain timecode, audio, and other data.

While the digital electrical standards are rarely seen in the consumer world, they are the lifeblood of video production, and much of the design and influence on video devices, libraries, and software on computers comes from these standards (not to mention the design of the DVD format and satellite broadcast standards). For example, you've almost certainly seen "601" or "D1" pop up in the UIs of video software.

In particular, the geniuses who designed the standard definition (480i, 576i) digital electrical standards decided to make every line have 720 non-square pixels, which we'll talk about next, and these are the exact same 720 non-square pixels that you are often forced to deal with by various video devices on PCs.

Fortunately, sanity prevailed for HD (1080i and 720p), whose digital electrical signals all have square pixels.

Non-Square Pixels and Pixel Aspect Ratio

In our blissful graphics software world, we are used to assuming that a 100 pixel vertical line is the same length, as seen on the screen by the user, as a 100 pixel horizontal line. This is called square pixels.

Unfortunately, when video engineers standardized the 480i and 576i digital electrical standards, they decided to use non-square pixels, and as a result, you as a software person will often be called upon to read and write non-square pixel data.

If your application wants to draw a circular circle or process a video image with a symmetric blur, you need to consider the pixel aspect ratio of your video data.

We have a quick how-to guide page dedicated to dealing with this important videosyncrasy:

Square and Non-square Pixels

But in the next section we will take you into the bowels of the past so that you may really understand the non-square issues in all their evil glory.

What Pixels are Square? How Non-Square is Non-Square?

Think about old-school analog equipment—an analog camera, VTR, monitor, or video switch—as we explained above, there are no pixels!

All analog equipment cares about is that the video doesn't get squished or stretched in one direction, and stays centered. And the way video engineers deal with this is to put up a standard video test pattern from a trusted test pattern generator and tweak the knobs on their monitor, by hand, until the monitor image is properly positioned and scaled on the screen. Obviously, this is not a very precise process. Modern monitors are consistent enough that this adjustment is rarely, if ever, needed, because the criterion for the image proportions being "good enough" are very loose—they are the limits of human perception of a video monitor.

Now enter computers.

Being mostly sane, the people who designed analog video input hardware for computers wanted to be able to capture square pixels, since square pixels were much easier to understand, manipulate, and display on a computer screen.

So those people were faced with this seemingly simple question: how should their hardware sample the analog signal in order to get square pixels? They want each pixel along the line to represent the same distance, on a video monitor, as the distance from one video picture line to the next. So how far apart, in the incoming video waveform, should each pixel sample be? In other words, how many million pixels per second (MHz) should their hardware capture (the "luma sampling frequency")?

In order to answer that, the video engineers turned to the analog video specs. You would think that the specs would very clearly state how the vertical data should stay in proper proportion to the horizontal data. For example, you'd expect to find a ratio saying that H microseconds of horizontal "distance" scanning across the monitor must be the same length as exactly V video lines of vertical "distance" on the monitor.

Unfortunately, it turns out that there is no such number. You cannot even derive such a number from the other numbers in the spec. Even if you employ death-defying amounts of handwaving to interpret the spec, like some Bible scholar looking for the hidden lottery numbers revealed inside, you still cannot come up with an exact figure.

Because, historically, nobody needed a super-precise definition.

So, what did the fearless video engineers of the 90s do? They made something up.

For reasons that I have never been able to completely figure out, the industry adopted a convention of sampling analog 480i video at exactly 12 3/11 MHz, and 576i video at exactly 14.75 MHz (and they even used decimals instead of fractions :), in order to get square pixels. The only clue we have is that these frequencies give us an integral number of square sampling instants per line, something which makes hardware people very happy.

Lurker's Guide Trivia Contest! If you know a way to derive the 12 3/11 MHz and 14.75 MHz industry-standard luma sampling rates, or you know some historical clues as to where they came from, then please send me some mail! Be sure to read this page first to see what we've figured out so far. The prize is your name in lights on the top of this page. Well, ok, your name on the top of this page. And maybe some other pages too! So far thanks to Charles Poynton and Andy Walls for helpful pointers that get us part of the way.

That is the sordid truth, and hurt though it may, knowing that will make it a lot easier for you to understand what comes next:

Now think about old-school digital equipment: it all uses ITU-R BT.601-4 ("Rec. 601"), which, as we mentioned above, specifies that each line contains exactly 720 pixels that are non-square.

However, being pesky video engineers, the authors of ITU-R BT.601-4 clear forgot to mention exactly what the pixel aspect ratios of those pixels was, exactly. They did, however, specify a very precise luma sampling frequency of 13.5 MHz (not 13.5M MHz).

So, this gives us an incontrovertible way to answer the question of "how non-square are non-square pixels?" If 480i square pixels are sampled at 12 3/11 MHz, and 480i non-square pixels are sampled at 13.5 MHz, that must mean that each non-square pixel has aspect ratio:

12 3/11 MHz / 13.5 MHz = 10 / 11

A similar derivation can be used for the 576i system, to give us these values:

System	hSpacing	vSpacing
480i/60M	10	11
576i/50	59	54

You can think of these values as the ratio of the width of a pixel to the height of a pixel. For example, say you want to draw a circle that appears round on the display device and whose diameter is n horizontal pixels (luma sampling instants). Draw an ellipse which is n pixels wide and n*hSpacing/vSpacing pixels (picture lines) high. Notice that vSpacing is in the denominator: the greater the vertical spacing of pixels (picture lines), the fewer vertical pixels you need in order to match a given number of horizontal pixels (luma sampling instants) on the display device.

Computer Industry Mass Confusion on Pixel Aspect Ratio

The sofware industry did not know or realize that the pixel aspect ratio of non-square pixels had already been determined by the hardware guys' choice of pixel sampling frequency.

Instead, what ensued was an unbelievable religious war of epic proportions, whereby uninformed computer people like us invented perhaps dozens of incorrect aspect ratios and argued about them endlessly like two Victorians bickering over whether the moon is made from cheese or crumpets. That unfortunately led to the creation of lots of mis-scaled and often un-labeled or mis-labeled video data, which will come back to haunt us in the future.

What mis-ratios did we use? There are the obvious mis-candidates, such as 640/720 and 786/720, but there are also amazingly nasty fractions created with Rube Goldberg-like proofs based on circularly dependent source data!

Even the MPEG committees got in on the action:

The sequence header of MPEG-1 video bitstreams contains an enumerated field identifying the pixel aspect ratio of the video images. In rev 1 of ISO/IEC DIS 2-11172, the MPEG-1 committee assigned two of the 16 tokens for the aspect ratios of 480i and 576i video. Unfortunately, the values behind these tokens were not the 10/11 or 59/54 above. The MPEG committee derived the values from a fixed 4:3 frame aspect ratio times a canonical image height and width in pixels which the committee seems to have made up. The pixel aspect ratio does not and should not depend on the video bitstream image size in pixels. The pixel aspect ratio is determined by the sampling grid of the device capturing the video; you can capture many different image sizes from the same grid.
In rev 2 of ISO/IEC DIS 2-11172, the committee changed the pixel aspect ratios in the table, but sadly they still were not 10/11 or 59/54. The committee only changed the made-up canonical image dimensions to other made-up canonical image dimensions.
Even more bizarrely, in both drafts, the committee decided to fill in the 12 open slots in the pixel aspect ratio table with linear interpolations and extrapolations of the 480i and 576i values! They topped it off with this riddle:
"It is evident that the specification does not allow all possible pel aspect ratios to be specified. We therefore presume that a certain degree of tolerance is allowable. Encoders will convert the actual pel aspect ratio to the nearest value in the table, and decoders will display the decoded values to the nearest pel aspect ratio of which they are capable."

Hopefully from this data you have learned not to trust the pixel aspect ratio encoded in an MPEG-1 video bitstream.

And what about MPEG-2?

The MPEG-2 committee discarded the MPEG I pixel aspect table. But unfortunately it retained the notion that pixel aspect ratio depends on encoded image size. Pixel aspect of an MPEG-2 video bitstream sequence can be 1.0, or it can be specified in terms of a frame aspect ratio times a ratio of the image height and width in pixels. The frame aspect ratio can be a fixed value (4:3 or 16:9) or it can be encoded in the bitstream as any two integers. Unlike MPEG-1, it's possible to twist the MPEG-2 paramers around in order to make them say what they should originally have said: that the pixel aspect is 10/11 or 59/54. It's too bad that the committee didn't just offer these as constants.

So I would watch out for the pixel aspect ratios encoded in all kinds of MPEG. If you know that data came from non-square-sampled video, use your own judgement.

SMPTE RP 187-1995 to the Rescue—Or Not

After the industry made something up and got back to work, a SMPTE standards committee was hard at work writing SMPTE RP 187-1995, a document that was finally supposed to address this pesky square and non-square issue.

Unfortunately, the pixel aspect ratio numbers they came up with for standard def (177/160 (y/x) for 480i and 1035/1132 (y/x) for 576i):

differed from the ubiquitous industry practice
were numerically onerous (large denominator) to the point where it would have been prohibitively expensive to design digital scaling hardware for them.
result in a non-integral numbers of luma and chroma sampling instants per line, which makes hardware people jump out of high windows

so they were quietly ignored.

Strangely, RP 187's informative annex A.4 talks about square to non-square conversion of computer images, and suggests resampling the image by a factor of 11:10—thus using the industry standard values and contradicting itself!

Fortunately, SMPTE RP 187-1995 made a major, positive contribution for HD (1080i and 720p), as we'll discuss later.

HDMI/CEA-861: Do We Ever Learn?

This disheartening but excellent submission comes in from reader Kevin Bracey:

I don't know if you're aware but CEA-861, and hence the HDMI spec which is derived from it, happily specify 720x576 and 720x480 digital signals (with 13.5MHz timing, and a statement that they're derived from ITU-R BT.656 and CEA-770.2), but then declares that those signals are exactly 4:3 or 16:9, and gives pixel aspect ratios corresponding to that, eg 16:15 for 4:3 576i (=1.067:1). Oops.
This is causing geometry errors in consumer kit. DVD players and the like are outputting their MPEG data with 1:1 pixel mapping to HDMI, and SD->HD upscalers and some HDMI-equipped displays treat the 720x576 HDMI frame as being 16:9. Pictures are 2.5% too thin.
You can switch between analogue and HDMI connections from the same source, and watch the picture get wider and narrower. And, often, you get little black bars at the left and right, from signals which don't have a full 720 pixels of image data - quite common.
It's a real struggle trying to talk to the producers of offending kit when such a vital spec is flouting reality. The HDMI spec itself doesn't go into aspect ratio detail, so it's the CEA-861 spec that is the problem.
The HDMI spec does sort of contradict the CEA spec in one place:
"For example, if a Source is processing material with fewer active pixels per line than required (i.e. [sic] 704 pixels vs. 720 pixels for standard definition MPEG2 material), it may add pixels to the left and right of the supplied material before transmitting across HDMI"
...a strong suggestion that it thinks that normal 704x576 material is narrower than HDMI, and it doesn't really think that a 702/704-wide 16:9 source should be scaled up to 720-wide because of the different aspect ratio.
In summary, we've got:
DVD players, DVB receivers, etc, ignoring the pixel aspect ratio claimed for HDMI and outputting 1:1 pixel mapping. Good lads.
Some TVs that default to treating 720x576 HDMI as 16:9 - can usually be corrected in the service menu, as they have separate geometry settings for SD and HD.
Almost all SD->HD upscalers with HDMI output (either built in to an SD device, or separate) scale 720x576 to fill a 1920x1080 frame. If you're going through such an upscaler, you can't correct it, as the scalers don't usually have any manual scaling controls, and it can't be compensated for in the TV without screwing up proper 1080-line HD sources.
At least some standards people are getting it right, such as NorDig (section 5.2.2.3), but this underlying standard isn't.

Sigh.

The Standard-Def Pixel Debacle

Now you're equipped with the knowledge to understand another horrifying, sordid tale, which went down in the 1990s, that still plagues software writers today when working with standard-def video.

The story begins like this:

Old-School Equipment Doesn't Care Where the Picture Ends

Consider this:

Analog VTRs generally just record the entire video frame to tape, including both the lines that contain picture data and those that do not. Similarly, they record all of every video line, including the parts that contain picture data and the parts that do not.
Digital VTRs more or less follow suit, at least vertically: they may skip a couple video lines that definitely won't contain video data, but generally they record every line, not just the picture, since there may be closed caption or other codes stuck outside the picture.
Many 1990s hardware DDRs (that's digital disk recorders, not dance dance revolution) had to follow suit as well, recording more lines than just the picture.
Studio video switches transmit the whole video frame, including all the lines and every part of each line.
Studio monitors and plain ol' TVs display just the picture data of course, but in reality the edges of the picture are hidden underneath the monitor bezel (studio monitors have a button to disable this, but typically that button is not pressed).

Do you see the pattern in all this?

The deep, dark secret of old-school video equipment is that it actually doesn't care exactly where the edges of the picture are located. It just leaves enough margin so that nobody's information will get cut off, and is happy with that.

Instead, old-school video engineers are much more concerned with making sure that the picture center is maintained by all pieces of processing gear.

That is the reason why if you look at the analog and digital electrical specifications, you will see the picture center being specified in insane detail, but you will see little, if any, mention of where the picture starts and ends (either vertically or horizontally).

In the standard def 480i and 576i analog specs, each field actually begins and ends with a "half-line," where only half the line is allowed to contain picture data.

In the standard def 480i digital spec, not only are the half-lines gone, but we actually get an extra bonus picture line at the top of field 1, changing the "top" and "bottom" sense of F1 and F2.

So, as a software person, you must be getting frustrated and asking yourself "Ok, so, which lines do I use? Where do I start? Do I include the half-lines? Do I skip them? For analog video, where do I start and stop sampling along each line?" You need to write a for() loop somewhere and it needs to have a definite start and end.

The video engineer's answer, of course is "who the hell cares!" He doesn't share your perspective or your needs. Which explains why the original specs, to this day, have never been amended to clearly answer this critical question.

The Result: Chaos

The result was that, for many many years, different brands and models of video input hardware would give you images with:

different widths and heights, or even worse,
the same width and height that represented a different region of the underlying video signal

Because many software engineers were not familiar with video, they would assume that if they have an image of a certain size, say, 640x480, that it must line up exactly with all other images of that size.

By the time all the dust settled (say, around 2000), we had some pretty clear de facto conventions, but in the meantime lots of legacy data was captured with different horizontal and vertical offsets.

And that is the reason why I have to spend so much energy explaining this mess to you, the software engineer. You shouldn't have to know about it, but you do, because your software will probably have to interpret that legacy data.

For example, if you write image compositing software, and the user imports and composites legacy data captured with two different hardware devices, you may have to offset those images (or force you user to do so) if they don't represent the same part of the underlying video signal, even if they have the same image size.

The De Facto Standard

In the section on 480i and 576i below, we will show the de facto square-pixel and non-square-pixel sampling pattern that emerged in the industry, including which lines get sampled and which part of each line gets sampled.

SMPTE RP 187-1995 to the Rescue Again—Sort Of

The SMPTE RP 187-1995 committee was also trying to address the pesky image size and offset issue.

The Good News: Apertures

Fortunately, SMPTE RP 187-1995 introduced two very useful terms into the industry:

The production aperture is that minimum sub-rectangle of the video signal that all devices, including both VTRs and computers, should record and play in order to avoid cutting off any of the users's important data. The production aperture is always centered about the picture center.
Finally (heroically!) the video industry is putting down their foot to declare which part of the video signal contains "the data."
Although the producton aperture is technically a minimum rectangle according to the spec, practically it should be exactly the same rectangle we store in memory when manipulating video in software. That is, it should be exactly the "image resolution" that we software types crave, and it also specifies the offset of that rectangle in the video signal.
The clean aperture, which is co-centric with the production aperture and the picture center, and contained within the production aperture, is that sub-rectangle of the video which has the system's standard picture aspect ratio (e.g. 4:3 for standard def).
The spec defines the picture center, the production aperture, and the clean aperture unambiguously in terms of the electrical video standards, so it is crystal clear where they lay in a video signal.
The clean aperture is smaller than the production aperture because video engineers like to live a little bit of margin around the picture that the end-viewer sees. This allows room for certain filtering artifacts (e.g. edge ringing) from old-school hardware filters to occur without being noticeable to the user. Makes hardware cheaper.
The term "clean aperture" does not refer to:
- The "Safe Action" and "Safe Title" areas defined in SMPTE RP 27.3-1989.
- The region of the video raster that is free from half-lines.
- The region of the video raster that is free from artifacts such as black bars from DV cameras or garbage bars from compression chips.

The Bad News: Bad SD Choices

Unfortunately, the values that the committee chose for pixel aspect ratio, clean aperture and production aperture for standard definition systems (480i and 576i) did not match any industry practice, and so the result was ignored for those systems.

Instead, the industry adopted four de facto standards for production aperture:

480i square: 640x486
480i non-square: 720x486
576i square: 768x576
576i non-square: 720x576

Here we have just given you the production aperture size. We'll detail the offsets as well in our sections on 480i and 576i below.

Already there is something amiss here: given that the industry had also adopted de-facto pixel aspect ratios for non-square pixels in 480i video (10/11) and 576i (54/59), if you apply those ratios to the above production apertures, you will see that the apertures are different in the square and non-square cases:

480i square: 640
480i non-square: 720 * (10/11) = 654 6/11, which is not equal to 640!
576i square: 768
576i non-square: 720 * (59/54) = 786 2/3, which is not equal to 768!

Here is a visualization of that:

480i

720 non-square pixels

640 square pixels

576i

720 non-square pixels

768 square pixels

This breaks the intended model of SMPTE RP 187-1995: the production aperture should be constant for a given system; it should not depend on how the video is sampled.

Oh, well. That's why, when you convert between square and non-square images, you must not only scale them but also pad or crop data, as we explain in this how-to guide:

Square and Non-square Pixels

HD: They've Seen the Light!

You might be pretty depressed over all this standard-def bad news.

Fortunately, with HDTV, the video standard designers had the benefit of hindsight.

Most joyously, 1080i and 720p pixels are square!

Secondly, the standards documents (including SMPTE RP 187-1995) clearly answer the question of exactly which HD video lines, and which part of each video line, consitutes the picture data which must be stored and transmitted—the production aperture.

Even better, the production aperture has the familiar-sounding dimensions of 1920x1080 (1080i) and 1280x720 (720p), and the SMPTE spec clearly states which video lines of the underlying video signal, and which pixels within the line, that includes.

The SMPTE committee was even clever enough to choose clean apertures and production apertures that both had a 16:9 aspect ratio, even though that was only really required of the clean aperture.

We can now hope that video hardware will be designed so that it inputs and outputs this production aperture, and software will use this production aperture for its standard image size. In this ideal world, software people such as yourself won't even have to know that the "picture" is embedded into a larger raster. You'll just be able to think of HD as field or frame images.

Ok, ok, stop laughing.

Below I will give you all the line number details for 1080i and 720p, just as I did for standard def, so that just in case some hardware designer messes it up, you'll still be able to talk to him to figure out what video you're really getting in your memory buffers.

Computer Representations of Video

Phew! Enough about the video geek's world-view!

On computers (which includes your code and the libraries you use, but also encoding and transmission standards like M-JPEG, DVC, MPEG, DVB, ATSC), we generally just store and transmit the picture. We don't include all those extra lines or extra stuff on each line.

Typically, we'll have buffers in memory that either contain one field or one frame. Those fields or frames may be uncompressed or they may be compressed with M-JPEG, MPEG, DVC, or other algorithms.

Even though they're "just pictures," by now it should be clear that we still have to think about how our pixels in memory map onto the video system for which they are intended, because in order to process video data correctly:

if the data comes from a field-based system like 480i or 1080i, we need to know the temporal relationship between the lines of our frame buffer, or the spatial relationship between the lines of our field buffers.
we need to know the pixel aspect ratio of the pixels in our buffers.
when we manipulate two or more images captured from different sources, we must be sure how they "line up." Even if they're both the same size, they may represent different apertures into the underlying video signal, and we need to correct for that (at least for 480i and 576p; maybe we'll all get lucky with HD and they won't mess it up).

We have to figure this out on a case-by-case basis.

Here are some hints for different situations:

Video Input and Output devices: You need to know how your hardware device will line up your pixels in memory with the real video signal coming in or going out. You may have to contact the manufacturer to find this out. Fortunately, this document arms you with exactly the terminology you need—such as video line numbers—to get through to those guys.
Uncompressed Video with QuickTime: If your software happens to be manipulating uncompressed Y'CbCr video with QuickTime, you'll be able to take advantage of the required ImageDescription extensions which I designed for Apple. These labels tell you, clearly and directly, how the pixels in your QuickTime file line up with the video signal from which they were captured, what the pixel aspect ratio is, etc. You can use these numbers directly in your software to get correct behavior every time.
If you're not using uncompressed Y'CbCr or you're not using QuickTime, why not send off an email to your API vendor telling them you want labels like these in your API?
Pixel Aspect Ratio Guessing: Fortunately there are only two real choices with hardware, square and non-square, each with one, industry-standard X:Y ratio per video system. So you can often guess the pixel aspect ratio from the image dimensions if you have to (720 or 704 imply non-square, 640 or 768 imply square). Unfortunately, there is so much confusion in the software world (not only general programmers but also the various MPEG committees!) that we have invented perhaps dozens of incorrect pixel aspect ratios and generated endless amounts of unneeded complexity. Try to avoid that!
DVC (DV/DVCPro/DVCAM/1394/Firewire Video, aka Blue Book, aka IEC 61834): You are in luck. The DVC compression standard was shockingly ahead of its time. Someone on the DVC design committee actually thought about video, and wrote down in the DVC spec exactly how the compressed DVC data lines up with standard-def video signals, exactly what the pixel aspect ratio is, and exactly how fields are handled. The image sizes are on this page. As you can see, 480i DVC encodes 720x480 instead of the de facto 720x486, but at least the spec tells you which 720x480 it is. I no longer have my DV specs to check the video line numbers for you, but fortunately, helpful reader Tobias Hoffmann obtained the information from section 7.4.1, "Sampling Structure," of DIN EN 61834-2:1998, which we later verified is consistent with section 5.1.1, "Sampling Structure," Table 20 of Proposed Standard SMPTE 314M:
- System 525-60
  - Field 1: line 23 to line 262
  - Field 2: line 285 to line 524
- System 625-50
  - Field 1: line 23 to line 310
  - Field 2: line 335 to line 622
M-JPEG Files: Sucks to be you. M-JPEG movie files were popular during the 90s, right when the pixel debacle was happening, and as a result you'll find an endless variation of unlabeled or mislabeled M-JPEG files, along with a raft of ambigious or confused blather about which field is "even," "odd", "top," or "bottom." Eventually, some conventions did emerge (i.e. labels in the movie files that tell you how the JPEG fields line up) but you're going to have to cut through a lot of noise to get to the truth. I would welcome any API- or device-specific pointers to publish here.
MPEG: From my dim memory, I cannot recall ever seeing an MPEG spec which specifies which video lines you must capture. However, I could have forgotten, or even more likely, one the many new ATSC or other specs may have filled in the missing information. If anyone has info they'd like to share with everyone here, please let me know and I will give you name credit too.
2016 Update: got a super-cool update from Calvin Walton:
After looking at a bunch of consumer '480 line' capture cards, and checking into the ATSC specs and some other MPEG-related specifications, I've run into something interesting:
The document SMPTE RP 202 "Video Alignment for Compression Coding" appears to specify which lines to capture for use with MPEG encoded video. The earliest version of this document was published January 2000 as "Video Alignment for MPEG-2 Coding"; it was updated in April 2008. I haven't actually seen the document itself, but in several places where it's referenced:
ATSC A/54a: "The lines to be encoded should be lines 23–262 and 286–525 for 480I and lines 45–524 for 480P, as specified in SMPTE Recommended Practice RP-202, “Video Alignment for MPEG Coding.” Page 25 of the spec.
ARIB STD-B32: (from English translation) "Desirable encoding area: Line numbers 23-262 and 286-525" Page 30 of the spec.
This is, annoyingly, notably different from the DVC specs that use 23- 262 and 285-524. (Offset by 1 line, switching which field the "top" line of the digital frame is from.)
480-line 480i video: I have stated my view that 486 lines became a de facto standard for 480i video around year 2000, but of course there will be exceptions. The next most common height is certainly 480. If your 480-line video came from DVC, read above because you can be sure what line it came from. Otherwise, a reasonable heuristic is that it starts on line 283, same as the 486-line de facto standard (see below for a picture of the video lines). My experience is that it tends to be captured with the same devices as the 486-line data (including the top line), but cropped to 480 for reasons of "convention." That implies that the production aperture for 480-line video is likely not to be centered about the picture center (boo hoo).

The 480i/60M Video System

Vital Statistics

480i/60M Vital Statistics
Scanning System	Field-based (2:1 interleaved)
Rate	60M fields per second
Picture Aspect Ratio	4:3
Total Lines Per Frame	525
Analog Electrical Standards	SMPTE 170M-1994 (NTSC component and composite) S-video based on NTSC.
Digital Electrical Standards	ITU-R BT.601-4 (basic parameters of digital image) SMPTE 125M-1995 (parallel digital) SMPTE 259M-1997 (serial digital, aka SDI)
Picture Center	Vertical	halfway between line 404 (field 2) and line 142 (field 1)
	Horizontal	analog: 481.5 luma sample periods from 0H
	Horizontal	digital: halfway between luma sample 359 and 360 of digital active line, which is halfway between luma sample 481 and 482 from 0H
Definition of F1 and F2	ITU-R BT.601-4 (also known as Rec. 601 and formerly CCIR 601) defines an encoding scheme for digital video. ANSI/SMPTE 170M-1994 defines Field 1, Field 2, Field 3, and Field 4 for NTSC (figure 7). ANSI/SMPTE 125M-1992 defines the 525-line version of the bit-parallel digital Rec. 601 signal, using an NTSC waveform for reference. 125M defines Field 1 and Field 2 for the digital signal (figure 4). ANSI/SMPTE 259M-1993 defines the 525-line version of the bit-serial digital Rec. 601 signal in terms of the bit-parallel signal. F1 is defined as an instance of Field 1 or Field 3. F2 is defined as an instance of Field 2 or Field 4.
Field Weave	See sampling methods below

Sampling

As we mentioned above, in the 1990s, there was massive chaos because different vendors in the industry could not agree on which rectangle (which production aperture) to extract from the video signal.

By around year 2000, the following de facto industry conventions had emerged for square-pixel and non-square-pixel sampling, although you will certainly find lots of legacy data and devices which use a different rectangle.

Square-Pixel Sampling

480i/60M Square Pixel Sampling
Pixel Aspect Ratio	1H : 1V (square)
Production Aperture (software image)	Size	640x486 pixels Heuristic: if you encounter 640x480 instead of 640x486, it likely, though not necessarily, begins on line 283 as the 640x486 aperture does in the diagram below (and is therefore not centered about the picture center). See above for more.
	Horizontal	640 points, spaced at exactly 12 3/11 MHz (no M), centered around the system's horizontal picture center (see above).
	Vertical	(same as non-square sampling below)
Clean Aperture	Size	640x480 pixels
Clean Aperture	Position	co-centric with production aperture

Non-Square-Pixel Sampling

480i/60M Non-Square Pixel Sampling
Pixel Aspect Ratio	10H : 11V (non-square: click here for more)
Production Aperture (software image)	Size	720x486 pixels Heuristic: if you encounter 720x480 instead of 720x486, it likely, though not necessarily, begins on line 283 as the 720x486 aperture does in the diagram below (and is therefore not centered about the picture center). See above for more.
	Horizontal	720 points, spaced at exactly 13.5 MHz (no M), centered around the system's horizontal picture center (see above). These are the same 720 active video points defined in ITU-R BT.601-4 ("Rec. 601").
	Vertical	(same as square-pixel sampling above)
Clean Aperture	Size	(640*(11/10))x480 pixels (explanation)
Clean Aperture	Position	co-centric with production aperture

The 576i/50 Video System

Vital Statistics

576i/50 Vital Statistics
Scanning System	Field-based (2:1 interleaved)
Rate	50 fields per second
Picture Aspect Ratio	4:3
Total Lines Per Frame	625
Analog Electrical Standards	ITU-R BT.470-3 (PAL component and composite) S-video based on PAL.
Digital Electrical Standards	ITU-R BT.601-4 (basic parameters of digital image) ITU-R BT.656-2 (parallel and serial digital, aka SDI)
Picture Center	Vertical	halfway between line 479 (field 2) and 167 (field 1)
	Horizontal	analog: 491.5 luma sample periods from 0H
	Horizontal	digital: halfway between luma sample 359 and 360 of digital active line, which is halfway between luma sample 491 and 492 from 0H
Definition of F1 and F2	ITU-R BT.601-4 (also known as Rec. 601 and formerly CCIR 601) defines an encoding scheme for digital video. ITU-R BT.470-3 (formerly known as CCIR Report 624-1) defines "first field" (F1) and "second field" (F2) (figure 2) for 625-line PAL. ITU-R BT.656-2 describes a 625-line version of the bit-serial and bit-parallel Rec. 601 digital video signal. It defines Field 1 (F1) and Field 2 (F2) for that signal (table I).
Field Weave	See sampling methods below

Sampling

As we mentioned above, in the 1990s, there was massive chaos because different vendors in the industry could not agree on which rectangle (which production aperture) to extract from the video signal.

Square-Pixel Sampling

576i/50 Square-Pixel Sampling
Pixel Aspect Ratio	1H : 1V (square)
Production Aperture (software image)	Size	768x576 pixels
	Horizontal	768 points, spaced at 14.75 MHz, centered around the system's horizontal picture center (see above).
	Vertical	(same as non-square sampling below)
Clean Aperture	Size	768x576 pixels
Clean Aperture	Position	co-centric with production aperture (identical in this case)

Non-Square-Pixel Sampling

576i/50 Non-Square-Pixel Sampling
Pixel Aspect Ratio	59H : 54V (non-square: click here for more)
Production Aperture (software image)	Size	720x576 pixels
	Horizontal	720 points, spaced at 13.5 MHz, centered around the system's horizontal picture center (see above). These are the same 720 active video points defined in ITU-R BT.601-4 ("Rec. 601").
	Vertical	(same as square-pixel sampling above)
Clean Aperture	Size	(768*(54/59))x576 pixels (explanation)
Clean Aperture	Position	co-centric with production aperture

The 1080i/60M and 1080i/50 Video Systems

Vital Statistics

1080i/60M and 1080i/50 Vital Statistics
Scanning System	Field-based (2:1 interleaved)
Rate	1080i/60M: 60M fields per second 1080i/50: 50 fields per second
Picture Aspect Ratio	16:9
Total Lines Per Frame	1125
Analog Electrical Standards	SMPTE 274M-1995 (component analog HD)
Digital Electrical Standards	SMPTE 274M-1995 (HD signal structure) SMPTE 292M-2006 (HD serial digital aka HD-SDI)
Picture Center	Vertical	halfway between line 291 (field 1) and 853 (field 2)
	Horizontal	analog: 1151.5 luma sample periods from 0H
	Horizontal	digital: halfway between luma sample 959 and 960 of digital active line, which is halfway between luma sample 1151 and 1152 from 0H
Definition of F1 and F2	SMPTE 274M-1995 defines "first field" (F1) and "second field" (F2) for the analog and serial digital signals (clause 6.3).
Field Weave	See sampling methods below

Sampling

Blissfully, 1080i has square pixels and no pixel debacles, so there's just one clear way to sample it:

1080i/60M and 1080i/50 Sampling
Pixel Aspect Ratio	1H : 1V (square)
Production Aperture (software image)	Size	1920x1080 pixels
	Horizontal	1920 points, spaced at: 1080i/60M: 74.25M MHz 1080i/50: 74.25 MHz centered around the system's horizontal picture center (see above).
	Vertical
Clean Aperture	Size	1888x1062 pixels
Clean Aperture	Position	co-centric with production aperture

The 720p/60M and 720p/50 Video Systems

Vital Statistics

720p/60M and 720p/50 Vital Statistics
Scanning System	Frame-based (progressive scan)
Rate	720p/60M: 60M frames per second 720p/50: 50 frames per second
Picture Aspect Ratio	16:9
Total Lines Per Frame	750
Analog Electrical Standards	SMPTE 296M-1995 (component analog HD)
Digital Electrical Standards	SMPTE 296M-1995 (HD signal structure) SMPTE 292M-2006 (HD serial digital aka HD-SDI)
Picture Center	Vertical	halfway between line 385 and 386
	Horizontal	analog: 899.5 luma sample periods from 0H
	Horizontal	digital: halfway between luma sample 639 and 640 of digital active line, which is halfway between luma sample 899 and 900 from 0H
Definition of F1 and F2	frame-based system: there are no fields
Field Weave	frame-based system: there are no fields

Sampling

Blissfully, 720p has square pixels and no pixel debacles, so there's just one clear way to sample it:

720p/60M and 720p/50 Sampling
Pixel Aspect Ratio	1H : 1V (square)
Production Aperture (software image)	Size	1280x720 pixels
	Horizontal	1280 points, spaced at 1080i/60M: 74.25M MHz 1080i/50: 74.25 MHz centered around the system's horizontal picture center (see above).
	Vertical
Clean Aperture	Size	1248x702 pixels
Clean Aperture	Position	co-centric with production aperture

Support This Site	This hobby site is supported by readers like you. To guarantee future updates, please support the site in one of these ways:

	donate now Use your credit card or PayPal to donate in support of the site.

	get anything from amazon.com Use this link to Amazon—you pay the same, I get 4%.

	get my thai dictionary app Learn Thai with my Talking Thai-English-Thai Dictionary app: iOS, Android, Windows.

	get my thai phrasebook app Experience Thailand richly with my Talking Thai-English-Thai Phrasebook app.

	get my chinese phrasebook app Visit China easily with my Talking Chinese-English-Chinese Phrasebook app.

	get thailand fever I co-authored this bilingual cultural guide to Thai-Western romantic relationships.






Copyright	All text and images copyright 1999-2023 Chris Pirazzi unless otherwise indicated.

Support This Site

This hobby site is supported by readers like you. To guarantee future updates, please support the site in one of these ways:

donate now

Use your credit card or PayPal to donate in support of the site.

get anything from amazon.com
Use this link to Amazon—you pay the same, I get 4%.

get my thai dictionary app
Learn Thai with my Talking Thai-English-Thai Dictionary app: iOS, Android, Windows.

get my thai phrasebook app
Experience Thailand richly with my Talking Thai-English-Thai Phrasebook app.

get my chinese phrasebook app
Visit China easily with my Talking Chinese-English-Chinese Phrasebook app.

get thailand fever
I co-authored this bilingual cultural guide to Thai-Western romantic relationships.

Copyright

All text and images copyright 1999-2023 Chris Pirazzi unless otherwise indicated.