Understanding SC 1.2.8: Media Alternative (Prerecorded) (Level AAA)
In Brief
- Goal
- Prerecorded videos can be understood by more people.
- What to do
- Provide a text equivalent for all content in videos.
- Why it's important
- More people, including those who are deaf-blind, can better understand the content at their own pace.
Success Criterion (SC)
An alternative for time-based media is provided for all prerecorded synchronized media and for all prerecorded video-only media.
Intent
The intent of this success criterion is to make audio visual material available to individuals whose vision is too poor to reliably read captions and whose hearing is too poor to reliably hear dialogue and audio description. This is done by providing an alternative for time-based media in the same human language as the video or page on which it appears.
This approach involves providing all of the information in the synchronized media (both visual and auditory) in text form. An alternative for time-based media provides a running description of all that is going on in the synchronized media content. The alternative for time-based media reads something like a book. Unlike audio description, the description of the video portion is not constrained to just the pauses in the existing dialogue. Full descriptions are provided of all visual information, including visual context, actions and expressions of actors, and any other visual material. In addition, non-speech sounds (laughter, off-screen voices, etc.) are described, and transcripts of all dialogue are included. The sequence of descriptions and dialogue transcripts is the same as the sequence in the synchronized media itself. As a result, the alternative for time-based media can provide a much more complete representation of the synchronized media content than audio description alone.
If there is any interaction as part of the synchronized media presentation (e.g., "press now to answer the question") then the alternative for time-based media would provide hyperlinks or whatever is needed to provide parallel functionality.
Individuals whose vision is too poor to reliably read captions and whose hearing is too poor to reliably hear dialogue can access the alternative for time-based media by using a refreshable braille display.
Note
1.2.3, 1.2.5, and 1.2.8 overlap somewhat with each other. This is to give the author some choice at the minimum conformance level, and to provide additional requirements at higher levels. At Level A in Success Criterion 1.2.3, authors do have the choice of providing either an audio description or a full text alternative. If they wish to conform at Level AA, under Success Criterion 1.2.5 authors must provide an audio description - a requirement already met if they chose that alternative for 1.2.3, otherwise an additional requirement. At Level AAA under Success Criterion 1.2.8 they must provide an extended text description. This is an additional requirement if both 1.2.3 and 1.2.5 were met by providing an audio description only. If 1.2.3 was met, however, by providing a text description, and the 1.2.5 requirement for an audio description was met, then 1.2.8 does not add new requirements.
Benefits
- People who are deaf-blind, who cannot see well or at all, and who also cannot hear well or at all can get access to information in audio-visual presentations.
Examples
- Example 1. alternative for time-based media for a training video
- A community center purchases a Training video for use by its clients and puts it on the center's intranet. The video involves explaining use of a new technology and has a person talking and showing things at the same time. The community center provides an alternative for time-based media that all clients, including those who can neither see the demonstrations nor hear the explanations in the synchronized media, can use to better understand what is being presented.
Related Resources
Resources are for information purposes only, no endorsement implied.
- Transcripts, in Making Audio and Video Media Accessible, W3C Web Accessibility Initiative (WAI)
- uiAccess list of transcription services
- Transcripts on the Web: Getting people to your podcasts and videos
Techniques
Each numbered item in this section represents a technique or combination of techniques that the Accessibility Guidelines Working Group deems sufficient for meeting this Success Criterion. A technique may go beyond the minimum requirement of the criterion. There may be other ways of meeting the criterion not covered by these techniques. For information on using other techniques, see Understanding Techniques for WCAG Success Criteria, particularly the "Other Techniques" section.
Sufficient Techniques
Select the situation below that matches your content. Each situation includes techniques or combinations of techniques that are known and documented to be sufficient for that situation.
Situation A: If the content is prerecorded synchronized media:
-
G69: Providing an alternative for time based media using one of the following techniques:
-
Linking to the alternative for time-based media using one of the following techniques:
Situation B: If the content is prerecorded video-only:
Failures
The following are common mistakes that are considered failures of this Success Criterion by the Accessibility Guidelines Working Group.
Key Terms
- alternative for time-based media
document including correctly sequenced text descriptions of time-based visual and auditory information and providing a means for achieving the outcomes of any time-based interaction
Note
A screenplay used to create the synchronized media content would meet this definition only if it was corrected to accurately represent the final synchronized media after editing.
- ASCII art
picture created by a spatial arrangement of characters or glyphs (typically from the 95 printable characters defined by ASCII)
- assistive technology
hardware and/or software that acts as a user agent, or along with a mainstream user agent, to provide functionality to meet the requirements of users with disabilities that go beyond those offered by mainstream user agents
Note 1
Functionality provided by assistive technology includes alternative presentations (e.g., as synthesized speech or magnified content), alternative input methods (e.g., voice), additional navigation or orientation mechanisms, and content transformations (e.g., to make tables more accessible).
Note 2
Assistive technologies often communicate data and messages with mainstream user agents by using and monitoring APIs.
Note 3
The distinction between mainstream user agents and assistive technologies is not absolute. Many mainstream user agents provide some features to assist individuals with disabilities. The basic difference is that mainstream user agents target broad and diverse audiences that usually include people with and without disabilities. Assistive technologies target narrowly defined populations of users with specific disabilities. The assistance provided by an assistive technology is more specific and appropriate to the needs of its target users. The mainstream user agent may provide important functionality to assistive technologies like retrieving web content from program objects or parsing markup into identifiable bundles.
- audio
the technology of sound reproduction
Note
Audio can be created synthetically (including speech synthesis), recorded from real world sounds, or both.
- audio description
narration added to the soundtrack to describe important visual details that cannot be understood from the main soundtrack alone
Note 1
Audio description of video provides information about actions, characters, scene changes, on-screen text, and other visual content.
Note 2
In standard audio description, narration is added during existing pauses in dialogue. (See also extended audio description.)
Note 3
Where all of the important video information is already provided in existing audio, no additional audio description is necessary.
Note 4
Also called "video description" and "descriptive narration."
Note 5
In the context of audio descriptions, synchronized media predominantly describes audio and video technologies that are combined and played in tandem, which allows an author to align the timing of the audio descriptions in the audio track with the appearance of the relevant visuals in the video. In pre-recorded content, there can be intentionally asynchronous sound or images; however, they will always occur at the same time on each playback of synchronized media.
- captions
synchronized visual and/or text alternative for both speech and non-speech audio information needed to understand the media content
Note 1
Captions are similar to dialogue-only subtitles except captions convey not only the content of spoken dialogue, but also equivalents for non-dialogue audio information needed to understand the program content, including sound effects, music, laughter, speaker identification and location.
Note 2
Closed Captions are equivalents that can be turned on and off with some players.
Note 3
Open Captions are any captions that cannot be turned off. For example, if the captions are visual equivalent images of text embedded in video.
Note 4
Captions should not obscure or obstruct relevant information in the video.
Note 5
In some countries, captions are called subtitles.
Note 6
Audio descriptions can be, but do not need to be, captioned since they are descriptions of information that is already presented visually.
- extended audio description
audio description that is added to an audiovisual presentation by pausing the video so that there is time to add additional description
Note
This technique is only used when the sense of the video would be lost without the additional audio description and the pauses between dialogue/narration are too short.
- human language
language that is spoken, written or signed (through visual or tactile means) to communicate with humans
Note
See also sign language.
- image of text
text that has been rendered in a non-text form (e.g., an image) in order to achieve a particular visual effect
Note
This does not include text that is part of a picture that contains significant other visual content.
- live
information captured from a real-world event and transmitted to the receiver with no more than a broadcast delay
Note 1
A broadcast delay is a short (usually automated) delay, for example used in order to give the broadcaster time to cue or censor the audio (or video) feed, but not sufficient to allow significant editing.
Note 2
If information is completely computer generated, it is not live.
- media alternative for text
media that presents no more information than is already presented in text (directly or via text alternatives)
Note
A media alternative for text is provided for those who benefit from alternate representations of text. Media alternatives for text may be audio-only, video-only (including sign-language video), or audio-video.
- non-text content
any content that is not a sequence of characters that can be programmatically determined or where the sequence is not expressing something in human language
Note
This includes ASCII art (which is a pattern of characters), emoticons, leetspeak (which uses character substitution), and images representing text
- prerecorded
information that is not live
- programmatically determined
determined by software from author-supplied data provided in a way that different user agents, including assistive technologies, can extract and present this information to users in different modalities
- sign language
a language using combinations of movements of the hands and arms, facial expressions, or body positions to convey meaning
- synchronized media
audio or video synchronized with another format for presenting information and/or with time-based interactive components, unless the media is a media alternative for text that is clearly labeled as such
- text
sequence of characters that can be programmatically determined, where the sequence is expressing something in human language
- text alternative
Text that is programmatically associated with non-text content or referred to from text that is programmatically associated with non-text content. Programmatically associated text is text whose location can be programmatically determined from the non-text content.
Note
Refer to Understanding Text Alternatives for more information.
- user agent
any software that retrieves and presents web content for users
- video
the technology of moving or sequenced pictures or images
Note
Video can be made up of animated or photographic images, or both.
- video-only
a time-based presentation that contains only video (no audio and no interaction)
Test Rules
The following are Test Rules for certain aspects of this Success Criterion. It is not necessary to use these particular Test Rules to check for conformance with WCAG, but they are defined and approved test methods. For information on using Test Rules, see Understanding Test Rules for WCAG Success Criteria.