BROADCAST FILM AND VIDEO DIRECTORY

Broadcast News

05/06/2014

A Viable Economic Alternative to AD...

Both Spoken Subtitling and Audio Description may be candidate services that could benefit from the application of Text-To-Speech technology, says John Birch, Strategic Partnerships Manager, Screen Systems

Speech synthesis has improved dramatically in recent years, with computer generated voices that can convey emphasis and emotion, without excessive need for 'mark-up' within the text used as input. In addition, recent research indicates that users may be happy with computer generated speech for Audio Description if it leads to more provision. As provision of Audio Description is being increasingly mandated, it is suggested that Text To Speech is a viable economic alternative to using voice talents, particularly for non-premium channels.

Audio Description
Traditionally, a trained 'describer' identifies appropriate points in the audio timeline where a description is needed (and can be placed) and produces a script. This process has much in common with captioning, but, perhaps for historical reasons, is almost always performed separately. Typically, the ‘describer’ voices and records the descriptions, although sometimes the ‘voicing’ is performed by a separate 'voice talent'. The (mono) recordings of each description are used to produce a full-length audio description track, which may be 'pre-mixed' to create a separate audio track in advance, or live mixed with the original audio at play-out.
Live mixing generally uses the mono description track and a control track. This control track (sometimes termed a ‘warble track’ because of its sound) contains a low rate digital signal that encodes pan and fade information. This information defines how the descriptive audio should be mixed with the original audio, allowing the balance between the description and the original audio to be controlled.

Spoken Subtitles
Spoken Subtitles are currently provided by national television broadcasters in some European countries as a service to provide accessibility for the blind and partially sighted viewers.
Spoken Subtitles supplement an Audio Description service and replace the inaccessible foreign language narrative provided as text subtitles. Unlike Audio Description, Spoken Subtitles are traditionally provisioned using Text To Speech, because the textual data is already available in the form of subtitle files and adding machine ‘reading’ of these is not operationally challenging. It is highly unusual for Spoken Subtitles to receive any special preparatory effort (for example, to match the voice with the gender of the speaker).
It should be understood that the 'quality' requirements of Spoken Subtitles may be different to Audio Description. Spoken Subtitles can be easily provided automatically for all programmes that have translation subtitles by default. In some regions and channels this is ALL programmes. As the spoken subtitles are automatically derived from the subtitle texts, the original spoken audio (in a foreign language) may be left audible, as it carries hint information such as the mood and gender of speaker. (The typical Audio Description practise of muting the original audio would detract from the quality of the viewing experience as audible cues would then be removed).

Technical Implementation
Both Spoken Subtitles and Audio Description have a common root in a timed script file. From this timed script file, audio is created. The main difference between the two practices is the 'typical' method of audio creation, using ‘voice talents’ for Audio Description and using Text To Speech for Spoken Subtitles. Additionally there may be a difference in the mixing of audio tracks due to a desire to retain connected information in the original program audio for Spoken Subtitles. From a technical perspective, both Spoken Subtitles and Audio Description may be provisioned using the same Text To Speech 'engine'.

Live insertion at programme playout
Screen has developed an output driver for our Polistream subtitle and caption transmission system that acts like all other Polistream output encoders. This specialist Polistream module receives 'subtitle texts' and renders them (using a programming interface called SAPI 5) to drive a Text to Speech engine and produces audio snippets.
The module will attempt to fit the generated audio snippets into the available time by re-rendering audio that is too long (by speeding up the spoken rate). If a delay in audio snippets occurs then the module will cut and fade the generated audio snippets. 'Live' Audio Description can also be rendered using the module, but in this case, the duration is unknown ahead of time, so there is no rate modification.
The specialist module can also detect the presence of an audio filename (hidden as metadata) in the subtitle file, in which case the identified sound file is loaded instead of performing a Text To Speech operation. This allows for traditional 'voice-talent' produced Audio Description to be supported, allowing a combination of Text To Speech and ‘voice-talent’ produced Audio Description, or the playout of pre-recorded voiceovers or other short audio prompts.
The same technology is also available as a module for our MediaMate product, to allow offline processing for file based workflows.

The article is also available in BFV online

(IT)

Top Related Stories
Click here for the latest broadcast news stories.

11/11/2016
What Is The Future For Immersive Audio?
Peter Poers, Managing Director at Jünger Audio, looks at production efforts versus consumer experience. Introduction Along with the evolution of highe

02/11/2009
Itfc Provides Subtitling And Audio Description For Michael Jackson Film
itfc, the leading London-based media access services provider, has completed work on This is It, a film that followed Michael Jackson as he prepared f

20/02/2024
NADiV Audio Introduces Range Of Dante Audio And Control Devices
NADiV Audio has launched its NADiV range of Dante-enabled audio interface and control devices for portable and installed AV and pro audio environments

28/07/2023
DHD audio Unveils XS3 Core Audio Processor
DHD audio has announced a new addition to its modular range of audio studio equipment and systems. The XS3 core audio processor supports up to 20 ster

19/04/2023
Bridge Technologies Adds Dolby E Monitoring To VB440
Bridge Technologies is to demonstrate the enhanced audio functionalities that they have brought to their leading production probe, the VB440 at NAB 20

10/04/2002
Sonic introduces DVD-Audio Centre LE
Sonic Solutions have introduced the DVD-Audio Creator LE – a highly-affordable DVD-Audio authoring system with advanced features. Incorporating core t

20/04/2023
Nomono Sound Capsule Now Shipping
Nomono has announced its groundbreaking Nomono Sound Capsule, a cloud-connected, self-contained recording kit for capturing audio in the field, is now

21/11/2018
Subtitling Is A Profit-Boosting Opportunity For Broadcasters
They are not limited to just being used as translation devices for foreign films or only of benefit to the hearing-impaired. Therefore, why is it that

09/08/2012
Blackmagic Design Announces Major New Software Update
Blackmagic Design has announced a major new software update that adds full-audio mixing capability to its ATEM 1 M/E Production Switcher and ATEM Tele

22/05/2023
Synthax Audio Appointed Distributor For TIERRA Audio
Synthax Audio UK has been appointed UK and Ireland distributor for TIERRA Audio's range of professional audio products. Founded in 2018 in Madrid, Spa

22/07/2015
Jünger Audio Prototype For IBC 2015
Jünger Audio will use IBC 2015 to showcase a prototype audio monitoring solution that will allow broadcasters to check the quality of all immersive au

17/07/2023
ES-Pro Audio Appointed To Handle Prism Sound's Range Of Audio Converters
Prism Sound has appointed ES-Pro Audio to handle its entire range of audio converters to the professional market in Germany. Formerly a Prism Sound re

01/03/2013
Jünger Audio Helps Two US Broadcasters Comply With CALM
WBPH-TV and WMBC-TV choose Jünger Audio’s T*AP Television Audio Processor as their "one-box" Loudness Control solution Berlin, Germany: The implementa

16/10/2015
Production News : French Drama First To Be Broadcast With Audio Description
French drama series The Returned is to become the first drama broadcast by Channel 4 with audio description. The audio description of the eight-part d

15/01/2013
Jünger Audio Crontrols Loudness For TVE
Spain's national broadcaster TVE has selected Jünger Audio’s loudness control technology to normalize the audio across a number of its television chan

More News From Around The Web

Broadcast News

A Viable Economic Alternative to AD...

Top Related StoriesClick here for the latest broadcast news stories.

Top Related Stories
Click here for the latest broadcast news stories.