Perceived Loudness (EBU R 128)

EBU: European Broadcasting Union

Example of Implementation(Source Code):

https://github.com/jiixyj/libebur128

The specification EBU R 128 / ITU-R Bs.1770-4 describes an algorithm for calculation of the perceived loudness of real world audio signals, for example no pure sine tones.

For details of the recommendations please see EBU R128 Loudness normalisation and permitted maximum level of audio signals.

LUFS

k frequency weighting function

the k-weighting approximates the characteristics of the human ear

EBU R 128 loudness is:

  • measured in LUFS (Loudness Units Full Scale, or equivalently LKFS, Loudness k-weighted Full Scale) for absolute values or LU (Loudness Units) for relative values, whereat 0 LUFS are equivalent to full scale.
  • The loudness is defined as the k-weighted (see Filter Curve EQ or Graphic EQ) sum of the squared values of all channels, in which the k-weighting approximates the characteristics of the human ear. Thus, LU are somewhat related to the mean square without root.
  • Usually, the perceived loudness is calculated in blocks of 400 milliseconds every 100 milliseconds** (so called momentary loudness).
  • LU's are defined from a physical point of view that multiple audio sources add up within air and thus the loudness increases by adding another audio source. A consequence of the LU definition is that the amplitude of multi-channel audio decreases if another channels is added while the loudness is kept constant.

Multi-channel audio

In multi-channel audio, the individual channels are weighted before adding them to the total loudness. In Audacity's case which supports only mono and stereo tracks at the time of writing, all channels have equal contribution.

LU's are defined from a physical point of view that multiple audio sources add up within air and thus the loudness increases by adding another audio source. A consequence of the LU definition is that the amplitude of multi-channel audio decreases if another channels is added while the loudness is kept constant. While this point of view is completely valid it causes problems together with all known PC audio drivers. Those drivers simply double mono signals that are played on stereo speakers without attenuating their value which results in the double playback loudness. To compensate this, Audacity's Loudness Normalization effect has an option to take this loudness doubling into account.

Integrative loudness and loudness gating

Loudness gating example

Since the perceived loudness is mostly determined by the louder parts of a track,

an accurate loudness measurement has to ignore quier or even silent parts.

This is realized by loudness gating.

  • Absolute Threshold: First, loudness gating as defined by EBU R 128 discards all measurement blocks that are below an absolute threshold of -70 LUFS(grey in the example histogram, the large peak in the lowest bin is caused by silence at the start and end of the track).
  • Relative Threshold: In the next step, a relative threshold is calculated 10 LU below the average of the blocks above -70 LUFS(red in the example histogram, since the average is -40 LUFS).

    All blocks below that relative threshold are discarded as well.
  • the Perceived Loudness: Finally, the perceived loudness is calculated as the average of the remaining blocks (blue in the example histogram).

THE SPECIFICATION OF EBU R 128

R 128 LOUDNESS NORMALISATION AND PERMITTED MAXIMUM LEVEL OF AUDIO SIGNALS

Status: EBU Recommendation

Geneva, November 2023

EBU R 128-2023

Audio loudness normalisation & permitted maximum level

Loudness normalisation and permitted maximum level of audio signals

The EBU has studied the needs of audio signal levels in production, distribution and transmission of broadcast programmes.

  • It is of the opinion that an audio-levelling paradigm based on loudness is needed.
  • The EBU recommends the measurement of the average loudness of a programme ('Programme Loudness') for the normalisation of audio signals.
  • The 'Maximum True Peak Level' of an audio signal should be used to check compliance with the upper technical limit of the signal chain.
  • The measures 'Loudness Range', 'Maximum Momentary Loudness' and 'Maximum Short-term Loudness' can be used to further characterise an audio signal as well as to fulfil the aesthetic needs of each programme/station depending on the genre(s), the target audience and the distribution platform.

The EBU, considering:

  1. a) that peak normalisation of audio signals has led to considerable loudness differences between programmes and between broadcast channels;
  2. b) that the resulting loudness inconsistencies between programmes and between channels are the cause of the most viewer/listener complaints;
  3. c) that, when used to read peaks in the usual way, the QPPM(Quasi-Peak Programme Meter) specified in EBU Tech 3205-E [1] does not reflect the loudness of an audio signal, and that the QPPM is not designed to indicate a long-term average;
  4. d) that with the proliferation of digital production, distribution and transmission systems, the permitted maximum level of an audio signal specified in ITU-R BS.645 [2] is no longer appropriate;
  5. e) that measuring audio programme loudness has been defined in ITU-R BS.1770 [3], introducing the measures LU (Loudness Unit) and LUFS (Loudness Units, referenced to Full Scale)1;
  6. f) and that the level-gated measurement of Programme Loudness defined in ITU-R BS.1770 Eq.(7) (which hence measures foreground loudness) is advantageous to improve the loudness matching of programmes with a wide loudness range;

recommends:

7. g) that the measure Programme Loudness shall generally be used to normalise an audio signal;

8. h) that the Programme Loudness Level shall be normalised to a Target Level of −23.0 LUFS. Where attaining the Target Level is not achievable practically (for example, live programmes), a tolerance of ±1.0 LU is permitted. A broadcaster should ensure that a deviation from the Target Level towards the limits of the tolerance does not become standard practice;

9. i) that for the implementation of Loudness workflows (for example, in Quality Control environments) a tolerance of ±0.2 LU is allowed in order to take account of measurement errors;

10. j) that in special cases the Programme Loudness Level may be normalised to a Target Level lower than −23.0 LUFS on purpose. This exception shall be clearly indicated to ensure that such a lower Programme Loudness Level is not compensated;

11. k) that the measurement shall be made with a loudness meter compliant with ITU-R BS.1770 (including the level-gating method described in equation (7)) and EBU Tech 3341 [4];

12. l) that the audio signal shall generally be measured in its entirety, without emphasis on specific foreground elements such as speech, music or sound effects;

13. m)and that the True Peak Level of a programme shall not exceed −1 dBTP (dB True Peak) during production (linear audio), measured with a meter compliant with ITU-R BS.1770 and EBU Tech 3341. The measurement tolerance is ±0.3 dB (for signals with a bandwidth limited to 20 kHz). Permitted Maximum True Peak Levels may be lower for different distribution systems and data reduction rates. A broadcaster should check EBU Tech 3344 [5] for details;

The EBU further recommends:

14. n) that the measure Loudness Range (measured in compliance with EBU Tech 3342 [6]) may be used to evaluate the loudness variation of a programme2, its potential subsequent dynamic treatment and the dynamic integrity of a distribution path;

15. o) that Maximum Momentary Loudness and Maximum Short-term Loudness (measured in compliance with EBU Tech 3341) may be used to determine if a programme exceeds the upper loudness tolerance limit of the target audience;

16. p) that Loudness Metadata shall correctly indicate the actual Programme Loudness. Additional metadata may be used by the broadcaster to ensure a playback loudness level deviating from the Target Level (for example, for programmes according to item j));

17. q) that production and normalisation of short-form content (adverts; promos etc.) should be made in compliance with EBU R 128 s1 [7];

18. r) that guidance for the normalisation of content for streaming is given in EBU R 128 s2 [8];

19. s) that guidance for Loudness in Radio is given in EBU R 128 s3 [9] and EBU Tech 3401 [10];

20. t) that guidance for Loudness normalisation of Cinematic Content is given in EBU R 128 s4 [11];

21. u) that audio processes, systems and operations concerning production of programmes should be made in compliance with EBU Tech 3343 [12];

22. v) and that audio processes, systems and operations concerning distribution and reproduction of programmes should be made in compliance with EBU Tech 3344.

Notes :

  1. For programmes shorter than 1 minute, the use of the measure Loudness Range is not recommended due to too few data points (Loudness Range is based on the Short-term-Loudness values (3-seconds-window)).
  2. LUFS' is equivalent to 'LKFS' (which is used in ITU-R BS.1770).

    The EBU uses 'LUFS' which is compliant with international naming conventions.

Definitions

  • Programme: An individual, self-contained audio-visual or audio-only item to be presented in Radio, Television or other electronic media. An advertisement (commercial), trailer, promotional item (‘promo’), interstitial or similar item (“Short-form Content”) shall also be considered to be a programme in this context.
  • Programme Loudness: The integrated loudness over the duration of a programme -Programme Loudness Level is the value (in LUFS) of Programme Loudness.
  • Loudness Range (LRA): The distribution of loudness within a programme
  • Maximum True Peak Level: The maximum value of the audio signal waveform of a programme in the continuous time domain

References

[1] EBU Tech 3205-E 'The EBU standard peak-programme meter for the control of international transmissions'

[2] ITU-R BS.645 'Test signals and metering to be used on international sound programme connections'

[3] ITU-R BS.1770 'Algorithms to measure audio programme loudness and true-peak audio level'

[4] EBU Tech 3341 'Loudness Metering: "EBU Mode" metering to supplementloudness normalisation in accordance with EBU R 128'

[5] EBU Tech 3344 'Guidelines for Distribution and Reproduction of Programmes in accordance with EBU R 128'

[6] EBU Tech 3342 'Loudness Range: A measure to supplement loudness normalisation in accordance with EBU R 128'

[7] EBU R 128 s1 'Loudness Parameters for Short-form Content (Adverts; Promos etc.)' supplement 1 to EBU R 128

[8] EBU R 128 s2 'Loudness in Streaming'; supplement 2 to EBU R 128

[9] EBU R 128 s3 'Loudness in Radio; supplement 3 to EBU R 128

[10] EBU Tech 3401 'Guidelines for Radio production and distribution in accordance with EBU R 128'

[11] EBU R 128 s4 'Loudness normalization for Cinematic Content; supplement 4 to EBU R 128'

[12] EBU Tech 3343 'Guidelines for Production of Programmes in accordance with EBU R 128'

SciTech-AV-Audio-DAP(Digital Audio Processing)-Perceived Loudness(感知响度)-EBU R 128 / ITU-R Bs.1770-4的更多相关文章

  1. Building a Radio Listening Station to Decode Digital Audio & Police Dispatches

    On April 7, 2017, residents in Dallas, Texas, woke to the sound of emergency sirens blaring all over ...

  2. Digital image processing(数字图像处理)

    In computer science, digital image processing is the use of computer algorithms to perform image pro ...

  3. Digital Image Processing 学习笔记3

    第三章 灰度变换与空间滤波 3.1 背景知识 3.1.1 灰度变换和空间滤波基础 本章节所讨论的图像处理技术都是在空间域进行的.可以表示为下式: $$g(x, y) = T[f(x,y)]$$ 其中$ ...

  4. 信号处理的好书Digital Signal Processing - A Practical Guide for Engineers and Scientists

    诚心给大家推荐一本讲信号处理的好书<Digital Signal Processing - A Practical Guide for Engineers and Scientists>[ ...

  5. Digital Audio - Creating a WAV (RIFF) file

    Abstract:This tutorial covers the creation of a WAV (RIFF) audio file. It covers bit size, sample ra ...

  6. audio 基本功能实现(audio停止播放,audio如何静音,audio音量控制等)

    audio最简单原始的播放.暂停.停止.静音.音量大小控制的功能,注意某些浏览器会有权限无法自动播放噢(video也会如此) <!doctype html> <html> &l ...

  7. c# Use NAudio Library to Convert MP3 audio into WAV audio(将Mp3格式转换成Wav格式)

    Have you been in need of converting mp3 audios to wav audios?  If so, the skill in this article prov ...

  8. audio标签以及audio对象

    一.audio标签 简单语法 <audio src="音频链接"></audio> 属性 属性 值 描述 autoplay 如果出现该属性,则音频在就绪后马 ...

  9. Digital Imaging Processing 数字图像处理

    8-Bit and 16-Bit Images 关于量化压缩与量化补偿 RGB Bayer Color分析 彩色CCD/CMOS的格式和计算机中的读取格式

  10. Digital Image Processing 学习笔记2

    第二章 2.1视觉感知要素 2.1.1 人眼的结构 眼睛由角膜与巩膜外壳.脉络膜和视网膜包围,晶状体由通信的纤维细胞层组成,并由附在睫状体上的纤维悬挂:视网膜上分布两类光感受器(锥状体和杆状体),他们 ...

随机推荐

  1. 6.4K star!轻松搞定专业领域大模型推理,这个知识增强框架绝了!

    嗨,大家好,我是小华同学,关注我们获得"最新.最全.最优质"开源项目和高效工作学习方法 「垂直领域大模型落地难?逻辑推理总出错?这个来自OpenSPG的开源框架,让专业领域知识服务 ...

  2. 操作系统综合题之“采用时间片轮转调度算法(Round-Robin,RR)执行,分时系统中的进程可能出现的状态变化”

    一.问题:某分时系统中的进程可能出现下图所示的状态变化,请回答下列问题: 1.根据图示,您认为该系统采用的是什么进程调度策略? 2.把图中所示的每一个状态变化的原因填在下表相应位置. 变化 原因 1 ...

  3. Linux之删除命令

    一.格式 只能删除文件夹 rmdir 文件名 删除文件 rm 是强大的删除命令,它可以 永久性地 删除文件系统中指定的文件或目录. 在使用 rm 命令删除文件或目录时,系统不会产生任何提示信息.此命令 ...

  4. 在鸿蒙NEXT中实现完全自定义导航栏

    在日常app开发中,导航栏扮演着重要的角色.鸿蒙提供了系统导航栏Navigation,它支持很多属性的修改,但是应用需求更加灵活多变,比如有的导航栏有背景图片,有的导航栏要求渐变色,有的导航栏需要随时 ...

  5. vue3 基础-API-响应式 toRef, context

    上篇讲了构造响应式变量 ref 和 reactive 函数, 其中 ref 针对基础类型, reactive 针对引用类型. 而 toRefs 可以将响应式变量进行结构后, 仍然具备响应式. 而本篇继 ...

  6. 【公众号搬运】React-Native开发鸿蒙NEXT(6)

    .markdown-body { line-height: 1.75; font-weight: 400; font-size: 16px; overflow-x: hidden; color: rg ...

  7. Java 删除目录下相同文件

    摘要:通过MD5算法对指定目录下的文件去重. 背景   之前写了一篇博客<Java判断两个文件是否相同>,介绍如何校验两个文件内容是否相同,但是,不适用于对目录下文件去重的场景,故对其进行 ...

  8. Docker和K8S存在的意义

    之前一直没搞明白为什么要用docker和k8s,下面简单说一下: 先说现实中需求: 1.服务器上的环境.数据,有时需要迁移 2.服务器上的资源,需要动态伸缩,比如双十一的时候搞促销,就需要扩容更多的服 ...

  9. 卡掉hash的方法

    大质数hash 通常,这个质数会选择在 \(10^9\) 附近,如 \(998244353\),\(10^9+7\). 考虑生日碰撞,欲达到 50% 成功率,需要尝试的次数为 \[\begin{ali ...

  10. 我们不可能永远都在救火 ——Scrum中技术债务“偿还”指南

    技术债务是指开发人员为了加速软件开发,在应该采用最佳方案时进行了妥协,改用了短期内能加速软件开发的方案,以至于未来给自己带来额外的开发负担. 软件工程师 Ward Cunningham首次将技术的复杂 ...