PulseAudio
PulseAudio is a network-capable sound server program distributed via the freedesktop.org project. It runs mainly on Linux, various BSD distributions such as FreeBSD and OpenBSD, macOS, as well as Illumos distributions and the Solaris operating system. Microsoft Windows was previously supported via the MinGW toolchain (implementation of the GNU toolchain, which includes various tools such as GCC and binutils). The Windows port has not been updated since 2011, however.[5]
Developer(s) | Lennart Poettering Pierre Ossman Shahms E. King Tanu Kaskinen Colin Guthrie Arun Raghavan David Henningsson |
---|---|
Initial release | 17 July 2004[1] |
Stable release | 13.0[2]
/ 13 September 2019 |
Repository | gitlab |
Written in | C[3] |
Operating system | FreeBSD, NetBSD, OpenBSD, Linux, Illumos, Solaris, macOS, and Microsoft Windows (not maintained) |
Platform | ARM, PowerPC, x86 / IA-32, x86-64, and MIPS |
Type | Sound server |
License | GNU Lesser General Public License 2.1[4] |
Website | pulseaudio.org |
PulseAudio is free and open-source software, and is licensed under the terms of the GNU Lesser General Public License version 2.1.[4]
It was created in 2004 under the name Polypaudio but was renamed in 2006 to PulseAudio.[6]
Software architecture
In broad terms ALSA is a kernel subsystem that provides the sound hardware driver, and PulseAudio is the interface engine between Applications and ALSA.
PulseAudio acts as a sound server, where a background process accepting sound input from one or more sources (processes, capture devices, etc.) is created. The background process then redirects these sound sources to one or more sinks (sound cards, remote network PulseAudio servers, or other processes).[7]
One of the goals of PulseAudio is to reroute all sound streams through it, including those from processes that attempt to directly access the hardware (like legacy OSS applications). PulseAudio achieves this by providing adapters to applications using other audio systems, like aRts and ESD.
In a typical installation scenario under Linux, the user configures ALSA to use a virtual device provided by PulseAudio. Thus, applications using ALSA will output sound to PulseAudio, which then uses ALSA itself to access the real sound card. PulseAudio also provides its own native interface to applications that want to support PulseAudio directly, as well as a legacy interface for ESD applications, making it suitable as a drop-in replacement for ESD.
For OSS applications, PulseAudio provides the padsp
utility, which replaces device files such as /dev/dsp
, tricking the applications into believing that they have exclusive control over the sound card. In reality, their output is rerouted through PulseAudio.
libcanberra
libcanberra is an abstract API for desktop event sounds and a total replacement for the "PulseAudio sample cache API":
- Complies with the XDG Sound Theme and Naming Specifications.
- Defines a simple abstract interface for playing event sounds.[8]
- Interfaces with ALSA through libasound.[9]
- Has a back-end to PulseAudio.[10]
libSydney
libSydney is a total replacement for the "PulseAudio streaming API", and plans have been made for libSydney to eventually become the only audio API used in PulseAudio.[11]
Features
The main PulseAudio features include:[7]
- Per-application volume controls.[12]
- An extensible plugin architecture with support for loadable modules.
- Compatibility with many popular audio applications.[13]
- Support for multiple audio sources and sinks.
- Low latency operation and latency measurement.
- A zero-copy memory architecture for processor resource efficiency.
- Ability to discover other computers using PulseAudio on the local network and play sound through their speakers directly.
- Ability to change which output device applications use to play sound through while they are playing sound (Applications do not need to support this, PulseAudio is capable of doing this without applications detecting that it has happened)
- A command-line interface with scripting capabilities.
- A sound daemon with command line reconfiguration capabilities.
- Built-in sample conversion and resampling capabilities.
- The ability to combine multiple sound cards into one.
- The ability to synchronize multiple playback streams.
- Bluetooth audio device support with dynamic detection capabilities.
- The ability to enable system wide equalization.
Adoption
PulseAudio first appeared for regular users in Fedora Linux, starting with version 8,[14] then was adopted by major Linux distributions such as Ubuntu, Debian,[15] Mageia, Mandriva Linux, Linux Mint, openSUSE, and OpenWrt.[16] There is support for PulseAudio in the GNOME project, and also in KDE, as it is integrated into Plasma Workspaces, adding support to Phonon (the KDE multimedia framework) and KMix (the integrated mixer application) as well as a "Speaker Setup" GUI to aid the configuration of multi-channel speakers. PulseAudio is also available in the Illumos distribution OpenIndiana, and enabled by default in its MATE environment.
Various Linux-based mobile devices, including Nokia N900, Nokia N9 and the Palm Pre[17] use PulseAudio.
Tizen, an open-source mobile operating system, which is a project of the Linux Foundation and is governed by a Technical Steering Group (TSG) composed of Intel and Samsung, uses PulseAudio.
Problems during adoption phase
- The PortAudio API was incompatible with PulseAudio's design and needed to be modified.[18] Almost all packages using OSS and many of the packages using ALSA needed to be modified to support PulseAudio.[19] Further development of the glitch-free audio feature required a complete rewrite of the PulseAudio core, and also changes to the ALSA API and internals were needed.[20][21]
- When first adopted by distributions, PulseAudio developer Lennart Poettering (also the creator of systemd) described it as "the software that currently breaks your audio".[22] Poettering later claimed that "Ubuntu didn't exactly do a stellar job. They didn't do their homework" in adopting PulseAudio[23] for Ubuntu "Hardy Heron" (8.04), a problem that was improved with subsequent Ubuntu releases.[24] However, in October 2009, Poettering reported that he was still not happy with Ubuntu's integration of PulseAudio.[25]
- Interaction with old sound components by particular software: Certain programs, such as Adobe Flash for Linux, caused instability in PulseAudio.[26][27] Newer implementations of Flash plugins do not require the conflicting elements, and as a result Flash and PulseAudio are now compatible.
- Early management of buffer over/underruns: Earlier versions of PulseAudio sometimes started to distort the processed audio due to incorrect handling of buffer over/underruns.[28]
Related software
Other sound servers
JACK is a sound server that provides real-time, low latency (i.e. 5 milliseconds or less) audio performance and, since JACK2, supports efficient load balancing by utilizing symmetric multiprocessing; that is, the load of all audio clients can be distributed among several processors. JACK is the preferred sound server for professional audio applications such as Ardour, ReZound, and LinuxSampler; multiple free audio-production distributions use it as the default audio server.
It is possible for JACK and PulseAudio to coexist: while JACK is running, PulseAudio can automatically connect itself as a JACK client, allowing PulseAudio clients to make and record sound at the same time as JACK clients.[29]
PipeWire is an audio and video server that "aims to support the usecases currently handled by both PulseAudio and Jack".[30][31]
General audio infrastructures
Before JACK and PulseAudio, sound on these systems was managed by multi-purpose integrated audio solutions. These solutions do not fully cover the mixing and sound streaming process, but they are still used by JACK and PulseAudio to send the final audio stream to the sound card.
- ALSA provides a software mixer called dmix, which was developed prior to PulseAudio. This is available on almost all Linux distributions and is a simpler PCM audio mixing solution. It does not provide the advanced features (such as timer-based scheduling and network audio) of PulseAudio. On the other hand, ALSA offers, when combined with corresponding sound cards and software, low latencies.
- OSS was the original sound system used in Linux and other Unix operating systems, but was deprecated after the 2.5 Linux kernel.[32] Proprietary development was continued by 4Front Technologies, who in July 2007 released sources for OSS under CDDL for OpenSolaris and under GPL for Linux.[33] The modern implementation, Open Sound System v4, provides software mixing, resampling, and changing of the volume on a per-application basis; in contrast to PulseAudio, these features are implemented within the kernel. PulseAudio support in OpenIndiana and other illumos distributions relies on the in-kernel OSS implementation ("Boomer").
References
- "OldNews". freedesktop.org.
- Kaskinen, Tanu (13 September 2019). "PulseAudio 13.0". pulseaudio-discuss (Mailing list). Retrieved 13 September 2019.
- "PulseAudio", Analysis Summary, Open Hub
- "License", PulseAudio git, Free desktop, archived from the original on 4 March 2014, retrieved 16 June 2011
- PulseAudio on Windows
- The Project Formerly Known as Polypaudio
- "About", PulseAudio, Free desktop, retrieved 11 March 2013
- webmaster@debian.org, Debian Webmaster. "Debian -- Package Search Results -- libcanberra". packages.debian.org.
- webmaster@debian.org, Debian Webmaster. "Debian -- Package Search Results -- libasound". packages.debian.org.
- webmaster@debian.org, Debian Webmaster. "Debian -- Package Search Results -- libcanberra-pulse". packages.debian.org.
- Poettering, Lennart (8 February 2007). "FOMS/LCA Recap". 0pointer.de. Retrieved 13 March 2017.
- Poettering, Lennart, "Interviews", Fedora Project, Red Hat, retrieved 3 July 2009
- Pulse Audio wiki, PulseAudio, archived from the original on 18 October 2009, retrieved 19 July 2009
- "LPC: Linux audio: it's a mess [LWN.net]". 18 September 2008. Retrieved 11 July 2019.
- PulseAudio, Debian, archived from the original (wiki) on 9 November 2013, retrieved 9 November 2013
- PulseAudio (wiki), OpenWRT, retrieved 8 January 2012
- "Open source identity: PulseAudio creator Lennart Poettering", TechWorld, 8 October 2009
- Poettering, Lennart (25 September 2004). "Writing a PortAudio driver". audio.portaudio.devel. git.net. Retrieved 28 February 2017.
- Poettering, Lennart. "PulseAudio is now enabled by default on new Fedora installs". Fedora Development ML. Red Hat. Retrieved 1 March 2017.
- "Features: Glitch-free Audio". Fedora Project Wiki. Retrieved 28 February 2017.
- Poettering, Lennart. "Alsa Issues". PulseAudio - Trac. Archived from the original on 16 October 2008. Retrieved 28 February 2017.
- LPC: Linux audio: it's a mess, LWN, 18 September 2008, archived from the original on 18 October 2009, retrieved 3 July 2009
- Lennart Poettering (18 July 2008), PulseAudio FUD, 0pointer.de, archived from the original on 18 October 2009, retrieved 30 December 2009
- How-to: PulseAudio Fixes & System-Wide Equalizer Support, Ubuntu Forums, 10 May 2008, archived from the original on 18 October 2009, retrieved 18 October 2009
- I'll Break Your Audio, Lennart Poettering Blog, 19 October 2009, retrieved 26 December 2009
- No sound after running Flash, YouTube, etc. (pulseaudio solution), Ubuntu Forums, archived from the original on 18 October 2009, retrieved 18 October 2009
- PulseAudio, Ubuntu Wiki, archived from the original on 18 October 2009, retrieved 18 October 2009
- "Over-optimistic buffering in PulseAudio causes underruns (audible stuttering, pops)". Launchpad. Retrieved 9 November 2013.
- See “Loadable Modules.” Modules, Freedesktop.org, https://www.freedesktop.org/wiki/Software/PulseAudio/Documentation/User/Modules/#index9h2, retrieved August 28, 2019
- "PipeWire". pipewire.org.
- "On the Road to Fedora Workstation 31 — Christian F.K. Schaller".
- An introduction to Linux sound systems and APIs, Linux.com, 9 August 2004, archived from the original on 19 October 2014, retrieved 23 March 2013,
OSS is available not only for Linux but also for BSD OSes and other Unixes. That may be its only advantage, because this system is not very powerful and was officially replaced by ALSA in 2.5 kernels...
- 4Front technologies releases the source code for open sound system, Linux PR, 14 June 2007, retrieved 8 January 2012.