Ambisonics: The Theory Behind the Format

Introduction: Beyond Point-Source Audio

To understand Ambisonics, one must first dismantle the traditional concept of “channels.” In a standard Stereo or 5.1 setup, we work with Channel-Based Audio. If you want a sound to come from the left, you send a signal to the left speaker.

Ambisonics is fundamentally different. It is a Scene-Based Audio format. It doesn’t care about where your speakers are located during the recording phase. Instead, it captures a complete 360-degree “sphere” of sound centered on a single point in space. This is made possible through a mathematical framework known as Spherical Harmonics.


1. The Foundation: Spherical Harmonics

The core theory of Ambisonics relies on the idea that any sound field at a specific point in space can be decomposed into a series of functions called Spherical Harmonics.

Think of Spherical Harmonics as the 3D equivalent of the Fourier Transform. Just as a complex waveform can be broken down into a sum of simple sine waves, a complex three-dimensional sound field can be broken down into a sum of spherical components.

  • Zero-Order: Represents the overall pressure (Omnidirectional).
  • First-Order: Represents the pressure gradients along the three primary axes (Dipoles).
  • Higher-Orders: Represent increasingly complex and detailed directional patterns.

2. The Transmission Tiers: From A-Format to B-Format

The journey of an Ambisonic signal from the microphone to your ears involves two critical stages of representation.

A-Format: The Raw Capture

When you use a microphone like a Sennheiser AMBEO or a Soundfield mic, you are dealing with four discrete cardioid or sub-cardioid capsules arranged in a tetrahedron. The signals from these four capsules are known as A-Format. While they contain all the necessary spatial information, they are not yet “Ambisonic.” They are simply four directional recordings that share a common center.

B-Format: The Universal Language

To become a true Ambisonic signal, the A-Format must undergo a mathematical transformation (matrixing) to become B-Format. This is the standard format for processing and storage. In First-Order Ambisonics (FOA), the B-Format consists of four signals:

  1. W (The Scalar): The omnidirectional component. It contains the total sound pressure. It is the “body” of the recording.
  2. X (Front-to-Back): A figure-8 pattern capturing the longitudinal velocity.
  3. Y (Left-to-Right): A figure-8 pattern capturing the lateral velocity.
  4. Z (Up-to-Down): A figure-8 pattern capturing the vertical velocity.

These four components (W,X,Y,Z) allow us to reconstruct the sound field from any direction.


3. Resolution and Precision: Higher-Order Ambisonics (HOA)

One common criticism of First-Order Ambisonics (4 channels) is that the “spatial resolution” is low. The “sweet spot” is small, and sounds can feel “blurry” or lack pinpoint localization.

The solution is Higher-Order Ambisonics (HOA). By adding more channels based on higher-order spherical harmonics, we increase the spatial resolution significantly.

  • 1st Order: 4 channels (Low resolution).
  • 2nd Order: 9 channels.
  • 3rd Order: 16 channels (High resolution, the current pro standard for high-end VR).
  • 7th Order: 64 channels (Used in high-density loudspeaker arrays and advanced research).

The higher the order, the narrower the “virtual microphones” become, allowing for much sharper localization of sound sources within the sphere.

Zekya Mic

4. The Standardization Wars: AmbiX vs. FuMa

As the theory moved into the digital realm, a conflict arose regarding how to order the channels and how to normalize the levels.

FuMa (Furse-Malham)

The traditional standard used in the early days. It uses a specific channel order (W,X,Y,Z,…) and a normalization known as MaxN. While it was the pillar for decades, it is now considered “Legacy.”

AmbiX (Ambisonic Exchange)

The modern winner. AmbiX uses the ACN (Ambisonic Channel Numbering) and SN3D normalization. This is the format required by YouTube, Meta (Facebook), and most modern VR engines.

  • Why it won: It is more scalable for Higher-Order Ambisonics and handles signal levels more efficiently for digital distribution.

5. Decoding: The Speaker-Independent Magic

The most powerful theoretical advantage of Ambisonics is that it is Speaker-Independent.

In channel-based audio, if you mix for 5.1, your mix only works perfectly on a 5.1 system. In Ambisonics, you mix once in the “B-Format sphere.” This master mix can then be decoded to any playback system:

  • Binaural: For headphones (using HRTFs – Head Related Transfer Functions).
  • Stereo: A simple downmix.
  • 5.1, 7.1, or 9.1.4 (Atmos): Mapping the sphere to specific speaker coordinates.
  • 22.2 Arrays: For specialized immersive theaters.

The decoder acts as a “virtual microphone” array, extracting the signals for each speaker based on its position relative to the center of the Ambisonic sphere.


6. Conclusion: Why Theory Matters for the Sound Designer

Understanding the theory behind Ambisonics isn’t just an academic exercise. It changes how you record and how you mix.

When you record a Room Tone or an Urban Interior in Ambisonics, you aren’t just capturing a “stereo file.” You are capturing a mathematical snapshot of a physical space. This snapshot allows the end-user (the film editor or game developer) to rotate, tilt, and manipulate that space to fit their narrative perfectly.

Ambisonics is the bridge between the physics of sound waves and the infinite possibilities of digital 3D space.

Products by Category

Get 20% Discount

We don’t spam! Read our privacy policy for more info.

Alberto Carlassare
Alberto Carlassare

Alberto Carlassare is a freelance sound designer and location sound recordist specializing in film, series, documentaries, advertising, and visual arts. With a degree in Arts, Music, and Entertainment from the University of Bologna and trained at ECAM in Madrid, he has worked on award-winning projects such as The Year of the Discovery (Goya 2021 for Best Documentary). He has collaborated with directors including Luis López Carrasco, Miguel Llansó, Pedro G. Romero, and Pablo Hernando.

Articles: 5

Leave a Reply