Technical
Beyond Frequency | Where Loudspeakers Lose Time — Part II
Why Loudspeakers Need Crossovers and Where Timing Breaks Down
A loudspeaker must reproduce more than ten octaves of sound. No single drive unit can do this efficiently or without distortion. Crossovers exist to divide the signal so that each driver operates where it performs best. However, the way this division is implemented has profound consequences for timing.
A crossover, however, shapes not only amplitude but also phase and time. Every filter used to redirect energy between drivers introduces Phase Rotation and Group Delay. In passive crossovers built from coils, capacitors and resistors, these effects are unavoidable and increase rapidly with filter order.
Physical time alignment of drivers at their acoustic centres is often presented as a solution to timing problems in loudspeakers. While this is beneficial, it addresses only one part of the issue. Aligning the acoustic centres can synchronise the initial arrival of sound at a single frequency or at a specific crossover point, but it does not resolve the frequency-dependent Group Delay introduced by the crossover itself.
Conventional passive crossovers, such as fourth-order Linkwitz–Riley designs, are optimised for flat summed frequency response under anechoic conditions, not for Time Coherence. Around the crossover frequency, each branch exhibits a pronounced Group Delay peak. In typical loudspeakers with crossover points between 80 and 300 Hz, this produces several milliseconds of delay before enclosure effects are considered.
In ported loudspeaker systems, this timing error is compounded by stored energy in the Helmholtz resonator formed by the enclosure and port. The port output reaches its maximum precisely where Group Delay is also highest. Published measurements commonly show 30 to 50 milliseconds of delay near the tuning frequency, with some systems exceeding this. By comparison, sealed enclosures usually remain below 10 milliseconds, though still far from constant.
“Bass reflex and bandpass systems often exhibit substantial Group Delay” [in excess of 50 milliseconds]., Audioholics summary (secondary) Audioholics
The combined result is that the low-frequency portion of a transient can arrive tens of milliseconds after the midrange and high-frequency components. This misalignment lies directly within the auditory system’s critical timing window.
Psychoacoustic research shows gap detection thresholds around 0.5 to 5 milliseconds, cross-band synchrony evaluated within approximately plus or minus 10 to 20 milliseconds, and clear perceptual asynchrony beyond about 20 to 30 milliseconds. These values overlap precisely with the musically meaningful microtiming range of roughly 5 to 50 milliseconds, where rhythm, drive and phrasing are encoded.
For many engineers, this realisation comes only after the electronics have been refined. Once bandwidth, noise and linearity are addressed upstream, it becomes clear that the dominant limitations arise beyond the preamplifier. The loudspeaker, and particularly the crossover and enclosure, becomes the defining bottleneck for time accuracy.
Looking closely at the charts above, we see the effect of a filter with non linear phase. The fundamental and its harmonics are not time aligned because the filter’s group delay varies with frequency, so different frequency components are delayed by different amounts.
Group delay is formally defined as the negative slope of phase with respect to angular frequency. If group delay is constant with frequency, the system is linear phase within the band of interest. If group delay changes with frequency, the system is not linear phase, and multi frequency waveforms will be time smeared.
A square wave can be represented by its Fourier series, meaning it can be built from sine waves at integer multiples of a fundamental frequency. For a 50 percent duty cycle square wave, the spectrum contains odd harmonics only, with amplitudes decreasing as 1 over harmonic number. The steepness of a square wave’s flanks is determined by two factors:
- Bandwidth: how many high order harmonics are present and at meaningful amplitude
- Time coherence: whether those harmonics arrive together in time, which requires linear phase, or equivalently constant group delay, over the relevant band
This links directly to loudspeakers because each rising or falling edge of a square wave is effectively a step, so the loudspeaker’s step response describes how cleanly it can reproduce rapid waveform transitions. A time coherent loudspeaker produces a step response that rises promptly and settles cleanly, while excess phase rotation, misaligned drivers, or stored energy appears as overshoot, ringing, or a smeared rise.
The charts below show the same sine wave with its first nineteen odd harmonics and the corresponding much cleaner square wave after passing through a linear phase filter.
