next up previous contents index
Next: Multiplying audio signals Up: Modulation Previous: Modulation   Contents   Index


Taxonomy of spectra

Figure 5.1 introduces a way of visualizing the spectrum of an audio signal. The spectrum describes, roughly speaking, how the signal's power is distributed into frequencies. (Much more precise definitions can be given than those that we'll develop here, but they would require more mathematical background.)

Figure 5.1: A taxonomy of timbres. The spectral envelope describes the shape of the spectrum. The sound may be discretely or continuously distributed in frequency; if discretely, it may be harmonic or inharmonic.
\begin{figure}\psfig{file=figs/fig05.01.ps}\end{figure}

Part (a) of the figure shows the spectrum of a harmonic signal, which is a periodic signal whose fundamental frequency is in the range of perceptible pitches, roughly between 50 and 4000 Hertz. The Fourier series (Page [*]) gives a description of a periodic signal as a sum of sinusoids. The frequencies of the sinusoids are in the ratio $0:1:2:\cdots$. (The constant term in the Fourier series may be thought of as a sinusoid,

\begin{displaymath}
{a_0} = {a_0}\cos(0 \cdot \omega n),
\end{displaymath}

whose frequency is zero.)

In a harmonic signal, the power shown in the spectrum is concentrated on a discrete subset of the frequency axis (a discrete set consists of isolated points, only finitely many in any bounded interval). We call this a discrete spectrum. Furthermore, the frequencies where the signal's power lies are in the $0:1:2\cdots$ ratio that arises from a periodic signal. (It's not necessary for all of the harmonic frequencies to be present; some harmonics may have zero amplitude.) For a harmonic signal, the graph of the spectrum shows the amplitudes of the partials of the signals. Knowing the amplitudes and phases of all the partials fully determines the original signal.

Part (b) of the figure shows a spectrum which is also discrete, so that the signal can again be considered as a sum of a series of partials. In this case, however, there is no fundamental frequency, i.e., no audible common submultiple of all the partials. This is called an inharmonic signal. (The terms $harmonic$ and $inharmonic$ may be used to describe both the signals and their spectra.)

When dealing with discrete spectra, we report a partial's amplitude in a slightly non-intuitive way. Each component sinusoid,

\begin{displaymath}
a \cos (\omega n + \phi)
\end{displaymath}

only counts as having amplitude $a/2$ as long as the angular frequency $\omega $ is nonzero. But for a component of zero frequency, for which $\omega = \phi = 0$, the amplitude is given as $a$--without dividing by two. (Components of zero frequency are often called DC components; ``DC" is historically an acronym for ``direct current"). These conventions for amplitudes in spectra will simplify the mathematics later in this chapter; a deeper reason for them will become apparent in Chapter 7.

Part (c) of the figure shows a third possibility: the spectrum might not be concentrated into a discrete set of frequencies, but instead might be spread out among all possible frequencies. This can be called a continuous, or noisy spectrum. Spectra don't necessarily fall into either the discrete or continuous categories; real sounds, in particular, are usually somewhere in between.

Each of the three parts of the figure shows a continuous curve called the spectral envelope. In general, sounds don't have a single, well-defined spectral envelope; there may be many ways to draw a smooth-looking curve through a spectrum. On the other hand, a spectral envelope may be specified intentionally; in that case, it is usually clear how to make a spectrum conform to it. For a discrete spectrum, for example, we could simply read off, from the spectral envelope, the desired amplitude of each partial and make it so.

A sound's pitch can sometimes be inferred from its spectrum. For discrete spectra, the pitch is primarily encoded in the frequencies of the partials. Harmonic signals have a pitch determined by their fundamental frequency; for inharmonic ones, the pitch may be clear, ambiguous, or absent altogether, according to complex and incompletely understood rules. A noisy spectrum may also have a perceptible pitch if the spectral envelope contains one or more narrow peaks. In general, a sound's loudness and timbre depend more on its spectral envelope than on the frequencies in the spectrum, although the distinction between continuous and discrete spectra may also be heard as a difference in timbre.

Timbre, as well as pitch, may evolve over the life of a sound. We have been speaking of spectra here as static entities, not considering whether they change in time or not. If a signal's pitch and timbre are changing over time, we can think of the spectrum as a time-varying description of the signal's momentary behavior.

This way of viewing sounds is greatly oversimplified. The true behavior of audible pitch and timbre has many aspects which can't be explained in terms of this model. For instance, the timbral quality called ``roughness" is sometimes thought of as being reflected in rapid changes in the spectral envelope over time. The simplified description developed here is useful nonetheless in discussions about how to construct discrete or continuous spectra for a wide variety of musical purposes, as we will begin to show in the rest of this chapter.


next up previous contents index
Next: Multiplying audio signals Up: Modulation Previous: Modulation   Contents   Index
Miller Puckette 2006-12-30