On the perception of phase differences in acoustic signals

(1)

o

ON THE PERCEPTION

OF PHASE DIFFERENCES

IN ACOUSTIC SIGNALS

(2)

ON THE PERCEPTION OF PHASE DIFFERENCES

IN ACOUSTIC SIGNALS

yi o

o o

Cc O^ III ill inlilillll "'liMl|h

ii.ipii'WM

III 111 in II' Hi liiJik

M

,1 I I

I

nil'

'VERVALLBN

BIBLIOTHEEK TU Delft P 1950 4197

(3)

ON THE PERCEPTION

OF PHASE DIFFERENCES

IN ACOUSTIC SIGNALS

PROEFSCHRIFT

ter verkrijging van de graad van doctor in de technische wetenschappen

aan de Technische Hogeschoo! Delft, op gezag van de rector magnificus

dr. ir. H. van Bekkum, hoogleraar in de afdeling der scheikundige

tech-nologie, voor een commissie aangewezen door het college van dekanen

te verdedigen op woensdag 7 aprii 1976 te 14.00 uur

door

THEODORUS JOHANNES FRANCISCUS BUUNEN

natuurkundig ingenieur, geboren te Utrecht

(4)

Dit proefschrift is goedgekeurd door de promotoren

Prof. dr. G. van den Brink

(5)

Aan mijn ouders

Aan Tine

(6)

(7)

De resultaten van het in dit proefschrift beschreven onderzoek zijn

grotendeels reeds elders gepubliceerd of zullen gepubliceerd worden. De

gegevens hierover zijn aan het eind van dit voorwoord te vinden.

Het tot stand komen van dit proefschrift is mede mogelijk gemaakt

door de inspanningen van anderen. Het zal duidelijk zijn dat de

mede-auteurs van bovengenoemde publikaties, te weten dr.ir. F.A. Bilsen, prof.

dr. G. v.d. Brink, ir. J.M. Festen, dr. J.H. ten Kate en ir. J. Raatgever

hierbij een belangrijke rol hebben gespeeld. Ir. T. Eland en D. van

Val-kenburg hebben deelgenomen aan de luisterproeven. De dialoog met dr. M.

Rodenburg is belangrijk geweest voor de experimenten betreffende

"omhul-lende detektie". De apparatuur die beschikbaar was bestond gedeeltelijk

uit eigenbouw apparaten die in de loop der tijd zijn vervaardigd door

K. Bel, J. Faase, E.E.E. Frietman en W. Willems. Voor sotnmige metingen

is gebruik gemaakt van de hybriede rekenmachine van het Rekencentrum van

de T.H. Delft. Dr. R. Bathgate heeft korrekties aangebracht bij de

verta-ling in het Engels. Het type werk is verzorgd door C.E.M. Buunen-op de

Weegh en M. Mulder-van Nouhuys.

Allen die hebben bijgedragen ben ik dank verschuldigd.

Buunen, T.J.F., Festen, J.M., Bilsen, F.A. and van den Brink, G., (1974) "Phase effects in a three-component signal", J.Acoust.Soc.Am.55, 297-303

Buunen, T.J.F. and Bilsen, F.A., (1974) "Subjective phase effects and combination tones", in Models and Faats in Hearing, Eds E. Zwicker and E. Terhardt, pp 344-352.

Buunen, T.J.F., (1975) "Two hypotheses on monaural phase effects", Acustica 34, 98-105.

Buunen, T.J.F., (1975) "Dynamic properties of the auditory system in relation to masking by non-stationary signals", Publikatie ^ , van het Nederlands Akoestisch Genootschap, 49-57.

Buunen, T.J.F., ten Kate, J.H., Raatgever, J. and Bilsen, F.A., (1976) "A combined psychophysical and electrophysiological study on the role of combination tones for the perception of phase changes", submitted for publication in J.Acoust.Soc.Am.

(8)

CONTENTS

INTRODUCTION 1 Chapter 1 CONDITIONS FOR THE AUDIBILITY OF PHASE CHANGES 6

1.1 Literature 6 1.2 The, audibility of phase changes in three-component signals 8

1.2.1 Signal, method and apparatus 8

1.2.2 The audibility range as a function of L 10

1.2.3 The audibility range as a function of f 12

1.3 The audibility of phase changes in broadband signals 13

1.4 Conclusions 14

Chapter 2 THE NATURE OF SUBJECTIVE PHASE EFFECTS 15

2.1 Literature 15

2.1.1 Phase effects in three-component signals 15

2.1.2 Phase effects in broadband signals 16

2.2 The prominence of residue pitch in a three-component signal 17

2.2.1 Qualitative observations 17

2.2.3 The prominence of residue pitch as a function of 9 20

2.3 The audibility of separate harmonics in broadband signals 22

2.4 Conclusions 23 Chapter 3 MECHANISMS FOR THE EXPLANATION OF PHASE EFFECTS 24

3.1 Time structure detection 24

3.1.1 Envelope detection and fine-structure detection 24

3.1.2 Envelope changes as a function of phase in a

three-oomponent signal 25

3.1.3 The audibility of differences between signals with

identical envelopes 28

3.2 Auditory nonlinearity 31

3.2.1 Combination tones 31

3.2.2 A phase dependent internal spectrum 32

3.3 Conclusions 34 Chapter 4 COMBINATION TONES AND THE AUDIBILITY REGION OF PHASE

CHANGES 35 4.1 The boundaries of the audibility region 35

4.1.1 The lower boundary 35

4.1.2 The upper boundary 37

(9)

4.2.1 The relation between psychophysical and

electrophysio-logical experiments 39

4.2.2 Animal preparation and acoustic system 40

4.2.4 The detectability of CT's as a function of L and f 45

4.3 Detection of CT's in a forward masking experiment 53

4.3.2 The detectability of CT's as a function of L 54

4.4 Conclusions 58

Chapter 5 INTERACTION OF COMBINATION TONES AND ACOUSTIC

COMPONENTS 60 5.1 Estimation of the internal spectrum by means of

cancel-lation experiments 60

5.1.2 The internal spectrum as a function of B 61

5.1.3 Relation between the internal spectrum and the prominence

of residue pitch 64

5.2 The interaction of CT's and acoustic components measured in

a forward masking experiment 65

5.2.1 Verification of the results of the cancellation

experiments 65

5.2.2 Vector addition of a CT and an acoustic component 67

5.3 The interaction of CT's and acoustic components measured in

an electrophysiological experiment 69 5.4 Reinterpretation of de Boer's phase rule 70

5.5 Conclusions 71 Chapter 6 ENVELOPE DETECTION AND THE AUDIBILITY OF PHASE CHANGES 73

6.1 A model for envelope detection 73 6.2 Envelope detection for amplitude modulated tones 74

6.2.2 The threshold of envelope detection as a function of f , 76

6.3 Envelope detection and the masking by modulated noise 78

6.3.1 Literature 78

6.3.3 The effect of f .on the masked threshold 81

6.3.4 Relation between the masking by modulated noise and

models for the processing of amplitude Variations 84

6.4 Envelope detection and the audibility region of phase

changes 87 6.5 Conclusions ' 89

SUMMARY AND GENERAL CONCLUSIONS 91

(10)

INTRODUCTION

This thesis describes investigations on the auditory system arising from some problems about the audibility of phase changes in acoustic signals. In this introduction we will try to place the work against the back-ground of auditory research in general, and will briefly indicate the methods of investigation used.

Auditory research in general may have one of two goals. The first

is to describe the functioning of the auditory system by using a model

that simulates the signal processing without being realistic with

respect to the physiological structure. Such research will yield a set

of parameters of the ear that can be used e.g. in technological

applications. An example of work of this kind is the measurement of the

sensitivity of the ear to sinusoidal signals as a function of frequency.

The knowledge gained from these experiments may be helpful e.g. for the

design of transducers like loudspeakers and microphones in communication

equipment. Another example is the investigation of the integration time

of the auditory system in order to measure subjective attributes (e.g.

loudness or nuisance value) of sounds whose amplitude varies with time.

This type of research generally has a very specific, well defined goal.

However, it has the disadvantage that the results, formulated in a

"black-box model", are of very limited application: A different model

must be designed for each attribute of auditory sensation.

A more generally applicable model can only be obtained if

physiological data about the auditory system are also taken into account.

This brings us to the second goal of auditory research, to "know how the

auditory system works". The model that results from this type of

research will have to be compatible with all the black-box models. It

will be clear that this fundamental knowledge about the functioning of

the auditory system will not only have technical implications as do the

black-box models. For instance, it will give more insight into the

information processing in the nervous system in general. It also

provides a guide for the diagnosis of auditory complaints. It is obvious

(11)

cannot be strictly ascribed to one of the two classes described before.

The results of an experiment may be used for a black-box model even

though they were obtained in an experiment aimed at increasing

fundamental knowledge, and conversely it is often useful for the

under-standing of a system to describe experimental results in terms of a

black-box model. The distinction thus lies not in the nature of the

experiments but in the intention of the worker. The experiments

des-cribed in this thesis were intended to increase fundamental knowledge

of the auditory system.

The basic subject of this study is the audibility of phase changes

in acoustic signals. The reason why this topic is of interest for the

theory of hearing can only be understood with reference to former and

current theories about signal processing by the auditory system.

Therefore a very brief introduction into this theory will be given here.

It is commonly accepted that the peripheral auditory system

performs a frequency analysis of an acoustic signal. Clear evidence for

this is found e.g. in the masking of one sound by another (Wegel and

Lane, 1924). From this point of view the peripheral auditory system can

be represented by a set of bandpass filters with a form that is still a

subject of research. The output of the bandpass filters is coded in one

way or another to give pulsed electrical signals ("spikes") which can be

observed in the cells and fibres of the auditory nervous system. One of

the main objectives of auditory research is to determine which

proper-ties of the output of the bandpass filters are coded in the spike trains.

Electrophysiological measurements have demonstrated that the number

of spikes per second is related to the level of the acoustic signal. It

has also been demonstrated that the interval between successive spikes

is related to the periodicity of the signal (Kiang et al., 1965). It is

now important to know whether a change in the time structure of an

acoustic signal but not in the power at the output of the bandpass

filters changes the sensation evoked by the signal. If this was the case

it would show that time-structure information is not only preserved in

the spike trains on the auditory pathway but is also used to bring about

an aspect of the subjective sensation. It will be clear that

investi-gation of this problem is no easy matter, as it requires an acoustic

signal whose time structure can be changed without affecting its power

(12)

spectrum. In order to obtain a signal that fulfils this condition,

phase changes between components in an acoustic signal have to be

intro-duced .

Such reasoning was already applied a century ago by von Helmholtz

(1954) who investigated whether changes in the phases of the first 8

harmonics of 120 Hz were audible. Since then the audibility of phase

changes in acoustic signals, which will be called "subjective phase

effects" from now on, has been a permanent subject of investigation

mainly in view of the fundamental question as to which parameters of a

sound are coded in spike trains in the peripheral auditory system and

decoded in the central auditory system. This has also been the main

motive for the work described in this thesis. The main question is

whether the audibility of phase changes in acoustic signals justifies

the conclusion that parameters other than the output power of the

peri-pheral bandpass filters are coded and used in the auditory system.

It is self-evident that the sensation of a sound cannot be fully

described by its power spectrum alone, because the frequency analysis

performed by the auditory system is non-ideal ("short-time frequency

analysis"). This can be demonstrated by modulating the amplitude of a

pure tone with a 1 Hz sinusoid. The power spectrum of the resulting

signal consists of three components with frequency separations of 1 H z .

The sensation is that of a pure tone with a slowly changing loudness.

The detection of time structure by the ear in this case comes down to

the detection of amplitude changes. A distinction between those signals

in which amplitude changes are audible as such, because of the

short-time analysis by the ear, and those in which amplitude modulation is

manifest as changes in the power spectrum, is clearly crucial for the

explanation of the audibility of phase changes, since phase changes

affect the time structure of the signal.

The expression "phase changes in acoustic signals" used throughout

this work, should not be interpreted too broadly. In the first place

only signals presented to one ear (monaurally) have been investigated.

Phase changes between signals at different ears may lead to phenomena

like lateralization and binaurally masked level differences (Blauert,

1974) which are not considered here. The second limitation on the field

(13)

spectrum consisting of discrete components are studied. This rules out

signals with a continuous frequency spectrum like white noise sent

through a phase shifting network as described e.g. by Cramer and Huggins

(1958). It also excludes signals that have a transient character, like

Huffmann sequences (signals with a flat frequency spectrum but with well

defined phase jumps; Patterson and Green, 1970).

In view of the above limitations the sim^plest signal to use for the

investigation of phase effects is one consisting of two discrete

frequency components. Changes in the phase of one component relative to

the other are easily detectable in such signals. A vast amount of

literature has been published on this topic. The use of octave complexes,

with a ratio of 1:2 between the frequencies of the two components, is

very common. One of the most recent papers is by Lamore (1975) who found

that the pitch and loudness of the higher frequency component depended

on the phase relation between the components. The main problem

encoun-tered in attempts to explain these effects is the question, whether the

auditory system is coding and using information about the time structure

of the signal. The research performed on phase effects in two-component

signals has not yet led to unanimous conclusions and new hypotheses are

still being made.One of the controversial topics is the influence of the

distortion introduced by the ear. It has been demonstrated that the

signal processing by the auditory system cannot always be described by

a linear transformation. It was emphasized already in Helmholtz's time

that distortions like the sum and difference tones of two components

may give rise to a phase-dependent interaction between either these

distortion products themselves or between distortion products and

acoustic components. The problem has not yet been solved (see e.g.

Clack et al., 1972 and Lamore, 1975).

Phase effects in three-component signals have not received as much

attention in the literature as phase effects in two-component signals.

Most of the experiments described in this thesis deal with phase

effects in three-component signals; in some cases more complex signals

such as a periodic pulse have also been studied. It appears from

literature that phase changes in three-component signals are audible

(Mathes and Miller, 1947) but the origin of these phase effects is not

(14)

of time-structure detection and nonlinearities still appears to lie

at the root of the problem.

Two different experimental approaches have been used during this

study. The first is the psychophysical method in which a specific sound

is presented to an observer who is asked to adjust e.g. the threshold

of hearing for that signal, or to adjust a reference signal in such a

way that e.g. its pitch or its loudness is equal to that of the signal

under investigation. He may also be asked whether two signals are equal

or not. These psychophysical experiments were performed by several

observers. The number of observers used to investigate a certain effect

was usually rather small in the experiments reported here. Data on many

observers are only useful if one is interested in e.g. the mean

auditory threshold for normal observers. For the purposes of our

experiments it is often sufficient to know that a certain threshold or

effect exists. Knowledge about the mean value of the threshold for a

large number of observers seldom adds essential information about the

hearing mechanisms involved. As a consequence of a limited number of

observers used, the comparison of results of different types of

experiments will have to be made for one and the same observer.

The second experimental method of studying the hearing system is

to follow the signals from the peripheral processing in the cochlea to

the central processes on the cortex. This is commonly done by inserting

a micro-electrode into the auditory nerve centres of an animal. These

experiments are called electrophysiological experiments. In the

experiments described below both techniques have been applied for one

and the same problem, the perception of phase changes in complex

(15)

CHAPTER 1. CONDITIONS FOR THE AUDIBILITY OF PHASE CHANGES

This chapter reviews some of the existing literature on phase effects. It appears from the work cited that phase changes are only audible if the separation between the frequency components of the signal is small enough. Measurements on this audibility range for phase changes are presented.

1.1. Literature

The audibility of phase changes has been a subject of research

since 1830. A limited amount of literature on this subject was published

before 1950. Thereafter the number of papers increases. We need not give

a complete historical survey of the literature here because a recent

review may be found elsewhere (Plomp and Steeneken, 1969). However, a

selected number of publications will be discussed to provide a

back-ground for the items dealt with in this study.

One of the first discussions of the audibility of phase changes in

a complex signal was started about a century ago by Helmholtz (1954).

He investigated a signal consisting of 8 successive harmonics of 120 Hz,

the amplitude and phase of each tone being separately variable, and

concluded that phase changes are not audible in this stimulus. He could

not investigate higher harmonics so he left the question open as regards

these "unmusical" sounds. Other observers did hear phase effects and the

discussions about the audibility of phase changes have nearly always

been concentrated on the role of distortion in the apparatus or the ear

itself.

The audibility of phase changes in signals consisting of three

frequency components was investigated by Mathes and Miller (1947). They

concluded that phase changes of the central component (i.e. the

component with the middle frequency) are easily detectable provided the

frequency separation between the components does not exceed a certain

fraction of the central frequency. Experiments with three-component

signals by de Boer (1956) gave qualitatively the same results although

his results differ quantitatively from those of Mathes and Miller.

(16)

Goldstein (1967a) confirmed the statements about the limited audibility

of phase changes and repeated experiments first performed by Zwicker

(1952) on the subjective difference between AM (amplitude-modulated) and

QFM (quasi-frequency-modulated) signals which differ only with respect

to the phase of the central component. The nature of these phase effects

has been described as a change in "roughness", "harshness", "timbre" and

in the prominence of a "basic pitch" or "residue pitch". These

subjec-tive qualities will be discussed more thoroughly in chapter 2.

The fact that phase changes are audible at all confirms that the

frequency resolution in the auditory system must be subject to

limitations. Phase effects can only occur if there is interaction

be-tween the frequency components of the signal, also after the frequency

analysis performed by the basilar membrane. This reasoning led Zwicker

(1952) and Goldstein (1967a) to estimate this frequency resolution

(the "critical band") from the maximum frequency separations between the

components for which phase effects still occur.

The limited frequency separation between the components is not the

only condition to be fulfilled if phase changes are to be heard. There

is also a restraint on the value of the phase changes. De Boer (1956,

1961) has reported that no subjective effects occur for phase changes

in a multi-component signal which obey his "phase rule". This empirical

rule states that the subjective quality of a sound is unchanged if the

phases of the components are shifted by a constant amount and/or amounts

that are linearly dependent on the frequency of the components.

Experimental evidence was obtained from psychophysical experiments with

steady-state signals whose components (3, 5 or 7 in total) had equal

frequency spacing. De Boer's phase rule has consequences for the number

of phase permutations of a complex sound that give different sensations.

It can be shown mathematically that the number of variables is reduced

by two, because of the two arbitrary constants in the phase rule (see

section 3.1 below). This implies that in a three-component signal with

three variable phases, all phase changes resulting in different

sensation can be expressed in terms of one variable phase.

Phase effects in more complex sounds (i.e. sounds with more

frequency components) have been studied by Licklider (1957). He

(17)

limited audibility in terms of a minimum separation between components.

Schroeder (1959) also observed phase effects in signals containing up

to 31 harmonics. His comments on the subjective nature of phase effects

are especially interesting and will be discussed more extensively in

chapter 2. However, he does not mention any limits for the frequency

separation required to give phase effects. Bilsen (1968) compared a

low-pass filtered periodic pulse and periodic noise filtered in the

same way. The only difference between these signals is the phase

relation between their frequency components. The results indicate a

limited audibility of phase effects in broadband signals, since no

phase effects occurred if only lower-order harmonics passed through the

low-pass filter. A common aspect of all experiments on phase effects is

that they show that these effects exist not only for high levels but

also for relatively low absolute levels of the signal.

The above mentioned authors explained phase effects on the

assumption that the time structure of the signal determines an aspect

of the sensation and changes of the phases alter the time structure of

the signal. They further assumed that the nonlinearity of the auditory

system does not play an important role because phase effects also occur

at low sensation levels. The limited audibility of the phase changes

as a function of the frequency separation of the components has been

ascribed to the frequency analysing properties of the ear, since the

complex time structure of the signal is not represented in the movements

of the basilar membrane at large frequency separations. De Boer's phase

rule is a very strong argument for the theory of time-structure

detec-tion because it can be demonstrated that phase changes obeying this

phase rule give no changes in the envelope of the signal (de Boer,

1956). The conclusion might easily be that phase changes that preserve

the envelope are inaudible . It will be shown (in section 3.2) however

that this "inverse" of the phase rule is only valid for small frequency

separations between the components, say up to 30-60 Hz.

1.2. The audibility of phase changes in three-component signals

1.2«1.

Signal, method and apparatus

(18)

frequency component in a three-component signal are audible depending

on the frequency separation Af of these components. Mathes and Miller

(1947) found no phase effects if the frequency separation was more than

40% of the signals central frequency for a signal with a power spectrum

equal to that of a 100% SAM (sinusoidally-amplitude-modulated) pure tone.

It is thus possible to define an audibility range for phase effects in

terms of the values of Af for which they are audible. The maximum and

minimum values of Af for which subjective phase effects occur will be

called the upper and lower limit of the audibility range. The upper

limit of the audibility range was measured by e.g. de Boer (1956) for a

limited number of carrier frequencies and for a level difference of

-6 dB between the central component and the sidebands. As regards its

power spectrum, this signal is equivalent to an SAM pure tone with a

modulation depth of 100%. In the present study the audibility range of

phase effects for this type of signals has been measured over a wider

range of carrier frequencies and values of the level difference L,

between the central frequency component and the sidebands.

The stimuli used in the present experiments on the audibility

range were generated as follows: Two sinusoids of frequency f and f ,

(=Af) were multiplied in a four-quadrant multiplier with a carrier

suppression of more than 50 dB. The resulting signal consists of two

frequency components f -Af and f +Af. If a component with frequency

f'=f +2 Hz is added, the signal can be described as a three-component c c

signal, the central component of which has a continuously changing

phase. This signal evokes a clearly audible beat with a repetition rate

of 4 Hz, provided Af is relatively small. The fact that this repetition

rate is twice the frequency difference between f and f implies that c c

the beats cannot be due to incomplete suppression of f . As the value

of Af is raised the beats become less pronounced and finally disappear.

The method used here to measure a limit of the audibility range

for phase changes is an adjustment procedure. The observer was seated

in an acoustically insulated booth, where he received the

three-compo-nent signal with continuously changing phase of the central compothree-compo-nent

monaurally via a Telephonies TDH-39 headphone. He was asked to increase

Af from a value for which phase changes were audible, until no phase

(19)

value until phase effects were just distinguishable. The mean of these

two values represents his estimate of the upper limit of the audibility

range. Distortion products were at least 55 dB lower than f at the c

input of the headphone.

1.2.2. The audibility range of phase changes as a function of L

The audibility range of phase changes was measured for several

values of the level L of the sidebands in the three-component signal

relative to the level of the central component. The adjustment

proce-dure used has b e e n , described in the previous section. The observers

measured the audibility range for discrete values of L, starting at -6

dB and decreasing in steps of 5 dB. For high values of L , only an upper

limit to the audibility range was found. When Af was decreased from the

maximum value in this case, the beat remained audible until Af was of

the order of the repetition rate (4 H z ) . For low values of L, the

observers also measured a value of Af below which phase changes were

inaudible. Each upper and lower limit was measured at least four times

in different experimental sessions. The spread of the results was about

O.lAf. The mean values found by two observers at f =2000 Hz are c

presented in Fig.1,1.a and Fig.l.l.b.

dB

Fig.l.1.a.

The audibility region of phase changes indicated as a hatched area. The filled circles are the datapoints from which the audibility region has been derived.

0 008 016 024 Af/. >

The audibility region is defined as the hatched area. Phase changes are

audible within this region. The boundaries of the audibility region are

(20)

Fig.1.l.b.

The same as in Fig.1.a, for a different observer.

obtained by graphical interpolation between the experimental points. The

same measurem.ents were performed for f =100

observer. The results are shown in Fig.1.2,

same measurem.ents were performed for f =1000 Hz and f =4000 Hz by one

da

0 t^ =1000 Hz 45dB SL 008 AtA dB 0 35dBSL 0.08 Af/f; 016 — > 016 024 Fig.1.2.

The audibility region of phase changes for two values of f for one observer.

(21)

A common aspect of all experimental results is that phase effects do

not occur for values of Af larger than 18-28% of the central frequency,

depending on the observer and the value of f . The average of these

values corresponds reasonably well with the value of 20% given by

de Boer (1956) for a signal with a power spectrum equal to that of

an SAM pure tone.

The form of these audibility regions differs from what could be

expected on the hypothesis of time-structure detection, which predicts

weaker interaction of the frequency components for large frequency

separations, owing to the ears frequency resolution. Thus, if the

changes in the time structure caused by phase changes, become less

pronounced (as is the case with decreasing values of L ) , we would

expect no phase effects to occur for small values of L and large

frequency separations. The results of our measurements show on the

contrary, that the phase effects vanish at small frequency separations

as L is decreased. This result demonstrates that time-structure

detection and frequency analysis by the auditory system alone cannot

account for the form of the audibility region for phase changes.

1.2.3. The audibility of phase changes as a function of f

The audibility range for three-component signals as defined in

1.2.1. has also been measured with varying values of f in order to

investigate whether the audibility of phase changes depends on the

frequency range of the signal. The level of f was 30 dB SL for each

observer. The levels of all three components were equal in this

experi-ment, which corresponds to L=0 dB. The results are given in Fig.1.3.

Fig.1.3.

The audibility range of phase changes for three observers as a function of f . Phase changes are audible for values of Af/f below the

data-. c

p o i n t s .

D o b s T B o obs JR A obsTE Af/ 05 25 Goldstein(1967a) P . - - 0 - - Q . 10 20 40 8D kHz fr > 12

(22)

The audibility appears to be fairly independent of the frequency range.

Similar results were obtained by Goldstein (1967a) for a

three-component signal with L=-6 dB.He measured the maximum frequency

separation between the components for which a difference between an AM

and a QFM signal was audible.His results are given in Fig.1.3 as a

dotted curve.

1.3. The audibility of phase changes in broadband signals

There are very few reports in the literature of a limited

audibility range of phase changes in broadband signals.As mentioned in

section 1.1, Bilsen (1968) measured the audibility of the difference

between low-pass filtered periodic pulses and periodic noise.The

frequency components of a periodic pulse are all in cosine phase while

the phase relations are random for periodic noise.Bilsen's experiments

were not aimed at deciding precisely which of the higher components

gives the phase information.This may be ths component nearest to the

cut-off frequency,or one of the other higher components since the

high-frequency slope of the low-pass filter was only 35 dB/oct.

Therefore, an experiment was devised to investigate the audibility

range of phase changes in a periodic pulse. One of the components of the

signal was given an adjustable phase.This was realised by cancelling the

original component with a phase-lock generator(HP-3203A) which gives a

supression of at least 60 dB. A component of the same frequency and

adjustable phase is subsequently added by means of a second phase-lock

generator. The observer could manipulate the phase at will. By changing

the harmonic number of the component we investigated for which harmonic

number a phase change of the component was audible. The level of the

periodic pulse was chosen such that the level of the harmonic under

investigation was always 40 dB SL. The whole procedure was repeated for

several values of the pulse repetition rate, thus giving data on the

audibility range of phase changes in a broadband signal. As this

experiment was performed by one observer only,the results have to be

used with caution. The results are plotted in Fig.1.4 together with

those of Bilsen (1968). For our results f is the frequency of the

component for which a phase change was audible and Af is the periodicity

(23)

Fig.l.4.

The audibility range of phase changes in broadband signals as a function of the frequency region. The mean of our datapoints of Fig. 1.3 is also plotted.

25 .5 rO 2fl 4.0 8.0 kHz fc >

of the low-pass filter. The agreement is reasonably good. The mean of the audibility ranges found in section 1.2.2 for three-component signals has also been plotted in this figure.

1.4. Conclusions

The measurements on the audibility region of phase changes as a function of the sideband level L have demonstrated that an explanation based solely on time-structure detection is hardly tenable. The

measurements on the audibility range as a function of the frequency f do not help us in the search for an alternative explanation for the phase effects. The resolution of the frequency analysis,responsible for the limited audibility,can be estimated from these results. It appears to be proportional to the frequency region because the audibility range is approximately a constant fraction of the central frequency. This conclusion confirms statements by other investigators (e.g. Zwicker,1967) about the constant relative bandwidth of the auditory "bandpass filters". It is not surprising that the data on the limited audibility of phase changes agree reasonably well with the "critical band" as the latter is a rough estimate of the frequency resolution of the auditory system. However, our data are not in agreement with the conclusion of earlier workers in this field, that time-structure detection is the reason for

the ear's phase sensitivity.

^k

.05

_.^._

periodic three-co periodic

**fc^-*~~**

pulse (obsTB) mponent signal noise/pulse (Bilsen

^^^^.

1968) ^ 1 14

(24)

CHAPTER 2. THE NATURE OF SUBJECTIVE PHASE EFFECTS

The foregoing chapter concentrated on the mere audibility of phase changes. Conditions were found which have to be fulfilled if phase changes were to be audible as a change in the sensation evoked by a complex signal. In this chapter the nature of phase effects will be discussed by describing the aspects of the sensation that vary when a phase change in a complex signal is heard.The phase dependence of the prominence of the residue pitch is quantified.

2.1 . Literature

2.1.1. Phase effects in three-component signals

Mathes and Miller (1947) described phase effects in a three-component

signal as a change in the "roughness" or "harshness" of the sound as a

whole. Terhardt (1967) measured the change in roughness for one specific

phase change.The second effect reported by Mathes and Miller was a

change in the prominence of a "basic pitch". A doubling or tripling of

this basic pitch has also been found. Goldstein (1967a) described the

subjective differences between AM and QFM sounds. For modulation

frequencies larger than 20 Hz he described AM as being "buzzier" than

QFM. An increase in the modulation frequency reduced the "buzziness" and

both sounds became tonal musical complexes. Ritsma and Engel (1964) found

a pitch variation as a function of phase. These pitch differences were

differences in the "residue pitch" or "basic pitch",

Residue pitch is a low pitch,roughly corresponding to that of a

pure tone with a frequency equal to that of the common (but absent)

fundamental of the complex (Schouten,1940). This fundamental is not

present in the power spectrum of the acoustic signal, nor is it

generated by nonlinear effects (Licklider,1954). For a survey of the

literature on residue pitch the reader is referred to articles by

Schouten (1970) and Ritsma (1970). The influence of the phase in a

three-component signal on the residue pitch was measured by Ritsma and

(25)

corresponding to changes in the "time structure" of the signal. In this

case "time structure" means the position of pronounced peaks within the

envelope of the signal and not the change in the envelope itself. The

doubling and tripling of pitch as reported by Mathes and Miller was not

found by these authors. However, the results of Ritsma and Engel were not

confirmed by Patterson (1973) who repeated their experiment. Experiments

by Vightman (1973) did not show any influence of phase on the value of

the residue either. Licklider (1955) noticed that the possibility of

hearing the residue pitch at all in a five-component signal depends on

the phase relation between the components. Bilsen (1973) found that the

masking of the residue pitch of a three-component signal by white noise

depends on the phase relation of the components if Af/f <0.12.

2.1.2. Phase effects in broadband signals

The most prominent change in the subjective quality of a sound due

to phase changes is called a change in timbre in almost every paper on

phase effects. We have not tried to define the term "timbre" sofar; in

general we will follow the definition given in the American Standards

Association (1960), which reads: "Timbre is that attribute of auditory

sensation in terms of which a listener can judge that two sounds,

similarly presented and having the same loudness and pitch, are

dissimilar". Plomp (1970) argued that this definition is a negative one,

since every aspect of sensation that is not pitch nor loudness nor

duration must then be timbre. Stating that a phase effect in a stationary

signal is a change in timbre would thus mean that the effect may be

anything except a change in pitch or loudness.

Licklider (1957) reported that a change of phase in a 16-harmonic

complex sound gives rise to changes in timbre and changes in pitch.

Schroeder (1959) made more explicit statements about the nature of phase

effects in signals containing up to 31 harmonics. He reported that

changes in timbre related to the peak factor of the signal could be

heard, as could distinct tones caused by manipulation of the phase,

Another interesting statement by this author is that little or no

subjective change is produced by a variation of the phase which leaves

the envelope of the stimulus unchanged,

Very limited progress has been made in the quantitative measurement

(26)

of timbre changes. The difficulties encountered in measuring a change in

timbre may be due to the negative definition of this concept. Plomp and

Steeneken (1969) have used a promising approach, the "multi-dimensional

scaling method", which can be used for comparing the timbres of sounds

with different phase configurations. It has proved possible in this way

to express the effects of phase changes in a sound in terms of an

equivalent change in the slope of the amplitude spectrum.

The variations in the pitch of a complex tone as a function of

phase has been investigated by Patterson (1973) for 6- and 12-component

signals. These experiments were repeated by Wightman (1973). A change in

the phase relation of the components from cosine to random phase did not

have any effect on the pitch of the stimulus as a whole. The experiments

were performed by comparing the test signal with a reference stimulus.

Their results do not support the results of Licklider (1957). A common

aspect of all investigations on phase effects is that the explanation is

mainly based on the assumption that the time structure of a stimulus is

detected by the auditory system.

2.2. The prominence of residue pitch in a three-component signal

2.2.1. Qualitative observations

Our own observations on phase effects in three-component signals

have confirmed the statements in the literature about the changing

roughness of a sound as a function of phase. This is certainly the case

for small frequency separations between the components, say up to 100 H z .

The changing roughness was less apparent for frequeny separations

approaching the boundaries of the audibility region as found in section

1.2.1. The change may then be described as one in "timbre" or "tonality".

It appears from this that a different subjective criterion is used in

assessing phase changes for small and large frequency separations. For

small separations the roughness difference is very apparent, while for

large separations it is a matter of different "tonalities". Some

indication of the existence of the two different criteria may be found

e.g. in a paper by Goldstein (1967a), who reported that as the frequency

separation increased an AM and a QFM signal were initially very

(27)

It might be possible that the frequency separation where the judgement

changes from "dissimilar" to "similar, but still different", is caused

by a change in the subjective criterion.

We observed no obvious pitch changes in the residue pitch although

the prominence of the residue pitch does depend on the phase in a

three-component signal. No doubling or tripling of residue pitch could be

observed. However, at low values of the frequency separation between the

components,say Af=30 Hz, the periodicity of the "rattle" is doubled if

the signal is changed from AM to QFM. The only effect of phase on the

residue pitch was concerned with the prominence of the latter. Therefore

a series of experiments was devised to measure the prominence of residue

pitch as a function of the phase 6 in a three-component signal,

2.2.2. Signal, method and apparatus

The pitch of a stimulus consisting of three components may be

ambiguous. The same stimulus can lead to a residue-pitch sensation as

well as to a pitch sensation that corresponds to the frequency content

of the signal. If the same stimulus is presented to an observer N times,

in between other stimuli, then the fraction of the N responses

corres-ponding to residue-pitch perception is assumed to be a measure of the

prominence of this residue pitch. An estimate of this fraction, which is

subject-dependent, can be obtained by the following method. Stimuli are

presented in pairs to an observer according to the time diagram of

Fig.2.1.

Fig.2.1,

Timing diagram of the stimu-lus presentation used in the experiments on the prominence time > °f ^^^ residue pitch

Stimulus 1 consists of 9f , lOf and llf and stimulus 2 consists of o o 0

8g , 9g and lOg ; g being slightly higher (5% or 10%) than f , see

Fig.2.2.

The observer is asked to report which one of the two stimuli has the

higher pitch. If he finds stimulus 1 higher than stimulus 2 there are

several possible ways this judgement could have been reached. The most

obvious one is that his judgement is based on a correlate of the spectral

centres of gravity of both signals. But as the residue pitch itself

, fie --Kb,

, fie .

(28)

s t i

st2 frequency

-9'o 10fo 11'o

8 g , 9g^10g^

F i g . 2 . 2

Power spectrum of the signals used to measure the prominence of the residue pitch. The dashed lines represent the pitch of the absent fundamental.

is ambiguous (Schouten et al.,1962) the observer's judgement might also

be based on a residue pitch in stimulus 1, not corresponding to f and

slightly higher than a residue pitch in stimulus 2. If the observer

judges stimulus 2 higher than stimulus 1, then this is only possible

if he heard one of the residue pitches in stimulus 1. The frequency of

occurrence of this response,which has been called the "correct" response

in this study, is assumed to be a measure of the "prominence" of the

residue pitch. This method does not tell us anything about the perception

of stimulus 2, which is merely used as a reference. This stimulus

confi-guration is analogous to those used by Patterson (1969) and Smoorenburg

(1971). Pairs of stimuli were presented in a random order and in

alteration with other stimuli. The percentage of "correct" responses is

comparable with the "performance" used by Houtsma and Goldstein (1972),

The signals were generated with a hybrid computer (AD-4,IBM 1800)

and recorded with a tape recorder (Ampex FR-1800). Distortion products

and noise were always more than 45 dB below the signal level. The phase

relation of the components in stimulus 1 was varied; it was kept constant

in stimulus 2 in order to have a constant reference. The tape was played

back and the signals were presented monotically (via a Grason-Stadler

earphone type TDH-39) to the observer who was seated in an acoustically

insulated room. The sensation level was 40 dB SL on the average unless

stated otherwise. The same tape was played a number of times to get

about 80 presentations for each phase relation. Two types of reference

stimuli (g =1.05f and g =1.10f ) were used in a random order on the O O o o

same tape, in order to make the pitch changes less monotonous.

A three-component signal with amplitudes A, B and C, phases p , p .

(29)

carried out with a combination of the 9th, 10th and 11th harmonics of

200 Hz. Only the phase 6 (=p„-(p.+p-)/2) was varied during the experiment.

By definition we took 6=0 for all three components in cosine phase.

While the following topic is not really vital for a proper

under-standing of the results presented below, it should be realized that "de

Boer's phase rule" permits such a reduction of the number of phase

permutations to be investigated. This can be shown as follows. Let the

frequencies of the components be f -Af,f and f +Af and their phases

p., p . and p. respectively. The phase rule states that a phase change

equal to a+bf, where f is the frequency of the components, does not

change the sensation. Because a and b are arbitrary constants they may

be chosen such that the following relations hold:

Pj+a+b(f^-Af)=0 (2.1)

p +a+b(f^+Af)=0 (2.2)

In this case the phase of the central component changes from p„ to

p -(p +p-)/2. This implies that an arbitrary phase permutation is

equi-valent to one in which the phases of the sidebands are zero and the phase

of the central component is given by 9 which is equal to p -(p +p-)/2.

Thus, only phase variations for which 0<A6<180° involve a perceptive

change and have to be examined. In other words, it does not matter which

phase (p,, p~ or p.) is changed as long as 6 has the desired value.

2.2.3. The prominence of residue pitch as a function of 9

The results for a sensation level of 40 dB are given in Fig.2.3 for

three observers. As described in the previous section, each measured

point represents the percentage of "correct" responses for one particular

phase angle 9. Inspection of Fig,2,3 shows that one value of 6 is least

favourable for hearing the residue pitch of the stimulus; however the

value of 9 in question varies from observer to observer.

Results for 30 and 50 dB SL are shown in Fig.2,4 for one observer.

It may be noted that the minimum in the curve becomes less pronounced

and shifts towards higher 6 with increasing intensity. It may further be

concluded that the envelope of the external time structure cannot be

related directly to the prominence of the residue pitch, since the

minimum would be expected to be at 6=90 in that case. It would be

(30)

looses

% 100 if) (/)

150

in 01 A i A = o = A fi 5% 10%

• \

k

0—

S

? B 8 1

obs.TB 40dBSL Fig.2.3. Fraction of "correct" responses as a function of 9 for three obser-vers. The sensation level of the signal is 40 dB above threshold. The smooth curve is made by eye as a best fit to the datapoints. The 5%- and 10% points are obtained with a reference stimulus whose absent

fundamen-tal g was 5% resp. 10% higher than f . 60 120 180" % 100 t/1

Iso

UI "o 01 i-o 0 A = 5% o = . o > ^< A ^ A . 10%

h-7H

obsJF 40dBSL

>

I/'

°/

/

> o o 5 i o A I

r\J

_N

o i A 0 60 120 180"

difficult to explain the difference between the different observers and

the level dependence on this assumption.

Measurements carried out at f =400 Hz gave analogous results, which

have not been plotted because they give no additional qualitative

infor-mation. In this case the prominence of the residue pitch is diminished

and the minimum is found at a value of 9 different from that for

f =200 Hz. o

(31)

1^8f—a—«—a—8—a—fi—a—SH-*

Fig.2.4.

Fraction "correct" responses as a function „ of 9 for levels of 30 •'SO and 50 dB SL. The o—o—I smooth curve is a best

fit made by eye to the measured points.

2,3. The audibility of separate harmonics in broadband signals

If the phases of harmonics above the fifth are varied in a periodic

pulse, changes in timbre are clearly audible. A phase shift of a

harmo-nic above the 10th-15th also makes it easier for an observer to "pick"

this harmonic out. Plomp (1966) showed that an observer can identify the

individual harmonics in a periodic pulse up to the 10th. Observations

during the measurements of section 1.3. however, have shown that if e.g.

the 15th harmonic is shifted in phase by 90 then this harmonic is also

very easily audible. This qualitative finding confirms Schroeder's

(1959) results. It can be explained by the same reasoning as given by

Duifhuis (1972) to account for the audibility of a high harmonic in a

periodic pulse although the harmonic itself is removed from the signal.

Due to the limited bandwidth of the frequency analysis of the ear, the

harmonic is observable in the short-time frequency spectrum of the

signal. Cancellation of a component of the acoustic signal is equivalent

to the addition of a component with equal amplitude but opposite phase.

(32)

Duifhuis argued that this component becomes audible if the bandwidth of

the bandpass filters comprises several harmonics of the signal.

A phase change of p in a component of amplitude D is equivalent

to the addition of a component with the same frequency, an amplitude

equal to D/2-2cos (p) and a phase of p/2+90 . This component will be

audible by analogy to the situation described by Duifhuis. Duifhuis

(1972) gave a more detailed description of this effect.

An alternative explanation of the effect is given by Schroeder

(1975). His reasoning is based on changes in the long term spectrum due

to phase changes in the signal. A disadvantage of this explanation is

that the limited frequency selectivity of the ear and its short-time

analysis are not explicitly required for the effect. It is obvious

however that these must play a role because the underlying phase effect

is the same for the 15th harmonic of 200 Hz as for the 50th harmonic

of 10 Hz. In the latter case it is useless to consider the long-term

spectrum of the signal. For this reason the explanation as given by

Duifhuis (1972) is preferred because it also covers those cases where

time-structure detection clearly occurs.

2.4. Conclusions

In general, phase changes in an acoustic signal appear to affect

the roughness of a sound, the prominence of the residue pitch, the

timbre (roughness is not included in the term timbre, here) and the

possibility of picking a harmonic out of a broadband signal.

The measurements described in this chapter quantify the dependence

of the prominence of residue pitch on the phase 6. Further experiments

on the actual values of the "residue pitch" as a function of phase have

not been performed, mainly because no pitch change was found in an

exploratory experiment with the signals used in the previous

measure-ments.

Explanation of phase effects is made difficult by the fact that

the subjective effects cannot all be accounted for by one and the same

mechanism. For instance the roughness of a sound and its residue pitch

cannot be detected by the same mechanism because roughness is audible

in signals where residue pitch is absent. Consequently more than one

(33)

CHAPTER 3. MECHANISMS FOR THE EXPLANATION OF PHASE EFFECTS

Two possible mechanisms for monaural phase effects are discussed in this chapter, viz. time-structure detection by the auditory system and interaction of combination tones and acoustic frequency components within the ear. The experimental evidence indicates that both mechanisms contribute to the audibility of phase changes.

3.1. Time-structure detection

3.1.1. Envelope detection and fine-structure detection

The audibility of phase changes is frequently ascribed to the

capability of the ear to code and detect changes in the time structure

of the signal, since this is the only clue to explain phase effects in

a linear system (Mathes and Miller, 1947, de Boer, 1956 and Goldstein,

1967a). Two kinds of time-structure detection may be distinguished.

The first is "fine-structure detection". This implies the detection of

prominent peaks in the time function of the acoustic signal (Schouten

et al., 1962). Such a mechanism has been frequently invoked to explain

the residue pitch. The inverse of the time interval between two

promi-nent peaks of the signal is supposed to be equal to the frequency of a

sinusoid with the same pitch as the residue pitch. According to this

argument, detection of the fine structure would result in an audible

phase effect because the time function of the signal is profoundly

changed by a phase variation. Recent investigations, however, have

pro-duced some evidence against an explanation of residue pitch in terms of

time-structure detection. Terhardt (1972) has summed up some of these

arguments. Furthermore, Houtsm-a and Goldstein (1972) have shown that

monotic interaction of frequency components is not necessary for the

perception of residue pitch. Consequently, it is improbable that phase

effects are caused by fine structure-detection since the measurements

of chapter 2 have demonstrated that the prominence of the residue pitch

changes as a function of phase. Besides, de Boer (1956) has shown that

phase changes resulting in a different fine structure but a constant

envelope are not audible. This eliminates fine-structure detection as a

(34)

major source of phase effects in three-component signals. It does not

imply, however, that the fine structure of a signal cannot be coded by

the auditory system at all. It has been shown in the literature that

fine-structure detection is a predominant factor in the localization of

a sound source for signal frequencies up to 1500 Hz (see e.g. Blauert,

1974).

The second meaning of the term "time-structure detection" is

detec-tion of the envelope of a signal by the auditory system. The sensitivity

of the ear to envelope changes can be readily demonstrated by modulating

a pure tone of e.g. 2000 Hz with a sinusoid of 10 Hz. The instantaneous

amplitude variations are then audible. It is obvious that variations of

the envelope due to changes of the phases in a three-component signal

will also be audible if the frequency Af is of the order of 10 H z . It

will become increasingly difficult to detect the changes in the envelope

if the value of Af increases. Thus, for small values of Af envelope

detection is apparently an important source of phase effects. The shape

of the envelope of a three-component signal as a function of phase is

therefore studied in greater detail in the next section,

3.1.2. Envelope changes as a function of phase in a three-component

signal

A three-component signal with frequency components at equal

separations Af can be written as:

S(t)=Acos((io -Au)t+p.)+Bcos(u t+P2)+Ccos ( ((o^+Au) t+p„) (3.1)

where S(t) is the instantaneous value of the acoustic signal,

(J) =2IIf , A(jj=2nAf and A, B and C are amplitudes. The envelope of

the signal is given by:

E(t) = |Aexp(j((jj^-Au)t+jpj)+Bexp(ju^t+jp2)+Cexp(j(a)^+Aa))t+jp2)| (3.2)

E(t)=|Aexp(j(-Au)t+Pj))+Bexp(jp2)+Cexp(j(Aojt+P2))| (3.3)

The three terms of equation(3.3)may each be represented as a phasor in

a complex plane (see Fig.3.1). The varying amplitude of the envelope is

represented by the length of the resultant E which varies as its tip

travels round the ellipse (see Goldstein, 1967a). Obviously, this

length does not depend on a rotation of the coordinate axes in the

(35)

Fig.3.1.

Phasor diagram to illus-trate the time dependence of the envelope E(t) of a three-component signal given in equation(3.3) .

determined by the values of A, B and C and by the phase 9 where

9=p.-(p.+p,)/2, The invariance of the envelope for constant 0

corres-ponds to the inaudibility of phase changes under these conditions as

formulated in the phase rule of de Boer (1956),

In order to illustrate the type of envelope changes that occur as

a function of 9, Fig.3.2 gives the phasor diagrams, the spectra and the

envelopes of an AM and a QFM signal. For both signals A=B/2=C but 9=0

for AM and 6=90 for QFM sounds. The envelope of the QFM signal shows

twice the periodicity of the AM case. This doubling is audible as a

doubling of the tempo of the "rattle" or the "roughness" for small values

of Af. The changes in the envelope due to phase changes will be smaller

if A and C are small compared to B. This is the case for decreasing

values of L in the experiments on the audibility region in section 1.2.

de Boer (1956) showed that arbitrary changes in the phases p , p„

and p in a three-component signal leaves the envelope intact if Ae=0°.

However, this does not imply that the envelope is always changed if 9

is varied. Consider for instance a three-component signal with L=-6 dB

and change the phase 9 from +60 to -60 . The phasor diagram.s of Fig,3,3

show that in this case too the envelope is unaffected. This may be

generalized to all signals with equal amplitudes of the highest and the

lowest frequency component. A phase change from 6=9, to 9=-9 does not

affect the envelope. Closer consideration of the effect of such a

symmetrical change in 9 reveals that the change results in a reversal of

(36)

AMsignal ENVELOPE SPECTRUM PHASOR DIAGRAM

Im

9 = 0°

Re

QFM signal ENVELOPE SPECTRUM PHASOR DIAGRAM

Im

9=90"

Re

Fig.3.2. The envelopes of AM and QFM sounds, as derived from the phasor diagrams. In this case the ellipse of Fig.3.1 has become a straight line because A=C.

Fig.3.3.

Phasor diagrams showing that the time dependence of E, as

its tip travels along the dashed line, is the same for 9=+6, and 9=-e..

the envelope in time. Such a reversal does not affect the envelope if

the sidebands in a three-component signal are equal, because of the

symmetry of the envelope in that condition. These signals are suitable

for investigation of whether the audibility of a phase change is indeed

due to a change in the envelope, as might be concluded from de Boer's

phase rule. In other words, an experiment can be performed to

investi-gate whether de Boer's phase rule justifies the conclusion that the

(37)

envelope. If the subjective character of a sound is completely

deter-mined by the envelope of the acoustic signal and its frequency spectrum,

then we may expect that "symmetrical" phase changes, not obeying the

phase rule but leaving the envelope intact, will be inaudible. This will

certainly be the case for very small frequency separations between the

components of the complex (large period of the envelope) but it is not

self-evident for larger values of Af.

3.1.3. The audibility of differences between signals with identical

envelopes

Three-component signals with a centre frequency f and a frequency

separation Af were used in a psychophysical experiment on the audibility

of a difference between two signals for which the values of 6 were

opposite. The level of the sidebands was 6 dB below the level of the

central frequency component. The signals were generated with the set-up

given in Fig.3.4. sine generator program. generator phase lock generator •c Af ^Sl nhncD shift

'\

?

X

^c -01 phase shift

-|-tnree-comp signal e=+9i

+

three comp e i n o o l Q - Q. -"•J-"-" " " 1 Fig.3.4. Block diagram of the experimental set-up for inves-tigation of

differences between signals with

opposite values of

The observer was seated in an acoustically isolated booth and

manipula-ted a switch to select one of the two stationary signals. He also had

two push-buttons at his disposal to indicate whether the signals sounded

equal or different. He listened to the signals as long as he wanted. In

one experimental session 6 different values of Af were investigated viz.

Af=8, 16, 3 2 , 64, 128 and 256 H z . The frequency of a programmable

(38)

oscillator (Krone Kite 4131 R) was set by a DEC Lab-8E computer at one

of the randomly chosen possible values of Af. The observer's choices

were also registrated by this computer. Each value of Af was selected

6 times in one session. The reliability of the responses has been checked

by presenting not only the two signals with opposite phase but also

two completely identical signals. The results for these "reliability

signals" indicate whether the observer is biased in calling the two

signals different. Three different values of f were used, f =1000, c c

2000 and 4000 H z . Each observer made 5 complete runs for every value of

f . In all signals the level of the central frequency component has been

chosen at approximately 50 dB SL and 9=60 or 9=-60 . The results are

given in Fig.3.5 for three observers. The scores for the 5 sessions have

been added to give a total of 30 presentations for every value of Af.

The "reliability check" was performed 10 times for every value of Af.

The results do not show any appreciable bias of the observers. The

occasions on which two identical signals were judged different were so

rare that they can be ascribed to mistakes. A similar measurement was

carried out for 6=30 and f =2000 Hz in order to show that the results c

are not specific for 6=60 . The results for this signal and with two

observers are also given in Fig.3.5.

The data show that two signals with opposite phase 6 are

indistin-guishable if the frequency separation Af is smaller than 30 Hz.

Consequently, the sensation evoked by these signals is determined by the

frequency spectrum and the envelope only. There are two possible

expla-nations for the subjective difference between the signals if Af is

larger than 30 Hz. The first is that apparently the observed phase

effects are not due to envelope detection by the ear but to some other

mechanism based on nonlinear signal processing as elaborated in the next

section. Alternatively, the envelope, as coded in the action potentials

of the nerves, may not be the same as the envelope of the acoustic

signal. This reasoning is inspired by the finding, confirmed by all

observers independently that for e.g. f =1000 Hz and Af=64 Hz, a

dis-tinct difference in roughness is audible between the signals with

oppo-site values of 6. The roughness of a sound is apparently not uniquely

determined by the envelope of the acoustical signal but by some

(39)

30 20 10' 0 f^=1000Hz,9=60 obsJR yA

/

\

/ ^

« r - / - - 9 " ^ " ° " " ' ° f^=1000Hz,9z60 nl ObsTB 16 64 255 f^=1000Hz,9 = 60 ObsFB 64 256 15 64 256H2 30 20-lOj 0 fc=2000Hz,9=60 obsJR f^=2000H2,9=50 ObsTB 16 64 255 f^=2000Hz,9=60 O b s F B 15 64 2 5 5 H z f|,=4000Hz,9=50 obs J.R ₃₀ 20 10' 0 f^=4000Hz,9=60 ObsTB 16 64 255 fj.=4000Hz,9 = 6 0 -,1 O b s F B 256 30" 20 10 f^=2000Hz,9=30 ObsJR Fig.3.5. 256 f^=2000Hz,9=30 ObsTB 255

Results of the experiment on the audibility of a d i f -ference between signals with opposite values of 6. The abscissa is the frequency separation A f . The number of times an observer judged the two signals to be differ-ent is the ordinate. The dashed lines represdiffer-ent the results of the "reliability" signals (see t e x t ) .