|
|
||||||||
J Neurophysiol (March 1, 2003). 10.1152/jn.00271.2001
Submitted on Submitted 5 April 2001; accepted in final form 4 October 2002
Laboratory of Auditory Neurophysiology, Department of Biomedical Engineering, Johns Hopkins University School of Medicine, Baltimore, Maryland 21205
| |
ABSTRACT |
|---|
|
|
|---|
Kadia, Siddhartha C. and Xiaoqin Wang. Spectral Integration in A1 of Awake Primates: Neurons With Single- and Multipeaked Tuning Characteristics. J. Neurophysiol. 89: 1603-1622, 2003. We investigated modulations by stimulus components placed outside of the classical receptive field in the primary auditory cortex (A1) of awake marmosets. Two classes of neurons were identified using single tone stimuli: neurons with single-peaked frequency tuning characteristics (147/185, 80%) and neurons with multipeaked frequency tuning characteristics (38/185, 20%), referred to as single- and multipeaked units, respectively. Each class of neurons was further studied using two-tone paradigms in which the frequency, intensity, and timing of the second tone were systematically varied while a unit was driven by the first tone placed at a unit's characteristic frequency (CF) if it was single-peaked or at one of multiple spectral peaks if it was multipeaked. The main findings were: 1) excitatory spectral peaks in the frequency tuning of the multipeaked units were often harmonically related. 2) Multipeaked units showed facilitation in their responses to combinations of two harmonically related tones placed at the spectral peaks of their frequency tuning. The two-tone facilitation was strongest for the simultaneously presented tones. 3) In 76 of 113 single-peaked units studied using the two-tone paradigm, facilitatory and/or inhibitory modulations by distant off-CF tones were observed. This distant inhibition differed from flanking (or side-band) inhibitions near CF. 4) In single-peaked units, the distant off-CF inhibitions were dominated by tones at frequencies that were harmonically related to the CF of a unit, whereas the facilitation by off-CF tones was observed for a wide range of frequencies. And 5) in both single- and multipeaked units, sound levels of two interacting tones determined whether the two tones produced excitation or inhibition. The largest facilitation was achieved by using two tones at their corresponding preferred sound levels. Together, these findings suggest that extracting or rejecting harmonically related components embedded in complex sounds may represent fundamental signal processing properties in different classes of A1 neurons.
| |
INTRODUCTION |
|---|
|
|
|---|
A fundamental concept
in neural processing of sensory information is the notion of
"receptive field." The receptive field of an auditory cortical
neuron (also referred to as the frequency response area in
the literature) is defined on a two-dimensional space of the frequency
and intensity (e.g., Shamma et al. 1993
; Suga
1965a
,b
). For neurons with single-peaked tuning
characteristics, the receptive field derived by single-tone stimulation
is typically characterized by an excitatory response area with a
threshold (the lowest sound level necessary to evoke a neuronal
response), a characteristic frequency (CF, the frequency corresponding
to the threshold), and a bandwidth (the range of frequencies
surrounding the CF to which the neuron responds, usually measured at 10 dB above threshold). For neurons that display nonmonotonic rate-level functions (Pfingst and O'Connor 1981
; Phillips
et al. 1985
; Wang et al. 1999
), a best or
preferred sound level is also used in the characterization. Finally,
side-band inhibition that flanks the excitatory response area could be
revealed by single-tone (if there is insufficient spontaneous activity)
or two-tone paradigms (Shamma et al. 1993
; Suga
1965a
,b
). The receptive field defined by these parameters is
referred to here as the classical receptive field.
A fundamental limitation in receptive field characterization is that
the nature of a receptive field is always limited or biased by the form
of acoustic stimuli used to define it. In the visual cortex, a
receptive field defined by spots of light or moving bars does not
necessarily reflect response properties of a neuron that integrates
inputs from a larger visual space. Often, such properties can only be
revealed by complex visual stimuli. Indeed, there has been growing
evidence that stimulus elements outside the classical visual receptive
field modulate a neuron's responses to both simple and complex visual
stimuli (Allman et al. 1985
; Gilbert and Wiesel
1990
; Kapadia et al. 1999
; Knierim and
Van Essen 1992
; Nelson and Frost 1985
;
Orban et al. 1987
; Tanaka et al. 1986
).
Accordingly, one may expect that the classical receptive field of an
auditory cortical neuron cannot adequately characterize response
properties when a broad-band stimulus is encountered (Nelken et
al. 1994
; Schulze and Langner 1999
; Suga and Tsuzuki 1985
).
In addition to neurons with single-peaked tuning characteristics,
neurons with multipeaked tuning characteristics have been found in the
auditory cortex of several species, including bats (Suga
1994
), cats (Abeles and Goldstein 1970
, 1972
;
de Ribaupierre et al. 1972
; Oonishi and Katsuki
1965
; Phillips and Irvine 1981
; Sutter
and Schreiner 1991
) and primates (Aitkin and Park
1993
). While the multipeaked tuning characteristics indicate
spectral integration of inputs outside the classical receptive field,
the extensive horizontal connections in the primary auditory cortex (A1) (Kadia et al. 1999
; Matsubara and Phillips
1988
; Ojima et al. 1991
; Reale et al.
1983
; Wallace et al. 1991
; Winer
1992
) suggest that integration of a broad range of spectral
inputs may also take place in neurons with single-peaked tuning
characteristics. In the mammalian A1, integration of multiple spectral
inputs has been explored extensively in bats (Suga 1994
)
and to a lesser extent in other mammals (Brosch and Schreiner
1997
, 2000
; Brosch et al. 1998
, 1999
;
Calford and Semple 1995
; Oonishi and Katsuki 1965
; Phillips et al. 1985
; Schulze and
Langner 1999
; Shamma et al. 1993
; Sutter
et al. 1999
). In the present study, we used a two-tone paradigm
to determine the extent of inputs to an A1 neuron from outside of the
classical receptive field. The assumption is that if a neuron receives
inputs from more than one frequency region, it may respond to multiple
spectral components in a manner not predictable from single-tone
responses. The two-tone paradigm has been used in many studies
(Brosch and Schreiner 1997
; Brosch et al.
1999
; Calford and Semple 1995
; Shamma et
al. 1993
; Suga 1965a
,b
; Sutter et al.
1999
) but has not been systematically tested in awake primates.
In some mammalian species such as bats, a class of specialized neurons
was found in the auditory cortex (Suga et al. 1978
, 1983
) and subcortical nuclei (Leroy and Wenstrup
2000
; Olsen and Suga 1991
; Wenstrup
1999
) that responds optimally to specific combinations of
spectral components in sonar signals that are harmonically related.
These neurons have been referred to as "combination-sensitive" neurons. In a broader sense, the combination sensitivity represents a
special case of contextual sensitivity. It has been suggested that
combination-sensitive neurons may account for other encoding problems
encountered by the auditory system including nonsonar signals such as
speech (Suga 1994
). However, evidence for such combination-sensitive neurons in the auditory cortex of primates has
been lacking (Schwarz and Tomlinson 1990
). An important
lesson from the studies of bat's auditory cortex is that well-chosen laboratory-generated stimuli can facilitate our understanding of
complex sounds such as species-specific communication calls.
Many natural and man-made sounds (e.g., species-specific animal
vocalizations, human speech, and music) contain harmonically related
spectral components. Because the peripheral auditory system functions
to segregate sounds into narrow frequency channels (von Bekesy
1960
), a fundamental task of central auditory systems is to
re-assemble these different channels into a coherent representation for
perception. How such a task is accomplished remains largely unknown. It
has been known from psychophysical studies that the perception of an
individual acoustic event depends on the spectrotemporal stimulus
context. In a natural acoustic scene, multiple frequency components
often co-exist and create spectral shapes with both coarse and fine
features. The perception of a single sound is profoundly influenced by
preceding, following, or concurrent sounds (Plack and Viemeister
1992
; Wright 1994
; Wright and Dai
1994
). For example, preceding or succeeding sounds can result
in elevated thresholds (Luscher and Zwislocki 1947
) or
alteration of subjective pitch and loudness (Stevens and Davis
1938
) of a single acoustic stimulus. Successive acoustic events
may be perceived as a single stream or segregated into different
streams depending on spectral composition, length, and temporal
separation of the events (Bregman 1990
). These
perceptual processes are possibly correlated with modulatory influences
on neuronal responses by stimulus components from "outside" of the
classical receptive field. Characterizing these influences and their
organizations are critical for bridging neurophysiology and perception.
We used a small, highly vocal New World monkey, the common marmoset
(Callithrix jacchus) as our animal model for the following reasons. Marmosets are highly vocal and have a well-described repertoire of communication calls (Agamaite and Wang
1997
; Epple 1968
; Wang 2000
;
Wang et al. 1995
). Marmoset vocalizations are composed
of distinct spectral components that are harmonically related. Such
harmonic structure is fundamental to the communication sounds of many
animal species. A large portion of the auditory cortex of marmosets is
located on the lateral surface of the temporal lobe, easily accessible
for electrophysiological studies (Aitkin et al. 1986
,
1988
). We conducted these experiments under the awake condition
to avoid potential complications imposed by anesthesia on cortical responses.
| |
METHODS |
|---|
|
|
|---|
Acoustic stimuli
Experimental stimuli included pure tones and two-tone
combinations. All acoustic stimuli were delivered in free-field
conditions by a speaker located ~1 meter in front of the animal. All
recording sessions were conducted within a double-walled, soundproof
chamber (Industrial Acoustics). The interior of the chamber was covered by 3-in acoustic absorption foam (Sonex, Illbruck). The speaker (B and
W Mode DM601) had a flat (±6 dB) frequency response from 50 Hz to 30 kHz. Stimuli were generated digitally on a computer using the full
range of a 16-bit D/A converter (DA3-4, Tucker-Davis Technologies) at
a 100-kHz sampling rate and attenuated to the desired sound pressure
level (PA4, Tucker-Davis Technologies). Stimuli from each set were
presented in random order. Typical tone duration was 100 ms (in some
cases 50-ms tones were used) with a rise-fall time of 10 ms.
Inter-stimulus intervals were
1 s. In the two-tone paradigms, the two
tones were generated separately through two channels of the D/A
converter and mixed before being sent to the speaker. We tested our
sound delivery system thoroughly and confirmed that harmonic artifacts
were at least 43 dB lower than the fundamental at 80 dB SPL (the
highest sound level used in this study). The difference grew as the
intensity of the fundamental decreased.
Preparation and recording procedure
Findings presented here were based on single-unit responses
recorded from A1 of two awake marmosets (n = 185). We
have developed a chronic recording preparation in this small primate
species (Lu et al. 2001
). Briefly, marmosets were first
adapted to sit quietly in a specially designed primate chair. Two
stainless steel head-posts were then implanted so that the animal's
head was immobilized during recording sessions. The skull over the
auditory cortex was exposed and covered with a thin layer of dental
cement and surrounded by a chamber that was sealed between recording
sessions. The auditory cortex was accessed using a single tungsten
microelectrode (A-M Systems, impedance 3-5 M
at 1 kHz) via a small
hole (~1mm) through the skull, the only opening on the skull at any
given time during the recording sessions. Each recording hole was
sealed by dental cement after several days of recordings. Daily
recording sessions, each lasting 3-5 h, were carried out for several
months in each animal. On average, two to four well-isolated single
units were recorded in each session. Spike waveforms were filtered, digitized, and detected using a template-matching discriminator (MSD,
Alpha-Omega Engineering) that is capable of sorting at least three
single units. We typically sorted one single unit, less often two
single units, from the signals recorded from a single electrode. The
advantage of this discriminator was that it allowed us to continuously
monitor the quality of recorded spikes in real time while recordings
were in progress and its template-matching rejected artifacts in the
signals produced by animal's movements.
In our experiments, the auditory cortex was approached laterally with electrodes entering perpendicularly. Spatial location of each penetration as well as the contact to the dura surface by the tip of an electrode was visually confirmed through an operating microscope at high magnification (typically ×40). Single units were encountered at all cortical layers, but the majority of the recorded data were from layers 2-4, judging by the depth and response characteristics. The location of A1 was determined by three criteria: tonotopic organization of A1, reversals in the gradient of CF at borders between A1 and the rostral and caudal fields, and change of response properties at the border between A1 and the lateral field (which was more responsive to noises). Neurons in A1 generally responded well to pure tone stimuli.
Data acquisition
Once a unit was isolated and its basic response properties were determined using single tones (e.g., CF, latency and rate-level characteristics), experimental protocols using two-tone stimuli were executed.
PROTOCOLS FOR DETERMINING CF AND RATE-LEVEL CHARACTERISTICS. For each isolated single unit, its CF and threshold to CF tones were first estimated manually and then determined precisely by a computer-controlled procedure. Tones at a sound level 10 dB above threshold were delivered over a range of frequencies spanning several octaves at 20 steps/octave. Each frequency was tested at least five times. This resulted in an accurate determination of the CF for the unit, calculated as the frequency that evoked the highest average discharge rate. This procedure was often repeated at several sound levels to obtain the frequency response area of the unit. Next, we measured a rate-level function (average discharge rate versus sound level) in each unit using the CF tone played at different sound levels. The threshold (Th) of the response was determined from the rate-level function. A preferred sound level (PL) was also calculated if the rate-level function was nonmonotonic.
PROTOCOLS FOR TWO-TONE STIMULI.
Two-tone interactions were studied by varying frequency, sound level,
and the onset delay between the two tones. The first tone (S1) was
fixed and the second tone (S2) was varied in frequency or sound level.
In most units, S1 and S2 were presented simultaneously. For a subset of
units studied, the onset delay between S1 and S2 was varied. The S1
frequency was usually fixed at the CF of units with single-peaked
tuning characteristics (Fig. 1) and at one of excitatory frequency peaks of units with multipeaked tuning characteristics (Fig. 2). The frequency
of the S2-tone was varied in small, linear steps over a large range of
frequencies (
2-3 octaves around the S1 frequency). The step size was
calculated by the formula CF/n (n = 12 or
24). The intensity of the S1 tone was generally kept at the preferred
sound level for nonmonotonic rate-level functions or 30-40 dB above
the threshold for monotonic rate-level functions. If the parameters
used in the preceding protocol did not result in any marked response
modulation (facilitation or inhibition) in two-tone responses as
compared with single tone responses, sound levels of the S1 tone and/or
S2 tone were varied and the two-tone protocol was repeated. Once we
identified the S2 frequency(ies) that elicited response modulation, we
then systematically tested these S2 frequencies across sound levels and
determined the best S2 sound level that produced the largest response
modulation. For all two-tone protocols, we tested each combination of
S1 and S2 for
10 (usually 20) repetitions. In addition, responses to the S1 tone played alone were recorded as a control. Stimuli in each
protocol were delivered in random order. However, it was not always
possible to evaluate the full range of parameters of two-tone
stimuli in each unit due to time constraints.
|
|
Data analysis
Single units were classified into two basic categories based on their single-tone responses: units with single-peaked tuning characteristics (referred to as single-peaked units hereafter) and units with multipeaked tuning characteristics (referred to as multipeaked units hereafter). Single-peaked units were responsive to single-tone stimulation in only one region of frequencies. Units that responded to more than one discrete region of frequencies were classified as multipeaked units. A multipeaked unit is distinct from a broadly tuned unit in that it has a region of nonresponsive frequencies separating the areas responsive to single-tone presentation.
The following measure was used to quantify the difference between
single-tone and two-tone responses in both single-peaked and
multipeaked units
|
In single-peaked units, two other measures, best facilitatory peak frequency and best inhibitory peak frequency, were used to quantify two-tone response modulations at particular S2 frequencies. Responses of A1 units to two-tone stimuli typically showed side-band inhibition at some S2 frequencies flanking the CF of a unit (as revealed by a reduction in discharge rates). These inhibitory regions are referred to as the flanking inhibitory regions. In some units, the inhibition could also be induced by S2 tones placed at frequencies outside the classical receptive field (i.e., CF-centered excitatory region plus flanking inhibitory regions). We referred to such inhibition as the distant inhibition. There was always a range of frequencies between the flanking inhibitory regions and distant inhibitory regions where S2 tones did not produce significant inhibition. The S2 frequency that caused the strongest distant inhibition was defined as the best inhibitory peak frequency. In some units, facilitation by the S2 tone could also be detected at frequencies outside the classical receptive field and is referred to as the distant facilitation. The S2 frequency that caused the strongest distant facilitation was defined as the best excitatory peak frequency. There could be more than one S2 frequency that caused facilitated or inhibited two-tone responses in a unit. For each unit, in addition to the best facilitatory (inhibitory) peak, there may be other secondary facilitatory (inhibitory) peaks where two-tone responses produced >30% increase (decrease) in discharge rate. We also calculated S2 frequencies corresponding to these additional facilitatory (inhibitory) peaks.
| |
RESULTS |
|---|
|
|
|---|
The results described in this report were from experiments performed in two awake marmoset monkeys. A total of 185 single units were recorded and classified into two categories based on their single-tone responses: units with single-peaked tuning characteristics (n = 147, 80% of total) and units with multipeaked tuning characteristics (n = 38, 20% of total).
Existence of neurons with multipeaked tuning characteristics
A1 neurons typically have one CF-centered excitatory response area, as shown by an example in Fig. 1. We observed in our experiments a substantial number of A1 units that had multiple, discrete excitatory response areas as revealed by single tones. Figure 2 shows three examples of such units, referred to as the multipeaked units. Each excitatory region can be described by the same parameters used to describe units with single-peaked tuning characteristics (i.e., CF, threshold, rate-level characteristics). In this report, we refer to the CF of the peak with the lowest frequency as CF1 in a multipeaked unit. The CFs of the additional peaks are referred to as CFn (n = 2, 3, ... ). A three-dimensional (3-D) response area for a multipeaked unit is depicted in Fig. 2Aa, where the discharge rate is plotted against both frequency and sound level. This unit responded strongly to tones of frequencies near 9.5 kHz (CF1) and 19 kHz (CF2) at 40 dB SPL (Fig. 2Ab, a cross-section of the 3-D frequency response area shown in Fig. 2Aa). Furthermore, this unit had nonmonotonic rate-level functions at both CF1 and CF2 (Fig. 2Aa) but with different preferred sound level (50 dB SPL at CF1, 60 dB SPL at CF2). Note that multipeaked units have regions of nonresponsive frequencies separating the excitatory regions across all sound levels. An important feature of multipeaked units was that the ratio of CFn and CF1 was often that of two integers (e.g., 3:2, 2:1, 5:2, 3:1, ...), which we considered as having a harmonic relationship or being harmonically related. For the unit shown in Fig. 2A, CF2 equals 2CF1.
Some multipeaked units had more than two peaks, such as the example shown in Fig. 2B. This unit had three excitatory peaks at 7 kHz (CF1), 10.5 kHz (CF2, ~1.5CF1) and 21 kHz (CF3, ~3CF1) respectively. There are some common characteristics of multipeaked units. First, the highest discharge rate evoked by a single tone could occur at any one of the peak frequencies (CFn, n = 1, 2, 3, ... ), not necessarily at CF1. Second, the minimum response latency could be different at each peak frequency. For example, responses near CF3 in the unit shown in Fig. 2B had shorter latencies than responses near CF1 and CF2. Third, the rate-level function at each peak frequency could be either monotonic or nonmonotonic. Fourth, response threshold at each peak frequency may differ as well.
An interesting observation of multipeaked units was that temporal discharge patterns at different peak frequencies were often not identical. By "temporal discharge pattern" we mean the distribution of spike times over a period of time of interest (usually the duration of a stimulus plus a short time period after the stimulus offset) as opposed to the count of spikes or mean firing rate over a period of time of interest. For example, offset responses were observed at CF3 for the unit shown in Fig. 2B and at CF2 for the one in Fig. 2C but not at other peak frequencies. Even in the same unit, temporal discharge patterns could vary across sound levels. Figure 3 shows responses of a multipeaked unit at two sound levels (10 and 40 dB SPL, respectively, Fig. 3, A and B). This unit had one excitatory peak at 10 dB SPL (CF1 = 14.5 kHz) and three excitatory peaks at 40 dB SPL (21.9, 28.1, and 35.3 kHz, corresponding approximately to 1.5CF1, 2CF1, and 2.5CF1, respectively) as indicated in Fig. 3C. While there were sustained discharges near 14.5 kHz at 10 dB SPL (Fig. 3A), the unit showed only onset firings at this frequency at 40 dB SPL (Fig. 3B).
|
Population properties of multipeaked units
Figure 4A shows the
relationship of the lowest response threshold versus
CF1 of 38 multipeaked units analyzed in this
study (open circles). In multipeaked units, the lowest response
threshold could occur at any peak frequency (CF1,
CF2, or CF3,). For
comparison, the distribution of response threshold across CF is also
shown in Fig. 4A for 113 single-peaked units (pluses)
recorded from the same animals. Two distributions are largely
overlapping. Most units had low thresholds in the range of 0-20 dB SPL
(single-peaked units: 82/113, multipeaked units: 31/38). We
concentrated our sampling in the central area of A1 near the range of
frequencies (6-8 kHz) where the first harmonic of many marmoset
vocalizations is centered (Agamaite and Wang 1997
).
|
The distribution of peak frequency ratios (CF2/CF1, CF3/CF2, CF3/CF1) for all multipeaked units is shown in Fig. 4B. A major peak near the ratio of 2 indicates that the most common relationship between the frequency peaks is that of an octave. The second largest peak in Fig. 4B is at the ratio of 1.5 (3/2) or half octave step. For 33 multipeaked units, we were able to determine the response threshold at all frequency peaks. Figure 4C shows the distribution of the threshold differences between response peaks at CF2 and that at CF1. In 18 of 33 units, thresholds at higher peak frequencies (CF2) were lower than that at CF1. The opposite was observed in 10 of 33 units. The remaining five units had approximately equal thresholds at CF1 and CF2.
Two-tone facilitation in multipeaked units
A natural question for multipeaked neurons is whether their
responses are facilitated when their multiple excitatory regions are
simultaneously stimulated. For example, the unit shown in Fig.
2A has two harmonically related peaks at ~10
(CF1) and 20 kHz (CF2). We
played the tones at these two peak frequencies simultaneously, fixing
the frequency of the first tone (S1) at CF1 and
varying the frequency of the second tone (S2) from 800 Hz to 30 kHz (in 36 logarithmic steps, Fig.
5, A and
B), both tones were delivered at 50 dB SPL. There were clear
modulations by the S2-tone at various frequencies (Fig. 5, A
and B). At frequencies near CF2, the
unit's response was facilitated, with the strongest facilitation at
CF2. The two-tone stimulation also revealed
flanking inhibitory regions near CF1 that have
been described in detail by others (Shamma and Symmes
1985
; Shamma et al. 1993
; Suga
1965a
,b
; Sutter et al. 1999
) as a part of the
response area of an A1 neuron. There are two features to note about the
facilitation near CF2 as shown in the dot raster
in Fig. 5A. First, the temporal profile of responses at
CF2 is different from that at
CF1: the latency is a little shorter at
CF2 and the response is sustained for a longer
period of time. Second, there are clear offset responses at
CF2 but little at CF1.
|
The two-tone facilitation was generally dependent on the sound level of the S2-tone. Figure 5, C and D, shows responses of the same unit as described in Fig. 5, A and B, when the CF2 sound level was varied between 10 and 80 dB SPL, while CF1 sound level was fixed at 60 dB SPL. Differences in temporal response profile are apparent in the PSTHs (Fig. 5C). In particular, discharges lasted longer for S2 at 50-80dB SPL. The corresponding rate-level function for the two-tone presentation is shown in Fig. 5D, where it is compared with the rate-level functions of CF1 and CF2 tones played separately. A line representing a linear summation is shown by adding the neural response at CF1 of 60 dB SPL to the rate-level function of the CF2 tone. It is evident that the two-tone interactions produced nonlinear facilitation at suprathreshold sound levels, where the responses to the two tones played together were stronger than the sum of the responses to both tones played individually.
In Fig. 5, E and F,, we explore further two-tone interactions at different sound levels in the same unit. Figure 5E shows the two-tone responses with CF2 tone at 10 dB and CF1 varying from 10 to 80 dB SPL. Although the CF2 tone by itself evoked no response, adding it to the CF1 tone increased the responses at all sound levels tested, as compared with the responses to the CF1 tone alone. This indicates that adding a harmonic, subthreshold tone can lower response threshold and enhance two-tone responses across sound levels in a multipeaked unit. Figure 5F shows the results from an experiment in which the roles of CF1 and CF2 were switched. Here, we fixed the CF1 sound level at 10 dB SPL and varied the CF2 sound level from 10 to 80 dB SPL. Two-tone responses again showed facilitation as compared with the rate-level function of CF2 tone alone, even though CF1 tone at 10 dB SPL did not produce any response. Furthermore, the rate-level function of the two-tone response has the similar nonmonotonic shape as the rate-level function of CF2 tone (Fig. 5F).
In 25 of 38 multipeaked units, we were able to carry out tests of two-tone influences. In most nonmonotonic units, we tested two-tone properties near the peak of the rate-level function of S1 tone to determine if there was a facilitatory response. The distribution of the percent facilitation measured in these units is shown in Fig. 6. A percent facilitation of 100% means that the two-tone responses are twice as strong compared with the single-tone responses. The properties of the two-tone facilitation for a wide range of CF1 frequencies tested indicate the generality of this mechanism at work in A1 neurons.
|
Facilitation by distant off-CF tones in single-peaked units
Using the two-tone paradigm, we observed in single-peaked units, facilitation by the second tone (S2) placed outside of a unit's central excitatory area and flanking inhibitory regions. We refer to this type of two-tone facilitation as the facilitation by distant off-CF tones. Two examples of such responses are shown in Fig. 7. The unit shown in Fig. 7, A and B, has a CF of 4.53 kHz, a threshold of 20 dB SPL and a preferred level of 40 dB SPL. Figure 7, A and B, shows the result from a two-tone test for this unit, representative of the tests given to other units. The two-tone responses were obtained for a range of S2 frequency (0.35-13 kHz). The two-tone paradigm revealed flanking inhibitory regions at frequencies near the CF (Fig. 7A). The reduction of discharge rate at CF was due to nonmonotonic rate-level characteristics of this unit (Fig. 7B). There were facilitatory responses away from the CF near a harmonically related frequency (7 kHz, ~1.5 times CF), where the two-tone responses were much stronger (~150%) than the single-tone response at CF (Fig. 7B). Note that the frequency of this facilitatory peak is outside the flanking inhibitory regions of the unit. In the single-tone test, there were no responses at these distant frequencies. Moreover, two-tone facilitation was also observed at several other higher frequencies (e.g., near 11.4 kHz) though much smaller in magnitude than the facilitatory responses near 7 kHz (Fig. 7, A and B). An interesting feature of the two-tone facilitation with the S2 tone near 7 kHz was that it resulted from an increase of sustained discharges (Fig. 7A). The response was largely onset when the S1 tone was played alone.
|
Figure 7, C and D, show data from another representative unit. The two-tone responses (Fig. 7D) revealed best facilitation with the S2 tone near 3 kHz (~2 times CF), where discharge rates were ~250% higher than the single tone response at CF. This facilitatory region is outside the flanking inhibitory region. The temporal profile of the two-tone facilitatory responses near 3 kHz differed from that of the single-tone response in that response latencies were much shorter and sustained responses were much stronger (Fig. 7C).
Distant inhibitory influences from outside the response area
Because spontaneous rates of most A1 neurons are generally low, it is not always possible to observe inhibitory side bands using single tone stimuli. Inhibitory response areas flanking the CF-centered excitatory response area could be revealed using simultaneously played two tones. In addition to the flanking inhibition, we observed inhibition at distant frequencies outside the response areas of a neuron. Figure 8 depicts three representative units in which an S2 tone at a distant frequency elicited inhibition. The unit shown in Fig. 8A had a CF of 4.42 kHz and was inhibited by S2 tones at frequencies near 2*CF. Sometimes inhibitory influences were observed at frequencies further than one octave away from CF, as shown by the example in Fig. 8B. This unit had a CF of 12.25 kHz. The S2 tone at 3*CF completely inhibited the response to the CF-tone (Fig. 8B). The example in Fig. 8C illustrates that in some single-peaked units, both facilitation and inhibition were observed at different S2 frequencies. This unit had a CF of 14.7 kHz and was inhibited at a harmonically related frequency (0.5*CF) but facilitated at another frequency (17.2 kHz).
|
The examples shown in Figs. 7 and 8 demonstrated both facilitatory and inhibitory influences from outside of the response area, often observed in the same unit. As is evident from some of these examples, in addition to changes in discharge rate, two-tone interactions also altered temporal discharge patterns (Fig. 7, A and C). Two additional examples shown in Fig. 9 further demonstrate this point. In Fig. 9A, two-tone responses revealed both facilitatory and inhibitory regions away from CF. Interestingly, a new, short-latency onset response was observed in two-tone conditions for S2 frequencies in the range of 6-7 kHz (indicated by a red square in Fig. 9A). This onset component did not exist at any frequency in single-tone conditions. Figure 9B shows a unit that had strong facilitation at S2 frequencies near 1.5*CF. The facilitation was manifested by increased sustained discharges near the facilitatory frequencies (Fig. 9B).
|
Distribution of spectral integration peaks in population of single-peaked units
A total of 76/113 single-peaked units showed some forms of response modulation (facilitation or inhibition) by S2 tones distant from the CF of a unit. In 51/113 units, we were able to determine a locus of facilitation distant from CF. In a separate but overlapping population of units (51/113), we were able to determine a locus of inhibition distant from CF. Both distant facilitation and inhibition were observed in 27/113 units. These numbers are probably an underestimate of the extent of two-tone modulations in single-peaked A1 units because it was not always feasible to study each unit with all possible combinations of two-tone frequencies and intensities.
Figure 10A shows the
distribution of best facilitatory peaks in single-peaked units (see
METHODS). In some units, the facilitation peaks had
harmonic relationships to the CF (0.5*CF, 1.5*CF, etc.), while in
others nonharmonically related frequencies evoked facilitation. Figure
10B shows distribution of best inhibitory peaks in
single-peaked units (see METHODS). Some units (26/51 = ~51%) had distant inhibition at harmonically related frequencies
(0.5*CF, 1.5*CF, 2*CF, 3*CF), most noticeably at 0.5*CF and 2*CF (Fig.
10B). Data in Fig. 10, A and B,
represent the most prominent facilitation and inhibition in each unit,
respectively. In some units, there was more than one facilitatory or
inhibitory peak at distant S2 frequencies. Frequency for these
secondary peaks were calculated in these units and included in the
overall distribution of all modulatory influences (139 peaks
facilitatory and inhibitory) in Fig. 10C. There are three prominent peaks in this distribution at 0.5*CF, 2*CF, and 3*CF,
respectively.
|
Sound level dependency of two-tone facilitation and inhibition
The two-tone facilitation and inhibition were found to depend on sound levels of both S1 and S2 tones. In general, this sound level dependency was closely related to the characteristics of rate-level functions of S1 and S2 tones. The preferred sound levels at CF1 and CF2 frequencies were often not the same. The unit shown in Fig. 11 responded after stimulus onset at 20-30 dB SPL to CF1 tone (2.46 kHz), but it responded after stimulus offset to CF2 tone (4.92 kHz) at higher sound levels (70-80 dB SPL; Fig. 11A). This offset response is an indication of release from inhibition that keeps the unit from responding to the CF2 tone at higher sound levels during the stimulus. We tested this unit over a range of CF2 sound levels using the two-tone paradigm with the CF1 tone set at its best level of 70 dB SPL (Fig. 11B). The strongest facilitatory responses with the shortest latencies were observed when the CF1 and CF2 tones were played together at their respective preferred sound levels (Fig. 11B). The two-tone response as a function of CF2 sound levels (Fig. 11C) mirrors the rate-level function of the CF2 tone alone (Fig. 11A). The two tones evoked facilitatory responses for S2 tones at 20-30 dB SPL but inhibitory responses for S2 tones at 50-80 dB SPL. This example shows that facilitatory or inhibitory interactions for two-tone paradigms are dependent on the sound levels of both tones and that when both tones are played at their respective preferred levels, the maximum facilitation is evoked.
|
Figure 12 shows the dependency of facilitation and inhibition on the sound level of two tones in another representative multipeaked unit. This unit had two frequency peaks at 8.37 kHz (CF1) and 16.7 kHz (CF2), respectively. The rate-level functions at both frequencies were nonmonotonic (Fig. 12A). We tested two two-tone paradigms for this unit. In Fig. 12B, the CF1 tone was played at 40 dB SPL (near the peak of its rate-level function, Fig. 12A) together with the CF2 tone at varying sound levels (0-80 dB). The two-tone response, largely facilitatory (Fig. 12B), follows the shape of the CF2 rate-level function (Fig. 12A). In Fig. 12C, the CF2 tone was played at 50 dB SPL (near the peak of its rate-level function, Fig. 12A) together with the CF1 tone at varying sound levels (0-80 dB). The two-tone response profile showed a nonmonotonic shape similar to the CF1 rate-level function (Fig. 12A). Two-tone responses were facilitatory for CF1 tones at 20-30 dB SPL but inhibited as compared with the one-tone condition when the CF1 tone was at 80 dB SPL (Fig. 12C). This example also demonstrates a general observation, i.e., the changes in two-tone responses over sound level closely follow the rate-level characteristics of the second tone that presumably drive the modulatory inputs to the unit.
|
The dependency of two-tone facilitation on S2 sound level described for multipeaked units in Figs. 11 and 12 was also observed in many single-peaked units. A representative example is shown in Fig. 13 (A and B) (the same unit as shown in Fig. 7, C and D). The S1 tone (at CF) and the S2 tone (at 2*CF) were played simultaneously with the S2 sound level varying from 30 to 80 dB SPL. The two-tone facilitation was observed over a specific range of S2 sound levels (60-80 dB SPL; Fig. 13B). Additional examples of two-tone facilitatory responses as functions of the S2 sound level are shown in Fig. 13C. In most cases when we were able to determine a range of frequencies away from CF for which two-tone stimulation resulted in facilitation, the facilitation was highly dependent on the sound level of the S2 tone.
|
Temporal aspects of two-tone facilitation
As the examples in the previous figures for both single- and
multipeaked units have shown, responses to two-tone stimuli may include
up to three components: onset, sustained, and offset discharges. Given
the variety in the temporal discharge patterns, it is important to know
how far two tones could be separated in time and still have
interactions to modify neural responses. We examined this issue in a
subset of units by varying the delay between the onsets of S1 and S2
tones. A typical example of a single-peaked unit is shown in Fig.
14A.
Responses to two simultaneously presented tones (with the S2 frequency
varying) for this unit were shown in Fig. 7, A and
B. The onset delay was varied over a range of 0-125 ms. At
the onset delay of 25 ms, the S2 tone (at a facilitatory off-CF
frequency) started 25 ms before the S1 tone (at CF of the unit) (Fig.
14Aa). The durations for both tones were 100 ms. The two
tones were completely overlapping at the onset delay of 0 ms, and
nonoverlapping at onset delays of
100 ms. Typically in such
experiments, we also recorded the control response (S1 alone) with each
experimental condition (S2 with delayed S1). Figure 14Ab
shows the percentage of change in discharge rate as a function of the
onset delay. The results indicate that the two tones have to overlap to
some extent in time to achieve any facilitation and that the
facilitation becomes weaker as the onset delay increases.
|
Figure 14B shows an example of a multipeaked unit. When both CF1 and CF2 tones (50 ms in duration) were played simultaneously, there was a strong facilitation. The single-tone response in this unit ended within the first 25-30 ms after the tone onset (Fig. 14Ba). When the onset delay between the two tones was <25-30 ms (i.e., the S1 tone was played before the response to the S2 tone was ended), two-tome facilitation was observed. Once the response to the S2 tone ended, the two-tone responses were weaker than the response to the S1 tone alone (Fig. 14Bb). In all the units where we tested the effects of varying the onset delay, the maximum two-tone response was always observed when S1 and S2 tones were delivered simultaneously, as demonstrated in the two examples in Fig. 14.
| |
DISCUSSION |
|---|
|
|
|---|
This study demonstrated excitatory and inhibitory modulations in A1 neurons arising from tones outside of the classical receptive field (defined by a central excitation area surrounded by flanking inhibitory areas). Approximately 20% of the units in marmoset A1 were found to have multipeaked tuning characteristics. The distribution of peak frequency ratios (e.g., CF2/CF1) showed maxima at 1.5 and 2 (Fig. 4), indicating a harmonic structure in the multipeaked A1 units. One of the functional properties that characterized units with multipeaked tuning characteristics was the facilitation in the response when these units were stimulated by two tones at peak frequencies with appropriate combinations of sound levels. Two-tone facilitation was strongest when the two tones were played nearly simultaneously. All units with multipeaked tuning characteristics that were tested for two-tone interactions exhibited facilitatory responses. We also found that responses of A1 units with single-peaked tuning characteristics (~80% of samples) could be modulated by tones over a wide range of frequencies away from CF. In the population of single-peaked units, the harmonic structure observed in two-tone interactions was closely linked to inhibition rather than facilitation (Fig. 10).
Technical considerations
Because unwanted harmonics are commonly generated by nonlinearity
in acoustic systems, it is important to rule out the possibility of
these artifacts in the observations of multipeaked tuning
characteristics. We tested our sound delivery system thoroughly and
found that harmonics were
43 dB lower than the fundamental for tones
delivered at the highest sound level used in this study (80 dB SPL). As sound level decreased, the magnitudes of the harmonics dropped at rates
faster than the fundamental. Moreover, several lines of evidence from
our data argued against the possibilities of artifacts. First, only a
subset of units recorded in our experiments had multipeaked receptive
fields (~20% of samples). The response thresholds of these units
were comparable to the thresholds of the single-peaked units (Fig.
4A). Nonlinearities generated by our stimulus delivery
system should affect all units with similar response thresholds.
Second, temporal discharge patterns produced by single tones at
different frequency peaks in the same unit were generally different
(e.g., Figs. 2 and 3). Harmonic artifacts should result in temporal
discharge patterns similar to that produced by the fundamental
component. Third, temporal profiles of two-tone responses can be
different (e.g., having longer sustained components or new offset
responses, Figs. 7 and 9) from temporal profiles of single-tone
responses. Fourth, differences in thresholds between frequency peaks
were as little as 0 dB and no more than 20 dB in most multipeaked units
(Fig. 4C). Larger threshold differences would be expected if
acoustic nonlinearity contributed to the observed multipeaked tuning characteristics.
The statistics for the occurrence of multipeaked units (~20% in our study) have some experimental biases that are worth mentioning. We recorded from A1 neurons indiscriminately. That is, we studied every unit from which we could record stably. While we tried to record from the full extent of A1, we may have missed parts of A1 where these multipeaked units might be more or less concentrated. However, our samples of multi- and single-peaked units appeared to be distributed over a wide range of CFs (Fig. 4A). We chose the awake marmoset as our experimental model to avoid confounding effects of anesthetics. In the reported experiments, the animals were not required to perform behavioral tasks when the acoustic stimuli were delivered. Whether the two-tone responses characterized in our study are effected by different behavioral states, especially when an animal attends to or discriminate these acoustic stimuli, needs to be studied in the future.
Multipeaked neurons versus single-peaked neurons
Cortical neurons with multipeaked tuning characteristics have been
previously observed in several mammalian species. In bats, combination-sensitive neurons with multipeaked tuning characteristics were found in both primary and secondary cortical areas
(Fitzpatrick et al. 1993
; Suga 1994
) and
have been implicated in echolocation behavior. In cats, multipeaked
neurons have been reported in A1 (Abeles and Goldstein 1970
,
1972
; de Ribaupierre et al. 1972
; Oonishi
and Katsuki 1965
; Phillips and Irvine 1981
;
Reale and Imig 1980
; Sutter 2000
;
Sutter and Schreiner 1991
). In marmosets, some examples
of multipeaked neurons were reported (Aitkin and Park
1993
). The rate of occurrence of multipeaked neurons varied between species and between investigations of the same species, the
latter may have resulted from sampling biases between the studies
(Sutter and Schreiner 1991
). In the present study, the majority of A1 units (~80%) were found to have single-peaked tuning characteristics.
Single- and multipeaked A1 units share some functional properties. They both can be modulated by distant off-CF inputs that could be facilitatory or inhibitory, depending on the specific frequency and sound level of the off-CF tone. However, these two types of A1 units appear to differ in some fundamental ways. The two-tone facilitation and inhibition observed in single-peaked units are not predictable from their single-tone responses because these units do not respond to off-CF tones when presented alone. More importantly, multipeaked units exhibited harmonically related two-tone facilitation (Figs. 4 and 5), whereas harmonically related inhibition was found to be closely associated with single-peaked units (Fig. 10). The anatomical substrates that give rise to the physiological properties in both types of neurons may be similar. The functional differences between these two types of neurons could arise from different distributions of synaptic weights among a neuron's multiple off-CF inputs. In this case, multipeaked units <