JN AJP: Lung Cellular and Molecular Physiology
HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
 QUICK SEARCH:   [advanced]


     


J Neurophysiol 98: 2747-2764, 2007. First published September 26, 2007; doi:10.1152/jn.00294.2007
0022-3077/07 $8.00
This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow All Versions of this Article:
98/5/2747    most recent
00294.2007v1
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via ISI Web of Science (1)
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Shaevitz, S. S.
Right arrow Articles by Theunissen, F. E.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Shaevitz, S. S.
Right arrow Articles by Theunissen, F. E.

Functional Connectivity Between Auditory Areas Field L and CLM and Song System Nucleus HVC in Anesthetized Zebra Finches

Sarita S. Shaevitz1 and Frédéric E. Theunissen1

1Department of Psychology and Helen Wills Neuroscience Institute, University of California, Berkeley, California

Submitted 15 March 2007; accepted in final form 21 September 2007


    ABSTRACT
 TOP
 ABSTRACT
 INTRODUCTION
 METHODS
 RESULTS
 DISCUSSION
 GRANTS
 ACKNOWLEDGMENTS
 REFERENCES
 
A key discovery that has emerged from studies of the vocal system in songbirds is that neurons in these regions respond preferentially to playback of the bird's own song (BOS). This BOS selectivity is not a general property of neurons in primary and secondary auditory forebrain regions, field L and caudolateral mesopallium (CLM). Moreover, anatomical studies have been unable to conclusively define a direct projection from field L and/or CLM to HVC, a central structure for integrating sensory and motor information in the vocal system. To examine the communication between these regions, we used simultaneous dual-electrode recording in anesthetized male zebra finches and cross-correlation analysis to estimate the functional connectivity between auditory areas, field L and CLM, and HVC. We found that ≥18% of neurons in field L and 33% of neurons in CLM are functionally connected to HVC, most with auditory forebrain leading-HVC latencies ranging from 0.5 to 15 ms. These results indicate that field L and CLM communicate extensively with HVC through both direct and indirect anatomical connections. To further explore the role of the auditory forebrain cells that are functionally connected with HVC, we assessed their responsiveness and selectivity for a variety of natural and synthetic auditory stimuli. We found that field L and CLM neurons that are functionally connected to HVC exhibit generic auditory forebrain properties including the lack of BOS selectivity. This finding puts further constraints on the neural architecture and the nature of the nonlinearity that leads to BOS-selective auditory responses in the vocal control nuclei.


    INTRODUCTION
 TOP
 ABSTRACT
 INTRODUCTION
 METHODS
 RESULTS
 DISCUSSION
 GRANTS
 ACKNOWLEDGMENTS
 REFERENCES
 
Songbirds learn to produce complex vocalizations in a manner that is reminiscent of some aspects of speech learning in humans. Song learning is a two-stage process in which a juvenile songbird listens to and memorizes the song of a tutor bird (sensory phase), begins practicing his own vocalizations, and then uses auditory feedback to adjust his copy of the tutor's song until he is able to produce a stable song that is similar but not identical to the tutor's song (sensory-motor phase; Marler 1981Go). The system of interconnected brain nuclei specialized for the learning and production of song has been well characterized over the past several decades (Brenowitz et al. 1997Go; Nottebohm et al. 1976Go). This "song system" is composed of two pathways: the motor pathway, essential for song production, and the anterior forebrain pathway, important for learning and maintaining song (Fig. 1). HVC, the song system structure, is situated at the junction of both pathways, functioning in both sensory and motor processing. Electrophysiological recording studies have found that in anesthetized, sedated, and sleeping birds, HVC neurons respond selectively to playback of the bird's own song (BOS) over other auditory stimuli (sedated: Cardin and Schmidt 2003Go; anesthetized: Margoliash 1983Go; sleeping: Nick and Konishi 2001Go). This selectivity emerges during vocal development (Doupe 1993Go; Solis and Diupe 1995Go; Volman 1993Go) and is found also in HVC's efferent targets (Doupe and Konishi 1991Go). In contrast, during wakefulness, the responses of HVC neurons to passive presentation of sounds diminishes, becomes more variable, and is less selective (Cardin and Schmidt 2003Go, 2004Go; Rauske et al. 2003Go). At the same time, in awake birds, HVC neurons exhibit robust premotor activity during active singing (McCasland and Konishi 1981Go; Nick and Konishi 2001Go; Rauske et al. 2003Go; Yu and Margoliash 1996Go).


Figure 1
View larger version (17K):
[in this window]
[in a new window]

 
FIG. 1. Simultaneous recording of field L and HVC activity. A: simplified schematic of the anatomy of the song and auditory systems. B: field L and HVC raster plots of spontaneous activity recorded simultaneously from multiunit clusters (2–5 units). Each horizontal line represents one trial. Boxes highlight instances in which an increase in field L activity closely preceded an increase in HVC activity. Such instances could lead to a positive coherency if they persist after calculating the cross-covariance and the coherency functions for this pair of cell clusters. C: detailed schematic of the auditory and song systems.

 
These findings suggest that HVC may play a key role in modulating the bird's motor activity based on auditory feedback during the sensory motor phase of song learning. To understand how this high degree of selectivity arises and what its role could be in shaping song learning behavior, recent studies have focused on areas presynaptic to HVC. Such studies have centered on two main lines of inquiry: determining the primary source of auditory input to HVC, and assessing the level of stimulus-specificity found in said auditory area.

The field L complex is a large auditory region afferent to HVC (Kelley and Nottebohm 1979Go). This avian analog of primary auditory cortex contains several subregions (L1, L2a, L2b, and L3) that can be distinguished based on cytoarchitecture and connectivity (see Fig. 1; Fortune and Margoliash 1992Go, 1995Go; Vates et al. 1996Go). Auditory nucleus ovoidalis in the thalamus projects to subregions L2a and L2b both of which project to subregions L1 and L3. L1 and L3 make bidirectional connections with two secondary auditory areas in the pallium: nidopallium caudal medial (NCM) and caudolateral mesopallium (CLM) (Vates et al. 1996Go). NCM projects to CLM by caudal medial mesopallium (CMM). Ascending auditory information then passes from CLM to HVC through nucleus interfacialis (NIf) (Vates et al. 1996Go). To date, no direct connections between field L and HVC have been observed, but fibers of passage in the HVC shelf region have made such observations extremely difficult (Vates et al. 1996Go). Tentative observations indicate that sparse connections may exist between field L and HVC shelf and HVC shelf and HVC proper, but confirmation is still pending (Fortune and Margoliash 1995Go; Gurney 1981Go; Katz and Gurney 1981Go; Vates et al. 1996Go). Physiological studies of field L have found that whereas units in this region are responsive to complex auditory stimuli, such as conspecific song (Con), field L neurons are not selective for the BOS over other conspecific song (Amin et al. 2004Go; Bonke et al. 1979Go; Grace et al. 2003Go; Leppelsack and Vogt 1976Go; Lewicki and Arthur 1996Go). The four subregions of field L show little difference in stimulus-specificity (Amin et al. 2004Go; Grace et al. 2003Go).

Simultaneous dual-electrode recording studies in NIf and HVC have found that NIf neurons respond to the BOS, the bird's own song played in reverse (Rev), and conspecific song (Con) stimuli, but show a stronger response to the BOS than to any other natural stimulus (Cardin and Schmidt 2004Go; Coleman and Mooney 2004Go; Janata and Margoliash 1999Go). This type of BOS selectivity is not as strong as that seen in HVC, where cells typically respond nearly exclusively to the BOS. Dual-electrode studies of NIf and HVC have also measured auditory neural activity in HVC before and after deactivation of NIf. The results show that deactivating NIf with {gamma}-aminobutyric acid (GABA) or muscimol greatly reduces or completely abolishes auditory activity in HVC (Cardin and Schmidt 2004Go; Coleman and Mooney 2004Go). Such results point to NIf as the primary source of auditory input to HVC.

Although NIf's contribution to the selectivity seen in HVC is becoming clear, some questions remain regarding the extent to which field L and CLM also contribute. In particular, since connectivity with HVC was not explored in previous electrophysiological studies of field L and CLM, it is possible that there exists a subset of cells within these areas that are connected with HVC and display a preference for the BOS. This hypothesis was proposed by Amin et al. (2004)Go and is consistent with the idea that such cells could represent the putative sparse connections between field L, HVC shelf, and HVC discussed earlier, or a class of cells that have not yet been characterized anatomically. The present study sought to investigate these possibilities. Our goal was twofold. First, we set out to estimate the degree of functional connectivity between the field L complex and HVC and CLM and HVC by measuring the cross-correlation between spike trains recorded simultaneously from pairs of cells or cell clusters in each of these areas. A pair of cells or cells clusters was considered functionally connected if there was a significant peak in the normalized cross-correlation (see METHODS). We eliminated the possibility of finding significant cross-correlations that resulted exclusively from stimulus-driven activity through a rigorous normalization procedure (see METHODS) and therefore we use the term functional connectivity to refer to a pair of cells or cell clusters that are likely to be anatomically connected through one or a small number of synapses. Second, we sought to evaluate stimulus selectivity for the BOS in field L and/or CLM cells or cell clusters that were functionally connected to HVC.


    METHODS
 TOP
 ABSTRACT
 INTRODUCTION
 METHODS
 RESULTS
 DISCUSSION
 GRANTS
 ACKNOWLEDGMENTS
 REFERENCES
 
Animal procedures

SUBJECTS. The Animal Care and Use Committee at University of California Berkeley approved all animal procedures. Adult male zebra finches (Taenopygia guttata) were raised in social/family contexts within a multifamily colony room at UC Berkeley. Song learning was assessed in a subset of the colony families and found to be normal (Amin et al. 2004Go). Adult males over 100 days of age were used for all experiments.

SURGERY. Two days before the acute physiological recording experiments, birds were anesthetized with intramuscular injections of 0.03–0.04 ml Equithesin (0.85 g of chloral hydrate, 0.21 g of pentobarbital, 0.42 g of MgSO4, 8.6 ml of propylene glycol, 2.2 ml of 100% ethanol, with a total volume of 20 ml H2O) and immobilized in a stereotax with ear bars and a beak holder. Small sections of the outer layer of skull were removed above the two areas we wished to explore in each experiment. In most experiments, the two areas that we targeted were HVC and field L/CLM and ink dots were made on the lower layer of skull at the precise coordinates for these regions (2.4 mm lateral from the dorsal bifurcation point of the midsagittal sinus for HVC and 1.2 mm lateral and 1.2 mm rostral from the dorsal bifurcation point of the midsagittal sinus for the field L complex and for CLM). For comparative purposes, we also targeted HVC and NIf in five separate experiments and in these instances, ink dots were placed at 1.7 mm lateral and 1 mm rostral from the dorsal bifurcation point of the midsagittal sinus for NIf, and at the HVC coordinates mentioned earlier. A metal post was then glued to the skull with dental cement. After completion of the surgical procedure and a period of monitored recovery, the bird was placed in its own recovery cage in the breeding colony until the experiment was to take place.

On the day of the experiment, the bird was anesthetized intramuscularly with three doses (25–35 µl) of 20% urethane administered at 0.5-h intervals. The bird was stabilized by affixing the metal head post to a stereotaxic device. The lower layer of skull and the dura were removed from the area surrounding the ink-marked locations. Tungsten extracellular electrodes of resistance 1–4 M{Omega} were lowered into each of the two exposed brain regions using a microdrive. The bird was then placed in a double-walled anechoic sound-attenuated chamber at a distance approximately 20 cm away from the speaker used to present all stimuli during recording sessions. The volume of the speaker was set to deliver zebra finch song at peak levels of 65 to 80 dB SPL (Ban dK Sound Level Meter, RMS weighting type B). Body temperature was monitored and adjusted to about 37° using a heating pad and a thermometer was placed under the bird's wing.

STIMULI. The stimulus ensemble consisted of both natural and synthetic sounds. The natural sounds were 1) the bird's own song (BOS), 2) the bird's own song played in reverse (Rev), 3) the bird's own song played in reverse syllable order (Revorder), and 4) conspecific song (Con). The synthetic stimuli were 1) random pure tones (Pips); 2) compound pure tones in which 20 samples from the random pure tones group were added together (Tones); and 3) broadband white noise (WN). All synthetic stimuli were exactly 2 s in length and all natural stimuli were chosen to be approximately 2 s in length. We used 20 different samples from each synthetic stimulus class. The overall peak power of the synthetic stimuli was matched to the overall peak power of the natural stimuli. Further details about the design of the synthetic stimuli can be found in Grace et al. (2003)Go.

The BOS was recorded and digitized using TDT2 hardware (Tucker-Davis Technologies, Alachua, FL) with a custom-made graphical interface several days before the experiment. Song recordings were obtained by placing the bird in a sound-attenuated chamber equipped with a microphone for several hours until many song renditions were collected. If the bird did not sing over a 24-h period, a female was then introduced into the cage with the male for anywhere from a few minutes to 1 h to encourage song production. If a bird still did not sing, he was excluded from the study. Thereafter, the various song renditions were played and viewed in spectrographic form. The song containing introductory notes from the two to three most frequently sung motifs, and lasting approximately 2 s, was chosen as the playback stimulus. Undirected song was used in the majority of subjects, but directed song was also used in some experiments. The conspecific songs consisted of a set of 20 unrelated undirected adult male zebra finch songs. It has been shown that such a sample is a good representation of the spectral and temporal patterns occurring in zebra finch song (Singh and Theunissen 2003Go).

Experimental protocol and extracellular recording

Two different experimental procedures were implemented in this study. In procedure A, we ran a search stimulus set followed by a cross-correlation stimulus set. In procedure B, we ran a combined search–cross-correlation stimulus set and followed this, in a small number of cases, with a selectivity protocol (see following text). We first describe the details of procedure A. In procedure A, the search stimulus set was used to probe for auditory units or unit clusters (two to five units) in each area. The cross-correlation stimulus set was played after the search protocol and was used to collect data for later cross-correlation analysis. All stimuli in this procedure were interleaved and randomly presented with an interstimulus interval of 7 to 8 s. Two seconds of spontaneous activity were recorded before each stimulus to establish a baseline firing rate for the neuron or neuron cluster. In addition, 2 to 3 s of spontaneous activity were recorded after the stimulus and 1 to 3 s (uniform random distribution) was added between stimuli. Ten to 20 trials of two to four of the following stimuli were used as search stimuli: the BOS, the Rev, Revorder, Con, and WN. If the neuron or neuron cluster in one of the two regions was found to show either an increase or a decrease in spiking activity to either the BOS or WN compared with its spontaneous activity as determined by an on-line t-test, then the cross-correlation stimulus set was presented. In certain cases, we opted to present the cross-correlation stimulus set even though the on-line t-test was not significant. Visual inspection of the responses in these instances suggested a change in the spike patterning even though the mean rate was the same as the spontaneous rate. The cross-correlation stimulus set consisted of 50 trials of the BOS and 50 trials of a 2-s-silence stimulus interleaved and randomly presented with an interstimulus interval of 7 to 8 s. The silence stimulus gave us the opportunity to collect additional spontaneous activity necessary for calculating the cross-correlation (see Data analysis). On completion of the cross-correlation stimulus set, the cross-correlation during stimulus presentation and the cross-correlation during spontaneous activity were analyzed separately, off-line.

After analyzing the data collected from the cross-correlation data sets in procedure A, we found that we had obtained more than enough spikes to reliably calculate cross-correlations at each recording site. Consequently, to maximize our sample size and to minimize adaptation to any one particular stimulus in HVC, we changed the experimental protocol as follows. We consolidated the search stimulus set and the cross-correlation stimulus set, changed the number of trials, and increased the duration of the interstimulus interval. These changes allowed us to run experiments more efficiently and to maximize the number of field L–HVC and CLM–HVC paired sites recorded during each individual experiment. It also increased our yield of cells in HVC that were selective for the BOS. In this procedure (procedure B), 15 trials of the BOS, Rev, and WN were interleaved and randomly presented with an interstimulus interval of 8 to 9 s. Two seconds of spontaneous activity were recorded before each stimulus, 4 to 5 s of silence were tagged on after the stimulus, and an additional 1 to 3 s (uniform random distribution) was added between stimuli. This interstimulus interval change served two main functions. We have observed and it has been reported elsewhere (Margoliash et al. 1994Go; Sutter and Margoliash 1994Go) that HVC neurons tend to integrate over long periods of time; therefore increasing the interstimulus interval provided time for each HVC cell or cell cluster to complete its response to one stimulus before being presented with another. Additionally, this interval change permitted us to eliminate the use of silence as a stimulus because we were able to obtain enough spikes to calculate the cross-correlation using the spontaneous activity obtained during the 2 s preceding stimulus presentation. On completion of the 15 trials, we assessed the cells for auditory activity using an on-line t-test. If one of the units or unit clusters in each of the two areas was considered auditory, then the data were analyzed on-line for cross-correlation.

For a subset of the paired sites, we played another set of stimuli—the selectivity protocol—following the search–cross-correlation stimulus sets, to assess the selectivity of the auditory forebrain cells. In this stimulus set, 10 trials of the following were played: the BOS; the Rev; Revorder; three examples each of Con, Pips, and Tones; and two examples of WN. As in procedure A, these stimuli were interleaved and randomly presented with an interstimulus interval of 7 to 8 s. We opted to use a different interstimulus interval for the search–cross-correlation stimulus set and the selectivity stimulus set because each stimulus set focused on a different region. In particular, the search–cross-correlation stimulus set was used mainly to target HVC cells, whereas the selectivity set was used to target the auditory forebrain areas field L and CLM. As a result of their shorter integration times, auditory forebrain neurons can be presented with stimuli more rapidly than HVC neurons. This reduces the amount of time spent at each recording site and increases the data yield per experiment.

Window discriminators were used to obtain the arrival times for spikes recorded simultaneously from each of the two electrodes. Using a digital oscilloscope with memory and average functions, we estimated, based on visual inspection of spike shapes, whether our recordings were from single units, small multiunit clusters (two to five units), or an unclassifiable number of units in each area. Neural activity was systematically sampled by moving the electrode through 50-micron (in HVC or NIf) or 100-micron (in field L or CLM) interval depths until one or a small cluster of units was isolated in each area. We collected cross-correlation data from a single unit or a multiunit cluster in field L or CLM only once before moving to the next unit or set of units 100 microns away. However, for HVC, we often collected data from a single unit or a multiunit cluster for several hours at a time while we continued to move the other electrode through CLM or field L. We adopted this strategy for practical reasons because, given the relatively small depth of HVC, a systematic search in that nucleus would require a series of electrode penetrations that would quickly lead to tissue damage. Our estimates of connectivity are therefore estimates of connectivity between units in the auditory system and one particular region in HVC. Given the range and complexity of responses found in HVC (Leonardo and Fee 2005Go; Mooney et al. 2002Go), the actual percentage of functionally connected neurons between the two areas will therefore be higher than what can be assessed with this approach. On the other hand, extracellular recordings in HVC in anesthetized birds have also revealed a high degree of interconnectivity (Margoliash et al. 1994Go; Sutter and Margoliash 1994Go), justifying both the feasibility of the approach and the validity of the lower-bound estimate. Typically between one and six electrode penetrations in each area were made per recording day. At the end of each recording pass, two electrolytic lesions (100 µA for 5 s) spaced 200–400 microns apart and placed well beyond the last recording site were made to aid in the later reconstruction of the recording sites in each area.

Histology and anatomical reconstructions

At the end of each recording experiment, the bird was killed with 0.06 ml Equithesin and then transcardially perfused with 0.9% saline followed by 3.7% formalin in 0.25 M phosphate buffer. After perfusion, the brain was postfixed in 3.7% formalin overnight or for several days. Forty- to 50-micron parasagittal sections were prepared using a freezing microtome. Alternate sections were stained with cresyl violet and silver stain to aid in the visualization process. The borders of the relevant regions of the auditory forebrain and HVC, electrode tracks, and lesion sites in each area were viewed at 10x magnification through a dual-frequency interferometric confocal microscope (DICM) and drawn using a drawing tube (courtesy of J. Winer, University of California, Berkeley). Unless stated otherwise, all recordings reported here were determined to have taken place in HVC and either field L, CLM, or NIf on inspection of the histology.

Data analysis

ANALYSIS FOR RESPONSIVENESS, EXCITATION AND INHIBITION, AND SELECTIVITY. The analysis used for determining the responsiveness of a particular recording site and classifying it as either stimulus-excited or stimulus-inhibited has been described in a previous study (Amin et al. 2004Go). Briefly, recording sites in each area were considered responsive to auditory stimuli if the firing rate to the BOS or WN was significantly different from the spontaneous firing rate (P < 0.05, two-tailed paired t-test). To classify each site as stimulus excited or stimulus inhibited, we then calculated the Z scores for all responsive sites. The Z score represents the normalized difference between the stimulus-driven mean firing rate and the baseline spontaneous firing rate collected in the 2 s preceding stimulus presentation. We averaged all responses to a particular stimulus class (e.g., all three exemplars of Pips) to calculate the Z score. If a recording site's Z score to the BOS was >0, then the site was considered stimulus excited. Conversely, if a recording site's Z score to the BOS was <0, then the site was considered stimulus inhibited.

Finally, the psychophysical d' measure was used to quantify the selectivity of each recording site for one stimulus class over the other. This measure has been used previously to quantify neural selectivity in the avian brain (Amin et al. 2004Go; Janata and Margoliash 1999Go; Solis and Doupe 1997Go; Theunissen and Doupe 1998Go). The d' measure for preference between two stimuli, A and B, is calculated as follows

Formula 1(1)
where µA and µB are the mean responses to stimulus A and stimulus B, respectively, and {sigma}2 is the variance of the response. A d' value was calculated for all pairwise comparisons before averaging a unit's response to one particular stimulus comparison. For example, we obtained responses to one exemplar of the BOS and three exemplars of Con, yielding three d' values for the BOS–Con comparison. To obtain one final d' value for all BOS–Con comparisons, we averaged the three d' values originally calculated for that particular unit. Recording sites in each area were considered responsive to auditory stimuli if the firing rate to the BOS or WN was significantly different from the spontaneous firing rate (P < 0.05, two-tailed paired t-test). A group of cells in our study was considered selective for the BOS if the average d' to the BOS was significantly >0 (P < 0.05, one-tailed paired t-test). A criterion of d' >0.5 has sometimes been used to classify single units as selective (Solis and Doupe 1997Go). However, when a mean d' is calculated for a small number of single units that are considered selective on their own, it is possible that the statistics will show that the means d' of the group is not significantly different from zero. The statistics reported here are with respect to the subset of cells that are functionally connected with HVC and not the individual units within the group.

CROSS-CORRELATION ANALYSIS. Data collected from the presentation of each cross-correlation stimulus set were analyzed for synchronized activity by calculating the coherency (Rosenberg et al. 1989Go), which is based on the cross-covariance function (Aertsen et al. 1989Go; Perkel et al. 1967Go), normalized by the product of the autocovariance. The cross-correlation of a spike train rB(t) relative to a second spike train rA(t) as a function of {tau} [time delay relative to spikes in rA(t); we examined {tau} values of ≤100 ms] is given by

Formula 2(2)
where T is the duration of the signal being analyzed and < > indicates that the measure is averaged across all trials. A schematic representation of this calculation can be seen in Fig. 1B.

The cross-covariance corrects for mean firing rates in each neuron, effectively measuring how deviations in firing rate from the expected mean in one recording site are correlated with deviations in firing rate from the expected mean in another recording site. The cross-covariance between neurons A and B is given by

Formula 3(3)
where Formula 3A(t) and Formula 3B(t) are the time-varying mean firing rates of the neurons. Both the cross-correlation and the cross-covariance are in units of (spikes/s)2, and their absolute values depend on the firing rates of each cell (in the case of the cross-covariance, the mean firing rates). To obtain a normalized measure, the cross-covariance (or the cross-correlation) can be divided by the variance in the firing rates of each cell, effectively obtaining a cross-correlation coefficient measure. The cross-correlation coefficient is given by

Formula 4(4)
where

Formula 4
and similarly for {sigma}B2. This cross-correlation coefficient represents the probability of firing in one cell (the "target" neuron) relative to the firing in the "reference" cell, and varies between –1 and 1, with 1 reflecting perfect correlation and –1, anticorrelation. A cross-correlation coefficient of zero indicates zero linear correlation between the two trains of spikes.

When using cross-correlations to assess functional connectivity, it is critical to correct for correlated firing that results simply from direct stimulus effects causing correlated fluctuations in time-varying mean firing rates (i.e., neurons in two entirely unconnected brain areas might show a correlation if they both fired to BOS). The cross-covariance corrects for these fluctuations because it measures only how trial-to-trial deviations from the time-varying mean rates of each cell are correlated with each other. The cross-covariance can be estimated by calculating the shuffle-corrected cross-correlogram. We calculated the shuffle corrector by correlating the response from A during the ith trial (of N total trials) with the response from B during the i + 1 trial. For i = N, i + 1 is set to be 1. We also calculated the average of all permutations of the shuffled corrector and found that the resulting distribution of coherency peaks quantified by their time delays, widths, and average strengths was very similar to what we observed when we used only one shuffle permutation. We therefore used the single permutation of shuffle corrector for the data here. This shuffle corrector is an estimate of how the mean time-varying rate in neuron A covaries with the mean time-varying rate in neuron B, across trials. In other words, it estimates the second term on the right side of Eq. 3

Formula 4

In practice, the integrals in Eqs. 2 and 3 are estimated by summing over small time bin windows, dt. In our study we used dt values of 10 and 5 ms. The results observed for dt values of 10 ms were similar to those seen for dt values of 5 ms; thus we report only the results for dt values of 10 ms. Smaller dt values yield cross-covariance curves with higher resolution but require more data. Given the bin window dt, the number of trials N, and Tn, the length of the signal in integer units of dt, the shuffle-corrected cross-correlogram is

Formula 5(5)
where rAi(j) is the number of spikes recorded from neuron A during trial i in the j th time bin and, similarly, rBi(j + k) in the (j + k) th time bin for neuron B. The shuffle-corrected cross-correlogram can then be normalized by the variance of spike firing rates as shown in Eq. 4, to provide a measure between –1 and 1.

Another possible source of cross-covariance between two neurons that does not reflect true neuronal interaction between these cells is the temporal structure of firing within each response. For instance, assume a spike in neuron A triggers a spike in neuron B; however, neuron A is a bursting neuron and has a high probability of firing again after it has fired once. Thus the second spike in A's burst will also be correlated to the spike in B, although it was actually triggered by the first spike in A. To correct for this type of correlation, we calculated the coherency function (Rosenberg et al. 1989Go). The coherency function extends the normalization by replacing the variance in the denominator of Eq. 4 by the autocovariance function of each of the two spike trains. This additional normalization takes into account bursting or other temporally structured behavior in either neuron A or B (or both) that would otherwise result in additional, or artificially large and wide peaks in the cross-covariance function. In practice the coherency is calculated in the frequency domain. The coherency is given by

Formula 6(6)
where CAB({omega}) is the Fourier transform of the cross-covariance between the responses from A and B, and CAA({omega}) and CBB({omega}) are the Fourier transform of the autocovariance of activity from neurons A and B, respectively. For plotting purposes, the coherency in the time domain is then calculated by taking the inverse Fourier transform of Eq. 6.

STRENGTH OF CORRELATED ACTIVITY. The peak amplitude or the area underneath the peak of the cross-correlation function is often used to estimate the strength of the correlation (Abeles et al. 1993Go; Bair et al. 2001Go; Brecht et al. 1998Go). However, a better estimate of degree of association is to calculate the average strength across all time delays within the peak. Because correlations at different time delays are not independent in the time domain, this is a complicated calculation in the time domain but it is relatively simple in the frequency domain. To calculate this average for the coherency, one takes the root mean square of the average coherency square in the frequency domain for frequencies below the Nyquist limit given by dt (the time bin window). From Parseval's theorem, however, the mean square of the coherency can also be obtained in the time domain by integrating the square of the coherency over the time bins. To estimate the mean square coherency for each peak, the area under the square of the coherency for that peak was divided by the time bin dt. The area under the coherency squared was estimated from the amplitude square of the peak multiplied by 2.5 times the width of the peak (the factor 2.5 is required to estimate the area under a Gaussian curve). Thus the average coherency strength represented by a peak is

Formula 6
The average coherency strength as a measure of the association between two time series is essentially equivalent to the correlation coefficient between two variables and indicates the degree of linear relationship between the variability of two firing rates. Like correlation coefficients, this measure is unitless. It should be noted that, in general, measures of correlation strength are strongly dependent on the size of the time bin, and this must be taken into account when comparing such values across different studies. The coherency for each pair was calculated across trials separately for spontaneous activity and stimulus-evoked activity during the cross-correlation stimulus set.

Finally, for all the cross-correlation measures, the sampling error was estimated using the jackknife resampling technique (Thomson and Chave 1991Go). In brief, for experimental data based on N trials, one estimates N values of the cross-correlation measures each based on N – 1 trials. The variance in the estimate is then obtained using Tukey's formula

Formula 6
where Formula 6i is the estimate of the cross-correlation measure with the i th trial deleted, Formula 6All is the estimate obtained with all the trials, and

Formula 6
Paired sites were considered to be significantly correlated if peaks in the cross-coherency exceed 3SE.


    RESULTS
 TOP
 ABSTRACT
 INTRODUCTION
 METHODS
 RESULTS
 DISCUSSION
 GRANTS
 ACKNOWLEDGMENTS
 REFERENCES
 
Our goal in this study was to investigate the possibility that a subset of neurons in two regions of the auditory forebrain, the field L complex and caudolateral mesopallium (CLM), is functionally connected to HVC and that the cells in this subset have different vocalization response properties than other neurons in field L and CLM. In particular, we hypothesized that functionally connected auditory forebrain cells would show a preference for the BOS over Con. Previous electrophysiological studies of field L have shown that although neurons in this region are responsive to complex auditory stimuli including Con, they do not respond preferentially to the BOS over Con (Amin et al. 2004Go; Bonke et al. 1979Go; Grace et al. 2003Go; Lewicki and Arthur 1996Go; Muller and Leppelsack 1985Go). We estimated the degree of functional connectivity between HVC and the field L complex or CLM, by measuring the coherency between spike trains recorded simultaneously from HVC and from one of two auditory forebrain areas, field L or CLM (Fig. 1). Based on the coherency peaks, we evaluated the directionality and strength of connections between the two areas. Although the latencies of the coherency peaks do allow for assessment of the directionality of connectivity between the paired sites in our study, they do not allow for conclusive interpretations regarding direct or indirect anatomical connectivity. For this reason, apart from directionality, we limit our discussion of the timing information provided by our analysis to a comparison between short and long latencies and a speculation about how these differences might relate to anatomical connectivity. For a subset of the paired sites that were found to be functionally connected, we also assessed the BOS selectivity of auditory forebrain cells.

Field L and HVC and CLM and HVC show functional connectivity during spontaneous activity

We recorded simultaneously from 338 paired sites of cells or cell clusters in HVC and a putative field L or CLM area in 45 anesthetized male zebra finches. To assess functional connectivity, we calculated first, the cross-covariance, and then the coherency of HVC and field L/CLM activity (see METHODS). We calculated the cross-covariance and the coherency during both spontaneous and stimulus-evoked activity in all paired sites of cells or cell clusters. For simplicity, throughout the paper, we will use the terms "functional connectivity," "correlation," "correlated," and "functionally connected" to describe the results of our coherency analysis. Unless specified otherwise, it should be assumed that we are using these terms to refer only to the coherency peaks and not to the raw cross-correlation or nonnormalized cross-covariance peaks. As will be further elaborated in the DISCUSSION, we consider functionally connected cells to be cells that are probably connected through one or a small number of synapses.

Fifty-nine of the 338 paired sites (17.45%) were found to be significantly correlated during spontaneous activity. Histological reconstruction confirmed that 54% of the correlated paired sites (32/59) were field L–HVC paired sites, 32% (19/59) were CLM–HVC paired sites, and 2% (1/59) were NIf–HVC paired sites (see Table 1). The remaining 12% (7/59) were HVC–unknown area paired sites and were excluded from further analysis. We recorded from a total of 181 field L–HVC paired sites, 57 CLM–HVC paired sites, and 2 NIf–HVC paired sites. Seven paired sites were found to be on the border of CLM and L1 (CLM/L1–HVC paired sites), one was on the border of NIf and L2a (NIf/L2a–HVC paired sites), and the remaining 89 paired sites (HVC–unknown area paired sites) could not be fully characterized as a result of the loss of the brain sections belonging to several subjects in which histological reconstruction had not yet been completed.1 Due to the small number of NIf–HVC paired sites, and paired sites where one of the cells or cell clusters was on the border of two different regions, CLM/L1–HVC and NIf/L2a–HVC, we chose to exclude these paired sites from our analyses and instead focus on the CLM–HVC and field L–HVC paired sites we found.


View this table:
[in this window]
[in a new window]

 
TABLE 1. Spontaneous activity correlations separated by unit type

 
Typical data examples of field L–HVC and CLM–HVC coherency functions can be seen in Fig. 2. In these sites (Fig. 2, A and B), we found significant coherency peaks during spontaneous activity but not during stimulus-evoked activity. This effect was typical of our data set (46 or 78%). Significant spontaneous activity coherency peaks were well fit by a Gaussian function (mean r2 = 0.87 ± 0.13, n = 59). Figure 2C shows an instance in which we found a significant coherency peak during both spontaneous and pooled-stimulus–evoked activity. This result will be subsequently discussed in more detail.


Figure 2
View larger version (15K):
[in this window]
[in a new window]

 
FIG. 2. Coherency functions for 3 pairs of unit clusters calculated during spontaneous and pooled-stimulus–evoked activity. A: coherency calculated for a field L–HVC pair of cell clusters. A significant peak is seen only during spontaneous activity. B: coherency calculated for a caudolateral mesopallium (CLM)–HVC pair of cell clusters. A significant peak is seen only during the spontaneous activity. C: coherency calculated for a CLM–HVC pair of cell clusters. Significant peaks are seen during spontaneous and pooled-stimulus–evoked activity. y-axis is the same for each spontaneous activity and evoked activity pair of plots. The solid line represents the coherency and the dotted line reflects the 3x jackknife SD estimate.

 
The correlation strengths of field L–HVC and CLM–HVC paired sites did not differ significantly (Wilcoxon signed-rank test, P = 0.61). Strengths ranged from 0.025 to 0.447 for field L–HVC paired sites and from 0.029 to 0.376 for CLM–HVC paired sites, with most strength values clustering around the lower end of the range (Fig. 3). The correlation latencies of field L–HVC and CLM–HVC paired sites had a very wide range (–22.98 to 29.60 ms in field L–HVC paired sites and –20.04 to 14.65 ms in CLM–HVC paired sites), but most of the peaks fell within a few milliseconds of zero. Peaks displaced positively in time occurred 76% of the time. Positive time delays are consistent with the anatomical evidence, suggesting that field L and CLM project either directly or indirectly to HVC (Vates et al. 1996Go). In 4 of 51 cases, coherency peaks with long negative latencies (greater than –5 ms) were observed in both field L–HVC and CLM–HVC paired sites. Peaks with these features are consistent with longer feedback loops through the outer areas of HVC, RA, and nucleus ovoidalis (Mello et al. 1998Go; Vates et al. 1996Go).


Figure 3
View larger version (16K):
[in this window]
[in a new window]

 
FIG. 3. Distribution of coherency strengths and latencies for all field L–HVC and CLM–HVC paired sites with significant correlations during spontaneous activity. A: histogram of the strengths for all paired sites with a significant coherency. B: histogram of latencies for all paired sites with a significant coherency. Positive latencies suggest an increase in HVC activity after an increase in activity in either field L or CLM.

 
The effect of unit type on correlated activity in field L and CLM

We recorded from both single units and multiunit clusters in the auditory forebrain and HVC (see Table 1 for a detailed breakdown). In HVC, most of the recordings (263/338 or 78%) consisted of multiunit data. In the auditory forebrain 187/338 (55%) were multiunit, 101/338 (30%) were single units, and 50/338 (15%) were unclassified. Units were considered unclassified if visual inspection of spike shapes using window discriminators did not allow for the experimenter to clearly discriminate between one or more spike waveforms.

The nature (single unit vs. multiunit) of the recording has the potential to affect the results as a consequence of the correlation among the units in the cluster recorded in the multiunit data. In the case of a single direct connection between the units recorded from the two sites, the correlation signal is attenuated by multiunit recordings and this attenuation is greater if the local units are independent (Gerstein 2000Go). On the other hand, if the units in the cluster are not independent and if multiple direct or indirect connections between units in the two recording sites exist, then the correlation signal relative to the noise can add up and facilitate detection. In the first scenario, multiunit recordings would lead to an underestimate of functional connectivity, whereas in the second scenario it would facilitate the detection of true positives (or reduce the chance of a Type II error). Our data suggest that we might be dealing with the second scenario because the percentage of connected neurons is higher for the multiunit recordings than for the single-unit recordings: for the 253 paired recordings where we had multiunit activity in HVC, 8/79 (10%) recordings showed a significant correlation when single-unit activity was obtained in the forebrain and 34/174 (20%) showed a significant correlation when multiunit activity was obtained in the forebrain. A chi-square test for independence indicates that this difference is significant at the 5% level ({chi}2 = 3.84, df = 1, P = 0.05). An estimate of functional connectivity based solely on single-unit recordings might therefore be an underestimate of the actual number of functional connections because of Type II errors. For the cases in which we detected significant cross-correlations, we found no significant differences between either the strengths or the latencies of field L–HVC [strengths: F(3,27) = 0.835, P = 0.486; latencies: F(3,27) = 0.915, P = 0.447], or CLM–HVC sites [strengths: F(2,14) = 1.05, P = 0.377; latencies: F(2,14) = 1.05, P = 0.576] whether recordings were from single–single, multi–single, multi–multi, or unclassified–unclassified recordings.

The effect of stimulus excitation and inhibition on correlated activity in field L and CLM

At each recording site, a search or a search–cross-correlation stimulus set was played to assess the responsiveness of each cell or cell cluster in the pair. Because we were interested in assessing 1) the overall amount of functional connectivity between HVC and auditory forebrain areas field L and CLM, irrespective of stimulus preference, and 2) the BOS selectivity of auditory forebrain cells or cell clusters that are functionally connected with HVC cells or cell clusters, we did not limit our recordings to cells or cell clusters that were responsive to our particular stimulus set. As a result, many recordings were from putative auditory forebrain–HVC stimulus-excited–unresponsive paired sites (EN, 44; see Table 2). The greatest number of paired sites involved stimulus-excited units in both the putative auditory forebrain areas field L or CLM and HVC (EE, 155). Histological analysis confirmed that there were 92 field L–HVC EE paired sites, 44 field L–HVC EN paired sites, 30 CLM–HVC EE paired sites, and 12 CLM–HVC EN paired sites. Seventeen percent (16/92) of the field L–HVC EE paired sites and 27% (12/44) of the EN paired sites showed correlated activity. Higher percentages of correlations were found in CLM–HVC paired sites, with 37% (11/30) of EE paired sites and 42% (5/12) of EN paired sites showing significant correlations. There were no significant differences between the correlation strengths found in EE paired sites and EN paired sites in either field L–HVC [t(26) = 0.210, P = 0.836] or CLM–HVC correlations [t(14) = 0.565, P = 0.581]. There was also no difference between the latencies of EE or EN paired sites for field L–HVC [t(26) = –0.5315, P = 0.600] or CLM–HVC cases [t(14) = –1.4292, P = 0.1749].


View this table:
[in this window]
[in a new window]

 
TABLE 2. Spontaneous activity correlations separated by responsiveness, stimulus excitation, and stimulus inhibition

 
Differences in correlations across subregions of the field L complex

Field L is a large auditory region with several distinct subregions (L1, L2a, L2b, and L3). To fully characterize field L–HVC functional connectivity, we sampled each of the individual subregions of field L over the course of our recordings. The highest percentage of significant correlations during spontaneous activity was in L1 with 13 of 49 (27%) L1–HVC paired sites showing functional connectivity. L1–HVC paired sites had latencies that were fairly evenly spread from 0.4 to 9 ms with a median value of 3.7 ms and one outlier on each end of the distribution (–2.2 and 10.6; see Fig. 4). These L1–HVC paired sites also had a median coherency strength of 0.12 with a distribution skewed toward much higher values. These strength values were generally greater than those seen in field L–HVC paired sites localized to other subregions of field L.


Figure 4
View larger version (10K):
[in this window]
[in a new window]

 
FIG. 4. Distribution of coherency strengths and latencies across field L subregions and CLM for paired sites with significant correlations during spontaneous activity. A: coherency strength box and whisker plot. B: coherency latency box and whisker plot. Length of each box represents the 25th to the 75th percentiles. Median is represented by a horizontal line on each box. Lines at the top and bottom of each whisker are the highest and lowest values in each group that are not extreme. + represents an outlier.

 
We recorded from 61 L2b–HVC paired sites and found only 9 (15%) significant correlations. L2b–HVC paired sites had correlation latencies that completely overlapped with the L1–HVC latency distribution (median = 1.4 ms, range = –1.7 to 5.6 ms), although the distribution was less evenly spread. The strengths of these correlations ranged from 0.03 to 0.29 (median = 0.06) and were generally lower than those seen in L1–HVC paired sites.

Four of 37 (11%) L3–HVC paired recordings demonstrated functional connectivity. These L3–HVC paired sites generally showed longer positive latencies than those seen in any of the other field L–HVC paired sites (range = 5.6–29.6; median = 8.36). The strengths of these L3–HVC paired sites were typical of most of the field L–HVC paired sites we found (excluding L1–HVC paired sites), ranging from 0.06 to 0.14 with a median of 0.10. The longer latencies seen in these L3–HVC paired sites suggest a multistage route of information travel from field L to HVC.

Next to L1, L2a had the highest percentage of sites showing significant correlations (3 of 18, or 17%). The strengths of all three correlations were typical of those found in field L subregions L2b and L3 (range = 0.04–0.16). Two of the three L2a correlated sites had long negative latencies (–16.88 and –22.98 ms), whereas the third site had a latency fairly close to zero (0.013). Thus only 1/18 (~5%) of L2a cells could be considered as providing input to HVC. The long negative latencies were unexpected and could represent a feedback route that travels from HVC through the outer regions of HVC (shelf), RA (cup), and Ov (shell) before reaching field L (L1, L2b, or L3) and then, finally, L2a (see Fig. 1).

Our histological review revealed a small number of field L–HVC paired sites on the border of L1 and L2a and on the border of L2a and L3. These cases showed that one of three (33%) L1/L2a–HVC paired sites and two of five (40%) L2a/L3–HVC paired sites were significantly correlated. Because these border field L–HVC paired sites could not be localized to any particular subregion of field L, they were excluded from statistical analysis of field L subregions. However, because these cells were certainly in the field L complex, they were included in statistical analysis that considered the entirety of the field L complex.

Based on putative anatomical connections between L1 and HVC, and L3 and HVC (Fortune and Margoliash 1995Go; Vates et al. 1996Go), we expected to find more paired sites with correlations in these areas than in any of the other subareas of field L. Despite a trend toward a greater number of correlations in L1–HVC paired sites, statistical analysis indicated that there was no significant difference in the number of correlations across field L subregions [{chi}2 (3, n = 165) = 4.23, P > 0.05 when all three significant correlations in L2a are taken into account; {chi}2 (3, n = 165) = 6.19, P > 0.05 when only the positive delay correlation in L2a is counted].

Field L–HVC and CLM–HVC functional connectivity changes between spontaneous and stimulus-evoked firing

Of the 32 field L–HVC paired sites with significant correlations during spontaneous activity, only 9 were significantly correlated during stimulus-evoked activity when the data were pooled across stimuli in each cross-correlation set. Likewise, of the 19 CLM–HVC paired sites with significant correlations during spontaneous activity, only 4 of them showed a significant correlation during pooled-stimulus–evoked activity. There were no instances in which a paired site showed a significant correlation peak during pooled-stimulus–evoked activity and not during spontaneous activity. In general, significant spontaneous activity coherency peaks and stimulus-evoked activity coherency peaks closely resembled each other with no significant differences between the strengths or the latencies of the coherency functions in the two categories for field L–HVC [strengths: t(39) = 0.0241, P = 0.9809; latencies: t(39) = –0.3896, P = 0.6989] or CLM–HVC paired sites [strengths: t(21) = –0.2486, P = 0.8061; latencies: t(21) = –1.0230, P = 0.3180].

Figure 2C shows one case in which a CLM–HVC paired site was significantly correlated during both spontaneous and pooled-stimulus–evoked activity. Paired sites that showed a significant correlation during both spontaneous and stimulus-evoked activity exhibited stronger peaks in their spontaneous activity coherency functions than those that showed a significant correlation only during spontaneous activity (Wilcoxon signed-rank test, P = 1.98 x 10–4). There was no significant difference between the latencies (Wilcoxon signed-rank test, P = 0.56) or the spreads of the spontaneous activity coherency functions in each of the two groups (Wilcoxon signed-rank test, P = 0.95). Paired sites that demonstrated functional connectivity during both spontaneous activity and stimulus-evoked activity had peaks in their spontaneous activity coherency functions that were better fit by Gaussian functions than those that showed functional connectivity during spontaneous activity alone (Wilcoxon signed-rank test, P = 0.0012).

To test for the effect of stimulus type on functional connectivity, correlations were calculated by stimulus type for all paired sites presented with more than one stimulus type that were found to be significantly correlated during pooled-stimulus–evoked activity. We found that correlations that were significant during pooled-stimulus–evoked activity were not necessarily significant during playback of each individual stimulus. Three of the six field L–HVC paired sites and two of the four CLM–HVC paired sites that showed a significant correlation during pooled-stimulus–evoked activity also demonstrated a significant correlation during playback of the BOS. One of the six field L–HVC paired sites did not display a significant correlation during the BOS, but displayed one during playback of the Rev. None of the paired sites that were significantly correlated during pooled-stimulus–evoked activity demonstrated functional connectivity during playback of white noise. Moreover, only one field L–HVC pair revealed a significant coherency peak during playback of the BOS and the Rev stimulus.

Although not all paired sites that were significantly correlated during pooled-stimulus–evoked activity had coherency peaks that reached significance during playback of individual stimuli, many of them still contained peaks from which the latencies and strengths of the correlations could be measured. The average difference between the coherency strength of the correlation found during each individual stimulus (the BOS, the Rev, and WN) and the pooled-stimulus–evoked activity for eight paired sites that were played the same stimuli across 15 trials is shown in Fig. 5. The average BOS–pooled stimulus coherency strength difference was significantly different from the average Rev–pooled stimulus coherency strength difference and the average WN–pooled stimulus coherency strength difference [repeated-measures ANOVA: F(2,7) = 6.40, P = 0.011]. This suggests that for the small number of paired sites that maintain functional connectivity while processing auditory stimuli, this connectivity is stronger during processing of the BOS than during the processing of each of the other stimuli.


Figure 5
View larger version (17K):
[in this window]
[in a new window]

 
FIG. 5. Average differences in coherency strengths for field L–HVC and CLM–HVC paired sites with significant correlations during spontaneous and pooled-stimulus–evoked activity. Differences between the coherency strengths during playback of the bird's own song (BOS), the bird's own song played in reverse (Rev), and white noise (WN) and pooled-stimulus–evoked activity were calculated and averaged for 8 field L/CLM–HVC pairs that were significantly correlated during spontaneous and pooled-stimulus–evoked activity.

 
Assessing stimulus selectivity in HVC and field L

To assess the responsiveness of each particular cell or cell cluster in a pair, we played a search or a search–cross-correlation stimulus set at each recording site. We classified a cell as either responsive or unresponsive based on the difference between its response to either the bird's own song or white noise and its spontaneous activity (see METHODS). In 90% of the histologically identified auditory forebrain–HVC paired sites (222/248), at least one member of the pair was classified as responsive (see Table 2). The remaining 10% were classified as unresponsive in both regions and were excluded from further analysis. Of the 222 responsive paired sites, 99 were responsive in both the auditory forebrain and HVC, 89 were responsive in the auditory forebrain and unresponsive in HVC, and 44 were responsive in HVC and unresponsive in the auditory forebrain. We attribute this unusually high number of unresponsive HVC units to the fact that in procedure A, our interstimulus interval was not optimized for HVC's temporal integration time (see METHODS). Once we modified our protocol to improve stimulus timing, we saw a decrease in the number of unresponsive HVC cells; 78 of 157 or about 50% of the HVC cells that were part of procedure A were classified as responsive, whereas 165 of 181 or 91% of the HVC cells that were part of procedure B were classified as responsive. In general, responsiveness was usually indicative of excitation to the BOS. In only 17 cases did the auditory forebrain unit or unit cluster show stimulus inhibition and in only 7 instances did the HVC cell show stimulus inhibition. For this reason, our discussion will focus on the selectivity analysis of stimulus-excited cells.

A particularly robust example of an EE pair of simultaneously recorded auditory forebrain–HVC cell clusters is shown in Fig. 6. Qualitatively speaking, the auditory forebrain cell cluster responds to several different natural stimuli with very consistent trial-to-trial spiking. This particular unit cluster was localized to CLM, but similar responses were observed in the various subregions of field L. In contrast, by visual inspection, one notes that HVC cells respond vigorously only to the bird's own song and show greater trial-to-trial variability in spike timing. As mentioned earlier, auditory forebrain–HVC EE and EN paired sites accounted for the majority (44) of significant correlations found during spontaneous activity. More specifically, 16 field L–HVC and 11 CLM–HVC EE paired sites were significantly correlated during spontaneous activity, whereas 12 field L–HVC and 5 CLM–HVC EN paired sites also showed a significant correlation (see Table 2). The remaining correlations were distributed among the three categories of responsiveness (stimulus-excited, stimulus-inhibited, and unresponsive) and were excluded from further analysis.


Figure 6
View larger version (19K):
[in this window]
[in a new window]

 
FIG. 6. Responses for a pair of unit clusters recorded simultaneously from HVC and CLM during playback of the BOS, the Rev, and conspecific song (Con). Top panel for each stimulus shows the spike raster with each horizontal line representing the response for one trial of the stimulus set. Middle graph: peristimulus time histogram. Bottom: sound waveform for each stimulus.

 
Stimulus selectivity in HVC

We used the search and search–cross-correlation stimulus sets to identify BOS-selective cells in HVC. After the completion of each experiment, stimulus-excited HVC cells were analyzed for stimulus selectivity by calculating the d' of pairwise comparisons between the BOS and the Rev, and the BOS and WN. The d' measures the normalized difference between responses to two stimuli in pairwise comparisons. Twenty-seven HVC cells or cell clusters were part of a significantly correlated (during spontaneous activity) pair that was played the BOS, Rev, and WN during the search or search–cross-correlation stimulus set.

We calculated the mean d' for the BOS–Rev comparison in these 27 HVC units and found a d' significantly greater than zero (d' = 1.68 ± 0.31, P < 0.05). We calculated the mean d' for the BOS–WN comparison in these 27 cells as well as a cell presented with BOS and WN, but not Rev, and similarly found a significantly positive mean d' (BOS–WN d' = 1.96 ± 0.36, P < 0.05), indicating that, on average, an HVC cell or cell cluster preferred the bird's own song over either the Rev or WN (Fig. 7). This BOS selectivity is similar to that reported in previous studies (for a summary, see Theunissen et al. 2004Go). There was no difference between the BOS selectivity found in HVC cells or cell clusters with significant auditory forebrain correlations and those in HVC cells or cell clusters that had the same stimuli played to them and were not significantly correlated during spontaneous activity [t(95) = 1.0014, P = 0.3192].


Figure 7
View larger version (15K):
[in this window]
[in a new window]

 
FIG. 7. Mean d' values were calculated in HVC to quantify the difference between the BOS and the Rev and the BOS and WN and to compare the selectivity in units shown to be members of a stimulus-excited–stimulus-excited paired site that was either correlated or uncorrelated with field L or CLM. HVC d' was calculated using the data collected from the search protocol in procedure A and the search–cross-correlation protocol in procedure B (see METHODS). Error bars are plotted for 2SE.

 
Stimulus selectivity for natural over synthetic stimuli in field L and CLM

One of the goals of this study was to assess the selectivity of field L cells that were functionally connected to HVC. To accomplish this, we presented a selectivity stimulus set containing natural and synthetic stimuli, similar to that presented in previous studies (Amin et al. 2004Go; Grace et al. 2003Go), to a subset of the simultaneously recorded paired sites. This set contained the song stimuli BOS, the Rev, Revorder, and Con, and the synthetic stimuli Pips, Tones, and WN (see METHODS). The song stimuli can be used to directly measure selectivity for the BOS, whereas analysis of the responses to synthetic stimuli provides an additional assessment of neuronal tuning properties and selectivity for conspecific songs in general. We evaluated the preference for Con over synthetic stimuli, Pips, Tones, and WN, in stimulus-excited auditory forebrain cells or cell clusters that were significantly correlated with HVC by calculating the d' relative to Con and comparing it to d' values calculated for cells that were not significantly correlated with HVC (Fig. 8). For the most part, our results were similar to those reported in a previous study (Grace et al. 2003Go). We found a mean d' significantly greater than zero for the Con-Pips comparison in both field L and CLM cells, regardless of functional connectivity (field L correlated, d' = 3.46 ± 0.85 vs. uncorrelated, d' = 2.31 ± 0.69; CLM correlated, d' = 1.57 ± 0.59 vs. uncorrelated, d' = 2.34 ± 0.88). We also found a preference for Con over Tones in field L cells that were part of a significantly correlated pair (d' = 1.88 ± 0.45) and in CLM cells that were part of a pair that did not show a significant correlation (d' = 2.27 ± 1.21). The mean d' was similarly positive for the Con–Tones comparison in the other two groups (field L uncorrelated, d' = 1.16 ± 0.57; CLM correlated, d' = 1.55 ± 0.79), but was not significantly different from zero at the 1% level. There were no significant differences between the responses to Con and the responses to WN in any of the four groups (field L correlated, d' = 0.54 ± 0.93 vs. field L uncorrelated, d' = –0.30 ± 0.82; CLM correlated, d' = 1.06 ± 1.29 vs. CLM uncorrelated, d' = 0.55 ± 1.11). A similar result was reported previously (Grace et al. 2003Go) and was attributed mainly to the strong onset response characteristic of field L and CLM cells presented with white noise. Similar onset responses were also observed in the present study.


Figure 8
View larger version (15K):
[in this window]
[in a new window]

 
FIG. 8. Mean d' values were calculated in field L and CLM to quantify and compare the differences between conspecific song and matched synthetic stimuli; random pure tones (Pips), compound tones (Tones), and WN in units shown to be either correlated or uncorrelated with HVC. Field L and CLM show selectivity for conspecific song over Pips. Field L and CLM d' values were calculated using the data collected from the selectivity protocol. d' was calculated for both stimulus-excited and stimulus-inhibited sites, but only the d' for stimulus-excited sites is shown. Error bars are plotted for 2SE.

 
Stimulus selectivity for the bird's own song in field L and CLM

We calculated the mean d' for the BOS–Rev and BOS–Con comparisons for all stimulus-excited field L (23 correlated, 21 uncorrelated) and CLM cells (13 correlated, 3 uncorrelated) that were played the stimulus selectivity set. The results showed that field L and CLM cells that were members of a significantly correlated auditory forebrain–HVC pair had a slight preference for responding to the BOS over the Rev (field L, d' = 0.46 ± 0.16; CLM, d' = 1.03 ± 0.36; Fig. 9). Field L stimulus-excited cells that were members of an uncorrelated auditory forebrain–HVC cell pair did not show this preference (d' = –0.05 ± 0.15), whereas CLM stimulus-excited cells that were members of an uncorrelated pair had a positive mean d' (d' = 0.67 ± 0.33), but it was not significant at the 0.01 level. The d' for the BOS–Con comparison was not significantly different from zero in any of the four groups (field L correlated, d' = –0.66 ± 0.37; CLM correlated, d' = –0.21 ± 0.61; field L uncorrelated, d' = –0.19 ± 0.39; CLM uncorrelated, d' = –0.99 ± 0.76), indicating suppression of the response to the BOS relative to Con. This trend has been reported and discussed in a previous study (Amin et al. 2004Go). Although the slight preference for BOS over Rev may be interpreted as BOS selectivity in brain regions that do not respond well to conspecific songs other than the bird's own song played in the forward condition (e.g., the song system nuclei), in field L and CLM where strong responses to forward conspecific song are typical, this "preference" for the BOS over the Rev is more readily attributed to a preference for the natural order found in conspecific song (Amin et al. 2004Go; Woolley et al. 2006Go).


Figure 9
View larger version (20K):
[in this window]
[in a new window]

 
FIG. 9. Mean d' values were calculated for the BOS–Rev and BOS–Con comparison in field L and CLM to quantify and compare the selectivity in units shown to be either correlated or uncorrelated with HVC. A: mean d' values were calculated for the BOS–Rev and BOS–Con comparison in all stimulus-excited field L and CLM units. There is a significant preference for the BOS over the Rev in both field L and CLM cells that are significantly correlated with HVC. B: mean d' values were calculated for the BOS–Rev and BOS–Con comparison in all stimulus-excited field L and CLM units or unit clusters that were paired with a stimulus-excited HVC cell or cell cluster. C: mean d' values were calculated for the BOS–Rev and BOS–Con comparison in stimulus-excited auditory forebrain cells or clusters that were paired with BOS-selective HVC cells or clusters. Field L and CLM d' values were calculated using the data collected from the selectivity protocol. d' was calculated for both stimulus-excited and stimulus-inhibited sites, but only the d' for stimulus-excited sites is shown. Error bars are plotted for 2SE.

 
Fourteen field L cells and nine CLM stimulus-excited cells were part of a significantly correlated EE paired site. Mean d' calculations for the BOS–Rev comparison in these instances yielded similar results to those of all stimulus-excited auditory forebrain cells with a slightly positive d' for the BOS–Rev comparison in the field L and CLM cells that were part of a significant<