Open access peer-reviewed chapter

The Importance of Acoustic Reflex in Speech Discrimination

By Kelly Cristina Lira de Andrade, Silvio Caldas Neto and Pedro de Lemos Menezes

Submitted: October 21st 2010Reviewed: March 18th 2011Published: June 13th 2011

DOI: 10.5772/17480

Downloaded: 4177

1. Introduction

Communication is essential for social interaction and may interfere significantly with school, professional performance or social communication. When we speak we allow others to know our thoughts, feelings and needs and them to know ours.

Hearing, in turn, is the main sense responsible for acquiring speech and language in children. Impairment of this function may compromise not only language, but also social, emotional and cognitive aspects (Tiensoli et al., 2007).


2. Noise

The word noise comes from the latin “rugitus”, which means roar. Acoustically it is composed of various sound waves in relation to anarchically distributed amplitude and phase, causing an unpleasant sensation, in contrast to music. It is any undesirable sound disturbance that interferes with what one wishes to hear. From the physics standpoint, noise is defined as a sound composed of a large number of randomly distributed acoustic vibrations with amplitude and phase relationships (Ferreira, 1975).

Noise can be divided into two categories: impact or impulsive and continuous or intermittent. In the first case, it represents peaks of acoustic energy of less than one second, at intervals greater than one second. Intermittent noise is all and any noise that is not classified as impact or impulsive.

Nepomuceno (1994) defines noise as an audible phenomenon whose frequencies cannot be discriminated because they differ between one another by lower values than those detected by the auditory device. It is responsible for the production of unpleasant auditory sensations and therefore different from sound. Noise is an interesting study subject because it directly affects the health of people, disqualifies the environment in which they live and causes social problems, in that its effects alter and degrade social relationships (Souza, 2004).

Speech intelligibility is defined as the relationship between words spoken and understood, expressed in percentage. For communication to be effective and intelligible, speech intelligibility must be higher than 90% (Nepomuceno, 1994). In noisy environments such as companies, schools, means of transportation, recreation areas or any other place with a large number of people, communication is significantly compromised.


3. Acoustic reflex

The hearing mechanism is described as consisting of three divisions: outer, middle and inner ear. Each compartment of the ear has the particular function of allowing sound to be transmitted, amplified and finally transformed into electrical stimulus that are conveyed to the cortex by the auditory nerve. Middle ear structures include the tensor tympani muscle and the stapedius muscle, the smallest striated muscles in the body.

Figure 1.

Middle ear

Contraction of stapedius and tympani muscles may also occur by extra acoustic excitation such as painful and tactile sensations. Voluntary contraction might occur in some subjects. Bujosa (1978) found that stimuli triggering stapedius muscle reflexes can be sonorous (unilateral, contralateral, ipsilateral and bilateral), tactile (in the skin of the tragus and upper eyelid) and electric (these are not observable in otitis media, osteosclerosis, facial paralysis, and sonorous cases). Stimuli that trigger tensor tympani reflexes are sonorous, tactile (airflow) and nasal mucosa stimulation with ammonia vapors.

Several studies have been conducted on the function of intratympanic muscles and some correlate it with activities of hearing centers. Lidén, Peterson & Hartford (1970) state that for acoustic stimuli, the primary response is from the stapedius muscle and that it exhibits lower reflex threshold to sound pressures than the tympani tensor. Observing the activity of the middle ear muscles is the function of the method used and, using imitanciometry, only stapedius muscle function can be determined with precision. For this reason, the acoustic reflex is denominated the stapedius reflex (Lopes Filho, 1975).

The neural complex of the acoustic reflex is located in the lowest part of the brainstem. In the ipsilateral pathway, the acoustic stimulus is transmitted from the cochlea to the acoustic nerve and then to the ipsilateral ventral cochlear nuclei, and in turn, through the trapezoid body to the facial motor nucleus and ipsilateral stapedius. In the contralateral pathway, transmission is from the ventral cochlear nuclei to the medial superior olivary complex, crossing to the facial motor nucleus, then proceeding to the facial nerve and contralateral stapedius. It is believed that this description, although the most accepted, involves much more complex multisynaptic connections, where upper auditory pathways act on the acoustic reflex, inhibiting or increasing it (Borg, 1973; Robinette & Brey, 1978; Downs & Crum, 1980; Northern, Gabbard, Kinder, 1989; Northern & Downs, 1989).

Motor neurons are located near, but not within the facial nucleus. They are separated anatomically into functional bundles and are capable of ipsi, contra or bilateral stimulus responses. By stimulating one ear, these connections can obtain bilateral responses from the facial nerve and consequently from the stapedius. In 1992, Colleti et al observed that pathologies of the ascending auditory pathways may interfere in the acoustic reflex, where activity is also thought to be regulated by cortical and subcortical structures. Thus, the acoustic reflex and the cochlear-olivary system would receive tonic facilitative influence from hearing centers (Carvallo, 1997a). Therefore, for acoustic reflex to occur, there must be integrity of the afferent, association and efferent pathway.

Figure 2.

Noise spectrum of speech

Andrade et al (2011) conducted a study involving 18 female participants, divided into two groups: acoustic reflexes present (20 ears) and absent (16 ears). A total of 180 disyllabic words were presented, 90 for each ear, randomly emitted at a fixed intensity of 40 dB above the three-tone average. At the same time, three types of noise were presented ipsilaterally (white, pink and speech), one at a time, at three intensities: 40, 50 and 60 dB above the three-tone average.

The three types of noises presented at intensities of 40 and 50 dB above the three-tone average showed better efficiency in discriminating speech in the group with acoustic reflexes present. For the intensity of 60 dB, using white and pink noise, a similar percentage of hits was observed between the two study groups. The better performance in discriminating at all intensities, including at 60dB, was found with the use of speech noise. The study concluded that the presence of reflex helps discriminate among speech sounds, especially in noisy environments.

White noise, described in the aforementioned study, consists of frequencies between 10 and 10,000 Hz, with equal intensity and energy maintained at high frequencies and efficient up to 6000 Hz. Pink noise is filtered white noise composed of frequencies between 500 and 4000 Hz, a band where it is most effective (Menezes, 2005).To construct speech noise, a recording was made in which several individuals were talking at the same time, simulating a noisy environment. Frequencies ranged only from 0 to 4 kHz. The noise spectrum was analyzed and fast Fourier transform is shown below.


4. New findings

A recent study conducted by Andrade et al (2011) compared three groups of normal hearers with pure-tone auditory threshold up to 20 dBHL for frequencies of 250, 500, 1000, 2000, 4000, 6000 and 8000 Hz dBNA (ANSI -1969), with inter-ear differences for frequencies less than or equal to 10 dB and age between 18 and 55 years. The following exclusion criteria were established: exposure to occupational or leisure noise, ear surgery, more than three ear infections within the last year, use of ototoxic medication and hereditary cases of deafness. In the control group the subjects showed reflexes up to 90 dBSPL (70 to 90 dB). In the second group, the normal hearers exhibited reflexes between 95 and 110 dBSPL (Group 2) and in the third group, the subjects had no stapedius reflex (Group 3). Each group consisted of 8 female participants, totaling 16 ears per group. As with Andrade (2011), speech noises were used at intensities of 40, 50 and 60 dB above the three-tone average.

Using TDH39 headphones in acoustic cabin, words and noises were emitted ipsilaterally and only hits were considered, that is, words correctly repeated. This test used 120 disyllable words, 60 for each ear. These were emitted randomly at a fixed intensity of 40dB above the three-tone average. Noise was used at intensities of 40, 50 and 60 dB above the three-tone average, that is, 20 words per intensity were presented. The responses characterized as distortions were stored in a databank and will be analyzed in future studies. If the participant did not respond or responded incorrectly, the word was repeated once again after the following word on the list was presented.

The sample studied was composed of 24 female volunteers. The age of control group subjects varied from 28.5 to 39.5 years, with mean age of 32.1 years and standard deviation of 9.8. In the second group, the age range was between 27.1 and 40 years, with mean age o 33.1 years and standard deviation of 10.8. In the group with no reflexes present, the age range was between 26.3 and 37.2 years, with mean age of 30.7 years and standard deviation of 6.5.

An evaluation of the different intensities of speech sound showed similar behavior to that of the means presented. Thus, it was decided to only show these mean results.

The results of this new study, as illustrated in graph 1, show a slightly higher mean percentage of hits for the control group (normal hearers with reflexes up to 90 dBSPL), as compared to Group 2 (normal hearers with reflexes between 95 and 110 dBSPL). However, the Mann Whitney U test did not reveal statistically significant differences (p=0.28). However, when the two aforementioned groups were compared (Control and 2) with the group of normal hearers with no stapedius reflexes (Group 3), the difference was significant, with p-values of 0.005, 0.000 and 0.031, as shown in the following table:

Noise (+40 dB)Noise (+50 dB)Noise (+60 dB)
Mann-Whitney U83.50054.000103.000
Wilcoxon W219.500190.000239.000
P value0.0050.0000.031

Table 1.

P-value for noise intensities

Figure 3.

Mean percentage of hits per group in the speech discrimination test with noise, irrespective of ear or noise intensity used.

Thus, enhanced discrimination ability can be observed at all intensities, using speech noise, including at 60 dB, for the group with reflexes present, regardless of intensity, as compared to Group 3, with no reflexes present. That is, a significant difference occurs only when we compare groups of individuals without acoustic reflex in discriminating sounds in the presence of speech noise (Andrade, 2011).


5. Speech tests

Based on the previously described studies, it is important to point out that word recognition tests are very important in audiological diagnosis. The audiological battery is considered incomplete without measures of speech recognition.

The ability to understand speech is one of the most important measurable aspects of human auditory function. Logoaudiometry is a means to evaluate speech recognition under adequately controlled conditions (Penrod, 1999). The tests used to measure auditory performance of individuals performing speech recognition tasks use isolated stimuli, monosyllable and disyllable words being the most widely used (Lacerda, 1976).

Speech recognition is accompanied by a combination of acoustic, linguistic, semantic and circumstantial cues (Gama, 1994). However, when an excess of some of these cues are heard under favorable conditions, they can be disregarded. For message transmission to be effective, there must be redundancy of the acoustic cues that the hearer draws on, according to the situation and context of the communication. This is what occurs, for example, with conversations in noisy environments.

In audiological practice, it is common to find subjects with the same degree and configuration of sensorineural auditory loss who exhibit substantially different skills with respect to speech perception. There is a relatively weak relationship between tone auditory thresholds and speech intelligibility for individuals with sensorineural auditory loss. Factors other than auditory sensitivity likely interfere in speech perception (Yoshioka, 1980).

The following have been offered to explain the difficulty in understanding speech in noise, for patients with sensorineural auditory loss: noise, which has a masking function; loss of binaural integration, which increases the signal-to-noise ratio by 3 dB or more; difficulties in temporal resolution and frequencies; decrease in the dynamic field of hearing and the effect of masking low frequency energy (vowels) on middle and high frequency threshold (consonants). The negative influence of noise may be due to the following factor: the monaural entry to the auditory system does not allow the noise reduction processing possible in a binaural auditory system (Almeida, 2003).

Most individuals with auditory loss at high frequencies (above 3000 Hz) have little or no difficulty in understanding speech in silent environments, since these situations contain a series of excess cues that they can use to understand speech. However, in noisy environments or under adverse conditions, such as when speech is distorted, the subject may have innumerable difficulties regarding speech intelligibility, given that the number of cues falls drastically, leading to the use of only the cues available in the particular situation.

Thus, it is extremely important to study hearing performance less favorable conditions, such as simulating conversation in an environment where several individuals are talking at the same time and determining which processes interfere in the speech perception of these subjects. This justifies the concern about not only measuring speech recognition capacity in situations of acoustic isolation, in which concurrent stimuli are under control, but also in situations resembling real life (Schochat, 1994).

Two types of noise are recommended in the evaluation of speech perception: competitive speech noise and environmental noise, with competitive speech noise exerting a more significant effect on speech perception than environmental noise in general (Sanders, 1982).

In 2004, Caporali & Silva conducted a study to determine the effects of hearing loss and age on speech recognition in the presence of noise, using two types of noise. Three experimental groups were organized, one composed of adults without auditory alteration, another by adult subjects with hearing loss at high frequencies and a third group of elderly with similar audiometric configuration to that of the group with loss. All subjects performed speech recognition task in silence, in the presence of an amplified white noise spectrum and “cocktail party” noise, at the same signal-to-noise ratio (0 dB), in both ears. The results showed that noise interferes negatively on speech recognition in all the groups. Performance of normal-hearing subjects was better than that of the groups with hearing loss. However, the group of elderly performed worse, especially with respect to “cocktail party” noise. It was also observed that all the subjects exhibited better results in the second ear tested, showing a learning effect. Findings demonstrate that both age and hearing loss contributed to the low performance exhibited by the elderly in speech perception in the presence of noise and that “cocktail party” noise was suitable for this investigation.


6. Noise in the work environment

It is well known that physical and mental well being is important for good performance, in both professional and social activities. Many measures have been taken to provide better occupational conditions to workers; however, there are a number of aspects that are not recognized or valued.

It is very important to carry out research to clarify the extra auditory effects of noise on human beings, increase concern and efforts to eliminate this risk agent and adopt effective preventive and curative measures that provide better quality of life to workers.

Hearing loss may result if individuals remain in high noise environments. Given that it occurs slowly and gradually, prevention has not been given proper attention (Ayres & Corrêa, 2011). Recruitment, buzzing and poor speech signal discrimination are also observed in unfavorable environmental conditions. In addition to these auditory problems, other disorders caused by elevated noise may occur, possibly affecting other organs in the body, provoking problems such as: headaches; digestive disorders; restless sleep; lack of sleep; impaired attention and concentration; buzzing in the ears or head; vertigo and loss of balance; cardiac and hormonal alterations; anxiety, nervousness and increased aggressiveness (Kwitko, 2001).

In a retrospective study, Miller (1972) observed that groups of workers with different periods of exposure to noise (in years) showed greater losses at 4000 Hz. These initial alterations would not be detected by individuals themselves. With an increase in exposure time, other frequencies would be involved (500 Hz to 3000 Hz) with damaging consequences for speech reception.

The physical working environment must be well planned, obeying health and safety norms, have adequate illumination and space and be acoustically treated, which does not always occur. In most companies, workers are exposed to an unfavorable acoustic environment owing to background noise generated by different sound sources, such as air conditioning, conversation, movement of people, equipment and non-acoustically treated room, promoting sound reverberation.

For workers with hearing loss, or even for those with normal hearing, attempting to avoid background noise in the work environment is undeniably challenging and often frustrating. Symptoms related to noise exposure include anger, anxiety, irritability, emotional stress, lower morale and motivation, distraction, mental fatigue and sleep disturbances (Kryter, 1971). The performance of tasks may also be significantly affected by the presence of sound. Several studies suggest that sound reduces overall precision instead of total amount of work, and is more apt to affect the performance of complex tasks rather than simple ones (Miller, 1974).

The absence of acoustic reflexes is another negative factor in noisy work environments, given that the speech sound discrimination capacity of individuals with this impairment, added to all the previously described problems, is more affected. It is important that more studies correlating the aforementioned variables be conducted in order to contribute to improved working environments for individuals with abnormal acoustic reflex.


7. Conclusion

Given the recognized importance of acoustic reflex for communication, new studies should make a thorough investigation of the nuances of this relationship, in order to develop new technologies that allow individuals to communicate adequately in noisy work environments, thereby avoiding accidents and ultimately improving their quality of life.



We are grateful to Professors of UNCISAL for auditory evaluating of the patients and we thank all students of the Acoustic Instrumentation Laboratory.

© 2011 The Author(s). Licensee IntechOpen. This chapter is distributed under the terms of the Creative Commons Attribution-NonCommercial-ShareAlike-3.0 License, which permits use, distribution and reproduction for non-commercial purposes, provided the original is properly cited and derivative works building on this content are distributed under the same license.

How to cite and reference

Link to this chapter Copy to clipboard

Cite this chapter Copy to clipboard

Kelly Cristina Lira de Andrade, Silvio Caldas Neto and Pedro de Lemos Menezes (June 13th 2011). The Importance of Acoustic Reflex in Speech Discrimination, Speech Technologies, Ivo Ipsic, IntechOpen, DOI: 10.5772/17480. Available from:

chapter statistics

4177total chapter downloads

1Crossref citations

More statistics for editors and authors

Login to your personal dashboard for more detailed statistics on your publications.

Access personal reporting

Related Content

This Book

Next chapter

Single-Microphone Speech Separation: The use of Speech Models

By S. W. Lee

Related Book

First chapter

Towards Efficient Translation Memory Search Based on Multiple Sentence Signatures

By Juan M. Huerta

We are IntechOpen, the world's leading publisher of Open Access books. Built by scientists, for scientists. Our readership spans scientists, professors, researchers, librarians, and students, as well as business professionals. We share our knowledge and peer-reveiwed research papers with libraries, scientific and engineering societies, and also work with corporate R&D departments and government entities.

More About Us