Misophonia is a relatively unexplored chronic condition in which a person experiences autonomic arousal (analogous to an involuntary “fight-or-flight” response) to certain innocuous or repetitive sounds such as chewing, pen clicking, and lip smacking. Misophonics report anxiety, panic, and rage when exposed to trigger sounds, compromising their ability to complete everyday tasks and engage in healthy and normal social interactions. Across two experiments, we measured behavioral and physiological characteristics of the condition. Interviews (Experiment 1) with misophonics showed that the most problematic sounds are generally related to other people's behavior (pen clicking, chewing sounds). Misophonics are however not bothered when they produce these “trigger” sounds themselves, and some report mimicry as a coping strategy. Next, (Experiment 2) we tested the hypothesis that misophonics' subjective experiences evoke an anomalous physiological response to certain auditory stimuli. Misophonic individuals showed heightened ratings and skin conductance responses (SCRs) to auditory, but not visual stimuli, relative to a group of typically developed controls, supporting this general viewpoint and indicating that misophonia is a disorder that produces distinct autonomic effects not seen in typically developed individuals.

General Introduction

Misophonia, literally translated to “hatred of sound,” is a chronic condition in which specific sounds provoke intense emotional experiences and autonomic arousal within an individual. Trigger stimuli include repetitive and social sounds typically produced by another individual, including chewing, pen clicking, tapping, and lip smacking. These experiences are not merely associative in nature, but drive the sufferer to avoid situations in which they may be produced, limiting one's ability to interact with others and often leading to severe problems in their social and professional lives. Also known as selective sound sensitivity syndrome, the term “misophonia” was first coined by Jastreboff (Jastreboff, 2000; Jastreboff and Jastreboff, 2001a,b, 2003) and little remains known about the condition. To our knowledge only two case studies (Hadjipavlou et al., 2008; Schwartz et al., 2011) and one clinical study (Schröder et al., 2013) have examined misophonia. In the latter study, psychiatrists presented questionnaires and administered interviews to 42 misophonics, an essential first step in showing that misophonia is a primary disorder with no obvious comorbidity with other known psychological or neurological conditions (Schröder et al., 2013).

The prevalence of misophonia is under active investigation but there exist several online support groups with thousands of members (Misophonia UK, Facebook and Yahoo). Sufferers of misophonia are fully aware of its presence and the abnormal responses they have to their trigger sounds. In addition, many sufferers have identified the condition in at least one close relative, suggesting a possible hereditary component. While effective treatments for misophonia remain elusive, individuals report utilizing coping mechanisms to minimize their exposure and response to triggering stimuli (discussed at length below). Further, misophonia appears to exhibit some general similarities to tinnitus. Jastreboff and Hazell (2004) propose that misophonia and tinnitus are both associated with hyperconnectivity between the auditory and limbic systems, suggesting that both conditions would evoke heightened reactions to their respective sounds. However, despite these general similarities, misophonia differs from tinnitus considerably, particularly in terms of how the condition is localized around certain human-produced sounds and situations as opposed to internally perceived, abstract sounds.

While the majority of typically developing individuals experience general and unelaborated emotional reactions to a range of sounds (Halpern et al., 1986), these widespread negative associations remain non-debilitating and at most an annoyance to the listener. One critical possibility is that the valenced associations present in typically developing individuals are matched to those with misophonia, with the latter merely experiencing a more extreme physiological response. Indeed, the sound of fingernails on a chalkboard is an emotionally evocative stimulus that elicits extreme discomfort in the typical population (Zald and Pardo, 2002; Kumar et al., 2012) and misophonic individuals often reference this stimulus in illustrating the extreme nature of their trigger sensations. In this study, we further elaborate on the symptoms and behaviors associated with misophonia as well as examine whether misophonics' physiological responses support their subjective reports of feeling autonomic arousal in response to certain sounds.

Experiment 1

Introduction

We first received information about misophonia in December of 2011 through members of an online misophonia support group. From initial descriptions, the condition appeared to have many intriguing qualities in addition to being quite unknown and unexplored. Misophonic individuals were invited to the lab for preliminary interviews with the hope of gaining a more concrete understanding of their experiences with the condition.

Materials and Methods

Participants

Eleven individuals with misophonia from the San Diego and Los Angeles areas were recruited from the University of California, San Diego campus, through self-identified contact of our lab as well as through an online misophonia support group (4 males and 7 females, mean age = 35.82; range = 19–65).

Procedure

Thirty to sixty minute semi-structured interviews were conducted by members of our research group on the University of California, San Diego campus. As no set diagnostic criteria for misophonia exists for misophonia, eligibility for study inclusion was based on severity of symptoms paired with experiential descriptions reported by the subject. The five initial interviews were exploratory in nature and included a range of topics, including approximate age of onset, lists of sounds that elicit varying degrees of discomfort, whether or not certain individuals exacerbate the condition, coping mechanisms, common thoughts when experiencing symptoms, physical responses to the trigger sounds, effect of the condition on their daily lives, and other potentially comorbid medical conditions. From these interviews we were able to generate a core set of questions to create the general framework of the subsequent six interviews that were held.

Results and Discussion

After conducting all 11 interviews, it was apparent that the experiences of the misophonics, though intrinsically variable between subjects, contained noticeable trends and similarities. The most salient categories of assessment and their traits are documented in Table 1. In addition, it should be noted that all diagnostic criteria listed by Schröder et al. (2013) were present in the reports of our misophonic subjects (see Table 1) even though these interviews were conducted prior to the publishing of that article.

TABLE 1

Table 1. Summary of qualitative data gathered from interviews of the 11 misophonic subjects (4 males and 7 females, mean age = 35.82; range = 19–65) in Experiment 1, broken down into 18 of the most salient diagnostic categories.

The most important criterion in misophonia is that particular sounds will evoke a disproportional aversive reaction. Our subjects were recruited based on their reports of this characteristic. In accordance with previous reports, our misophonics reported that the worst trigger sounds are chewing, eating, and crunching sounds, followed by lip smacking, pen clicking, and clock ticking (see Table 1). Other notable trigger sounds include low frequency bass sounds, footsteps, finger tapping, whistling sounds, and typing (see Table 1). Nine of our 11 misophonics reported that sounds repetitive in nature were particularly bad. In addition, six of our misophonics indicated that spoken “Ss” sounds were unpleasant, although not quite on the same level as trigger sounds.

In terms of aversive responses to these sounds, misophonics report a range of negative feelings, thoughts, as well as physical reactions. Some of the negative feelings experienced include intense anxiety, panic, anger, extreme irritation, and even rage (see Table 1). Additionally, in the context of our study, it is important to distinguish anxiety from fear. Specifically, while our subjects report feeling extreme stress and anxiety in response to trigger sounds, they did not report being afraid or fearful of them. Nine of our 11 misophonics reported trigger sounds as being invasive, intrusive, disgusting, or rude. They also reported feeling offended or violated by these sounds to the point where negative thoughts such as “I hate this person,” “Stop it, I can't stand it,” and “Don't you know what you sound like?” enter their minds. However, on top of the strong psychological effects, misophonics also report experiencing strong physical effects in response to trigger sounds. The most commonly reported physical effects were pressure in the chest, arms, head, or entire body as well as clenched, tightened, and tense muscles. Some misophonics reported an increase in blood pressure, heart rate or body temperature, sweaty palms, physical pain, and even difficulty breathing in response to trigger sounds (see Table 1). The aforementioned aversive responses evoked by trigger sounds are characteristic of a typical, autonomic nervous system response. In line with this, the worst situations for misophonics are often ones where they feel trapped and unable to escape, including long trips in cars or planes. Similarly, two misophonics report that trigger sounds at school or at home are worse than in places one can easily leave, such as a public place. However, despite extreme discomfort, misophonics generally do not physically act out on feelings of aggression. Some report instances of snapping at others while others internalize their frustration (see Table 1).

A final indication that misophonia produces physical and autonomic responses is the suggestion that pharmacological agents affect the condition. Four of our misophonics indicated that caffeine intensifies misophonic experiences while seven of misophonic individuals indicated that alcohol decreases symptomatology; these subjects describe that while under the influence of alcohol they can still hear the sound but their aversive response is not as strong.

In response to their aversive reactions to trigger sounds, misophonic individuals have developed a number of coping strategies including: avoiding or removing themselves from certain situations, mimicking trigger sounds, or the action producing it to “cancel out” or “retaliate,” utilizing earplugs, headsets or listening to music, distracting oneself, reciting positive internal dialog to help calm themselves, asking others to stop making the sounds, as well as being conscientious about their own sounds (see Table 1).

The degree to which quality of life is affected varied between our misophonic participants. One subject reported that misophonia “… does not affect the quality of my life too much. But it seems ridiculous and I would like to get rid of it” while another subject reported that misophonia had in the past evoked thoughts of suicide. These reports indicate there might be different degrees of the misophonic condition, ranging from mildly hindering to severely debilitating.

Misophonic individuals most commonly describe onset of the condition in childhood. Two subjects reported that with age, they learned to better cope with their misophonia, five subjects reported that it worsened over time (due to increasing aversiveness as well as increasing number of triggering stimuli) and three recalled no change over time. It is not fully understood why differences in trigger accumulation and severity develop between misophonics but it appears that prolonged and repeated exposure to a sound may be a contributing factor. For example, one of our misophonic subjects related this to the “honeymoon” period in a new job or relationship, in which for a few years new sounds caused little irritation. However, over time the negative affect of these sounds intensified to become triggers as well.

Six of our misophonics reported that one or several close family members display misophonic-like symptoms and behaviors. Two subjects had no information on this topic and three reported that they do not believe that misophonia runs in their families. While these reports are only anecdotal, they suggest there may be a familial or genetic component to misophonia, calling for further investigation in future studies.

Interestingly, misophonic individuals further report that responses evoked by trigger sounds appear to be modulated by prior knowledge, context, and sound source, implying that the condition is not driven simply by the physical properties of sound alone. For example, nine of our misophonics indicated that their misophonia is isolated to or exacerbated by certain individuals, usually close friends, coworkers, or family members whom they are exposed to frequently (see Table 1). Another curious characteristic described by 10 of our misophonics is the fact that self-induced trigger sounds (trigger sounds produced by the misophonic individual themselves) will not evoke nearly as much of an aversive response as when produced by others. In fact, as mentioned earlier, mimicking trigger sounds is one of the coping strategies utilized by misophonics to “overwrite” the disturbing sound being produced by another individual. Several misophonics even report eating foods in synchrony with the other person. However, mimicking is also mentioned as a way to retaliate against the offending individual producing the sounds, thus acting as a way to cope with the anger evoked by the condition.

The interviews further revealed an interesting effect of the role of context on aversive responses. For instance, eight of our misophonics report eating and chewing sounds (severely offensive triggers associated with rudeness when produced by human adults) will not bother them nearly as much if produced by animals or babies (see Table 1). One individual described that, as these individuals have little control over their actions and “don't know any better,” it helps in cancelling out strong aversive feelings. These results suggest that the aversive responses experienced by misophonics are explicitly tied to other individuals, implying an underlying social component to the condition. Accordingly, even though our subjects fit in with Schröder et al.'s (2013) diagnostic criterion of misophonics being aware of their condition, and recognizing their feelings as “excessive, unreasonable, or out of proportion,” they will still comment on the inappropriateness of another person's behavior nonetheless.

Another recurring topic from the interviews is the role of attention in misophonia. Nine of our misophonics report being hyper-focused on sounds that normally exist as background noise. One misophonic subject described the inability to tune out background noises as being like an “involuntary cocktail party effect” while another mentioned that “noises are never in the background. People sounds crash right through jet engine sounds.” Eight of our misophonics described being unable to pay attention to a movie or lecture when individuals around them produce trigger sounds, with partial remediation by distracting themselves and directing their attention elsewhere. In addition, it is possible that through understanding the role of attention in misophonia, potential treatments may be able to be developed.

In accordance with Schröder et al. (2013), our subjects reported a few symptoms shared with other diagnoses, however the complete symptomology of misophonia does not fit with any of the diagnostic categories in the diagnostic and statistical manual of mental disorders (DSM-IV). In their interviews, subjects described symptoms related to obsessive-compulsive disorder (OCD), attention deficit disorder (ADD), post-traumatic stress disorder (PTSD), auditory processing disorders as well as tinnitus and hyperacusis (see Table 1). However, these symptoms did not cover the full range of complaints, including the critical symptom of misophonia (a strong aversive response to particular sounds). Two of our misophonics reported being treated with medications, including antianxiety medications and antidepressants, that were intended to alleviate some of the effects of misophonia but as it stands, a treatment to fully address the root of the problem still remains elusive. Thus, our results are in line with the previous conclusion that misophonia is not part of another clinical, psychiatric, or psychological disorder (Schröder et al., 2013).

Experiment 2

Introduction

Qualitative assessments of misophonic subjects demonstrated the consistent association between specific sounds and intense emotional experiences. In order to confirm the presence of these emotional reactions and further examine their relationship to sound preferences present in the general population, we measured skin conductance response (SCR) while misophonic participants and typically developed individuals were exposed to aversive and non-aversive auditory, visual, and auditory-visual stimuli. SCR measures the electrical conductance of the skin and consequently the amount of sweat produced. Because sweat production is not under volitional control, SCR is widely accepted to indicate arousal of the sympathetic nervous system (Critcheley, 2002). For these reasons, we believe SCR to be an appropriate method of measuring autonomic arousal to various emotion-eliciting stimuli.

Materials and Methods

Participants

Six misophonic subjects who also participated in Experiment 1 (2 males and 4 females; mean age = 22.8; range = 19–30) and five controls (mean age = 22; range = 19–29) matched on age and gender participated in the experiment; A sixth control was excluded due to an error during data collection. Controls were recruited from the student population at the University of California, San Diego. All participants reported normal hearing and vision, gave signed, informed consent prior to the experiment, and participated either for cash or in fulfillment of a course requirement. The study was reviewed and approved by the university's Human Research Protections Program. Total experiment time was less than 1 h.

Procedure and stimuli

Participants were seated 20 inches from an 18 inch monitor and provided Sennheiser® headsets. SCR recordings were acquired with BIOPAC System (MP100A-CE) and AcqKnowledge 4.1 recording software. A pair of Ag-AgCl electrodes was attached to the palmar surface of the middle and ring fingers of the participant's dominant hand. Prior to attachment, participants' hands were cleansed with an alcohol wipe and a skin conductance gel was applied to each electrode. SCR was recorded in micro Siemens at a rate of 30 samples/s. Participants were instructed to relax with their dominant hand placed palm up on their thigh and to minimize movement throughout the duration of the experiment. SCR was examined in subjects prior to experimental testing for typicality; absence of a normal response precluded a subjects' participation in the rest of the study.

Stimuli included 31 video clips either acquired from YouTube or recorded in the lab. Video content varied in order to cover a range of sounds and predicted emotional responses in misophonic subjects, selected based on interview data from Experiment 1. Example stimuli included birds singing, children laughing, whale song, nails on a chalkboard, lips smacking, gum chewing, etc. Each clip lasted for 15 s. Auditory and visual components of these videos were separated to generate auditory alone, visual alone, and auditory-visual conditions. Each auditory, visual, and auditory-visual stimulus was presented once for a total of 93 trials. Trial order was randomized into two orders and order was counterbalanced across participants. Critically, as each specific video was presented a total of three times (once in each auditory, visual, and auditory-visual condition), a consistent ordering of the presentation of each stimulus was maintained for each type: auditory alone, visual alone, followed by auditory-visual. Stimuli were presented with E-Prime® version 2.0.

On each trial, participants viewed a centrally presented fixation cross for a 5-s period, followed by either an auditory clip (A), visual movie (V), or auditory-visual movie (AV) for 15 s, concluded with an inter-trial interval of 10 s; during this 10-s interval subjects provided a verbal aversiveness rating on a scale of 0–4 based on how much discomfort they experienced in response to the preceding trial. Participants were informed that a rating of 0 would signify no discomfort at all and a rating of 4 would signify an extreme amount of discomfort, anxiety, or an urge to leave the room. Each aversiveness rating was recorded by the experimenter.

Data preprocessing

As our stimuli were presented in quick succession, a linear downward trend was observed throughout the recording session. To account for this artifact, separate linear regressions were fitted to the 5-s fixation period at the start of each trial through a line of best fit. Each observed value during the stimulus epoch was re-plotted as the residual of this line of best fit, normalizing for the pre-stimulus baseline period and removing artifact trends present throughout the epoch. A consistent pattern of results was additionally observed on non-detrended data.

Data analysis

SCR onset was time-locked to pre-stimulus fixation cross. Mean SCR was calculated from the 15-s stimulus epoch for each trial, following the fixation cross. Mean values exceeding three standard deviations from the mean SCR across all trials for each participant were deemed outliers and consequently removed from the dataset; an average of 1.9% of trials were removed per participant.

Statistical analyses

First, we conducted repeated measures ANOVAs across factors of Group (misophonics, controls), Measurement (SCR, aversiveness rating), and Condition (auditory, visual, auditory-visual) to observe overall effects. Follow-up ANOVAs, non-parametric independent samples tests and descriptive analyses were conducted to explore group differences. Follow-up correlations revealed further group differences as well as similarities. Greenhouse-Geisser corrections were used where appropriate, but we report the original degrees of freedom for clarity.

Results

Overall group effects

As an overall examination of the data, we conducted a repeated measures ANOVA with factors Group (misophonics, controls), Measurement (SCR, subjective rating), and Condition (auditory, visual, auditory-visual). Results showed significant main effects of Group [F (1, 9) = 17.5, p < 0.005], Condition [F (2, 18) = 47.3, p < 0.001], and Measurement [F (1, 9) = 48.5, p < 0.001], as well as significant interactions between Group × Condition [F (2, 18) = 18.8, p < 0.005], Group × Measurement [F (1, 9) = 13.7, p < 0.01], Measurement × Condition [F (2, 18) = 40.5, p < 0.001], and Group × Measurement × Condition [F (2, 18) = 16.2, p < 0.005].

However, as the primary goal of this study was to examine unisensory responses to stimuli in both groups, subsequent tests for group effects excluded multisensory (auditory-visual) trials and included only auditory and visual conditions. Figure 1 shows misophonic and control subjects' average SCR data in auditory and visual conditions as a function of time. A repeated measures ANOVA with factors of Group (misophonics, controls), Measurement (SCR, subjective rating), and Condition (auditory, visual) similarly identified significant main effects of Group [F (1, 9) = 14.3, p < 0.005], Condition [F (1, 9) = 47.5, p < 0.001], and Measurement [F (1, 9) = 40.7, p < 0.001], as well as significant interactions between Group × Condition [F (1, 9) = 17.5, p < 0.005], Group × Measurement [F (1, 9) = 10.1, p < 0.05], Measurement × Condition [F (1, 9) = 44.0, p < 0.001], and Group × Measurement × Condition [F (1, 9) = 16.1, p < 0.005]. This overall ANOVA validated the use of follow-up analyses to test specific hypotheses.

FIGURE 1

Figure 1. Average misophonic and control participants' skin conductance response to auditory and visual stimuli as a function of time.

Group differences

We conducted additional follow-up repeated measure ANOVAs with factors of Group (misophonics, controls) and Condition (auditory, visual), first for subjective aversiveness ratings alone. Results showed main effects of Group [F (1, 9) = 12.4, p < 0.01] and Condition [F (1, 9) = 46.5, p < 0.001], and critically an interaction between the two [F (1, 9) = 17.1, p < 0.005] supporting the differences between the groups (see Figure 2A). This difference between the groups was largely due to controls rarely rating stimuli as greater than 2 on the aversiveness scale (ranging from 0 to 4; see Figures 3A,B). Examining this model for SCR data yielded a similar pattern of results with main effects of Group [F (1, 9) = 6.77, p < 0.05] and Condition [F (1, 9) = 11.9, p < 0.01], and a marginally significant interaction between the two [F (1, 9) = 4.53, p = 0.06] (see Figure 2B).

FIGURE 2

Figure 2. Group means of controls and misophonics, per presentation condition (auditory and visual) for (A) subjective reports and (B) SCR.

FIGURE 3

Figure 3. Percentage of trials per index on the 5-point aversiveness scale, for controls and misophonics, during (A) auditory and (B) visual conditions.

Given the small sample size of these groups, follow-up non-parametric independent-samples Mann-Whitney U-tests were used to compare groups across these critical conditions. Misophonics reported significantly higher ratings than control subjects in response to auditory stimuli, U (9) = 29.0, p < 0.01, but not visual stimuli, U (9) = 23.5, p = 0.13. The median rating of auditory trials was 1.82 (SD = 1.38) for misophonics and 0.42 (SD = 0.77) for controls while the median rating of visual trials was 0.29 (SD = 0.98) for misophonics and 0.19 (SD = 0.55) for controls. This pattern of results was consistent with SCR responses, with misophonics producing larger SCR responses than controls to auditory stimuli, U (9) = 28.0, p < 0.05, but not visual stimuli, U (9) = 21.0, p = 0.33. The median SCR of auditory trials was 0.15 micro Siemens (SD = 0.40) for misophonics and 0.03 micro Siemens (SD = 0.11) for controls while the median SCR of visual trials was 0.07 micro Siemens (SD = 0.39) for misophonics and 0.00 micro Siemens (SD = 0.08) for controls. The same pattern of results for these tests was observed with parametric independent samples t-tests.

In order to determine if higher SCR is directly correlated with higher aversiveness ratings, we examined individual subjects' aversiveness ratings relative to average SCR activity from all auditory, visual, and auditory-visual trials. Results identified a significant positive correlation between average aversiveness ratings and average SCR across all participants (see Figure 4), (r s = 0.700, N = 11, Z = 2.21, p < 0.05), indicating that stimuli subjectively thought of as aversive generally evoked a proportional SCR.

FIGURE 4

Figure 4. Correlation of average aversiveness ratings and average SCR (in micro Siemens) for all trials across all subjects.

Group similarities

As an examination of whether the stimuli that trigger aversive experiences in misophonic individuals are idiosyncratic to the condition or consistent to, though more extreme than, preferences present in the general population, we examined the consistency of ratings across the groups. Findings indicated a significant positive correlation between misophonic and control aversiveness ratings across all three types of stimuli, (r s = 0.605, N = 93, Z = 5.80, p < 0.001); this correlation is additionally present when examining the correlation between the groups for only auditory trials, (r s = 0.413, N = 31, Z = 2.26, p < 0.05; see Figure 5) suggesting that misophonics and controls find similar stimuli to be aversive and non-aversive.

FIGURE 5

Figure 5. Correlation of average aversiveness ratings of stimuli (x- and y-axis) across conditions in misophonics and controls. Select stimuli identified by proximal text.

Discussion

Experiment 2 provides, to the best of our knowledge, the first experimental investigation on misophonia, serving to validate the severity of this chronic condition beyond anecdotal description. Misophonic subjects rated auditory stimuli as more aversive than the same visual stimuli, and this pattern was consistent with SCR measurements. Furthermore, SCR and subjective ratings to auditory stimuli were greater in misophonic individuals than controls, supporting the specificity of aversive reactions in misophonia. Nevertheless, misophonic subjects demonstrated increased ratings and SCR regardless of stimulus type, as revealed by observed main effects of group, possibly denoting generalized anxiety to the stimuli used in the present study.

The significant positive correlation between average aversiveness ratings and mean SCR across all participants importantly confirms the validity of each subject's ratings during the task. Therefore, participant's physiological responses to stimuli were consistent with their subjective ratings. However, as shown in Figure 4, this positive correlation seems most likely driven by group differences between misophonics, (represented in green) and controls (represented in blue).

The significant positive correlation between misophonic aversiveness ratings and control aversiveness ratings reflects a general agreement of the relative valence of the inducing stimuli across the groups. In other words, misophonics and controls find similar stimuli to be aversive and non-aversive on a subjective level, suggesting that misophonics may experience an extreme form of the discomfort most individuals experience to normally aversive or irritating stimuli. This raises the important possibility that there is nothing intrinsically different about misophonic individuals from those in the general population and misophonic individuals are merely at the tail end of the distribution.

General Discussion

In a preliminary examination of individuals with misophonia, we report qualitative and physiological investigations of the condition and its relationship to responses in the typical population. Experiment 1, which is comprised of qualitative assessments on eleven misophonic subjects, examined the qualities associated with misophonia in order to help develop reliable diagnostic criteria and understand the complex social factors involved. Results were consistent with early reports of the phenomenon, such as the critical characteristic of misophonia being a disproportionately aversive reaction is in response to common sounds in everyday life. Additionally, a visceral autonomic response is physically felt in misophonics in response to trigger sounds. In Experiment 2, physiological measurements were acquired on six misophonic individuals using SCR to provide an objective corroboration of misophonics' reports that specific sounds evoke intense emotional reactions. Results showed an increased autonomic response to trigger sounds, but not visual stimuli, in misophonics as compared with non-misophonic controls.

Administering semi-structured interviews proved to be an effective way of determining the most critical symptoms, triggers and experiences associated with misophonia as well as the degree to which these varied across subjects. In addition to reporting psychological symptoms, all of our misophonics reported physical symptoms synonymous with autonomic arousal in response to trigger sounds. Furthermore, our qualitative results are in line with all of the diagnostic criteria proposed by Schröder et al. (2013) which, shortly summarized are: (A) aversive and angry feelings evoked by particular sounds, (B) rare potentially aggressive outbursts, (C) recognition by the misophonic individual that his/her behavior is excessive, (D) avoidance behavior, (E) distress and interference in daily life, and lastly, (F) the lack of another condition to account for all symptoms. Additionally, our principal finding that misophonic individuals experience physical, autonomic arousal that is measurable by SCR, provides empirical validation for some of the aforementioned critical criteria proposed by Schröder et al. (2013), particularly criterion A. Through conducting interviews, we also identified other interesting aspects of misophonia that were not previously apparent. In particular, subjects reported that misophonia can be modulated by social expectations as well as situational context, indicating that the condition may be more complicated than merely an aversive response to the purely physical properties of sounds. Additionally, the finding that misophonic individuals report involuntary, physiological distress in response to a very specific subset of social sounds supplements research on complex mind-body interactions, with high-level knowledge demonstrating prolonged and specific physiological reactions (e.g., as in placebos; Margo, 1999). However, at this time, these speculations remain based on anecdotes and need to be properly tested in the future before firm conclusions can be drawn.

To date, no research has examined the neurological origin of misophonia, and preliminary investigations suggest it is not due to any primary neurological or psychological disorder or trauma (Schröder et al., 2013). Nevertheless, misophonia displays similarities to a genetic condition known as synesthesia. In synesthesia, as in misophonia, particular sensory stimuli evoke particular and consistent, additional sensations and associations. Well-known forms of synesthesia include letters evoking a particular color, or sounds/music evoking colors (Cytowic, 1989; Baron-Cohen et al., 1996; Simner et al., 2006) but there are in fact many different subtypes of synesthesia, with a variety of “inducers” (e.g., music, taste, words, sequences) evoking certain “concurrents” (e.g., color, shapes, taste). While most synesthesia research has examined the perceptual sensations related to synesthesia, the condition seems to have an affective component as well. First, synesthetic congruency (e.g., when a grapheme-color synesthete sees a letter in the “correct” color) is related to positive affect (e.g., Callejas et al., 2007). Furthermore, both inducers (Ward, 2004; Ramachandran et al., 2012) and concurrents (Simner and Holenstein, 2007) can be of emotional rather than perceptual nature. Interestingly, the latter indicates that for certain subtypes of synesthesia, similar to misophonia, inducers evoke a particular feeling or emotion rather than a pure perceptual sensation. This has been studied in tactile-emotion synesthesia (e.g., feeling sandpaper evokes a feeling of jealousy; Ramachandran and Brang, 2008). Synesthetic associations, like misophonic experiences, are automatic (in the sense that they do not take effort or conscious deliberation), are consistent within an individual and persist throughout life, and seem to run in families (Asher et al., 2009; Tomson et al., 2011; for a review see Brang and Ramachandran, 2011). Given these similarities, neuroimaging findings in synesthetes may provide us with hypotheses on the neural basis of misophonia. First, associated sensations in synesthesia are found to be associated with co-activation in relevant (associated) brain areas (Nunn et al., 2002; Hubbard et al., 2005; Rouw et al., 2011). Furthermore, previous studies support a direct linking of relevant sensory regions in synesthesia (Hubbard and Ramachandran, 2001), mediated by an actual increase of anatomical connectivity (Rouw and Scholte, 2007; Zamm et al., 2013). Similarly, altered connections from a lesioned thalamus to the cerebral cortex (Ro et al., 2007; Beauchamp and Ro, 2008) led to a type of acquired synesthesia in which auditory stimuli produced tactile percepts. Differing in the level of specificity and complexity of evoked responses observed in synesthetes, individuals with misophonia display basic and non-elaborated responses to triggering stimuli, varying largely in the intensity of the response. Nevertheless, the underlying neurological cause of this condition may be similar to that of synesthesia in terms of enhanced connectivity between relevant brain regions. In short, a pathological distortion of connections between the auditory cortex and limbic structures could cause a form of sound-emotion synesthesia.

This study also provides the critical finding of a relationship between aversive stimuli in misophonia and mildly aversive stimuli in the general population. That is, in Experiment 2 we observed a significant correlation between aversive ratings across the groups, suggesting that misophonia may be based on mechanisms fundamentally present in the general population, but simply exaggerated in misophonia. Critically, as observed in the interviews in Experiment 1, many of the common aversive stimuli in misophonia are also deemed as socially inappropriate in western society (e.g., lip smacking, repetitive tapping, etc.). While speculative at present, this consistent pattern raises the possibility that the aversive nature of these stimuli to all individuals may be based on the same driving factors (though notably more mild) as in misophonia, leading to the development of these cultural norms.

The present paradigm was designed to include a range of aversive stimuli for misophonic individuals based on our preliminary interviews in Experiment 1. Accordingly, misophonic individuals reported a large number of the stimuli as aversive: mean 24.2% and median 24.7% stimuli with a rating of 3 or 4. In contrast, control participants reported very few stimuli as very aversive: mean 2.4% and median 0.0% stimuli with a rating of 3 or 4 (Figures 3A,B). Potential future studies are suggested to examine if this same pattern of group differences is consistent with stimuli that evoke a broader range of aversive responses in typically developed individuals.

As the current study is exploratory in nature and included a small sample of participants, there are several limitations to acknowledge. One limitation is that the presentation of stimuli in a controlled laboratory setting lacked the ecological validity of how these stimuli occur in the real world. As such, several misophonics reported that because they knew each clip would end in a matter of seconds, their physiological reactions were tempered, consistent with self-reports in Experiment 1 showing that contextual information about these cues mediated subjects' responses. We predict naturalistic observational studies of physiological reactions in misophonic individuals will show a similar but more extreme pattern of results to those observed here. A second limitation is that while SCR is a good measure of autonomic arousal in response to emotion-eliciting stimuli, it does not indicate what specific emotion is being experienced at the time. Instead it only indicates a very general, physiological arousal that can be interpreted in many ways. For example, SCR would not be able to differentiate anxiety and aggression. However, information as to what exactly a subject was feeling during each stimulus can potentially be inferred by obtained self-reports after each trial. A third limitation is the fact that no rigorous diagnostic tests or screenings were utilized during interviews to completely exclude the possibility that subjects' symptoms were being driven by another underlying condition. Also, interviews were conducted by members of our research group and not by psychiatrists. Potentially comorbid conditions were therefore determined from the self-reports of subjects (some of whom had previous, official diagnoses), and the discretion of the researchers. However, because these interviews were not conducted with the intent of being clinical or diagnostic in nature, but rather to gain more insight into the phenomenological experiences of individuals who identify with having misophonia, we believe these findings are still of considerable value to the research community and misophonic individuals alike. A fourth limitation of the study is the small sample size. As research on misophonia is limited to the last few years and little remains known about the condition, obtaining a large sample size for this study was not feasible. Nevertheless, while these results should be validated on a larger group of subjects, we believe they reflect properties of the condition generalizable to the misophonia community in general.

While these data serve to support the veracity of the subjective reports in misophonia as an intrusive and labile condition, numerous additional avenues remain for future research. Critically, as this condition appears to be chronic, the nature of how subjects' triggers evolve over time should be investigated. How does context contribute to and modulate misophonia and can contextual information or expectation effects bias subjects' responses to aversive stimuli? Critically, what are the mechanisms (genetic, neurological, and/or psychological) that underlie the condition? While speculative at present, one potential neural mechanism for misophonia may lie in aberrant anatomical or functional connections between auditory and limbic regions, akin to the finding of increased structural connectivity in synesthesia. Regardless of the mechanisms that underlie misophonia, the present research supports its validity as an intrusive condition and highlights the need for additional research into contributing factors and potential treatments.

Conflict of Interest Statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Acknowledgments

The authors thank Laura Case for contributing to the experimental design, Dr. Julian Parris for assisting with data analysis, Ronald E. Robertson and Marli Gitelson for assisting in subject recruitment and data collection, and all of our subjects who kindly agreed to participate.

References

Cytowic, R. E. (1989). Synaesthesia: A Union of the Senses. New York, NY: Springer-Verlag.

Hubbard, E. M., and Ramachandran, V. S. (2001). Cross wiring and the neural basis of synaesthesia. Invest. Ophthalmol. Vis. Sci. 42, S712.

Jastreboff, M. M., and Jastreboff, P. J. (2001a). Components of decreased sound tolerance: hyperacusis, misophonia, phonophobia. ITHS News Lett. 2, 5–7.

Jastreboff, M. M., and Jastreboff, P. J. (2001b). Hyperacusis. Audiology Online. June 2001. Available online at: http://www.audiologyonline.com

Jastreboff, P. J. (2000). “Tinnitus habituation therapy (THT) and tinnitus retraining therapy (TRT),” in Tinnitus Handbook, ed R. S. Tyler (San Diego, CA: Singular, Thomson Learning), 357–376.

Jastreboff, P. J., and Hazell, J. (2004). Tinnitus Retraining Therapy: Implementing the Neurophysiological Model. New York, NY: Cambridge University Press. doi: 10.1017/CBO9780511544989 CrossRef Full Text