Turn off the music in a horror game and two things happen. One, you stop being scared. Two, you get better at it. The link between those two facts is worth examining.

Sound in games is usually discussed in terms of immersion — the degree to which audio pulls you into the experience. That framing is accurate but incomplete. Audio also directly affects performance: how fast you react, how accurately you predict what's about to happen, how stressful a given moment feels relative to its mechanical demands. Sound design is, in a precise sense, a difficulty system.

The arousal-performance curve

There's an established relationship between arousal — broadly, physiological and psychological activation — and performance on complex tasks. Very low arousal produces poor performance; the person isn't engaged enough. Very high arousal also produces poor performance; they're too activated, tunnel vision sets in, precision degrades.

Music and sound effects are one of the more direct tools available for shifting player arousal. An escalating percussion track increases heart rate and breathing rate. High-pitched, dissonant sound design creates tension that persists even when no direct threat is visible. The player's body responds to the audio environment in ways that aren't fully under conscious control.

This means a designer can take a mechanically moderate challenge and make it feel genuinely hard by using the audio to push the player toward the over-aroused end of the curve. And conversely, a mechanically demanding sequence paired with calm, quiet audio can feel easier than it is — not because the player is performing better, but because their system is running cooler.

How audio cues function as information

Separate from arousal, audio carries information. Enemy footsteps tell you direction and proximity. An ambient shift tells you the encounter is about to escalate. A specific sound associated with a specific attack tells you to dodge now rather than react to the visual.

When that information is accurate and consistent, players learn to rely on it and their performance improves — not because the game got easier, but because they're using an additional channel of input. When the information is inconsistent or unreliable, players stop attending to it. The result looks like slower reaction times and worse anticipation, because players are operating from visual information alone when they could be operating from audio plus visual.

This is one reason why the same encounter can feel dramatically different across different audio setups. Headphones with accurate directional audio don't just make a game more immersive — they give the player information that genuinely changes the challenge. Playing the same game on TV speakers in a noisy room, the spatial information is degraded and the player is effectively playing a harder version without any indicator that this is happening.

The case of silent playthroughs

Players who mute game audio and replace it with their own music sometimes report two things: reduced immersion and improved performance. This isn't contradictory — it makes precise sense under the arousal model. Their music maintains engagement without pushing arousal into the degraded-performance range. They've removed an external arousal driver and replaced it with one they can control.

It also removes the information channel, which is worth noting. The performance improvement suggests that for some players, in some game types, the arousal cost of the original audio exceeds the information value. The net effect of high-tension audio design is actually negative performance. Calmer music means they perform better on net, even with less signal.

Designing audio difficulty intentionally

Most accessibility menus have an audio section. It almost always contains volume sliders — music, sound effects, speech — and sometimes subtitle options. Almost none of them address the arousal function of audio design.

A small number of games have experimented with what you might call audio tension scaling: separate controls for how aggressively the music responds to threat states, or the option to use a less dissonant ambient track without removing the cue system entirely. This is meaningful design — it treats audio difficulty as a real variable rather than an afterthought.

The missing insight in most accessibility thinking about audio is that turning down volume and adjusting arousal intensity are different interventions with different effects. Lower music volume doesn't reduce the physiological response to dissonant composition; it just makes you less aware of the source. A genuinely lower-tension audio option requires different source material — a different emotional register, not just a different level.

For designers who care about what their game actually asks of players, audio is underexplored territory. It's one of the few design elements that affects performance independently of mechanical skill, which makes it both powerful and worth treating carefully.