Glossary

Intermittent reinforcement.

Intermittent reinforcement is the conditioning schedule in which rewards are delivered unpredictably rather than every time a behavior occurs. It is among the most behaviorally addictive schedules known to learning theory. It is also the schedule that narcissistic-abuse cycles reliably implement — and is the neurochemical foundation underneath the term trauma bond.

0:00 / 0:00

Audio readout.

Definition

The framework comes from B. F. Skinner's operant-conditioning research in the mid-twentieth century. Skinner studied the effects of different reward schedules: continuous (every behavior rewarded), fixed-ratio (every nth behavior rewarded), variable-ratio (rewarded on a random schedule), and variations on time-based versions of each. The robust finding across decades of subsequent replication is that variable-ratio reinforcement produces the highest rates of behavior, the longest persistence after rewards stop, and the most extinction resistance — which is to say, it produces behavior that is very hard to stop.

This is the schedule slot machines run on. It is the schedule that makes gambling addictive in a way that lottery tickets (low-frequency rewards) are not. It is also, by accident or by design, the schedule that pathological narcissistic relationships implement.

How the narcissistic cycle implements it

The relationship oscillates unpredictably between idealization (high reward — attention, warmth, intimacy) and devaluation (no reward, or worse — withdrawal, criticism, contempt). The survivor cannot predict which version will appear at a given moment, but knows from experience that the idealization phase does sometimes return. The result is a learned behavior pattern of working harder to elicit the rewarding behavior, which is exactly the behavior the schedule selects for.

This is why “just leave” is not the helpful instruction it sounds like. Leaving a narcissistic relationship is closer in difficulty profile to leaving a substance addiction than to ending an ordinary unhappy partnership, and the literature on what helps comes from the addiction field as much as from the abuse-recovery one.

What helps

The only schedule that breaks the conditioning is one where the variable rewards stop appearing. No contact is, in operant terms, the extinction protocol — the survivor's behavior is no longer being reinforced, even occasionally, and the conditioned response eventually weakens. Every contact, even one, resets the extinction clock. This is the neurobiological reason that the first weeks of no contact are the hardest and the months that follow get progressively easier — the conditioning is being unwound.

← Glossary