Operant Conditioning
“Reward it and it grows. Punish it and it hides — but never truly disappears.”
Interactive Demo
Press the lever and observe the reinforcement schedule.
Fixed ratio: You know exactly when the reward comes. Predictable but less addictive.
The Psychology
Operant conditioning is a method of learning where behavior is controlled by its consequences. Behaviors followed by favorable outcomes are strengthened (reinforced), while behaviors followed by unfavorable outcomes are weakened (punished). Unlike classical conditioning, which pairs stimuli, operant conditioning is about the relationship between voluntary behavior and its results.
B.F. Skinner formalized the theory in the 1930s and 1940s, building on Edward Thorndike's earlier "Law of Effect." Skinner designed the now-famous "Skinner Box" — an apparatus where an animal (typically a rat or pigeon) could press a lever to receive food, or learn to avoid pressing it to prevent a mild shock. Through meticulous experimentation, Skinner identified four key principles: positive reinforcement (adding something pleasant to increase behavior), negative reinforcement (removing something unpleasant to increase behavior), positive punishment (adding something unpleasant to decrease behavior), and negative punishment (removing something pleasant to decrease behavior).
One of Skinner's most influential discoveries was the power of variable reinforcement schedules. Behaviors reinforced on an unpredictable schedule — sometimes rewarded, sometimes not — become far more resistant to extinction than those reinforced every time. This principle explains why slot machines are so addictive (unpredictable rewards), why intermittent social media notifications keep us checking our phones, and why inconsistent parenting can actually strengthen unwanted behavior in children.
Real-World Examples
Social media platforms use variable-ratio reinforcement schedules — you never know when you'll get a like, comment, or viral post, so you keep checking. Gamification in apps (streaks, badges, points) is pure operant conditioning. In parenting, praising specific behaviors ('Great job sharing your toys!') rather than general traits ('You're a good kid') is more effective because it reinforces the exact behavior you want to see repeated.
Based on B.F. Skinner's research (1938): The Behavior of Organisms: An Experimental Analysis