Want to create or adapt books like this? Learn more about how Pressbooks supports open publishing practices.
Skinner’s Operant Conditioning
Traci Geiser
Learning Objectives:
1-Identify specific activities that instructional designers may use that incorporate Operant Conditioning.
2-Identify and differentiate between examples of the four types of reinforcement and punishment in operant conditioning.
Introduction to the Learning Theory
Operant conditioning, rooted in the behaviorist movement, emphasizes how consequences influence behavior, making it more or less likely to occur in the future. Developed by B.F. Skinner (1904–1990), widely regarded as the father of operant conditioning, the theory asserts that behavior is entirely shaped by the environment. Skinner focused solely on observable actions, disregarding internal mental processes. According to his perspective, learning is driven by rewards and punishments, which reinforce or discourage specific behaviors.
Influential in the development of Skinner’s operant conditioning, Edward Lee Thorndike (1874-1949) proposed the law of effect (Thorndike, 1905). The law states that if our behavior produces a favorable consequence, we will be more likely to make the response again in the future with the same favorable behavior because we expect the same favorable consequence. Likewise, if our actions lead to dissatisfaction, we will not repeat the same behavior in the future. While Thorndike’s Law of Effect was a foundation for operant conditioning and explains the basic tendency of behavior to be shaped by consequences, Skinner’s operant conditioning expanded and refined it into a comprehensive learning theory that includes reinforcement and punishment schedules.
Operant conditioning is a foundational learning theory that has had a major impact on both education and psychology by explaining how behavior is learned, maintained, and modified through reinforcement and punishment. It provides a scientific, observable framework for understanding learning and behavior change, making it highly applicable in real-world settings. Operant conditioning is essential in both education and psychology because it provides a structured, evidence-based approach to modifying behavior, improving learning outcomes, and supporting psychological interventions. It explains how behaviors are acquired, maintained, and changed, making it a key framework for teaching strategies, behavior therapy, and developmental psychology.
Origins of the Learning Theory
Operant conditioning emerged within the behaviorist movement, which dominated psychology in the early 1900s. This was a reaction against introspective methods and psychoanalysis, focusing instead on observable behavior. Psychologist B. F. Skinner saw that classical conditioning is limited to existing behaviors that are reflexively elicited, and it doesn’t account for new behaviors such as learning to ride a bike. He proposed a theory about how such behaviors come about. Skinner believed that behavior is motivated by the consequences we receive for the behavior: the reinforcements and punishments. His idea that learning is the result of consequences is based on the law of effect, which was first proposed by psychologist Edward Thorndike after his puzzle box experiments. Thorndike placed cats inside specially designed boxes with a simple escape mechanism (e.g., a lever, rope, or button that opened the door). A reward (food) was placed outside the box to motivate the cat to escape. The cat had to figure out how to manipulate the mechanism to open the door and get to the food. According to the law of effect, behaviors that are followed by satisfying consequences are more likely to be repeated, and behaviors followed by unpleasant consequences are less likely to be repeated (Thorndike, 1911).
Working with Thorndike’s Law of Effect as his foundation, Skinner began conducting scientific experiments on animals (mainly rats and pigeons) to determine how organisms learn through operant conditioning (Skinner, 1938). He placed these animals inside an operant conditioning chamber, which has come to be known as a “Skinner box”. A Skinner box contains a lever (for rats) or disk (for pigeons) that the animal can press or peck for a food reward via the dispenser. Speakers and lights can be associated with certain behaviors. A recorder counts the number of responses made by the animal.
Diagram of a Skinner Box, with elements like a mouse, lever, air vent, stimulus light, food dispenser, speaker, and observation window labeled.
From his Skinner box experiments, Skinner learned how behavior is influenced by its consequences. Behavior that is reinforced tends to be repeated (i.e., strengthened); behavior that is not reinforced tends to die out or be extinguished (i.e., weakened).
Operant conditioning aligns with or differs from other prominent learning theories in several ways, particularly in how it explains behavioral change, cognition, and social influences in learning. Operant conditioning is a powerful tool for understanding learning and behavior modification, but it is best used in combination with other theories. It excels at shaping behaviors and reinforcing learning, but cognitive, social, and constructivist approaches provide deeper insights into mental processes, motivation, and personal engagement in learning.
Fundamental Tenets of the Theory
The four consequences in operant conditioning include positive reinforcement, negative reinforcement, positive punishment, and negative punishment.
Reinforcement, which increases behavior, is divided into Positive Reinforcement and Negative Reinforcement .
Definitions and examples of how Skinner box experiments helped identify these consequences are found below. A video link with an example of each is included for clarification.
Positive Reinforcement is adding something pleasant to increase the likelihood of a behavior. In the Skinner box, a rat is placed inside and learns that when it presses a lever, a food pellet is dispensed. The food pellet is added after the rat presses the lever. Since the rat enjoys the food, it is likely to repeat the behavior (pressing the lever) in the future to receive more food. This is positive reinforcement because something pleasant (the food) is added to encourage the rat to continue pressing the lever.
Negative Reinforcement is removing something unpleasant to increase the likelihood of a behavior. For example, a rat inside the Skinner box had a mild electric shock continuously applied until the rat discovers that if it presses the lever, the shock stops. The removal of the unpleasant shock when the lever is pressed acts as negative reinforcement because the rat is more likely to press the lever in the future to escape or avoid the unpleasant stimulus (the shock). Here, something unpleasant (the shock) is removed to increase the desired behavior (lever pressing).
Punishment is the opposite of reinforcement since it is designed to weaken or eliminate a response rather than increase it. It is an aversive event that decreases the behavior that it follows. It is divided into positive punishment and negative punishment.
Positive punishment involves adding something unpleasant to decrease a behavior. For example, when the rat presses the lever, a mild electric shock is administered. This is an example of positive punishment because something unpleasant (the shock) is added following the behavior of pressing the lever. The shock serves to decrease the likelihood that the rat will press the lever in the future.
Negative punishment involves removing something pleasant to decrease a behavior. n Skinner’s box, a rat might be trained to press a lever to receive a food pellet. However, if the rat presses the lever and then an unpleasant noise (like a loud sound) is introduced, the food reward is removed. This would be an example of negative punishment because the unpleasant stimulus (the sound) is taking away the food, which the rat values, in response to its behavior (lever pressing). The goal is to decrease the behavior of lever pressing by removing something desirable (food).
In short, these four types of operant conditioning are different ways of using consequences (either reinforcement or punishment) to either increase or decrease a behavior.
Note: It is not always easy to distinguish between positive punishment and negative reinforcement. Below you will find explanation of the differences (with examples).
Positive Punishment VS Negative Reinforcement
1. Positive Punishment:
Positive punishment involves adding an unpleasant stimulus to decrease the likelihood of a behavior happening again.
Example: A teacher gives a student extra homework (unpleasant stimulus) because the student was talking during class. The goal is to decrease the student’s talking behavior in the future.
Why it’s positive punishment: The teacher is adding something unpleasant (extra homework) to discourage the behavior (talking in class).
2. Negative Reinforcement:
Negative reinforcement involves removing an unpleasant stimulus to increase the likelihood of a behavior happening again.
Example: A person wears sunscreen to avoid getting sunburned. Every time they apply sunscreen, they are removing the potential for the unpleasant experience of sunburn.
Why it’s negative reinforcement: The person is removing the unpleasant stimulus (the possibility of getting sunburned) by performing the behavior (wearing sunscreen), which increases the likelihood that they will continue to apply sunscreen in the future.
The key idea is that the behavior (wearing sunscreen) is reinforced because it helps avoid something unpleasant (sunburn).
Summary of the Key Difference:
Positive Punishment: Adding something unpleasant to reduce behavior.
Negative Reinforcement: Removing something unpleasant to encourage behavior.
Strengths and Limitations of the Theory
Operant conditioning has several strengths, particularly its practical applications in education, behavior therapy, and training. It provides a clear, observable framework for shaping behavior through reinforcement and punishment, making it highly effective for behavior modification programs, classroom management, and habit formation.
However, the theory has limitations, including its overemphasis on external reinforcement while neglecting internal cognitive processes like thinking, motivation, and problem-solving. Critics argue that intrinsic motivation may be undermined by excessive reliance on rewards, and the theory does not fully account for social or emotional influences on learning, as emphasized in cognitive and social learning theories. Despite these limitations, operant conditioning remains a valuable tool in understanding and shaping human behavior.
Instructional Design Implications
Operant conditioning is used in instructional design as it helps us understand how rewards, consequences, and feedback shape behavior. This is a critical skill in instructional design. By providing instant feedback, using gamification to make learning more interactive and fun, and breaking tasks down into smaller steps, students are more motivated to learn. The presentation below gives an overview of operant conditioning and how it is used in instructional design.
Below are examples of the ways each of the four tenets of operant conditioning can be used in instructional design:
Positive Reinforcement Activities
Rewarding desired behavior
Gamified Learning with Points & Badges – Learners earn points, badges, or certificates for completing training modules, engaging in discussions, or achieving high quiz scores.
Immediate Feedback in E-Learning – Correct answers trigger positive reinforcement, such as praise (“Great job!”) or virtual rewards.
Leaderboards & Incentives – Encouraging healthy competition through rankings and tangible rewards for top performers.
Token Economy in Training – Employees earn tokens, which can be exchanged for privileges or prizes, for consistently applying learned skills in real-life scenarios.
Negative Reinforcement Activities-
Removing an aversive stimulus when desired behavior occurs
Adaptive Learning Paths – Learners who score well on pre-assessments can skip basic content and move directly to advanced material, removing unnecessary repetition.
Time Reduction for Mastery – Learners demonstrating proficiency in a skill can opt out of additional practice exercises or receive early course completion.
Minimized Error Messages in Software Training – When learners correctly perform tasks, they avoid error pop-ups or additional corrective steps.
Positive Punishment Activities
Adding an aversive stimulus to reduce undesired behavior
Error Messages & Corrective Feedback – When learners answer incorrectly, they receive an error message with an explanation, prompting them to rethink their approach.
Scenario-Based Consequences – In workplace ethics training, learners who make poor choices in a scenario may see virtual consequences, such as a simulated reprimand from a manager.
Penalties in Learning Games – Losing points or virtual currency for incorrect responses in a simulation or game.
Negative Punishment Activities
Removing a reward to reduce undesired behavior
Loss of Access to Perks – In a corporate training setting, employees who do not complete mandatory compliance training on time lose access to certain work tools or privileges.
Score Deduction for Incorrect Answers – Learners lose points for guessing too many times instead of attempting to apply knowledge.
Restricted Access to Higher-Level Content – If learners fail to demonstrate mastery in prerequisite modules, they are temporarily restricted from progressing to advanced lessons.
Conclusion
Learning extends beyond content; it is deeply connected to engagement and motivation. Operant conditioning provides valuable insight into how behaviors are shaped through reinforcement and punishment, making it a key principle in instructional design. By applying its strategies, we can create engaging and impactful learning experiences that enhance retention and drive participation. Ultimately, operant conditioning enables the development of effective training programs that sustain learner motivation and improve outcomes.