Wednesday, 06 January 2010 19:53

Operant Conditioning

Written by Keiron Walsh
Rate this item
(0 votes)
Operant Conditioning is a process of learning whereby the likelihood of a particular behaviour being repeated is determined by the consequences of previous occurences of the behaviour. Initially the behaviour occurs by chance and is followed by either reinforcements, punishments or neutral consequences. Reinforcements increase the frequency of the behaviour being repeated, punishments decrease the frequency and neutral consequences have no effect.

Edward Thorndike (1898) was one of the first researchers to study what was later to be called "operant conditioning". Thorndike put hungry cats in puzzle boxes and put pieces of tasty fish outside them to motivate them to escape. To escape, the cats had to push a lever and pull on a string. When first put in the box, the cats would behave in a fairly random way, pacing up and down, scratching at the sides etc., but eventually, by chance, they would push the lever and pull on the string in the correct sequence. They would then be rewarded by being able to eat the fish.

Thorndike found that the cats would become faster and faster at escaping, while never showing any insight into how the puzzle box was solved. There was no sudden drop in escape time, which would be expected if the cat had solved the puzzle mentally or realised how its actions led to escape; this provides support for the behaviourist view that only behaviours are important and not cognitions: the escape behaviour was simply strengthened by the reward of a piece of fish. From this Thorndike developed his 'law of effect', that behaviours that have a pleasant outcome are stamped in and those that have unpleasant consequences are stamped out.

Click here for a video of Thorndike's experiment

Burrhus F. Skinner (1938) Elaborated on Thorndike’s “law of effect”. He criticised Thorndike’s use of “Stamps out” and “Stamps in” as being “mentalist”. He said the terms “Strengthen” and “Weaken” should be used instead: Responses that have positive outcomes are strengthened and responses that have negative outcomes are weakened.

Skinner was very critical of "mentalism". Mentalism is when inferences are made about what an organism is thinking. Skinner, like other behaviourists such as John B. Watson believed that since mental events, such as thoughts cannot be observed directly they should not be studied. Instead, according to Skinner, psychologists should study only antecedents of behaviour, the behaviours themselves and the consequences of the behaviour. This is the ABC of operant conditioning.

The Skinner Box

Skinner invented the “Skinner Box” as an alternative to Thorndike’s puzzle box. One reason for this was because it allowed tighter control of extraneous variables by automating the experimental process (e.g., rewards, such as food pellets or water could be dispensed and punishments (electric shocks) could be given. It also recorded the animals’ responses eliminating any experimenter bias. In addition, it allowed Rats or Pigeons to do things instead of just escaping; for example, rats could pull a lever or pigeons could peck a disc.

Reinforcement and Punishment

In Operant Conditioning, reinforcements increase the likelihood of behaviour being repeated. Punishments decrease the likelihood of it being repeated. Whether an outcome is a reinforcement, punishment or neither can only be judged after observing whether the frequency of the behaviour increases, decreases or does not change.


Positive and Negative Reinforcements and Punishments

Reinforcements and punishments can be either positive or negative (this is often a source of confusion). Positive and negative do not refer to whether the outcome is good or bad, but to whether the outcome involves presenting something (positive) or taking something away (negative).

Positive reinforcements and punishments produce their effects by being presented

Negative reinforcements and punishments produce their effects by being removed


Here are some examples:

Outcome Type of Outcome
Natalie’s parents take her TV out of her room Negative Punishment
Seka’s parents give her £1,000 Positive Reinforcement
Ayesha’s parents tell her off Positive Punishment
When the rat presses the lever, Skinner turns off the electric shock. Negative Reinforcement

Note: The outcomes in the table are only correct if they have the expected effect on the behaviour.


Reinforcement Punishment
Positive Leads to increase in target behaviour when presented Leads to decrease in target behaviour when presented
Negative Leads to increase in target behaviour when removed Leads to decrease in target behaviour when removed

Click here for classical conditioning

Last modified on Thursday, 07 January 2010 11:31

1 Comment

  • Comment Link email extractor Thursday, 29 July 2010 16:20 posted by email extractor

    Operant conditioning is the use of consequences to modify the occurrence and form of behavior. Operant conditioning is distinguished from classical conditioning (also called respondent conditioning, or Pavlovian conditioning) in that operant conditioning deals with the modification of "voluntary behavior" or operant behavior. Operant behavior "operates" on the environment and is maintained by its consequences, while classical conditioning deals with the conditioning of respondent behaviors which are elicited by antecedent conditions. Behaviors conditioned via a classical conditioning procedure are not maintained by consequences.

    This e-mail address is being protected from spambots. You need JavaScript enabled to view it

Add comment


Keiron Walsh

Keiron Walsh

If you have any tips, suggestions or would like to contribute to the site, email me at this address.

Website: alevelpsychology.co.uk E-mail: This e-mail address is being protected from spambots. You need JavaScript enabled to view it
More in this category: Classical Conditioning »