B.F. Skinner and Operant Conditioning



B.F. Skinner

By the 1920s Watson had left academic psychology and other behaviourists were becoming influential, proposing new forms of learning other than classical conditioning. Perhaps the most important of these was Burrhus Frederic Skinner.

Skinner's views were slightly less extreme than those of Watson. Skinner believed that we do have such a thing as a mind, but that it is simply more productive to study observable behaviour rather than internal mental events.

Skinner believed that the best way to understand behaviour is to look at the causes of an action and its consequences. He called this approach operant conditioning.


Thorndike's Law of Effect

Whereas classical conditioning depends on developing associations between events, operant conditioning involves learning from the consequences of our behaviour. Skinner wasn’t the first psychologist to study learning by consequences. Thorndike (1898) studied learning in cats. He placed cats in a puzzle box, just out of reach of which was a scrap of fish. Thorndike would put a cat into the box and time how long it took to escape. The cats experimented with different ways to escape the puzzle box and reach the fish. Eventually they would stumble upon the lever which opened the cage. When it had escaped it was put in again, and once more the time it took to escape was noted. In successive trials the cats would learn that pressing the lever would have favourable consequences and they would adopt this behaviour, becoming increasingly quick at pressing the lever (see Fig 1). Thorndike put forward a “Law of effect” which stated that any behaviour that is followed by pleasant consequences is likely to be repeated, and any behaviour followed by unpleasant consequences is likely to be stopped.

Thorndike Puzzle Box

Fig 1: Simplified graph of the result of the puzzle box experiment.


B.F. Skinner: Operant Conditioning

Skinner is regarded as the father of Operant Conditioning, but his work was based on Thorndike’s law of effect. Skinner introduced a new term into the Law of Effect - Reinforcement. Behaviour which is reinforced tends to be repeated (i.e. strengthened); behaviour which is not reinforced tends to die out-or be extinguished (i.e. weakened).

Skinner coined the term operant conditioning; it means roughly changing of behaviour by the use of reinforcement which is given after the desired response. Skinner identified three types of responses or operant that can follow behaviour.

Skinner coined the term operant conditioning; it means roughly changing of behaviour by the use of reinforcement which is given after the desired response. Skinner identified three types of responses or operant that can follow behaviour.

Neutral operants: responses from the environment that neither increase nor decrease the probability of a behaviour being repeated.

Reinforcers: Responses from the environment that increase the probability of a behaviour being repeated. Reinforcers can be either positive or negative.

Punishers: Response from the environment that decrease the likelihood of a behaviour being repeated. Punishment weakens behaviour.

We can all think of examples of how our own behaviour has been affected by reinforcers and punishers. As a child you probably tried out a number of behaviours and learnt from their consequences. For example, if when you were younger you tried smoking at school, and the chief consequence was that you got in with the crowd you always wanted to hang out with, you would have been positively reinforced (i.e. rewarded) and would be likely to repeat the behaviour. If, however, the main consequence was that you were caught, caned, suspended from school and your parents became involved you would most certainly have been punished, and you would consequently be much less likely to smoke now.

Skinner studied operant conditioning by conducting experiments using animals which he placed in a “Skinner Box” (see fig 2) which was similar to Thorndike’s puzzle box.

Skinner BoxB.F. Skinner

Fig 2: A Skinner Box.

Reinforcement (strengthen behaviour)

Positive reinforcement strengthens a behaviour by providing a consequence an individual finds rewarding. For example, if your teacher gives you £5 each time you complete your homework (i.e. a reward) you are more likely to repeat this behaviour in the future, thus strengthening the behaviour of completing your homework.

Skinner showed how positive reinforcement worked by placing a hungry rat in his Skinner box. The box contained a lever in the side and as the rat moved about the box it would accidentally knock the lever. Immediately it did so a food pellet would drop into a container next to the lever. The rats quickly learned to go straight to the lever after a few times of being put in the box. The consequence of receiving food if they pressed the lever ensured that they would repeat the action again and again.

Skinner Box Video Clip

The removal of an unpleasant reinforcer can also strengthen behaviour. This is known as Negative Reinforcement because it is the removal of an adverse stimulus which is ‘rewarding’ to the animal. Negative reinforcement strengthens behaviour because it stops or removes an unpleasant experience. For example, if you do not complete your homework you give your teacher £5. You will complete your homework to avoid paying £5, thus strengthening the behaviour of completing your homework.

Skinner showed how negative reinforcement worked by placing a rat in his Skinner box and then subjecting it to an unpleasant electric current which caused it some discomfort. As the rat moved about the box it would accidentally knock the lever. Immediately it did so the electric current would be switched off. The rats quickly learned to go straight to the lever after a few times of being put in the box. The consequence of escaping the electric current ensured that they would repeat the action again and again.

In fact Skinner even taught the rats to avoid the electric current by turning on a light just before the electric current came on. The rats soon learned to press the lever when the light came on because they knew that this would stop the electric current being switched on.

These two learned responses are known as Escape Learning and Avoidance Learning.

Punishment (weakens behaviour)

Punishment is defined as the opposite of reinforcement since it is designed to weaken or eliminate a response rather than increase it. Like reinforcement, punishment can work either by directly applying an unpleasant stimulus like a shock after a response or by removing a potentially rewarding stimulus, for instance, deducting someone’s pocket money to punish undesirable behaviour.

Note: It is not always easy to distinguish between punishment and negative reinforcement.


Operant Conditioning Summary

Looking at Skinner's classic studies on pigeons’ behaviour we can identify some of the major assumptions of behaviourists approach.

• Psychology should be seen as a science, to be studied in a scientific manner. Skinner's study of behaviour in rats was conducted under carefully controlled laboratory conditions.

• Behaviourism is primarily concerned with observable behaviour, as opposed to internal events like thinking and emotion. Note that Skinner did not say that the rats learnt to press a lever because they wanted food. He instead concentrated on describing the easily observed behaviour that the rats acquired.

• The major influence on human behaviour is learning from our environment. In the Skinner study, because food followed a particular behaviour the rats learned to repeat that behaviour, e.g. classical and operant conditioning.

• There is little difference between the learning that takes place in humans and that in other animals. Therefore research (e.g. classical conditioning) can be carried out on animals (Pavlov’s dogs) as well as on humans (Little Albert). Skinner proposed that the way humans learn behaviour is much the same as the way the rats learned to press a lever.

So, if your layperson's idea of psychology has always been of people in laboratories wearing white coats and watching hapless rats try to negotiate mazes in order to get to their dinner, then you are probably thinking of behavioural psychology.

Behaviourism and its offshoots tend to be among the most scientific of the psychological perspectives. The emphasis of behavioural psychology is on how we learn to behave in certain ways. We are all constantly learning new behaviours and how to modify our existing behaviour. Behavioural psychology is the psychological approach that focuses on how this learning takes place.


Operant Conditioning Downloads

Listen to a MIT undergraduate lecture on Conditioning Behaviourism Audio Broadcast

Operant Conditioning

Learning and Behaviour

Behaviourism | Classical Conditioning | Behaviour Therapies | Behaviourism Crossword | Behaviourism Essay


Back to Homepage