SKINNER'S OPERANT CONDITIONING (INSTRUMENTAL CONDITIONING)
SKINNER'S OPERANT CONDITIONING (INSTRUMENTAL CONDITIONING)
B. F Skinner is regarded as the father of Operant Conditioning, but his work was based on Thorndike's (1898) law of effect. According to this principle, behavior that is followed by pleasant consequences is likely to be repeated, and behavior followed by unpleasant consequences is less likely to be repeated.
OPERANT CONDITIONING - is sometimes referred to as instrumental conditioning. It is a method of learning that employs rewards and punishments for behavior. Through operant conditioning, an association is made between a behavior and a consequence (whether negative or positive) for that behavior.
Skinner identified three types of responses, or operant, that can follow behavior.
• Neutral operant: responses from the environment that neither increase nor decrease the probability of a behavior being repeated.
• Reinforcers: Responses from the environment that increase the probability of a behavior being repeated. Reinforcers can be either positive or negative.
• Punishers: Responses from the environment that decrease the likelihood of a behavior being repeated. Punishment weakens behavior.
Operant Conditioning is not just something that takes place in experimental settings while training lab animals. It also plays a powerful role in everyday learning. Reinforcement and punishment take place in natural settings all the time, as well as in more structured settings such as classrooms or therapy sessions.
COMPONENTS OF OPERANT CONDITIONING
Reinforcement in Operant Conditioning - Reinforcement is any event that strengthens or increases the behavior it follows. There are two kinds of reinforcers. In both of these cases of reinforcement, the behavior increases.
Positive Reinforcers - are favorable events or outcomes that are presented after the behavior. In positive reinforcement situations, a response or behavior is strengthened by the addition of praise or a direct reward. If you do a good job at work and your manager gives you a bonus, that bonus is a positive reinforcer.
Negative Reinforcers - involve the removal of an unfavorable events or outcomes after the display of a behavior. In these situations, a response is strengthened by the removal of something considered unpleasant. For example, if your child starts to scream in the middle of a restaurant, but stops once you hand them a treat, your action led to the removal of the unpleasant condition, negatively reinforcing your behavior (not your child's).
Punishment in Operant Conditioning - Punishment is the presentation of an adverse event or outcome that causes a decrease in the behavior it follows.
Positive Punishment - sometimes referred to as punishment by application, presents an unfavorable event or outcome in order to weaken the response it follows. Spanking for misbehavior is an example of punishment by application.
Negative Punishment - also known as punishment by removal, occurs when a favorable event or outcome is removed after a behavior occurs. Taking away a child's video game following misbehavior is an example of negative punishment.
Reinforcement Schedules
Reinforcement is not necessarily a straightforward process, and there are a number of factors that can influence how quickly and how well new things are learned. Skinner found that when and how often behaviors were reinforced played a role in the speed and strength of acquisition. In other words, the timing and frequency of reinforcement influenced how new behaviors were learned and how old behaviors were modified.
Skinner identified several different schedules of reinforcement that impact the operant conditioning process:
1. Fixed-ratio schedules - are a type of partial reinforcement. Responses are reinforced only after a specific number of responses have occurred. This typically leads to a fairly steady response rate.
2. Fixed-interval schedules - are another form of partial reinforcement. Reinforcement occurs only after a certain interval of time has elapsed.
3. Variable-ratio schedules - are also a type of partial reinforcement that involve reinforcing behavior after a varied number of responses. This leads to both a high response rate and slow extinction rates.
4. Variable-interval schedules - are the final form of partial reinforcement Skinner described. This schedule involves delivering reinforcement after a variable amount of time has elapsed. This also tends to lead to a fast response rate and slow extinction rate.