trong>Questions –When the Referee Sees Red…
1. Which data are more variable – the age of the referees or their years of experience?How do you know?
2.What is the IV in this study?What is the DV?
3.In this study, why is a dependent t-test more appropriate than an independent t-test to analyze the data?Make sure your answer is specific with respect to what is being compared.
4.Was the number of points awarded to the competitor wearing red protective gear significantly higher than those awarded to the competitor wearing blue protective gear?How do you know, statistically speaking?
5.Was the number of points awarded to a blue competitor who was digitally transformed into a red competitor significantly higher?How do you know, statistically speaking?
6.For which comparison was the effect size the largest?How would the value be interpreted?
7.What one change or addition would you make to this study? (can’t say “none”!)
PS YC HOLOGICA L SC IENCE
Short Report
When the Referee Sees Red . . .
Norbert Hagemann, Bernd Strauss, and Jan Leiing
Westfälische Wilhelms-Universität Münster
Hill and Barton (2005) showed that wearing red sports attire has
a positive impact on one’s outcome in a combat sport (e.g., tae
kwon do or wrestling). They suggested that this effect is due to an
evolutionary or cultural association of the color red with dominance and aggression, proposing that this association triggers
a psychological effect in an athlete who wears red (or in his or
her opponent; e.g., Cuthill, Hunt, Cleary, & Clark, 1997;
Milinski & Bakker, 1990; Setchell & Wickings, 2005). Rowe,
Harris, and Roberts (2005) criticized this argument and instead
attributed the bias evident in these and other data (judo) to
differences in opponents’ visibility.
We disagree with both interpretations (see also Barton & Hill,
2005), arguing that this phenomenon is actually due to a perceptual bias in the referee. That is, we propose that the perception of colors triggers a psychological effect in referees that
can lead to bias in evaluating identical performances. Referees
and umpires exert a major influence on the outcome of sports
competitions (Plessner & Haar, 2006). Athletes frequently make
very rapid movements, and referees have to view sports competitions from a very disadvantageous perspective, so it is extremely difficult for them to make objective judgments (Oudejans et al., 2000). As a result, their judgments may show biases
like those found in other social judgments (Frank & Gilovich,
1988; Plessner & Haar, 2006; Ste-Marie & Valiquette, 1996).
Therefore, we believe that it is the referees who are the actual
cause of the advantage competitors have when they wear red.
Because the effect of red clothing on performance and on the
decisions of referees may well have been confounded in the
original data, we conducted a new experiment and found that
referees assign more points to tae kwon do competitors dressed
in red than to those dressed in blue, even when the performance
of the competitors is identical.
referee 5 8.02 years, SD 5 6.27) individually watched videotaped excerpts from sparring rounds of five different male
competitors of similar abilities. Each of two blocks contained 11
clips, with an average length of 4.4 s. The video images measured 1,024 768 pixels and were displayed on a notebook
computer with a 15.4-in. screen.
In each video, one competitor was wearing red protective gear,
and the other was wearing blue protective gear. (Underneath this
gear, each competitor wore a white tae kwon do uniform.) The two
blocks contained the same clips, but with the colors of the competitors reversed. We reversed the colors using digital graphics,
animation, and image-compositing software (Adobe After Effects
7.0).
After viewing each clip, participants indicated how many
points they would award the red and the blue competitors.
Following the rules of the World Taekwondo Federation,1 participants awarded points when permitted techniques were used
to deliver attacks to the legal scoring areas of the body: Specifically, 1 point was awarded for an attack to the trunk protector
(fist and foot techniques), and 2 points were awarded for an attack to the face (only attacks by foot technique are permitted).
Additional points could be awarded if a contestant knocked
down his opponent. Prohibited acts could be counted as a deduction of 1 point.
The video clips were presented in random order within each
block, and the order of the blocks was counterbalanced across
participants. For each referee, we calculated the total number of
points for the red and blue competitors, and these values were
subjected to separate dependent t tests. We used Cohen’s d as
our measure of effect size. We expected that changing the color
of the protective gear from blue to red would lead to an increase
in points awarded, whereas changing the color from red to blue
would have the opposite effect.
METHOD
We investigated the effect of the color of the protective gear
(trunk and head protectors) in tae kwon do on the decisions of
referees. A total of 42 experienced referees (13 female, 29 male;
mean age 5 29.31 years, SD 5 10.56; mean experience as a
Address correspondence to Norbert Hagemann, Department of Sport
Psychology, University of Münster, Horstmarer Landweg 62b, 48149
Münster, Germany, e-mail: nhageman@uni-muenster.de.
Volume 19—Number 8
RESULTS AND DISCUSSION
Figure 1 shows the effect of the color of the protective gear. The
competitor wearing red protective gear was awarded an average
1
In a regular competition, the match score is the sum of points in three 2-min
rounds. Unless there is a knockout, withdrawal, or disqualification, the winner
is usually determined by points: The winner has the higher final score, exceeds
the opponent’s score by 7 points, or reaches the maximum of 12 points.
Copyright r 2008 Association for Psychological Science
769
Seeing Red
10
9
8
Mean Points
7
6
5
4
3
Competitors A Competitors B
(red)
(blue)
Competitors A Competitors B
(blue)
(red)
Original
Color-Reversed
2
1
0
Version
Fig. 1. Mean number of points awarded to tae kwon do competitors in the original and color-reversed versions of the video
clips. Each clip depicted a sparring round with one competitor dressed in red and the other dressed in blue. Competitors A
wore red in the original clips and blue in the color-reversed clips, and Competitors B wore blue in the original clips and red
in the color-reversed clips. Error bars indicate standard errors.
of 13% (0.94 points) more points than the competitor wearing
blue protective gear, t(41) 5 2.85, p < .01, d 5 0.35. The
number of points awarded increased for a blue competitor who
was digitally transformed into a red competitor, t(41) 5 2.45,
p < .01 (one-tailed), d 5 0.36, and decreased for a red competitor who was digitally transformed into a blue competitor,
t(41) 5 1.66, p < .05 (one-tailed), d 5 0.25. The gender of the
referee, total number of points awarded in the two versions of the
video clip (original vs. color-reversed), and the order in which
the two versions were presented had no effect on the referees’
decisions.
Thus, competitors dressed in red are awarded more points
than competitors dressed in blue, even when their performance
is identical. The effect found in this experiment can also explain
why the effect of clothing color on the outcome is stronger when
competitors have similar abilities than when there is a large
asymmetry in their abilities (Hill & Barton, 2005). Referees’
decisions will ‘‘tip the scales’’ when athletes are relatively well
matched, but have relatively small influence when one is clearly
superior. Even though the color of athletes’ sportswear may well
exert an influence on their performance (through associations
with dominance or differences in visibility of the opponent), we
argue that the referees are responsible for the advantage conveyed to athletes who wear red. Although there is a need for
further research (including research on the effects of different
770
colors), our results suggest a need to change the rules (i.e., forbid
red sports attire) or support referees by providing electronic
decision-making aids (e.g., electronic trunk protectors) in those
sports in which this color bias may be a problem.
REFERENCES
Barton, R.A., & Hill, R.A. (2005). Sporting contests: Seeing red?
Putting sportswear in context (reply). Nature, 437, E10–E11.
Cuthill, I.C., Hunt, S., Cleary, C., & Clark, C. (1997). Colour bands,
dominance, and body mass regulation in male zebra finches
(Taeniopygia guttata). Proceedings of the Royal Society B: Biological Sciences, 264, 1093–1099.
Frank, M.G., & Gilovich, T. (1988). The dark side of self- and
social perception: Black uniforms and aggression in professional
sports. Journal of Personality and Social Psychology, 54, 74–85.
Hill, R.A., & Barton, R.A. (2005). Red enhances human performance
in contests. Nature, 435, 293.
Milinski, M., & Bakker, T.C.M. (1990). Female sticklebacks use male
coloration in mate choice and hence avoid parasitized males.
Nature, 344, 330–333.
Oudejans, R.R., Verheijen, R., Bakker, F.C., Gerrits, J.C., Steinbruckner, M., & Beek, P.J. (2000). Errors in judging ‘offside’ in
football. Nature, 404, 33.
Plessner, H., & Haar, T. (2006). Sports performance judgments from a
social cognitive perspective. Psychology of Sport and Exercise, 7,
555–575.
Volume 19—Number 8
Norbert Hagemann, Bernd Strauss, and Jan Leiing
Rowe, C., Harris, J.M., & Roberts, S.C. (2005). Sporting contests:
Seeing red? Putting sportswear in context. Nature, 437, E10.
Setchell, J.M., & Wickings, E.J. (2005). Dominance, status signals and
coloration in male mandrills (Mandrillus sphinx). Ethology, 111,
25–50.
Volume 19—Number 8
Ste-Marie, D.M., & Valiquette, S.M. (1996). Enduring memory-influenced biases in gymnastic judging. Journal of Experimental
Psychology: Learning, Memory, and Cognition, 22, 1498–1502.
(RECEIVED 12/16/07; REVISION ACCEPTED 2/13/08)
771
Ed iti o n
© Deborah Batt
10
Statistics for the
Behavioral Sciences
Frederick J Gravetter
The College at Brockport, State University of New York
Larry B. WaLLnau
The College at Brockport, State University of New York
Australia ● Brazil ● Mexico ● Singapore ● United Kingdom ● United States
This is an electronic version of the print textbook. Due to electronic rights restrictions, some third party content may be suppressed. Editorial
review has deemed that any suppressed content does not materially affect the overall learning experience. The publisher reserves the right to
remove content from this title at any time if subsequent rights restrictions require it. For valuable information on pricing, previous
editions, changes to current editions, and alternate formats, please visit www.cengage.com/highered to search by
ISBN#, author, title, or keyword for materials in your areas of interest.
Important Notice: Media content referenced within the product description or the product text may not be available in the eBook version.
Statistics for the Behavioral Sciences,
Tenth Edition
Frederick J Gravetter and Larry B. Wallnau
Product director: Jon-david Hague
Product Manager: timothy Matray
Content developer: Stefanie Chase
Product Assistant: Kimiya Hojjat
Marketing Manager: Melissa Larmon
Content Project Manager: Michelle Clark
© 2017, 2013 Cengage Learning
WCn: 02-200-203
ALL RiGHtS RESERVEd. no part of this work covered by the copyright
herein may be reproduced, transmitted, stored, or used in any form
or by any means graphic, electronic, or mechanical, including but not
limited to photocopying, recording, scanning, digitizing, taping, Web
distribution, information networks, or information storage and retrieval
systems, except as permitted under Section 107 or 108 of the 1976
United States Copyright Act, without the prior written permission of
the publisher.
Art director: Vernon Boes
Manufacturing Planner: Karen Hunt
Production Service: Lynn Lustberg,
MPS Limited
text and Photo Researcher:
Lumina datamatics
Copy Editor: Sara Kreisman
For product information and technology assistance, contact us at
Cengage Learning Customer & Sales Support, 1-800-354-9706
For permission to use material from this text or product,
submit all requests online at www.cengage.com/permissions
Further permissions questions can be e-mailed to
permissionrequest@cengage.com
illustrator: MPS Limited
Library of Congress Control number: 2015940372
text and Cover designer: Lisa Henry
Student Edition:
iSBn: 978-1-305-50491-2
Cover image: © deborah Batt
“Community 2-1”
Compositor: MPS Limited
Loose-leaf Edition:
iSBn: 978-1-305-86280-7
Cengage Learning
20 Channel Center Street
Boston, MA 02210
USA
Cengage Learning is a leading provider of customized learning solutions
with employees residing in nearly 40 different countries and sales in
more than 125 countries around the world. Find your local representative
at www.cengage.com
Cengage Learning products are represented in Canada by
nelson Education, Ltd.
to learn more about Cengage Learning Solutions,
visit www.cengage.com
Purchase any of our products at your local college store or at our
preferred online store www.cengagebrain.com
Printed in Canada
Print Number: 01
Print Year: 2015
B RiEF Co n tEn t S
CHAPtER
1
Introduction to Statistics 1
CHAPtER
2
Frequency Distributions
CHAPtER
3
Central Tendency
CHAPtER
4
Variability
CHAPtER
5
z-Scores: Location of Scores and Standardized Distributions 131
CHAPtER
6
Probability
CHAPtER
7
Probability and Samples: The Distribution of Sample Means 193
CHAPtER
8
Introduction to Hypothesis Testing 223
CHAPtER
9
Introduction to the t Statistic
C H A P t E R 10
CHAPtER
11
33
67
99
159
267
The t Test for Two Independent Samples 299
The t Test for Two Related Samples 335
C H A P t E R 12
Introduction to Analysis of Variance 365
C H A P t E R 13
Repeated-Measures Analysis of Variance 413
C H A P t E R 14
Two-Factor Analysis of Variance (Independent Measures) 447
CHAPtER
15
Correlation
485
C H A P t E R 16
Introduction to Regression 529
C H A P t E R 17
The Chi-Square Statistic: Tests for Goodness of Fit and Independence 559
C H A P t E R 18
The Binomial Test
603
iii
Co n tEn t S
CHAPtER
1 Introduction to Statistics
PREVIEW
1
2
1.1 Statistics, Science, and Observations 2
1.2 Data Structures, Research Methods, and Statistics 10
1.3 Variables and Measurement 18
1.4 Statistical Notation 25
Summary
29
Focus on Problem Solving 30
Demonstration 1.1 30
Problems 31
CHAPtER
2 Frequency Distributions
PREVIEW
33
34
2.1 Frequency Distributions and Frequency Distribution Tables 35
2.2 Grouped Frequency Distribution Tables 38
2.3 Frequency Distribution Graphs 42
2.4 Percentiles, Percentile Ranks, and Interpolation 49
2.5 Stem and Leaf Displays 56
Summary
58
Focus on Problem Solving 59
Demonstration 2.1 60
Demonstration 2.2 61
Problems 62
v
vi
CONTENTS
CHAPtER
3 Central Tendency
PREVIEW
67
68
3.1 Overview 68
3.2 The Mean 70
3.3 The Median 79
3.4 The Mode 83
3.5 Selecting a Measure of Central Tendency 86
3.6 Central Tendency and the Shape of the Distribution 92
Summary
94
Focus on Problem Solving 95
Demonstration 3.1 96
Problems 96
CHAPtER
4 Variability
PREVIEW
99
100
4.1 Introduction to Variability 101
4.2 Defining Standard Deviation and Variance 103
4.3 Measuring Variance and Standard Deviation for a Population 108
4.4 Measuring Standard Deviation and Variance for a Sample 111
4.5 Sample Variance as an Unbiased Statistic 117
4.6 More about Variance and Standard Deviation 119
Summary
125
Focus on Problem Solving 127
Demonstration 4.1 128
Problems 128
z-Scores: Location of Scores
C H A P t E R 5 and Standardized Distributions
PREVIEW
132
5.1 Introduction to z-Scores 133
5.2 z-Scores and Locations in a Distribution 135
5.3 Other Relationships Between z, X, 𝛍, and 𝛔 138
131
CONTENTS
vii
5.4 Using z-Scores to Standardize a Distribution 141
5.5 Other Standardized Distributions Based on z-Scores 145
5.6 Computing z-Scores for Samples 148
5.7 Looking Ahead to Inferential Statistics 150
Summary
153
Focus on Problem Solving 154
Demonstration 5.1 155
Demonstration 5.2 155
Problems 156
CHAPtER
6 Probability
PREVIEW
159
160
6.1 Introduction to Probability 160
6.2 Probability and the Normal Distribution 165
6.3 Probabilities and Proportions for Scores
from a Normal Distribution
172
6.4 Probability and the Binomial Distribution 179
6.5 Looking Ahead to Inferential Statistics 184
Summary
186
Focus on Problem Solving 187
Demonstration 6.1 188
Demonstration 6.2 188
Problems 189
Probability and Samples: The Distribution
C H A P t E R 7 of Sample Means
PREVIEW
194
7.1 Samples, Populations, and the Distribution
of Sample Means
194
7.2 The Distribution of Sample Means for any Population
and any Sample Size
199
7.3 Probability and the Distribution of Sample Means 206
7.4 More about Standard Error 210
7.5 Looking Ahead to Inferential Statistics
215
193
viii
CONTENTS
Summary
219
Focus on Problem Solving 219
Demonstration 7.1 220
Problems 221
CHAPtER
8 Introduction to Hypothesis Testing
PREVIEW
223
224
8.1 The Logic of Hypothesis Testing 225
8.2 Uncertainty and Errors in Hypothesis Testing 236
8.3 More about Hypothesis Tests 240
8.4 Directional (One-Tailed) Hypothesis Tests 245
8.5 Concerns about Hypothesis Testing: Measuring Effect Size 250
8.6 Statistical Power 254
Summary
260
Focus on Problem Solving 261
Demonstration 8.1 262
Demonstration 8.2 263
Problems 263
CHAPtER
9 Introduction to the t Statistic
PREVIEW
268
9.1 The t Statistic: An Alternative to z 268
9.2 Hypothesis Tests with the t Statistic 274
9.3 Measuring Effect Size for the t Statistic 279
9.4 Directional Hypotheses and One-Tailed Tests 288
Summary
291
Focus on Problem Solving 293
Demonstration 9.1 293
Demonstration 9.2 294
Problems 295
267
CONTENTS
CHAPtER
10 The t Test for Two Independent Samples
PREVIEW
ix
299
300
10.1 Introduction to the Independent-Measures Design 300
10.2 The Null Hypothesis and the Independent-Measures t Statistic 302
10.3 Hypothesis Tests with the Independent-Measures t Statistic 310
10.4 Effect Size and Confidence Intervals for the
Independent-Measures t
316
10.5 The Role of Sample Variance and Sample Size in the
Independent-Measures t Test
Summary
322
325
Focus on Problem Solving 327
Demonstration 10.1 328
Demonstration 10.2 329
Problems 329
CHAPtER
11 The t Test for Two Related Samples
PREVIEW
335
336
11.1 Introduction to Repeated-Measures Designs 336
11.2 The t Statistic for a Repeated-Measures Research Design 339
11.3 Hypothesis Tests for the Repeated-Measures Design 343
11.4 Effect Size and Confidence Intervals for the Repeated-Measures t 347
11.5 Comparing Repeated- and Independent-Measures Designs 352
Summary
355
Focus on Problem Solving 358
Demonstration 11.1 358
Demonstration 11.2 359
Problems 360
CHAPtER
12 Introduction to Analysis of Variance
PREVIEW
366
12.1 Introduction (An Overview of Analysis of Variance) 366
12.2 The Logic of Analysis of Variance 372
12.3 ANOVA Notation and Formulas 375
365
x
CONTENTS
12.4 Examples of Hypothesis Testing and Effect Size with ANOVA 383
12.5 Post Hoc Tests 393
12.6 More about ANOVA 397
Summary
403
Focus on Problem Solving 406
Demonstration 12.1 406
Demonstration 12.2 408
Problems 408
CHAPtER
13 Repeated-Measures Analysis of Variance
PREVIEW
413
414
13.1 Overview of the Repeated-Measures ANOVA 415
13.2 Hypothesis Testing and Effect Size with the
Repeated-Measures ANOVA
420
13.3 More about the Repeated-Measures Design 429
Summary
436
Focus on Problem Solving 438
Demonstration 13.1 439
Demonstration 13.2 440
Problems 441
Two-Factor Analysis of Variance
C H A P t E R 14 (Independent Measures)
PREVIEW
447
448
14.1 An Overview of the Two-Factor, Independent-Measures, ANOVA: Main
Effects and Interactions
448
14.2 An Example of the Two-Factor ANOVA and Effect Size 458
14.3 More about the Two-Factor ANOVA 467
Summary
473
Focus on Problem Solving 475
Demonstration 14.1 476
Demonstration 14.2 478
Problems 479
CONTENTS
CHAPtER
15 Correlation
PREVIEW
xi
485
486
15.1 Introduction 487
15.2 The Pearson Correlation 489
15.3 Using and Interpreting the Pearson Correlation 495
15.4 Hypothesis Tests with the Pearson Correlation 506
15.5 Alternatives to the Pearson Correlation 510
Summary
520
Focus on Problem Solving 522
Demonstration 15.1 523
Problems 524
CHAPtER
16 Introduction to Regression
PREVIEW
529
530
16.1 Introduction to Linear Equations and Regression 530
16.2 The Standard Error of Estimate and Analysis of Regression:
The Significance of the Regression Equation
538
16.3 Introduction to Multiple Regression with Two Predictor Variables 544
Summary
552
Linear and Multiple Regression
554
Focus on Problem Solving 554
Demonstration 16.1 555
Problems 556
The Chi-Square Statistic: Tests for Goodness
C H A P t E R 17 of Fit and Independence
PREVIEW
560
17.1 Introduction to Chi-Square: The Test for Goodness of Fit 561
17.2 An Example of the Chi-Square Test for Goodness of Fit 567
17.3 The Chi-Square Test for Independence 573
17.4 Effect Size and Assumptions for the Chi-Square Tests
17.5 Special Applications of the Chi-Square Tests 587
582
559
xii
CONTENTS
Summary
591
Focus on Problem Solving 595
Demonstration 17.1 595
Demonstration 17.2 597
Problems 597
CHAPtER
18 The Binomial Test
PREVIEW
603
604
18.1 Introduction to the Binomial Test 604
18.2 An Example of the Binomial Test 608
18.3 More about the Binomial Test: Relationship with Chi-Square
and the Sign Test
Summary
612
617
Focus on Problem Solving 619
Demonstration 18.1 619
Problems 620
A PPE N D IX E S
A Basic Mathematics Review 625
A.1
A.2
A.3
A.4
A.5
Symbols and Notation 627
Proportions: Fractions, Decimals, and Percentages 629
Negative Numbers 635
Basic Algebra: Solving Equations 637
Exponents and Square Roots 640
B Statistical Tables 647
C Solutions for Odd-Numbered Problems in the Text 663
D General Instructions for Using SPSS 683
E Hypothesis Tests for Ordinal Data: Mann-Whitney,
Wilcoxon, Kruskal-Wallis, and Friedman Tests
687
Statistics Organizer: Finding the Right Statistics for Your Data
References
717
Name Index
723
Subject Index
725
701
PREFACE
M
any students in the behavioral sciences view the required statistics course as an
intimidating obstacle that has been placed in the middle of an otherwise interesting curriculum. They want to learn about human behavior—not about math and science.
As a result, the statistics course is seen as irrelevant to their education and career goals.
However, as long as the behavioral sciences are founded in science, knowledge of statistics
will be necessary. Statistical procedures provide researchers with objective and systematic
methods for describing and interpreting their research results. Scientific research is the
system that we use to gather information, and statistics are the tools that we use to distill
the information into sensible and justified conclusions. The goal of this book is not only
to teach the methods of statistics, but also to convey the basic principles of objectivity and
logic that are essential for science and valuable for decision making in everyday life.
Those of you who are familiar with previous editions of Statistics for the Behavioral
Sciences will notice that some changes have been made. These changes are summarized
in the section entitled “To the Instructor.” In revising this text, our students have been
foremost in our minds. Over the years, they have provided honest and useful feedback.
Their hard work and perseverance has made our writing and teaching most rewarding. We
sincerely thank them. Students who are using this edition should please read the section of
the preface entitled “To the Student.”
The book chapters are organized in the sequence that we use for our own statistics
courses. We begin with descriptive statistics, and then examine a variety of statistical procedures focused on sample means and variance before moving on to correlational methods
and nonparametric statistics. Information about modifying this sequence is presented in the
To The Instructor section for individuals who prefer a different organization. Each chapter
contains numerous examples, many based on actual research studies, learning checks, a
summary and list of key terms, and a set of 20–30 problems.
Ancillaries
Ancillaries for this edition include the following.
■■
MindTap® Psychology: MindTap® Psychology for Gravetter/Wallnau’s Statistics
for The Behavioral Sciences, 10th Edition is the digital learning solution that helps
instructors engage and transform today’s students into critical thinkers. Through paths
of dynamic assignments and applications that you can personalize, real-time course
analytics, and an accessible reader, MindTap helps you turn cookie cutter into cutting
edge, apathy into engagement, and memorizers into higher-level thinkers.
As an instructor using MindTap you have at your fingertips the right content and
unique set of tools curated specifically for your course, such as video tutorials that
walk students through various concepts and interactive problem tutorials that provide
students opportunities to practice what they have learned, all in an interface designed
to improve workflow and save time when planning lessons and course structure. The
control to build and personalize your course is all yours, focusing on the most relevant
xiii
xiv
PREFACE
■■
■■
■■
material while also lowering costs for your students. Stay connected and informed in
your course through real time student tracking that provides the opportunity to adjust
the course as needed based on analytics of interactivity in the course.
Online Instructor’s Manual: The manual includes learning objectives, key terms,
a detailed chapter outline, a chapter summary, lesson plans, discussion topics, student
activities, “What If” scenarios, media tools, a sample syllabus and an expanded test
bank. The learning objectives are correlated with the discussion topics, student
activities, and media tools.
Online PowerPoints: Helping you make your lectures more engaging while effectively reaching your visually oriented students, these handy Microsoft PowerPoint®
slides outline the chapters of the main text in a classroom-ready presentation. The
PowerPoint® slides are updated to reflect the content and organization of the new
edition of the text.
Cengage Learning Testing, powered by Cognero®: Cengage Learning Testing,
Powered by Cognero®, is a flexible online system that allows you to author, edit,
and manage test bank content. You can create multiple test versions in an instant and
deliver tests from your LMS in your classroom.
Acknowledgments
It takes a lot of good, hard-working people to produce a book. Our friends at Cengage
have made enormous contributions to this textbook. We thank: Jon-David Hague, Product
Director; Timothy Matray, Product Team Director; Jasmin Tokatlian, Content Development Manager; Kimiya Hojjat, Product Assistant; and Vernon Boes, Art Director. Special
thanks go to Stefanie Chase, our Content Developer and to Lynn Lustberg who led us
through production at MPS.
Reviewers play a very important role in the development of a manuscript. Accordingly,
we offer our appreciation to the following colleagues for their assistance: Patricia Case,
University of Toledo; Kevin David, Northeastern State University; Adia Garrett, University of Maryland, Baltimore County; Carrie E. Hall, Miami University; Deletha Hardin,
University of Tampa; Angela Heads, Prairie View A&M University; Roberto Heredia,
Texas A&M International University; Alisha Janowski, University of Central Florida;
Matthew Mulvaney, The College at Brockport (SUNY); Nicholas Von Glahn, California
State Polytechnic University, Pomona; and Ronald Yockey, Fresno State University.
To the Instructor
Those of you familiar with the previous edition of Statistics for the Behavioral Sciences will
notice a number of changes in the 10th edition. Throughout this book, research examples
have been updated, real world examples have been added, and the end-of-chapter problems
have been extensively revised. Major revisions for this edition include the following:
1. Each section of every chapter begins with a list of Learning Objectives for that
specific section.
2. Each section ends with a Learning Check consisting of multiple-choice questions
with at least one question for each Learning Objective.
PREFACE
xv
3. The former Chapter 19, Choosing the Right Statistics, has been eliminated and
an abridged version is now an Appendix replacing the Statistics Organizer, which
appeared in earlier editions.
Other examples of specific and noteworthy revisions include the following.
Chapter 1 The section on data structures and research methods parallels the new
Appendix, Choosing the Right Statistics.
Chapter 2 The chapter opens with a new Preview to introduce the concept and purpose
of frequency distributions.
Chapter 3
Minor editing clarifies and simplifies the discussion the median.
Chapter 4 The chapter opens with a new Preview to introduce the topic of Central
Tendency. The sections on standard deviation and variance have been edited to increase
emphasis on concepts rather than calculations.
The section discussion relationships between z, X, μ, and σ has been
expanded and includes a new demonstration example.
Chapter 5
Chapter 6
The chapter opens with a new Preview to introduce the topic of Probability.
The section, Looking Ahead to Inferential Statistics, has been substantially shortened and
simplified.
Chapter 7
The former Box explaining difference between standard deviation and
standard error was deleted and the content incorporated into Section 7.4 with editing to
emphasize that the standard error is the primary new element introduced in the chapter.
The final section, Looking Ahead to Inferential Statistics, was simplified and shortened to
be consistent with the changes in Chapter 6.
Chapter 8
A redundant example was deleted which shortened and streamlined the
remaining material so that most of the chapter is focused on the same research example.
Chapter 9 The chapter opens with a new Preview to introduce the t statistic and explain
why a new test statistic is needed. The section introducing Confidence Intervals was edited
to clarify the origin of the confidence interval equation and to emphasize that the interval
is constructed at the sample mean.
Chapter 10
The chapter opens with a new Preview introducing the independent-measures t statistic. The section presenting the estimated standard error of (M1 – M2) has been
simplified and shortened.
Chapter 11
The chapter opens with a new Preview introducing the repeated-measures t
statistic. The section discussing hypothesis testing has been separated from the section on
effect size and confidence intervals to be consistent with the other two chapters on t tests.
The section comparing independent- and repeated-measures designs has been expanded.
Chapter 12 The chapter opens with a new Preview introducing ANOVA and explaining
why a new hypothesis testing procedure is necessary. Sections in the chapter have been
reorganized to allow flow directly from hypothesis tests and effect size to post tests.
xvi
PREFACE
Chapter 13
Substantially expanded the section discussing factors that influence the
outcome of a repeated-measures hypothesis test and associated measures of effect size.
Chapter 14
The chapter opens with a new Preview presenting a two-factor research
example and introducing the associated ANOVA. Sections have been reorganized so that
simple main effects and the idea of using a second factor to reduce variance from individual differences are now presented as extra material related to the two-factor ANOVA.
Chapter 15
The chapter opens with a new Preview presenting a correlational research
study and the concept of a correlation. A new section introduces the t statistic for evaluating the significance of a correlation and the section on partial correlations has been simplified and shortened.
Chapter 16 The chapter opens with a new Preview introducing the concept of regression and
its purpose. A new section demonstrates the equivalence of testing the significance of a correlation and testing the significance of a regression equation with one predictor variable. The section on residuals for the multiple-regression equation has been edited to simplify and shorten.
Chapter 17
A new chapter Preview presents an experimental study with data consisting
of frequencies, which are not compatible with computing means and variances. Chi-square
tests are introduced as a solution to this problem. A new section introduces Cohen’s w as
a means of measuring effect size for both chi-square tests.
Chapter 18
Substantial editing clarifies the section explaining how the real limits for
each score can influence the conclusion from a binomial test.
The former Chapter 19 covering the task of matching statistical methods to specific
types of data has been substantially shortened and converted into an Appendix.
■■Matching the Text to Your Syllabus
The book chapters are organized in the sequence that we use for our own statistics courses.
However, different instructors may prefer different organizations and probably will choose
to omit or deemphasize specific topics. We have tried to make separate chapters, and even
sections of chapters, completely self-contained, so they can be deleted or reorganized to fit
the syllabus for nearly any instructor. Some common examples are as follows.
■■
■■
■■
It is common for instructors to choose between emphasizing analysis of variance
(Chapters 12, 13, and 14) or emphasizing correlation/regression (Chapters 15 and 16).
It is rare for a one-semester course to complete coverage of both topics.
Although we choose to complete all the hypothesis tests for means and mean
differences before introducing correlation (Chapter 15), many instructors prefer to
place correlation much earlier in the sequence of course topics. To accommodate
this, Sections 15.1, 15.2, and 15.3 present the calculation and interpretation of
the Pearson correlation and can be introduced immediately following Chapter 4
(variability). Other sections of Chapter 15 refer to hypothesis testing and should be
delayed until the process of hypothesis testing (Chapter 8) has been introduced.
It is also possible for instructors to present the chi-square tests (Chapter 17) much
earlier in the sequence of course topics. Chapter 17, which presents hypothesis tests
for proportions, can be presented immediately after Chapter 8, which introduces the
process of hypothesis testing. If this is done, we also recommend that the Pearson
correlation (Sections 15.1, 15.2, and 15.3) be presented early to provide a foundation
for the chi-square test for independence.
PREFACE
xvii
To the Student
A primary goal of this book is to make the task of learning statistics as easy and painless
as possible. Among other things, you will notice that the book provides you with a number
of opportunities to practice the techniques you will be learning in the form of Learning
Checks, Examples, Demonstrations, and end-of-chapter problems. We encourage you to
take advantage of these opportunities. Read the text rather than just memorizing the formulas. We have taken care to present each statistical procedure in a conceptual context that
explains why the procedure was developed and when it should be used. If you read this
material and gain an understanding of the basic concepts underlying a statistical formula,
you will find that learning the formula and how to use it will be much easier. In the “Study
Hints,” that follow, we provide advice that we give our own students. Ask your instructor
for advice as well; we are sure that other instructors will have ideas of their own.
Over the years, the students in our classes and other students using our book have given
us valuable feedback. If you have any suggestions or comments about this book, you can
write to either Professor Emeritus Frederick Gravetter or Professor Emeritus Larry Wallnau
at the Department of Psychology, SUNY College at Brockport, 350 New Campus Drive,
Brockport, New York 14420. You can also contact Professor Emeritus Gravetter directly at
fgravett@brockport.edu.
■■Study Hints
You may find some of these tips helpful, as our own students have reported.
■■
■■
■■
■■
■■
The key to success in a statistics course is to keep up with the material. Each new
topic builds on previous topics. If you have learned the previous material, then the
new topic is just one small step forward. Without the proper background, however,
the new topic can be a complete mystery. If you find that you are falling behind, get
help immediately.
You will learn (and remember) much more if you study for short periods several
times per week rather than try to condense all of your studying into one long session.
For example, it is far more effective to study half an hour every night than to have
a single 3½-hour study session once a week. We cannot even work on writing this
book without frequent rest breaks.
Do some work before class. Keep a little ahead of the instructor by reading the appropriate sections before they are presented in class. Although you may not fully understand what you read, you will have a general idea of the topic, which will make the
lecture easier to follow. Also, you can identify material that is particularly confusing
and then be sure the topic is clarified in class.
Pay attention and think during class. Although this advice seems obvious, often it is
not practiced. Many students spend so much time trying to write down every example
presented or every word spoken by the instructor that they do not actually understand
and process what is being said. Check with your instructor—there may not be a need
to copy every example presented in class, especially if there are many examples like
it in the text. Sometimes, we tell our students to put their pens and pencils down for a
moment and just listen.
Test yourself regularly. Do not wait until the end of the chapter or the end of the
week to check your knowledge. After each lecture, work some of the end-of-chapter
problems and do the Learning Checks. Review the Demonstration Problems, and
be sure you can define the Key Terms. If you are having trouble, get your questions
answered immediately—reread the section, go to your instructor, or ask questions in
class. By doing so, you will be able to move ahead to new material.
xviii
PREFACE
■■
■■
Do not kid yourself! Avoid denial. Many students observe their instructor solve
problems in class and think to themselves, “This looks easy, I understand it.” Do
you really understand it? Can you really do the problem on your own without having
to leaf through the pages of a chapter? Although there is nothing wrong with using
examples in the text as models for solving problems, you should try working a problem with your book closed to test your level of mastery.
We realize that many students are embarrassed to ask for help. It is our biggest challenge as instructors. You must find a way to overcome this aversion. Perhaps contacting the instructor directly would be a good starting point, if asking questions in class
is too anxiety-provoking. You could be pleasantly surprised to find that your instructor does not yell, scold, or bite! Also, your instructor might know of another student
who can offer assistance. Peer tutoring can be very helpful.
Frederick J Gravetter
Larry B. Wallnau
A B o U t tH E AU tH o R S
Frederick Gravetter
Frederick J Gravetter is Professor Emeritus of Psychology at the
State University of New York College at Brockport. While teaching at
Brockport, Dr. Gravetter specialized in statistics, experimental design, and
cognitive psychology. He received his bachelor’s degree in mathematics from
M.I.T. and his Ph.D in psychology from Duke University. In addition to publishing this textbook and several research articles, Dr. Gravetter co-authored
Research Methods for the Behavioral Science and Essentials of Statistics for
the Behavioral Sciences.
Larry B. Wallnau
Larry B. WaLLnau is Professor Emeritus of Psychology at the State
University of New York College at Brockport. While teaching at Brockport,
he published numerous research articles in biopsychology. With
Dr. Gravetter, he co-authored Essentials of Statistics for the Behavioral
Sciences. Dr. Wallnau also has provided editorial consulting for numerous
publishers and journals. He has taken up running and has competed in 5K
races in New York and Connecticut. He takes great pleasure in adopting
neglected and rescued dogs.
xix
CH A P T ER
Introduction to Statistics
1
© Deborah Batt
PREVIEW
1.1 Statistics, Science, and Observations
1.2 Data Structures, Research Methods, and Statistics
1.3 Variables and Measurement
1.4 Statistical Notation
Summary
Focus on Problem Solving
Demonstration 1.1
Problems
1
PREVIEW
Before we begin our discussion of statistics, we ask you
to read the following paragraph taken from the philosophy of Wrong Shui (Candappa, 2000).
The Journey to Enlightenment
In Wrong Shui, life is seen as a cosmic journey,
a struggle to overcome unseen and unexpected
obstacles at the end of which the traveler will find
illumination and enlightenment. Replicate this quest
in your home by moving light switches away from
doors and over to the far side of each room.*
Why did we begin a statistics book with a bit of twisted
philosophy? In part, we simply wanted to lighten the
mood with a bit of humor—starting a statistics course is
typically not viewed as one of life’s joyous moments. In
addition, the paragraph is an excellent counterexample for
the purpose of this book. Specifically, our goal is to do
everything possible to prevent you from stumbling around
in the dark by providing lots of help and illumination as
you journey through the world of statistics. To accomplish
this, we begin each section of the book with clearly stated
learning objectives and end each section with a brief quiz
to test your mastery of the new material. We also introduce each new statistical procedure by explaining the purpose it is intended to serve. If you understand why a new
procedure is needed, you will find it much easier to learn.
1.1
The objectives for this first chapter are to provide
an introduction to the topic of statistics and to give you
some background for the rest of the book. We discuss
the role of statistics within the general field of scientific
inquiry, and we introduce some of the vocabulary and
notation that are necessary for the statistical methods
that follow.
As you read through the following chapters, keep
in mind that the general topic of statistics follows a
well-organized, logically developed progression that
leads from basic concepts and definitions to increasingly sophisticated techniques. Thus, each new topic
serves as a foundation for the material that follows. The
content of the first nine chapters, for example, provides
an essential background and context for the statistical
methods presented in Chapter 10. If you turn directly
to Chapter 10 without reading the first nine chapters,
you will find the material confusing and incomprehensible. However, if you learn and use the background
material, you will have a good frame of reference for
understanding and incorporating new concepts as they
are presented.
*Candappa, R. (2000). The little book of wrong shui. Kansas City:
Andrews McMeel Publishing. Reprinted by permission.
Statistics, Science, and Observations
LEARNING OBJECTIVEs
1. Define the terms population, sample, parameter, and statistic, and describe the
relationships between them.
2. Define descriptive and inferential statistics and describe how these two general
categories of statistics are used in a typical research study.
3. Describe the concept of sampling error and explain how this concept creates the
fundamental problem that inferential statistics must address.
■■Definitions of Statistics
By one definition, statistics consist of facts and figures such as the average annual snowfall
in Denver or Derrick Jeter’s lifetime batting average. These statistics are usually informative
and time-saving because they condense large quantities of information into a few simple figures. Later in this chapter we return to the notion of calculating statistics (facts and figures)
but, for now, we concentrate on a much broader definition of statistics. Specifically, we use
the term statistics to refer to a general field of mathematics. In this case, we are using the
term statistics as a shortened version of statistical procedures. For example, you are probably using this book for a statistics course in which you will learn about the statistical techniques that are used to summarize and evaluate research results in the behavioral sciences.
2
SEctIon 1.1 | Statistics, Science, and Observations
3
Research in the behavioral sciences (and other fields) involves gathering information.
To determine, for example, whether college students learn better by reading material on
printed pages or on a computer screen, you would need to gather information about students’ study habits and their academic performance. When researchers finish the task of
gathering information, they typically find themselves with pages and pages of measurements such as preferences, personality scores, opinions, and so on. In this book, we present
the statistics that researchers use to analyze and interpret the information that they gather.
Specifically, statistics serve two general purposes:
1. Statistics are used to organize and summarize the information so that the researcher can
see what happened in the research study and can communicate the results to others.
2. Statistics help the researcher to answer the questions that initiated the research by
determining exactly what general conclusions are justified based on the specific
results that were obtained.
DEFInItIon
The term statistics refers to a set of mathematical procedures for organizing, summarizing, and interpreting information.
Statistical procedures help ensure that the information or observations are presented
and interpreted in an accurate and informative way. In somewhat grandiose terms, statistics
help researchers bring order out of chaos. In addition, statistics provide researchers with a
set of standardized techniques that are recognized and understood throughout the scientific
community. Thus, the statistical methods used by one researcher will be familiar to other
researchers, who can accurately interpret the statistical analyses with a full understanding
of how the analysis was done and what the results signify.
■■Populations and Samples
Research in the behavioral sciences typically begins with a general question about a specific
group (or groups) of individuals. For example, a researcher may want to know what factors
are associated with academic dishonesty among college students. Or a researcher may want
to examine the amount of time spent in the bathroom for men compared to women. In the
first example, the researcher is interested in the group of college students. In the second
example, the researcher wants to compare the group of men with the group of women. In statistical terminology, the entire group that a researcher wishes to study is called a population.
DEFInItIon
A population is the set of all the individuals of interest in a particular study.
As you can well imagine, a population can be quite large—for example, the entire set
of women on the planet Earth. A researcher might be more specific, limiting the population
for study to women who are registered voters in the United States. Perhaps the investigator
would like to study the population consisting of women who are heads of state. Populations
can obviously vary in size from extremely large to very small, depending on how the investigator defines the population. The population being studied should always be identified by
the researcher. In addition, the population need not consist of people—it could be a population of rats, corporations, parts produced in a factory, or anything else an investigator wants
to study. In practice, populations are typically very large, such as the population of college
sophomores in the United States or the population of small businesses.
Because populations tend to be very large, it usually is impossible for a researcher to
examine every individual in the population of interest. Therefore, researchers typically select
4
chaPtER 1 | Introduction to Statistics
a smaller, more manageable group from the population and limit their studies to the individuals in the selected group. In statistical terms, a set of individuals selected from a population
is called a sample. A sample is intended to be representative of its population, and a sample
should always be identified in terms of the population from which it was selected.
A sample is a set of individuals selected from a population, usually intended to
represent the population in a research study.
DEFInItIon
Just as we saw with populations, samples can vary in size. For example, one study might
examine a sample of only 10 students in a graduate program and another study might use a
sample of more than 10,000 people who take a specific cholesterol medication.
So far we have talked about a sample being selected from a population. However, this is
actually only half of the full relationship between a sample and its population. Specifically,
when a researcher finishes examining the sample, the goal is to generalize the results back
to the entire population. Remember that the research started with a general question about
the population. To answer the question, a researcher studies a sample and then generalizes
the results from the sample to the population. The full relationship between a sample and a
population is shown in Figure 1.1.
■■Variables and Data
Typically, researchers are interested in specific characteristics of the individuals in the population (or in the sample), or they are interested in outside factors that may influence the
individuals. For example, a researcher may be interested in the influence of the weather on
people’s moods. As the weather changes, do people’s moods also change? Something that
can change or have different values is called a variable.
DEFInItIon
A variable is a characteristic or condition that changes or has different values for
different individuals.
THE POPULATION
All of the individuals of interest
The results
from the sample
are generalized
to the population
F I G U R E 1.1
The relationship between a
population and a sample.
The sample
is selected from
the population
THE SAMPLE
The individuals selected to
participate in the research study
SEctIon 1.1 | Statistics, Science, and Observations
5
Once again, variables can be characteristics that differ from one individual to another,
such as height, weight, gender, or personality. Also, variables can be environmental conditions that change such as temperature, time of day, or the size of the room in which the
research is being conducted.
To demonstrate changes in variables, it is necessary to make measurements of the variables
being examined. The measurement obtained for each individual is called a datum, or more commonly, a score or raw score. The complete set of scores is called the data set or simply the data.
DEFInItIon
Data (plural) are measurements or observations. A data set is a collection of measurements or observations. A datum (singular) is a single measurement or observation and is commonly called a score or raw score.
Before we move on, we should make one more point about samples, populations, and
data. Earlier, we defined populations and samples in terms of individuals. For example,
we discussed a population of graduate students and a sample of cholesterol patients. Be
forewarned, however, that we will also refer to populations or samples of scores. Because
research typically involves measuring each individual to obtain a score, every sample (or
population) of individuals produces a corresponding sample (or population) of scores.
■■Parameters and Statistics
When describing data it is necessary to distinguish whether the data come from a population or a sample. A characteristic that describes a population—for example, the average
score for the population—is called a parameter. A characteristic that describes a sample is
called a statistic. Thus, the average score for a sample is an example of a statistic. Typically,
the research process begins with a question about a population parameter. However, the
actual data come from a sample and are used to compute sample statistics.
DEFInItIon
A parameter is a value, usually a numerical value, that describes a population. A
parameter is usually derived from measurements of the individuals in the population.
A statistic is a value, usually a numerical value, that describes a sample. A statistic
is usually derived from measurements of the individuals in the sample.
Every population parameter has a corresponding sample statistic, and most research
studies involve using statistics from samples as the basis for answering questions about
population parameters. As a result, much of this book is concerned with the relationship
between sample statistics and the corresponding population parameters. In Chapter 7, for
example, we examine the relationship between the mean obtained for a sample and the
mean for the population from which the sample was obtained.
■■Descriptive and Inferential Statistical Methods
Although researchers have developed a variety of different statistical procedures to organize and interpret data, these different procedures can be classified into two general categories. The first category, descriptive statistics, consists of statistical procedures that are used
to simplify and summarize data.
DEFInItIon
Descriptive statistics are statistical procedures used to summarize, organize, and
simplify data.
6
chaPtER 1 | Introduction to Statistics
Descriptive statistics are techniques that take raw scores and organize or summarize
them in a form that is more manageable. Often the scores are organized in a table or a graph
so that it is possible to see the entire set of scores. Another common technique is to summarize a set of scores by computing an average. Note that even if the data set has hundreds
of scores, the average provides a single descriptive value for the entire set.
The second general category of statistical techniques is called inferential statistics.
Inferential statistics are methods that use sample data to make general statements about a
population.
DEFInItIon
Inferential statistics consist of techniques that allow us to study samples and then
make generalizations about the populations from which they were selected.
Because populations are typically very large, it usually is not possible to measure
everyone in the population. Therefore, a sample is selected to represent the population.
By analyzing the results from the sample, we hope to make general statements about the
population. Typically, researchers use sample statistics as the basis for drawing conclusions
about population parameters. One problem with using samples, however, is that a sample
provides only limited information about the population. Although samples are generally
representative of their populations, a sample is not expected to give a perfectly accurate
picture of the whole population. There usually is some discrepancy between a sample statistic and the corresponding population parameter. This discrepancy is called sampling
error, and it creates the fundamental problem inferential statistics must always address.
DEFInItIon
Sampling error is the naturally occurring discrepancy, or error, that exists between
a sample statistic and the corresponding population parameter.
The concept of sampling error is illustrated in Figure 1.2. The figure shows a population of 1,000 college students and 2 samples, each with 5 students who were selected from
the population. Notice that each sample contains different individuals who have different
characteristics. Because the characteristics of each sample depend on the specific people in
the sample, statistics will vary from one sample to another. For example, the five students
in sample 1 have an average age of 19.8 years and the students in sample 2 have an average
age of 20.4 years.
It is also very unlikely that the statistics obtained for a sample will be identical to the
parameters for the entire population. In Figure 1.2, for example, neither sample has statistics that are exactly the same as the population parameters. You should also realize that
Figure 1.2 shows only two of the hundreds of possible samples. Each sample would contain
different individuals and would produce different statistics. This is the basic concept of
sampling error: sample statistics vary from one sample to another and typically are different from the corresponding population parameters.
One common example of sampling error is the error associated with a sample proportion. For example, in newspaper articles reporting results from political polls, you frequently find statements such as this:
Candidate Brown leads the poll with 51% of the vote. Candidate Jones has 42%
approval, and the remaining 7% are undecided. This poll was taken from a sample of registered voters and has a margin of error of plus-or-minus 4 percentage points.
The “margin of error” is the sampling error. In this case, the percentages that are reported
were obtained from a sample and are being generalized to the whole population. As always,
you do not expect the statistics from a sample to be perfect. There always will be some
“margin of error” when sample statistics are used to represent population parameters.
SEctIon 1.1 | Statistics, Science, and Observations
7
F I G U R E 1. 2
A demonstration of sampling error. Two
samples are selected from the same population.
Notice that the sample statistics are different
from one sample to another and all the sample
statistics are different from the corresponding
population parameters. The natural differences that exist, by chance, between a sample
statistic and population parameter are called
sampling error.
Population
of 1000 college students
Population Parameters
Average Age 5 21.3 years
Average IQ 5 112.5
65% Female, 35% Male
Sample #1
Sample #2
Eric
Jessica
Laura
Karen
Brian
Tom
Kristen
Sara
Andrew
John
Sample Statistics
Average Age 5 19.8
Average IQ 5 104.6
60% Female, 40% Male
Sample Statistics
Average Age 5 20.4
Average IQ 5 114.2
40% Female, 60% Male
As a further demonstration of sampling error, imagine that your statistics class is separated into two groups by drawing a line from front to back through the middle of the room.
Now imagine that you compute the average age (or height, or IQ) for each group. Will the
two groups have exactly the same average? Almost certainly they will not. No matter what
you chose to measure, you will probably find some difference between the two groups.
However, the difference you obtain does not necessarily mean that there is a systematic
difference between the two groups. For example, if the average age for students on the
right-hand side of the room is higher than the average for students on the left, it is unlikely
that some mysterious force has caused the older people to gravitate to the right side of
the room. Instead, the difference is probably the result of random factors such as chance.
The unpredictable, unsystematic differences that exist from one sample to another are an
example of sampling error.
■■Statistics in the Context of Research
The following example shows the general stages of a research study and demonstrates
how descriptive statistics and inferential statistics are used to organize and interpret the
data. At the end of the example, note how sampling error can affect the interpretation of
experimental results, and consider why inferential statistical methods are needed to deal
with this problem.
8
chaPtER 1 | Introduction to Statistics
ExamplE 1.1
Figure 1.3 shows an overview of a general research situation and demonstrates the roles that
descriptive and inferential statistics play. The purpose of the research study is to address a
question that we posed earlier: Do college students learn better by studying text on printed
pages or on a computer screen? Two samples are selected from the population of college
students. The students in sample A are given printed pages of text to study for 30 minutes
and the students in sample B study the same text on a computer screen. Next, all of the
students are given a multiple-choice test to evaluate their knowledge of the material. At this
point, the researcher has two sets of data: the scores for sample A and the scores for sample
B (see the figure). Now is the time to begin using statistics.
First, descriptive statistics are used to simplify the pages of data. For example, the
researcher could draw a graph showing the scores for each sample or compute the average score for each sample. Note that descriptive methods provide a simplified, organized
Step 1
Experiment:
Compare two
studying methods
Data
Test scores for the
students in each
sample
Step 2
Descriptive statistics:
Organize and simplify
Population of
College
Students
Sample A
Read from printed
pages
25
27
30
19
29
26
21
28
23
26
20
25
28
27
24
26
22
30
Average
Score = 26
Step 3
Inferential statistics:
Interpret results
F i g u r E 1. 3
The role of statistics in experimental
research.
Sample B
Read from computer
screen
20
20
23
25
22
18
22
17
28
19
24
25
30
27
23
21
22
19
Average
Score = 22
The sample data show a 4-point difference
between the two methods of studying. However,
there are two ways to interpret the results.
1. There actually is no difference between
the two studying methods, and the sample
difference is due to chance (sampling error).
2. There really is a difference between
the two methods, and the sample data
accurately reflect this difference.
The goal of inferential statistics is to help researchers
decide between the two interpretations.
SEctIon 1.1 | Statistics, Science, and Observations
9
description of the scores. In this example, the students who studied printed pages had an average score of 26 on the test, and the students who studied text on the computer averaged 22.
Once the researcher has described the results, the next step is to interpret the outcome.
This is the role of inferential statistics. In this example, the researcher has found a difference
of 4 points between the two samples (sample A averaged 26 and sample B averaged 22). The
problem for inferential statistics is to differentiate between the following two interpretations:
1. There is no real difference between the printed page and a computer screen, and
the 4-point difference between the samples is just an example of sampling error
(like the samples in Figure 1.2).
2. There really is a difference between the printed page and a computer screen, and
the 4-point difference between the samples was caused by the different methods
of studying.
In simple English, does the 4-point difference between samples provide convincing
evidence of a difference between the two studying methods, or is the 4-point difference just
chance? The purpose of inferential statistics is to answer this question.
■
lE arn in g Ch ECk
1. A researcher is interested in the sleeping habits of American college students.
A group of 50 students is interviewed and the researcher finds that these students
sleep an average of 6.7 hours per day. For this study, the average of 6.7 hours is an
example of a(n)
.
a. parameter
b. statistic
c. population
d. sample
2. A researcher is curious about the average IQ of registered voters in the state of Florida.
The entire group of registered voters in the state is an example of a
.
a. sample
b. statistic
c. population
d. parameter
3. Statistical techniques that summarize, organize, and simplify data are classified
as
.
a. population statistics
b. sample statistics
c. descriptive statistics
d. inferential statistics
4. In general,
statistical techniques are used to summarize the data from
a research study and
statistical techniques are used to determine what
conclusions are justified by the results.
a. inferential, descriptive
b. descriptive, inferential
c. sample, population
d. population, sample
10
chaPtER 1 | Introduction to Statistics
5. IQ tests are standardized so that the average score is 100 for the entire group of
people who take the test each year. However, if you selected a group of 20 people
who took the test and computed their average IQ score you probably would not get
100. What statistical concept explains the difference between your mean and the
mean for the entire group?
a. statistical error
b. inferential error
c. descriptive error
d. sampling error
an s wE r s
1. B, 2. C, 3. C, 4. B, 5. D
1.2 Data Structures, Research Methods, and Statistics
LEARNING OBJECTIVEs
4. Differentiate correlational, experimental, and nonexperimental research and describe
the data structures associated with each.
5. Define independent, dependent, and quasi-independent variables and recognize
examples of each.
■■Individual Variables: Descriptive Research
Some research studies are conducted simply to describe individual variables as they exist
naturally. For example, a college official may conduct a survey to describe the eating, sleeping, and study habits of a group of college students. When the results consist of numerical
scores, such as the number of hours spent studying each day, they are typically described
by the statistical techniques that are presented in Chapters 3 and 4. Non-numerical scores
are typically described by computing the proportion or percentage in each category. For
example, a recent newspaper article reported that 34.9% of Americans are obese, which is
roughly 35 pounds over a healthy weight.
■■Relationships Between Variables
Most research, however, is intended to examine relationships between two or more variables. For example, is there a relationship between the amount of violence in the video
games played by children and the amount of aggressive behavior they display? Is there a
relationship between the quality of breakfast and academic performance for elementary
school children? Is there a relationship between the number of hours of sleep and grade
point average for college students? To establish the existence of a relationship, researchers must make observations—that is, measurements of the two variables. The resulting
measurements can be classified into two distinct data structures that also help to classify
different research methods and different statistical techniques. In the following section we
identify and discuss these two data structures.
I. One Group with Two Variables Measured for Each Individual: The Correlational Method One method for examining the relationship between variables is to
observe the two variables as they exist naturally for a set of individuals. That is, simply
SEctIon 1.2 | Data Structures, Research Methods, and Statistics
11
measure the two variables for each individual. For example, research has demonstrated a
relationship between sleep habits, especially wake-up time, and academic performance
for college students (Trockel, Barnes, and Egget, 2000). The researchers used a survey to
measure wake-up time and school records to measure academic performance for each student. Figure 1.4 shows an example of the kind of data obtained in the study. The researchers then look for consistent patterns in the data to provide evidence for a relationship
between variables. For example, as wake-up time changes from one student to another, is
there also a tendency for academic performance to change?
Consistent patterns in the data are often easier to see if the scores are presented in a
graph. Figure 1.4 also shows the scores for the eight students in a graph called a scatter
plot. In the scatter plot, each individual is represented by a point so that the horizontal
position corresponds to the student’s wake-up time and the vertical position corresponds
to the student’s academic performance score. The scatter plot shows a clear relationship
between wake-up time and academic performance: as wake-up time increases, academic
performance decreases.
A research study that simply measures two different variables for each individual and
produces the kind of data shown in Figure 1.4 is an example of the correlational method,
or the correlational research strategy.
In the correlational method, two different variables are observed to determine
whether there is a relationship between them.
DEFInItIon
■■Statistics for the Correlational Method
When the data from a correlational study consist of numerical scores, the relationship
between the two variables is usually measured and described using a statistic called a
correlation. Correlations and the correlational method are discussed in detail in Chapters 15 and 16. Occasionally, the measurement process used for a correlational study
simply classifies individuals into categories that do not correspond to numerical values.
For example, a researcher could classify a group of college students by gender (male
Student
Wake-up
Time
Academic
Performance
A
B
C
D
E
F
G
H
11
9
9
12
7
10
10
8
2.4
3.6
3.2
2.2
3.8
2.2
3.0
3.0
(b)
Academic performance
(a)
3.8
3.6
3.4
3.2
3.0
2.8
2.6
2.4
2.2
2.0
7
F i g u r E 1. 4
8
9
10
11
12
Wake-up time
One of two data structures for evaluating the relationship between variables. Note that there are two separate measurements for each individual (wake-up time and academic performance). The same scores are shown in a table (a) and in
a graph (b).
12
chaPtER 1 | Introduction to Statistics
or female) and by cell-phone preference (talk or text). Note that the researcher has two
scores for each individual but neither of the scores is a numerical value. This type of data
is typically summarized in a table showing how many individuals are classified into each
of the possible categories. Table 1.1 shows an example of this kind of summary table. The
table shows for example, that 30 of the males in the sample preferred texting to talking.
This type of data can be coded with numbers (for example, male = 0 and female = 1)
so that it is possible to compute a correlation. However, the relationship between variables for non-numerical data, such as the data in Table 1.1, is usually evaluated using
a statistical technique known as a chi-square test. Chi-square tests are presented in
Chapter 17.
Ta b lE 1.1
Correlational data consisting of non-numerical scores. Note that there are two measurements for
each individual: gender and cell phone preference. The numbers indicate how many people are in
each category. For example, out of the 50 males, 30 prefer text over talk.
Cell Phone Preference
Text
Talk
Males
30
20
50
Females
25
25
50
■■Limitations of the Correlational Method
The results from a correlational study can demonstrate the existence of a relationship
between two variables, but they do not provide an explanation for the relationship. In
particular, a correlational study cannot demonstrate a cause-and-effect relationship. For
example, the data in Figure 1.4 show a systematic relationship between wake-up time and
academic performance for a group of college students; those who sleep late tend to have
lower performance scores than those who wake early. However, there are many possible
explanations for the relationship and we do not know exactly what factor (or factors) is
responsible for late sleepers having lower grades. In particular, we cannot conclude that
waking students up earlier would cause their academic performance to improve, or that
studying more would cause students to wake up earlier. To demonstrate a cause-and-effect
relationship between two variables, researchers must use the experimental method, which
is discussed next.
II. Comparing Two (or More) Groups of Scores: Experimental and Nonexperimental Methods The second method for examining the relationship between two
variables involves the comparison of two or more groups of scores. In this situation, the
relationship between variables is examined by using one of the variables to define the
groups, and then measuring the second variable to obtain scores for each group. For example, Polman, de Castro, and van Aken (2008) randomly divided a sample of 10-year-old
boys into two groups. One group then played a violent video game and the second played
a nonviolent game. After the game-playing session, the children went to a free play period
and were monitored for aggressive behaviors (hitting, kicking, pushing, frightening, name
calling, fighting, quarreling, or teasing another child). An example of the resulting data is
shown in Figure 1.5. The researchers then compare the scores for the violent-video group
with the scores for the nonviolent-video group. A systematic difference between the two
groups provides evidence for a relationship between playing violent video games and
aggressive behavior for 10-year-old boys.
SEctIon 1.2 | Data Structures, Research Methods, and Statistics
F i g u r E 1. 5
Evaluating the relationship between
variables by comparing groups of scores.
Note that the values of
one variable are used
to define the groups
and the second variable is measured to
obtain scores within
each group.
One variable (type of video game)
is used to define groups
A second variable (aggressive behavior)
is measured to obtain scores within each group
Violent
Nonviolent
7
8
10
7
9
8
6
10
9
6
8
4
8
3
6
5
3
4
4
5
13
Compare groups
of scores
■■Statistics for Comparing Two (or More) Groups of Scores
Most of the statistical procedures presented in this book are designed for research studies that compare groups of scores like the study in Figure 1.5. Specifically, we examine
descriptive statistics that summarize and describe the scores in each group and we use
inferential statistics to determine whether the differences between the groups can be generalized to the entire population.
When the measurement procedure produces numerical scores, the statistical evaluation typically involves computing the average score for each group and then comparing
the averages. The process of computing averages is presented in Chapter 3, and a variety
of statistical techniques for comparing averages are presented in Chapters 8–14. If the
measurement process simply classifies individuals into non-numerical categories, the statistical evaluation usually consists of computing proportions for each group and then comparing proportions. Previously, in Table 1.1, we presented an example of non-numerical
data examining the relationship between gender and cell-phone preference. The same data
can be used to compare the proportions for males with the proportions for females. For
example, using text is preferred by 60% of the males compared to 50% of the females. As
before, these data are evaluated using a chi-square test, which is presented in Chapter 17.
■■Experimental and Nonexperimental Methods
There are two distinct research methods that both produce groups of scores to be compared:
the experimental and the nonexperimental strategies. These two research methods use
exactly the same statistics and they both demonstrate a relationship between two variables.
The distinction between the two research strategies is how the relationship is interpreted.
The results from an experiment allow a cause-and-effect explanation. For example, we can
conclude that changes in one variable are responsible for causing differences in a second
variable. A nonexperimental study does not permit a cause-and effect explanation. We can
say that changes in one variable are accompanied by changes in a second variable, but we
cannot say why. Each of the two research methods is discussed in the following sections.
■■The Experimental Method
One specific research method that involves comparing groups of scores is known as the
experimental method or the experimental research strategy. The goal of an experimental
study is to demonstrate a cause-and-effect relationship between two variables. Specifically,
14
chaPtER 1 | Introduction to Statistics
an experiment attempts to show that changing the value of one variable causes changes to
occur in the second variable. To accomplish this goal, the experimental method has two
characteristics that differentiate experiments from other types of research studies:
1. Manipulation The researcher manipulates one variable by changing its value from
one level to another. In the Polman et al. (2008) experiment examining the effect
of violence in video games (Figure 1.5), the researchers manipulate the amount of
violence by giving one group of boys a violent game to play and giving the other
group a nonviolent game. A second variable is observed (measured) to determine
whether the manipulation causes changes to occur.
2. Control The researcher must exercise control over the research situation to ensure
that other, extraneous variables do not influence the relationship being examined.
In more complex experiments, a researcher
may systematically
manipulate more than
one variable and may
observe more than one
variable. Here we are
considering the simplest
case, in which only one
variable is manipulated
and only one variable is
observed.
To demonstrate these two characteristics, consider the Polman et al. (2008) study examining the effect of violence in video games (see Figure 1.5). To be able to say that the difference in aggressive behavior is caused by the amount of violence in the game, the researcher
must rule out any other possible explanation for the difference. That is, any other variables
that might affect aggressive behavior must be controlled. There are two general categories
of variables that researchers must consider:
1. Participant Variables These are characteristics such as age, gender, and intelligence that vary from one individual to another. Whenever an experiment compares
different groups of participants (one group in treatment A and a different group
in treatment B), researchers must ensure that participant variables do not differ
from one group to another. For the experiment shown in Figure 1.5, for example,
the researchers would like to conclude that the violence in the video game causes
a change in the participants’ aggressive behavior. In the study, the participants in
both conditions were 10-year-old boys. Suppose, however, that the participants in
the nonviolent condition were primarily female and those in the violent condition
were primarily male. In this case, there is an alternative explanation for the difference in aggression that exists between the two groups. Specifically, the difference
between groups may have been caused by the amount of violence in the game,
but it also is possible that the difference was caused by the participants’ gender
(females are less aggressive than males). Whenever a research study allows more
than one explanation for the results, the study is said to be confounded because it is
impossible to reach an unambiguous conclusion.
2. Environmental Variables These are characteristics of the environment such as
lighting, time of day, and weather conditions. A researcher must ensure that the
individuals in treatment A are tested in the same environment as the individuals
in treatment B. Using the video game violence experiment (see Figure 1.5) as an
example, suppose that the individuals in the nonviolent condition were all tested in
the morning and the individuals in the violent condition were all tested in the evening. Again, this would produce a confounded experiment because the researcher
could not determine whether the differences in aggressive behavior were caused by
the amount of violence or caused by the time of day.
Researchers typically use three basic techniques to control other variables. First, the
researcher could use random assignment, which means that each participant has an equal
chance of being assigned to each of the treatment conditions. The goal of random assignment is to distribute the participant characteristics evenly between the two groups so that
neither group is noticeably smarter (or older, or faster) than the other. Random assignment
can also be used to control environmental variables. For example, participants could be
assigned randomly for testing either in the morning or in the afternoon. A second technique
SEctIon 1.2 | Data Structures, Research Methods, and Statistics
15
for controlling variables is to use matching to ensure equivalent groups or equivalent environments. For example, the researcher could match groups by ensuring that every group
has exactly 60% females and 40% males. Finally, the researcher can control variables by
holding them constant. For example, in the video game violence study discussed earlier
(Polman et al., 2008), the researchers used only 10-year-old boys as participants (holding
age and gender constant). In this case the researchers can be certain that one group is not
noticeably older or has a larger proportion of females than the other.
DEFInItIon
In the experimental method, one variable is manipulated while another variable
is observed and measured. To establish a cause-and-effect relationship between the
two variables, an experiment attempts to control all other variables to prevent them
from influencing the results.
■■Terminology in the Experimental Method
Specific names are used for the two variables that are studied by the experimental method. The
variable that is manipulated by the experimenter is called the independent variable. It can be
identified as the treatment conditions to which participants are assigned. For the example in
Figure 1.5, the amount of violence in the video game is the independent variable. The variable
that is observed and measured to obtain scores within each condition is the dependent variable. For the example in Figure 1.5, the level of aggressive behavior is the dependent variable.
DEFInItIon
The independent variable is the variable that is manipulated by the researcher. In
behavioral research, the independent variable usually consists of the two (or more) treatment conditions to which subjects are exposed. The independent variable consists of the
antecedent conditions that were manipulated prior to observing the dependent variable.
The dependent variable is the one that is observed to assess the effect of the treatment.
Control Conditions in an Experiment
An experimental study evaluates the relationship between two variables by manipulating one variable (the independent variable) and
measuring one variable (the dependent variable). Note that in an experiment only one
variable is actually measured. You should realize that this is different from a correlational
study, in which both variables are measured and the data consist of two separate scores
for each individual.
Often an experiment will include a condition in which the participants do not receive
any treatment. The scores from these individuals are then compared with scores from participants who do receive the treatment. The goal of this type of study is to demonstrate that
the treatment has an effect by showing that the scores in the treatment condition are substantially different from the scores in the no-treatment condition. In this kind of research,
the no-treatment condition is called the control condition, and the treatment condition is
called the experimental condition.
DEFInItIon
Individuals in a control condition do not receive the experimental treatment.
Instead, they either receive no treatment or they receive a neutral, placebo treatment. The purpose of a control condition is to provide a baseline for comparison
with the experimental condition.
Individuals in the experimental condition do receive the experimental treatment.
16
chaPtER 1 | Introduction to Statistics
Note that the independent variable always consists of at least two values. (Something
must have at least two different values before you can say that it is “variable.”) For the
video game violence experiment (see Figure 1.5), the independent variable is the amount
of violence in the video game. For an experiment with an experimental group and a control
group, the independent variable is treatment versus no treatment.
■■Nonexperimental Methods: Nonequivalent Groups
and Pre-Post Studies
In informal conversation, there is a tendency for people to use the term experiment to refer
to any kind of research study. You should realize, however, that the term only applies to
studies that satisfy the specific requirements outlined earlier. In particular, a real experiment must include manipulation of an independent variable and rigorous control of other,
extraneous variables. As a result, there are a number of other research designs that are not
true experiments but still examine the relationship between variables by comparing groups
of scores. Two examples are shown in Figure 1.6 and are discussed in the following paragraphs. This type of research study is classified as nonexperimental.
The top part of Figure 1.6 shows an example of a nonequivalent groups study comparing boys and girls. Notice that this study involves comparing two groups of scores (like an
experiment). However, the researcher has no ability to control which participants go into
F i g u r E 1.6
(a)
Two examples of nonexperimental
studies that involve comparing two
groups of scores. In (a) the study
uses two preexisting groups (boys/
girls) and measures a dependent
variable (verbal scores) in each
group. In (b) the study uses time
(before/after) to define the two
groups and measures a dependent
variable (depression) in each group.
Variable #1: Subject gender
(the quasi-independent variable)
Not manipulated, but used
to create two groups of subjects
Variable #2: Verbal test scores
(the dependent variable)
Measured in each of the
two groups
Boys
Girls
17
19
16
12
17
18
15
16
12
10
14
15
13
12
11
13
Any
difference?
(b)
Variable #1: Time
(the quasi-independent variable)
Not manipulated, but used
to create two groups of scores
Variable #2: Depression scores
(the dependent variable)
Measured at each of the two
different times
Before
Therapy
After
Therapy
17
19
16
12
17
18
15
16
12
10
14
15
13
12
11
13
Any
difference?
SEctIon 1.2 | Data Structures, Research Methods, and Statistics
Correlational studies are
also examples of nonexperimental research. In
this section, however, we
are discussing nonexperimental studies that
compare two or more
groups of scores.
17
which group—all the males must be in the boy group and all the females must be in the
girl group. Because this type of research compares preexisting groups, the researcher cannot control the assignment of participants to groups and cannot ensure equivalent groups.
Other examples of nonequivalent group studies include comparing 8-year-old children and
10-year-old children, people with an eating disorder and those with no disorder, and comparing children from a single-parent home and those from a two-parent home. Because it
is impossible to use techniques like random assignment to control participant variables and
ensure equivalent groups, this type of research is not a true experiment.
The bottom part of Figure 1.6 shows an example of a pre–post study comparing depression scores before therapy and after therapy. The two groups of scores are obtained by
measuring the same variable (depression) twice for each participant; once before therapy
and again after therapy. In a pre-post study, however, the researcher has no control over
the passage of time. The “before” scores are always measured earlier than the “after”
scores. Although a difference between the two groups of scores may be caused by the
treatment, it is always possible that the scores simply change as time goes by. For example, the depression scores may decrease over time in the same way that the symptoms of
a cold disappear over time. In a pre–post study the researcher also has no control over
other variables that change with time. For example, the weather could change from dark
and gloomy before therapy to bright and sunny after therapy. In this case, the depression
scores could improve because of the weather and not because of the therapy. Because the
researcher cannot control the passage of time or other variables related to time, this study
is not a true experiment.
Terminology in Nonexperimental Research
Although the two research studies
shown in Figure 1.6 are not true experiments, you should notice that they produce the
same kind of data that are found in an experiment (see Figure 1.5). In each case, one variable is used to create groups, and a second variable is measured to obtain scores within
each group. In an experiment, the groups are created by manipulation of the independent
variable, and the participants’ scores are the dependent variable. The same terminology is
often used to identify the two variables in nonexperimental studies. That is, the variable
that is used to create groups is the independent variable and the scores are the dependent
variable. For example, the top part of Figure 1.6, gender (boy/girl), is the independent
variable and the verbal test scores are the dependent variable. However, you should realize that gender (boy/girl) is not a true independent variable because it is not manipulated.
For this reason, the “independent variable” in a nonexperimental study is often called a
quasi-independent variable.
DEFInItIon
lE arn in g Ch ECk
In a nonexperimental study, the “independent variable” that is used to create the
different groups of scores is often called the quasi-independent variable.
1. In a correlational study, how many variables are measured for each individual and
how many groups of scores are obtained?
a. 1 variable and 1 group
b. 1 variable and 2 groups
c. 2 variables and 1 group
d. 2 variables and 2 groups
18
chaPtER 1 | Introduction to Statistics
2. A research study comparing alcohol use for college students in the United States
and Canada reports that more Canadian students drink but American students drink
more (Kuo, Adlaf, Lee, Gliksman, Demers, and Wechsler, 2002). What research
design did this study use?
a. correlational
b. experimental
c. nonexperimental
d. noncorrelational
3. Stephens, Atkins, and Kingston (2009) found that participants were able to tolerate
more pain when they shouted their favorite swear words over and over than when
they shouted neutral words. For this study, what is the independent variable?
a. the amount of pain tolerated
b. the participants who shouted swear words
c. the participants who shouted neutral words
d. the kind of word shouted by the participants
an s wE r s
1.3
1. C, 2. C, 3. D
Variables and Measurement
LEARNING OBJECTIVEs
6. Explain why operational definitions are developed for constructs and identify the two
components of an operational definition.
7. Describe discrete and continuous variables and identify examples of each.
8. Differentiate nominal, ordinal, interval, and ratio scales of measurement.
■■Constructs and Operational Definitions
The scores that make up the data from a research study are the result of observing and
measuring variables. For example, a researcher may finish a study with a set of IQ scores,
personality scores, or reaction-time scores. In this section, we take a closer look at the variables that are being measured and the process of measurement.
Some variables, such as height, weight, and eye color are well-defined, concrete entities that can be observed and measured directly. On the other hand, many variables studied
by behavioral scientists are internal characteristics that people use to help describe and
explain behavior. For example, we say that a student does well in school because he or
she is intelligent. Or we say that someone is anxious in social situations, or that someone
seems to be hungry. Variables like intelligence, anxiety, and hunger are called constructs,
and because they are intangible and cannot be directly observed, they are often called
hypothetical constructs.
Although constructs such as intelligence are internal characteristics that cannot be
directly observed, it is possible to observe and measure behaviors that are representative
of the construct. For example, we cannot “see” intelligence but we can see examples of
intelligent behavior. The external behaviors can then be used to create an operational definition for the construct. An operational definition defines a construct in terms of external
SEctIon 1.3 | Variables and Measurement
19
behaviors that can be observed and measured. For example, your intelligence is measured
and defined by your performance on an IQ test, or hunger can be measured and defined by
the number of hours since last eating.
DEFInItIon
Constructs are internal attributes or characteristics that cannot be directly
observed but are useful for describing and explaining behavior.
An operational definition identifies a measurement procedure (a set of operations) for measuring an external behavior and uses the resulting measurements as
a definition and a measurement of a hypothetical construct. Note that an operational definition has two components. First, it describes a set of operations for
measuring a construct. Second, it defines the construct in terms of the resulting
measurements.
■■Discrete and Continuous Variables
The variables in a study can be characterized by the type of values that can be assigned to
them. A discrete variable consists of separate, indivisible categories. For this type of variable, there are no intermediate values between two adjacent categories. Consider the values
displayed when dice are rolled. Between neighboring values—for example, seven dots and
eight dots—no other values can ever be observed.
DEFInItIon
A discrete variable consists of separate, indivisible categories. No values can exist
between two neighboring categories.
Discrete variables are commonly restricted to whole, countable numbers—for
example, the number of children in a family or the number of students attending class.
If you observe class attendance from day to day, you may count 18 students one day
and 19 students the next day. However, it is impossible ever to observe a value between
18 and 19. A discrete variable may also consist of observations that differ qualitatively.
For example, people can be classified by gender (male or female), by occupation
(nurse, teacher, lawyer, etc.), and college students can by classified by academic major
(art, biology, chemistry, etc.). In each case, the variable is discrete because it consists
of separate, indivisible categories.
On the other hand, many variables are not discrete. Variables such as time, height, and
weight are not limited to a fixed set of separate, indivisible categories. You can measure
time, for example, in hours, minutes, seconds, or fractions of seconds. These variables
are called continuous because they can be divided into an infinite number of fractional
parts.
DEFInItIon
For a continuous variable, there are an infinite number of possible values that fall
between any two observed values. A continuous variable is divisible into an infinite
number of fractional parts.
Suppose, for example, that a researcher is measuring weights for a group of individuals
participating in a diet study. Because weight is a continuous variable, it can be pictured as
a continuous line (Figure 1.7). Note that there are an infinite number of possible points on
20
chaPtER 1 | Introduction to Statistics
F i g u r E 1.7
149.6
When measuring weight to
the nearest whole pound,
149.6 and 150.3 are assigned
the value of 150 (top). Any
value in the interval between
149.5 and 150.5 is given the
value of 150.
150.3
149
151
152
151
152
150
149.5
149
148.5
150.5
150
149.5
150.5
151.5
152.5
Real limits
the line without any gaps or separations between neighboring points. For any two different
points on the line, it is always possible to find a third value that is between the two points.
Two other factors apply to continuous variables:
1. When measuring a continuous variable, it should be very rare to obtain identical
measurements for two different individuals. Because a continuous variable has an
infinite number of possible values, it should be almost impossible for two people to
have exactly the same score. If the data show a substantial number of tied scores,
then you should suspect that the measurement procedure is very crude or that the
variable is not really continuous.
2. When measuring a continuous variable, each measurement category is actually an
interval that must be defined by boundaries. For example, two people who both
claim to weigh 150 pounds are probably not exactly the same weight. However,
they are both around 150 pounds. One person may actually weigh 149.6 and the
other 150.3. Thus, a score of 150 is not a specific point on the scale but instead is
an interval (see Figure 1.7). To differentiate a score of 150 from a score of 149 or
151, we must set up boundaries on the scale of measurement. These boundaries are
called real limits and are positioned exactly halfway between adjacent scores. Thus,
a score of X = 150 pounds is actually an interval bounded by a lower real limit
of 149.5 at the bottom and an upper real limit of 150.5 at the top. Any individual
whose weight falls between these real limits will be assigned a score of X = 150.
DEFInItIon
Real limits are the boundaries of intervals for scores that are represented on a continuous number line. The real limit separating two adjacent scores is located exactly
halfway between the scores. Each score has two real limits. The upper real limit is
at the top of the interval, and the lower real limit is at the bottom.
The concept of real limits applies to any measurement of a continuous variable, even
when the score categories are not whole numbers. For example, if you were measuring time
to the nearest tenth of a second, the measurement categories would be 31.0, 31.1, 31.2, and
so on. Each of these categories represents an interval on the scale that is bounded by real
limits. For example, a score of X = 31.1 seconds indicates that the actual measurement
is in an interval bounded by a lower real limit of 31.05 and an upper real limit of 31.15.
Remember that the real limits are always halfway between adjacent categories.
SEctIon 1.3 | Variables and Measurement
Students often ask
whether a value of
exactly 150.5 should
be assigned to the
X = 150 interval or the
X = 151 interval. The
answer is that 150.5 is
the boundary between
the two intervals and is
not necessarily in one
or the other. Instead,
the placement of 150.5
depends on the rule that
you are using for rounding numbers. If you
are rounding up, then
150.5 goes in the higher
interval (X = 151) but if
you are rounding down,
then it goes in the lower
interval (X = 150).
21
Later in this book, real limits are used for constructing graphs and for various calculations with continuous scales. For now, however, you should realize that real limits are a
necessity whenever you make measurements of a continuous variable.
Finally, we should warn you that the terms continuous and discrete apply to the variables that are being measured and not to the scores that are obtained from the measurement.
For example, measuring people’s heights to the nearest inch produces scores of 60, 61, 62,
and so on. Although the scores may appear to be discrete numbers, the underlying variable
is continuous. One key to determining whether a variable is continuous or discrete is that
a continuous variable can be divided into any number of fractional parts. Height can be
measured to the nearest inch, the nearest 0.5 inch, or the nearest 0.1 inch. Similarly, a professor evaluating students’ knowledge could use a pass/fail system that classifies students
into two broad categories. However, the professor could choose to use a 10-point quiz that
divides student knowledge into 11 categories corre...