Please submit your responses to these questions/tasks as a Word document, pdf file, or using the text entry option.
Write Up:
Firstname Lastname
Professor Lastname
STA2023
Today’s Date
Example Project 1
For this project, I will be using descriptive statistics on a sample of 30 houses for sale in Ruskin, FL.
Warning: This is an example project. The topic of your project will be different. Please follow your
project directions.
Population, Sample, Individual
The population of interest are all houses for sale in Ruskin, FL. The sample are the 30 houses in Ruskin,
FL listed for sale on Zillow.com that were included in the sample. The individual is a house listed for sale
in Ruskin, FL.
Variables
Price, measured in thousands of dollars. Price is a quantitative variable at the ratio level of
measurement.
Size, measured in square feet. Size is a quantitative variable at the ratio level of measurement.
HOA status (either “HOA” or “No HOA”). HOA Status is a qualitative variable at the nominal level of
measurement.
Pool status (either “pool,” “spa,” “community pool only,” or “no pool”). Pool status is a qualitative
variable at the nominal level of measurement.
Sampling Method
142 houses were listed for sale in Ruskin, FL on Zillow.com. I used simple random sampling to select a
sample of 30 houses. I numbered these houses 1-142 and picked 30 using the random number
generator applet on StatCrunch. See the image below.
LastName 2
After generating 30 numbers, I sorted the list to make it easier to use: 1, 2, 4, 26, 28, 29, 34, 36, 47, 55,
61, 64, 66, 68, 80, 84, 90, 94, 96, 97, 101, 112, 118, 120, 125, 127, 130, 132, 135, 142.
Summary Statistics
Column n Mean
Std. dev.
Min Q1
Median Q3
Max
Price
30 389.56667 114.19544 180 330 375
415 660
Size
30 2087.8333 518.44982 1200 1668 2030.5 2501 3095
Graphs for Price Variable
Because the mean (green line on histogram) is slightly to the right/above of the median (red line), the
distribution of price is slightly skewed right. Because there is a clear single hump, it is unimodal. The box
plot shows that there are 5 outliers (2 dots are on top of each other): 180, 596, 607, 659, and 660
thousand dollars.
LastName 3
Graphs for Size Variable
Because the mean (green line on histogram) is slightly to the right/above of the median (red line), the
distribution of size is slightly skewed right. It also appears bimodal. There are no outliers for the size
variable.
Pool Status
Pool Status
Community Pool Only
No Pool
Pool
Spa
Frequency
10
11
8
1
Relative Frequency
0.33333333
0.36666667
0.26666667
0.033333333
LastName 4
The most common pool status is for “no pool” with 11 houses (36.67% of houses sampled) not having
any pool access followed closely by having only community pool access with 10 houses (33.33% of the
sampled houses). Eight houses had a pool (26.67% of the sample) and one house had a spa.
Data Set
Price
330
390
449
390
596
355
607
365
660
372
429
303
385
385
215
450
355
280
415
365
659
378
405
275
330
387
180
297
315
365
Size
1612
1846
1525
2722
2501
1828
2544
1804
3066
1912
2210
1200
2457
2366
2400
2270
2091
1668
2557
2355
3095
2614
1970
1406
1629
2722
1704
1303
1412
1846
HOA Status
HOA
HOA
No HOA
No HOA
No HOA
No HOA
No HOA
HOA
HOA
HOA
HOA
No HOA
HOA
HOA
No HOA
HOA
HOA
No HOA
HOA
HOA
HOA
HOA
HOA
HOA
HOA
HOA
No HOA
HOA
No HOA
HOA
Pool Status
Pool
Community Pool Only
Pool
No Pool
No Pool
No Pool
No Pool
Pool
Pool
Community Pool Only
Community Pool Only
No Pool
Pool
Pool
Community Pool Only
No Pool
Pool
No Pool
Spa
Community Pool Only
Community Pool Only
Community Pool Only
Community Pool Only
Community Pool Only
Pool
No Pool
No Pool
No Pool
No Pool
Community Pool Only