Data - AP Statistics
Card 1 of 552
The following data set represents Mr. Marigold's students' scores on the final. If you are in the 96th percentile, what did you score?


The following data set represents Mr. Marigold's students' scores on the final. If you are in the 96th percentile, what did you score?
Tap to reveal answer
If you scored in the 96th percentile, that means that 96% of your classmates scored the same as or worse than you.
This class has 26 students, and 96% of 26 is about 25.
If 25 of your classmates did the same as or worse than you, that means only one person did better than you - nice work.
You must have gotten a 92, because it's the second-best score.
If you scored in the 96th percentile, that means that 96% of your classmates scored the same as or worse than you.
This class has 26 students, and 96% of 26 is about 25.
If 25 of your classmates did the same as or worse than you, that means only one person did better than you - nice work.
You must have gotten a 92, because it's the second-best score.
← Didn't Know|Knew It →
The following data set represents Mr. Marigold's students' scores on the final. If you got a 74 on the final, what percentile does that place you in?


The following data set represents Mr. Marigold's students' scores on the final. If you got a 74 on the final, what percentile does that place you in?
Tap to reveal answer
If you scored a 74, you can count and discover that eight students did worse than or the same as you.
There are a total of 26 students in the class, so 8 students would be
or around
.
This places you in the 31st percentile.
If you scored a 74, you can count and discover that eight students did worse than or the same as you.
There are a total of 26 students in the class, so 8 students would be
or around
.
This places you in the 31st percentile.
← Didn't Know|Knew It →
There are four suspects in a police line-up, and one of them committed a robbery. The suspect is described as "abnormally tall". In this case, "abnormally" refers to a height at least two standard deviations away from the average height. Their heights are converted into the following z-scores:
Suspect 1: 2.3
Suspect 2: 1.2
Suspect 3: 0.2
Suspect 4: -1.2.
Which of the following suspects committed the crime?
There are four suspects in a police line-up, and one of them committed a robbery. The suspect is described as "abnormally tall". In this case, "abnormally" refers to a height at least two standard deviations away from the average height. Their heights are converted into the following z-scores:
Suspect 1: 2.3
Suspect 2: 1.2
Suspect 3: 0.2
Suspect 4: -1.2.
Which of the following suspects committed the crime?
Tap to reveal answer
Z-scores describe how many standard deviations a given observation is from the mean observation. Suspect 1's z-score is greater than two, which means that his height is at least two standard deviations greater than the average height and thus, based on the description, Suspect 1 is the culprit.
Z-scores describe how many standard deviations a given observation is from the mean observation. Suspect 1's z-score is greater than two, which means that his height is at least two standard deviations greater than the average height and thus, based on the description, Suspect 1 is the culprit.
← Didn't Know|Knew It →
Which of the following correlation coefficients indicates the strongest relationship between variables?
Which of the following correlation coefficients indicates the strongest relationship between variables?
Tap to reveal answer
Correlation coefficients range from 1 to -1. The closer to either extreme, the stronger the relationship. The closer to 0, the weaker the relationship.
Correlation coefficients range from 1 to -1. The closer to either extreme, the stronger the relationship. The closer to 0, the weaker the relationship.
← Didn't Know|Knew It →
In a regression analysis, the y-variable should be the variable, and the x-variable should be the variable.
In a regression analysis, the y-variable should be the variable, and the x-variable should be the variable.
Tap to reveal answer
Regression tests seek to determine one variable's ability to predict another variable. In this analysis, one variable is dependent (the one predicted), and the other is independent (the variable that predicts). Therefore, the dependent variable is the y-variable and the independent variable is the x-variable.
Regression tests seek to determine one variable's ability to predict another variable. In this analysis, one variable is dependent (the one predicted), and the other is independent (the variable that predicts). Therefore, the dependent variable is the y-variable and the independent variable is the x-variable.
← Didn't Know|Knew It →
On a residual plot, the
-axis displays the and the
-axis displays .
On a residual plot, the -axis displays the and the
-axis displays .
Tap to reveal answer
A residual plot shows the difference between the actual and expected value, or residual. This goes on the y-axis. The plot shows these residuals in relation to the independent variable.
A residual plot shows the difference between the actual and expected value, or residual. This goes on the y-axis. The plot shows these residuals in relation to the independent variable.
← Didn't Know|Knew It →
Tap to reveal answer
No explanation available
No explanation available
← Didn't Know|Knew It →
What transformation should be done to the data set, with its residual shown below, to linearize the data?

What transformation should be done to the data set, with its residual shown below, to linearize the data?

Tap to reveal answer
Taking the log of a data set whose residual is nonrandom is effective in increasing the correleation coefficient and results in a more linear relationship.
Taking the log of a data set whose residual is nonrandom is effective in increasing the correleation coefficient and results in a more linear relationship.
← Didn't Know|Knew It →
A national study on cell phone use found the following correlations:
-The correlation between the number of texts sent each day and a person's average credit card debt is
.
-The correlation between the number of texts sent each day and the number of books read each month is
.
Which of the following statements are true?
i. As the number of texts sent each day increases, average credit card debt increases.
ii. Sending more texts causes people to read less.
iii. A person's average credit card debt is related more strongly to the number of texts sent each day than the number of books read each month is related to the number of texts sent each day.
A national study on cell phone use found the following correlations:
-The correlation between the number of texts sent each day and a person's average credit card debt is .
-The correlation between the number of texts sent each day and the number of books read each month is .
Which of the following statements are true?
i. As the number of texts sent each day increases, average credit card debt increases.
ii. Sending more texts causes people to read less.
iii. A person's average credit card debt is related more strongly to the number of texts sent each day than the number of books read each month is related to the number of texts sent each day.
Tap to reveal answer
i is correct because there is a positive correlation between the number of texts sent each day and average credit card debt.
ii is incorrect because the word "cause" was used in the statement. Correlation does not mean causation. There is a relationship between the number of texts sent each day and the number of books that a person reads each month. However, the number of texts sent each day does not cause a person to read a certain number of books each month.
iii is correct because the absolute values of the correlations indicate which correlation is stronger.
is a stronger correlation than
.
i is correct because there is a positive correlation between the number of texts sent each day and average credit card debt.
ii is incorrect because the word "cause" was used in the statement. Correlation does not mean causation. There is a relationship between the number of texts sent each day and the number of books that a person reads each month. However, the number of texts sent each day does not cause a person to read a certain number of books each month.
iii is correct because the absolute values of the correlations indicate which correlation is stronger. is a stronger correlation than
.
← Didn't Know|Knew It →
Which of the following shows the least correlation between two variables?
Which of the following shows the least correlation between two variables?
Tap to reveal answer
The strength of correlation is measured on an absolute value scale of
to
with
being the least correlated and
being the most correlated. The positive or negative in front of the correlation integer simply determines whether or not there is a positive or negative correlation between the variables.
A correlation of
means that there is no correlation at all between two variables.
The strength of correlation is measured on an absolute value scale of to
with
being the least correlated and
being the most correlated. The positive or negative in front of the correlation integer simply determines whether or not there is a positive or negative correlation between the variables.
A correlation of means that there is no correlation at all between two variables.
← Didn't Know|Knew It →
In a medical school, it is found that there is a correlation of
between the amount of coffee consumed by students and the number of hours students sleep each night. Which of the following is true?
i. There is a positive association between the two variables.
ii. There is a strong correlation between the two variables.
iii. Coffee consumption in medical school students causes students to sleep less each night.
In a medical school, it is found that there is a correlation of between the amount of coffee consumed by students and the number of hours students sleep each night. Which of the following is true?
i. There is a positive association between the two variables.
ii. There is a strong correlation between the two variables.
iii. Coffee consumption in medical school students causes students to sleep less each night.
Tap to reveal answer
Since the correlation is negative, there must be a negative association between the two variables (therefore statement i is incorrect). Statement ii is correct since a correlation of
to
on an absolute value scale of
to
is considered to be a strong correlation. Statement iii is incorrect since correlation does not mean causation.
Since the correlation is negative, there must be a negative association between the two variables (therefore statement i is incorrect). Statement ii is correct since a correlation of to
on an absolute value scale of
to
is considered to be a strong correlation. Statement iii is incorrect since correlation does not mean causation.
← Didn't Know|Knew It →
It is found that there is a correlation of exactly
between two variables. Which of the following is incorrect?
It is found that there is a correlation of exactly between two variables. Which of the following is incorrect?
Tap to reveal answer
Under no circumstance will correlation ever equate to causation, regardless of how strong the correlation between two variables is. In this case, all other answer choices are correct.
Under no circumstance will correlation ever equate to causation, regardless of how strong the correlation between two variables is. In this case, all other answer choices are correct.
← Didn't Know|Knew It →
Shawn would like to determine what the most popular television channel is in his town, called Hearne.
Which of the following would be an appropriate target population for the survey?
Shawn would like to determine what the most popular television channel is in his town, called Hearne.
Which of the following would be an appropriate target population for the survey?
Tap to reveal answer
Since Shawn would like to know the most popular television channel in his town, he must include all residents of the town in his target population. The sample must come from the directory and each person in the directory should have an equal chance of being selected. The incorrect options are too narrowly focused and are not representative of the town.
Since Shawn would like to know the most popular television channel in his town, he must include all residents of the town in his target population. The sample must come from the directory and each person in the directory should have an equal chance of being selected. The incorrect options are too narrowly focused and are not representative of the town.
← Didn't Know|Knew It →
Which of the following correlation coefficients implies the strongest relationship between variables:





Which of the following correlation coefficients implies the strongest relationship between variables:
Tap to reveal answer
A high correlation coefficient regardless of sign implies a stronger relationship. In this case
has a stronger negative relationship than the positive relationship described by a value of 
A high correlation coefficient regardless of sign implies a stronger relationship. In this case has a stronger negative relationship than the positive relationship described by a value of
← Didn't Know|Knew It →
A basketball coach wants to determine if a player's height can predict the number of points the player scores in a season. Which statistical test should the coach conduct?
A basketball coach wants to determine if a player's height can predict the number of points the player scores in a season. Which statistical test should the coach conduct?
Tap to reveal answer
Linear regression is the best option for determining whether the value of one variable predicts the value of a second variable. Since that is exactly what the coach is trying to do, he should use linear regression.
Linear regression is the best option for determining whether the value of one variable predicts the value of a second variable. Since that is exactly what the coach is trying to do, he should use linear regression.
← Didn't Know|Knew It →
Use the following five number summary to determine if there are any outliers in the data set:
Minimum: 
Q1: 
Median: 
Q3: 
Maximum: 
Use the following five number summary to determine if there are any outliers in the data set:
Minimum:
Q1:
Median:
Q3:
Maximum:
Tap to reveal answer
An observation is an outlier if it falls more than
above the upper quartile or more than
below the lower quartile.


. The minimum value is
so there are no outliers in the low end of the distribution.
. The maximum value is
so there are no outliers in the high end of the distribution.
An observation is an outlier if it falls more than above the upper quartile or more than
below the lower quartile.
. The minimum value is
so there are no outliers in the low end of the distribution.
. The maximum value is
so there are no outliers in the high end of the distribution.
← Didn't Know|Knew It →
For a data set, the first quartile is
, the third quartile is
and the median is
.
Based on this information, a new observation can be considered an outlier if it is greater than what?
For a data set, the first quartile is , the third quartile is
and the median is
.
Based on this information, a new observation can be considered an outlier if it is greater than what?
Tap to reveal answer
Use the
criteria:
This states that anything less than
or greater than
will be an outlier.
Thus, we want to find
where
.


Therefore, any new observation greater than 115 can be considered an outlier.
Use the criteria:
This states that anything less than or greater than
will be an outlier.
Thus, we want to find
where
.
Therefore, any new observation greater than 115 can be considered an outlier.
← Didn't Know|Knew It →
Which of the following is the best way to ensure a sample is representative and unbiased?
Which of the following is the best way to ensure a sample is representative and unbiased?
Tap to reveal answer
Taking a random sample is the only data-collecting method that ensures that every set of individuals has an equal chance of being selected for the sample and does not introduce any significant bias. Taking either a voluntary response sample or a convenience sample does not ensure these important aspects, as there is some unwanted selectivity involved in the data being collected. Taking a census is inferior to taking a sample of any sort, as it is both costly and time-consuming.
Taking a random sample is the only data-collecting method that ensures that every set of individuals has an equal chance of being selected for the sample and does not introduce any significant bias. Taking either a voluntary response sample or a convenience sample does not ensure these important aspects, as there is some unwanted selectivity involved in the data being collected. Taking a census is inferior to taking a sample of any sort, as it is both costly and time-consuming.
← Didn't Know|Knew It →

Which values in the above data set are outliers?
Which values in the above data set are outliers?
Tap to reveal answer
Step 1: Recall the definition of an outlier as any value in a data set that is greater than
or less than
.
Step 2: Calculate the IQR, which is the third quartile minus the first quartile, or
. To find
and
, first write the data in ascending order.
. Then, find the median, which is
. Next, Find the median of data below
, which is
. Do the same for the data above
to get
. By finding the medians of the lower and upper halves of the data, you are able to find the value,
that is greater than 25% of the data and
, the value greater than 75% of the data.
Step 3:
. No values less than 64.
. In the data set, 105 > 104, so it is an outlier.
Step 1: Recall the definition of an outlier as any value in a data set that is greater than or less than
.
Step 2: Calculate the IQR, which is the third quartile minus the first quartile, or . To find
and
, first write the data in ascending order.
. Then, find the median, which is
. Next, Find the median of data below
, which is
. Do the same for the data above
to get
. By finding the medians of the lower and upper halves of the data, you are able to find the value,
that is greater than 25% of the data and
, the value greater than 75% of the data.
Step 3: . No values less than 64.
. In the data set, 105 > 104, so it is an outlier.
← Didn't Know|Knew It →
You are given the following information regarding a particular data set:
Q1: 
Q3: 
Assume that the numbers
and
are in the data set. How many of these numbers are outliers?
You are given the following information regarding a particular data set:
Q1:
Q3:
Assume that the numbers and
are in the data set. How many of these numbers are outliers?
Tap to reveal answer
In order to find the outliers, we can use the
and
formulas.




Only two numbers are outside of the calculated range and therefore are outliers:
and
.
In order to find the outliers, we can use the and
formulas.
Only two numbers are outside of the calculated range and therefore are outliers: and
.
← Didn't Know|Knew It →