Statistics

Description: This quiz covers various concepts and techniques in Statistics.
Number of Questions: 15
Created by:
Tags: statistics probability data analysis descriptive statistics inferential statistics
Attempted 0/15 Correct 0 Score 0

Which of the following is a measure of central tendency?

  1. Mean

  2. Median

  3. Mode

  4. Range


Correct Option: A
Explanation:

Mean is a measure of central tendency that represents the average of a set of values.

The probability of an event occurring is represented by:

  1. P(A)

  2. P(not A)

  3. P(A or B)

  4. P(A and B)


Correct Option: A
Explanation:

P(A) represents the probability of event A occurring.

In a normal distribution, the area under the curve between two z-scores represents:

  1. The probability of a value falling within that range

  2. The mean of the distribution

  3. The standard deviation of the distribution

  4. The skewness of the distribution


Correct Option: A
Explanation:

The area under the curve between two z-scores in a normal distribution represents the probability of a value falling within that range.

Which of the following is a non-parametric test?

  1. t-test

  2. ANOVA

  3. Chi-square test

  4. Correlation analysis


Correct Option: C
Explanation:

Chi-square test is a non-parametric test that is used to determine whether there is a significant relationship between two categorical variables.

The coefficient of determination (R-squared) in a linear regression model represents:

  1. The proportion of variance in the dependent variable explained by the independent variable

  2. The slope of the regression line

  3. The y-intercept of the regression line

  4. The standard error of the estimate


Correct Option: A
Explanation:

R-squared represents the proportion of variance in the dependent variable that is explained by the independent variable.

Which of the following is a measure of variability?

  1. Mean

  2. Median

  3. Mode

  4. Standard deviation


Correct Option: D
Explanation:

Standard deviation is a measure of variability that represents the spread of data around the mean.

The process of collecting, organizing, and summarizing data is known as:

  1. Data analysis

  2. Data mining

  3. Data visualization

  4. Data collection


Correct Option: D
Explanation:

Data collection is the process of gathering and measuring information on targeted variables in an organized manner so that it can be analyzed, interpreted, and used.

In a hypothesis testing scenario, the null hypothesis (H0) represents:

  1. The hypothesis that is being tested

  2. The hypothesis that is assumed to be true

  3. The hypothesis that is rejected if the p-value is less than the significance level

  4. The hypothesis that is accepted if the p-value is greater than the significance level


Correct Option: B
Explanation:

The null hypothesis (H0) represents the hypothesis that is assumed to be true until proven otherwise.

Which of the following is a type of probability distribution that is used to model the distribution of continuous random variables?

  1. Binomial distribution

  2. Poisson distribution

  3. Normal distribution

  4. Uniform distribution


Correct Option: C
Explanation:

Normal distribution is a type of probability distribution that is used to model the distribution of continuous random variables.

The probability of obtaining at least one success in a sequence of independent trials is calculated using the:

  1. Binomial distribution

  2. Poisson distribution

  3. Normal distribution

  4. Uniform distribution


Correct Option: A
Explanation:

Binomial distribution is used to calculate the probability of obtaining at least one success in a sequence of independent trials.

Which of the following is a graphical representation of the distribution of data?

  1. Histogram

  2. Scatter plot

  3. Bar chart

  4. Pie chart


Correct Option: A
Explanation:

Histogram is a graphical representation of the distribution of data that shows the frequency of occurrence of different values.

The process of using sample data to make inferences about a larger population is known as:

  1. Sampling

  2. Inference

  3. Estimation

  4. Hypothesis testing


Correct Option: B
Explanation:

Inference is the process of using sample data to make inferences about a larger population.

Which of the following is a technique used to reduce the dimensionality of a dataset?

  1. Principal component analysis

  2. Factor analysis

  3. Cluster analysis

  4. Discriminant analysis


Correct Option: A
Explanation:

Principal component analysis is a technique used to reduce the dimensionality of a dataset by identifying the principal components that explain the majority of the variance in the data.

The probability of obtaining exactly k successes in a sequence of n independent trials with probability of success p is given by the:

  1. Binomial distribution

  2. Poisson distribution

  3. Normal distribution

  4. Uniform distribution


Correct Option: A
Explanation:

Binomial distribution is used to calculate the probability of obtaining exactly k successes in a sequence of n independent trials with probability of success p.

Which of the following is a measure of the strength and direction of the linear relationship between two variables?

  1. Correlation coefficient

  2. Regression coefficient

  3. Coefficient of determination

  4. Standard error of the estimate


Correct Option: A
Explanation:

Correlation coefficient is a measure of the strength and direction of the linear relationship between two variables.

- Hide questions