Probability, Distributions, Hypothesis Testing, and Measures
Question
Unit V Homework Assignment
This is the Textbook we are using. We are on Chapters 14, 15 and 16.
Instructions
Probability, Distributions, Hypothesis Testing, and Measures
This is a three-part assignment that will be submitted as one document. You will need a title and reference page for this assignment. You do not need to include the introduction or conclusion sections.
Part I. Based on what you have learned about probability and distributions, hypothesis testing, and measures of association, answer the following:
- What statistical tests will you use to test your null hypothesis (H0)?
- Explain why your chosen statistical tests are appropriate for testing the null hypothesis (H0)?
- How will you determine correlations between variables?
Part II. Go to the U.S. Bureau of Labor Statistics Table A-1: Employment status of the civilian population by sex and age.
Table A-1 presents the employment status of the civilian population by sex and age and is updated monthly. For this assignment, select the data from the latest available month under the Seasonally Adjusted header column, men, 20 years and over employed and unemployed, and women, 20 years and over, employed, and unemployed.
Using any software of your choice and data from Table A-1, develop the following tables and figures, and conduct the following statistical tests.
- Create a contingency table and calculate the frequencies.
- Create an expected frequencies table.
- Calculate the Chi-Square statistic.
- Calculate the t-Statistic.
- Develop a standard scatter plot for the variables.
- Determine if your data level is nominal, ordinal, or continuous. Then calculate the Phi coefficient for nominal level data, Gamma for ordinal level data, or Pearson’s r for continuous level data.
Export the tables and figure(s) into a Word document. Tables and figures are required to follow APA Style.
Part III. As in previous units, revisions to the hypothesis and research questions may be necessary. If you feel revisions are needed, go ahead, and make them. If no revisions are needed, justify your decision. Keep in mind you will be developing a quantitative research proposal.
Your final submitted document must be at least three pages long. The title and reference pages do not apply to the page count. You may use the textbook and at least two peer-reviewed scholarly articles from the CSU Online Library as resources.
Adhere to APA Style when constructing this assignment, including in-text citations and references for all sources that are used. Please note that no abstract is needed.
Solution
Probability, Distributions, Hypothesis Testing, and Measures
PART I
Statistical hypothesis testing refers to the statistical methods employed in gaining inference from a set of data. According to Simar and Wilson (2020), the methods are used to develop a suitable conclusion from two sets of contrasting and possibly conflicting hypotheses. The research project is quantitative, and as such, only statistical tests will be conducted to determine the null hypotheses. The research study has formulated two null hypotheses that it intends to test. Both hypotheses pertain to the relationship between the ADA transition plan among public agencies in Florida and post-secondary school transition into employment and marginalized communities and people living with disabilities. The t-test is used to determine the differences between the two means. The test will be appropriate for this study since it is simple and provides an interesting comparison standard. In other words, t-tests are often simple to interpret while the analysis is robust. Besides, the test is also characterized by the ease of collecting data and computation.
Another important test that will be used for the set of null hypotheses is the analysis of variance. The analysis of variance (ANOVA) is one of the most suitable tests to adopt whenever comparing two sets of means (Zhu & Bradic, 2018). Since only one independent variable, the ADA transition plan, will be used in the study, the one-way ANOVA test will be utilized. The Test is suitable as it can control type 1 error and provides the basis for a general comparison of means. The final test used for the questions and null hypothesis testing is Pearson’s t-test. The test is conducted to establish the strength of the relationship between variables. Many researchers consider Pearson’s r test as the best for determining the association between variables. The test also provides information about the magnitude of the relationship. The correlation between the dependent and independent variables will be determined by testing how their means are closely related. For instance, if the mean for ADA transition and the rate of secondary school transition to the job are not closely related, the results from the three tests will show a value of zero. A negative value will mean a weak correlation while a positive value a strong correlation.
PART II
Employed and Unemployed Men and Women age 20 Years and Over in the U.S
Men 20 Years and Over |
Women 20 Years and Over |
||
Employed |
Unemployed |
Employed |
Unemployed |
80,767 |
3,199 |
70,857 |
2,637 |
Source: U.S Bureau of Labor Statistics.
Contingency Table
Men Employed / Men Unemployed |
Category = 3 |
Category = 199 |
Women Employed / Women Unemployed |
Category = 2 |
Category = 637 |
Category = 80 |
1 |
0 |
Category = 70 |
1 |
0 |
Category = 767 |
0 |
1 |
Category = 857 |
0 |
1 |
Expected Frequencies Table
Frequency table: |
|
|
|
|
element |
frequency |
cumulative frequency |
relative frequency |
cumulative relative frequency |
2637 |
1 |
1 |
0.25 |
0.25 |
3199 |
1 |
2 |
0.25 |
0.5 |
70857 |
1 |
3 |
0.25 |
0.75 |
80767 |
1 |
4 |
0.25 |
1 |
|
|
|
|
|
1. From the tables, the frequency distribution value is 1. Frequency in statistics refers to the number of times a single value appears in a set of values.
2. The Chi-Square statistic at the significance level of 0.5 equals 5.4035, while the p-value is 0 .020097. Chi-square determines the difference between the observed frequencies and expected frequencies in statistics. The p-value refers to the probability that the observed values could occur randomly.
3. The T-Statistic equals 1.982273. T-Statistic, in simple terms, refers to the differences in the means between two groups in statistics. In this case, the differences in means are taken between the total number of employed and unemployed males and females.
Three main data types are used in statistics, including nominal, ordinal, and continuous data. Nominal data is characterized by variables labeled without any quantitative value or order. On the other hand, Ordinal data have natural ordering with values representing some kind of relative position. The type of data occupies a position between quantitative and qualitative data and cannot be subjected to any form of statistical analysis. Finally, continuous data is often expressed in the form of fractions. The data presented above falls under the category of data referred to as discrete data. Discrete data is separate or distinct and contains values with the whole number such as 80,767, 3,199, 70,857, and 2,637, among others. Since the data is discrete, it cannot be subjected to the three types of tests.
Part III
The two null hypotheses formulated for this research study are as follows:
- The percentage of local public agencies in Florida with ADA transition plans have no direct impact on post-secondary school transition into employment.
- The ADA transition plans in Florida have no significant impact on marginalized communities and people living with disabilities.
Since the research study is formulated to be quantitative, the two null hypotheses seem suitable for the study at this point. The reason is that they can easily be tested through the here identified statistical testing methods to provide the required results for analysis. Therefore, the null hypotheses that have been formulated for this quantitative study will not be subjected to any changes.
References
Simar, L., & Wilson, P. W. (2020). Hypothesis testing in nonparametric models of production using multiple sample splits. Journal of Productivity Analysis, 53(3), 287-303.
Zhu, Y., & Bradic, J. (2018). Linear hypothesis testing in dense high-dimensional linear models. Journal of the American Statistical Association, 113(524), 1583-1600.
About Author
Tough Essay Due? Hire Tough Essay Writers!
We have subject matter experts ready 24/7 to tackle your specific tasks and deliver them ON TIME, ready to hand in. Our writers have advanced degrees, and they know exactly what’s required to get you the best possible grade.
Find the right expert among 500+
We hire Gradewriters writers from different fields, thoroughly check their credentials, and put them through trials.
View all writers