Instructions: (Points will be deducted for failure to follow instructions) Use the PennStateData dataset to answer the questions below using Microsoft Excel 2016. Write a professional report of your analysis. 1. You must turn in neatly formatted copies of both the write up and the excel workbook. 2. You should include a title page with your name, course, project name, and data. 3. You should include page numbers in your report. 4. Your results must be written in clear, professional English, with proper grammar, spelling, and punctuation. 5. Your answers should be numbered according to the original question number. 6. You must include proper formatting and descriptive titles, axis labels, and data labels on all graphs and tables. 7. Copy and paste your excel tables and graphs AS PICTURES in your final write document. 8. Graphs and tables should be number sequentially according to the way they appear in your write up (i.e. Figure 1: Hist…, Figure 2: Box…, Table 1: Data…, Table 2: Desc…, etc.). Be sure to reference the graphs and tables in your write up when reporting on the analysis. Summary: Data from n=205 students in a statistics class for students in the social and behavioral sciences. The survey was done in the Spring semester of 2000. (Source: Mind on Statistics (Utts and Heckard)). The dataset has been modified for this homework. Variable Names in order from left to right: Column Name Description A Gender What is your gender orientation? (Male, Female, Non Binary) B Friends What gender do you find it easier to make friends with? (Opposite, NoDiff, or Same) C Tattoo Do you have a tattoo? (Yes or No) D EarPrc How many ear piercings do you have? E CDs How many CDs do you own? F Height What is your height (in inches)? Getting to Know the Dataset: Create a new sheet called “Data Dict”. Create table in this sheet 1. Create a table that lists all the variables in the data set. In this table, also include headings “Label” (a more thorough description of the variable, if needed); “General Type” (either quantitative or categorical); “Specific Type” (either identifier variable, discrete, continuous, nominal, or ordinal). Provide this information for each variable in the data set. Finally, include a heading “Measurement Units” and specify the measurement units for quantitative variables. Use N/A when there are no measurement units. Univariate Quantitative Analysis: Copy and paste the quantitative variables onto a new sheet. Name the sheet “Uni Quant”. Create all tables and graphs in this sheet 2. Provide a table of descriptive statistics for the quantitative variables (including measurements of central tendency and dispersion). Use the Descriptive Statistics analysis tool when possible. 3. Create a histogram and boxplot for each the quantitative variables. 4. For each the quantitative variables, explain why the mean or the median is the best representation of central tendency and the standard deviation or IQR of dispersion. This is decided by examining the histogram and boxplot. Identify the appropriate measures and state the associated values from the descriptive statistics table in your write up. Univariate Categorical Analysis: Copy and paste the variables “Gender” and “Friends” into a new sheet. Name the sheet “Uni Cat”. Create all tables and graphs in this sheet 5. Create univariate frequency tables (frequency and percent) for the variables “Gender” and “Friends”. 6. Create a bar chart and pie chart for the variables “Gender” and “Friends”. 7. For each variables comment on the frequency distribution including the mode. Bivariate Analysis 2 Categorical: Copy and paste the variables “Gender” and “Friends” into a new sheet. Name the sheet “Bi 2 Cat”. Create all tables and graphs in this sheet 8. Using the Pivot Table tool, provide a contingency table of “Gender” by “Friends”. Have “Gender” represented by the rows and have “Friends” represented by the columns. Report the frequency counts and all three percentages – row, column and total. You will have four tables. 9. Briefly explain the differences between these types of percentages displayed in each contingency table you created. 10. Use the frequency counts to create a stacked bar chart. Then create a 100% stacked bar chart. “Gender” should be represented along the x-axis, and “Friends” should be represented by the legend. 11. Describe the differences between these graphics and determine which graph is a better visualization of the relationship of the two variables. Bivariate Analysis 1 Cat 1 Quant: Copy and paste the variables “Gender” and “Friends” into a new sheet. Name the sheet “Bi 1 Cat 1 Quant”. Create all tables and graphs in this sheet 12. Perform a stratified analysis of the mean of the three quantitative variables by “Tattoo” status. Compare and contrast the mean number of piercings, the mean number of CD’s, and the mean height between these two groups. 13. Create side-by-side boxplots of the “EarPrc” variable stratified by the “Tattoo” variable. 14. Comment on the distribution of “EarPrc” between each of the “Tattoo” groups Bivariate Analysis 2 Quant: Copy and paste the variables “Height” and “CDs” into a new sheet. Name the sheet “2 Quant”. Create all tables and graphs in this sheet 15. Create a scatterplot of the variables “Height” and “CDs” (this relationship doesn’t make logical sense, but we will examine it for illustrative purposes and practice with the tools) 16. If there is an outlier remove it, by deleting the observation, and rescale the axes to create a neater display. 17. Calculate the correlation coefficient and briefly describe on the relationship you observe between these two variables, based on shape, strength, and direction. Confidence Interval: Copy and paste the variable “Height” into a new sheet. Name the sheet “Conf Int”. Create the table in this sheet 18. Construct and report 90%, 95% and 99% confidence intervals for the quantitative variable, Height, in a table. 19. Interpret the 95% confidence interval in context. Variable Creation: Copy and paste the variable “CDs” into a new sheet. Name the sheet “Var Create”. Create the new variable and the table in this sheet 20. Create a new categorical variable using the “CDs” variable named “CD_Cat”. Categorize the number music CDs each student owns (BelowAvg = 0 to 30, Average = 31 to 80, AboveAvg = more than 80) 21. Create a frequency table of the newly created categorical variable Sampling: Copy and paste the dataset into a new sheet. Name the sheet “Sample”. Create the random sample and the table in this sheet 22. Using the RAND function, generate a random sample of 30 observations. Display your sample with the random numbers in a formatted table. 23. Generate the descriptive statistics for the quantitative variables only for this sample and display them in a table.

