Let’s integrate all (most) of what we’ve learned together over the past two terms. You may work on your own or collaborate with one (max 2) partner. Please make sure that you engage in a a full, fair and mutually-agreeable collaboration if you do choose to collaborate. If you do collaborate, you should plan, execute and write-up your analyses together, not simply divide the work. Please make sure to indicate clearly when your work is joint and any other individual or resource (outside of class material) you consulted in your responses.
Please upload below two files on Canvas:
Please submit a complete “proto” paper (see below) that includes all figures/tables integrated into the manuscript (can be either in-line or at the end of the manuscript), uses full sentences and does not include any code chunks interspersed (you will be graded on this). Please format your manuscript, tables and figures following APA 7 guidelines.
Calculate and interpret descriptive statistics about categorical and continuous quantitative data
Use the General Linear Model framework to answer relational research questions that are amenable to the use of quantitative methods to respond to these questions
Using tables, figures and words, interpret the results of your analysis for an academic research audience
The data set we’ll be using for this assignment is drawn from the Stanford Education Data Archive (SEDA) version 4.1. SEDA was launched in 2016 to provide nationally comparable, publicly available test score data for U.S. public school districts, allowing scientific inquiries on the relationships between educational conditions, contexts, and outcomes (especially student math/ELA achievements) at the district- (and school-) level across the nation. It contains rich variables including measures of academic achievement and achievement gaps for school districts and counties, as well as district-level measures of racial and socioeconomic composition, racial and socioeconomic segregation patterns, and other features of the schooling system. Some descriptive findings can be found here. Findings from the SEDA data have drawn high-profile media coverage on levels of inequality and differential rates of growth across districts.
Analytic Sample. Our data set is district-level data for all United States school districts, with some restrictions due to enrollment and data availability. The test-score outcome data has been averaged across the 10 years of the data collection window. Observations with missing values on any of the key variables were deleted for simplification reasons. After these restrictions, our dataset includes 12,239 observations.
Key variables. The data set contains 24 variables, detailed below.
Data preparation: Open your RStudio, create a project and save it. Go to the root directory of the project and create folders named: “Code”, “Data”, “Figures” and “Tables.” Download the seda.csv dataset and store it in the folder “Data”. Create an R script (or .Rmd) file in the Code folder. Read the data into your R environment. You do not need to include this part of the response in your memo; only in your code.
Your final assessment in EDUC 643 is to postulate one (or more) relational research questions that are amenable to the use of quantitative methods to respond to these questions. You will pose these research questions in the context of the SEDA data and analyze these data to seek answers to these questions. Then, you will construct an “proto” academic research paper synthesizing the results of your analysis. We call this a “proto” paper because (1) you will not spend any time contextualizing your study within the research literature; and (2) your results will be preliminary.
Your “proto” paper should be structured as follows:
Please review the rubric on Canvas for more details on how the assignment will be assessed.
NOTE: It is possible to spend weeks on this assignment; however, it is intended to be completed in 10 hours. Please limit yourselves to 25 hours maximum on this task. If you find yourself approaching this limit, please conclude your paper by describing how you would complete the analysis tasks and submit your work.