Long time no type, folks! I still live, and am still working at the university, but my old office was sacked during COVID and I've been bounced across four units and counting being a "fixer" to solve various office problems, and once solved get moved to a new failing office to sort things out.The situation:
In recent weeks I've been transferred to a data unit to cover for a supervisor who took on all sorts of tasks the office wasn't equipped for and then fucked off to the private sector, leaving the office (and a large number of stakeholders) in something of a lurch. I've been able to put out most of the fires since being brought over here, but mostly the office is equipped for large set data analysis on a more descriptive statistics side of things. We have never had to do inferential statistics for the basic role. Enter this final problem, from which the former supervisor said "YES, WE CAN DO THIS!" literally on the day he left and left us holding the bag. While all of us ~have~ done statistics in the past, the most recent one of us had grad statistics something like 6 years ago so we are super rusty. We need guidance toward what tests we should be doing with this data to answer the questions asked of us. We know how to run the tests (that's easy enough for us to find out), but currently we are at odds on if ANOVA, Chi Squared, or something else would be the best tools here.The data:
We have a set of data which is essentially as follows:
- Variable 1: Pass/Fail (How many students passed a specific test). This is represented in raw counts.
- Variable 2: Cohort Name: Special Cohort A, Special Cohort B, Comparison Group (Randomly Selected Student Sample who weren't in those cohorts)
- Variable 3: Cohort Hear (2020-21, 2019-2020, 2018-19 etc. For a total of five years for each cohort)
We have been tasked with answering the following questions:
- Are test pass rates different then a Comparison Group for Special Cohort A and Special Cohort B
- Are test pass rates different between Cohort A and Cohort B
- "Superiority" (by which we understand this to mean is one of the cohorts statistically "better" than the other)
Failure to deliver on this isn't the end of the world, but the unit that asked for this information from the supervisor that bailed has been poorly treated in the past and we would like to support them given that this helps a special needs target population and could lead to them getting funding which would help that population a whole lot. Any advice on what direction we should be going here is greatly appreciated!