TY - GEN
T1 - “Medium-n studies" in computing education conferences
AU - Guerzhoy, Michael
N1 - Publisher Copyright:
© 2023 Copyright held by the owner/author(s).
PY - 2023/11/13
Y1 - 2023/11/13
N2 - Good (Frequentist) statistical practice requires that statistical tests be performed in order to determine if the phenomenon being observed could plausibly occur by chance if the null hypothesis is false. Good practice also requires that a test is not performed if the study is underpowered: if the number of observations is not sufficiently large to be able to reliably detect the effect one hypothesizes, even if the effect exists [7]. Running underpowered studies runs the risk of false negative results. This creates tension in the guidelines and expectations for computer science education conferences: while things are clear for studies with a large number of observations, researchers should in fact not compute p-values and perform statistical tests if the number of observations is too small [2]. The issue is particularly live in CSed venues, since class sizes where those issues are salient are common We outline the considerations for when to compute and when not to compute p-values in different settings encountered by computer science education researchers. We survey the author and reviewer guidelines in different computer science education conferences (ICER, SIGCSE TS, ITiCSE, EAAI, CompEd, Koli Calling). We present summary data and make several preliminary observations about reviewer guidelines: guidelines vary from conference to conference; guidelines allow for qualitative studies, and, in some cases, experience reports, but guidelines do not generally explicitly indicate that a paper should have at least one of (1) an appropriately-powered statistical analysis or (2) rich qualitative descriptions. We present preliminary ideas for addressing the tension in the guidelines between small-n and large-n studies.
AB - Good (Frequentist) statistical practice requires that statistical tests be performed in order to determine if the phenomenon being observed could plausibly occur by chance if the null hypothesis is false. Good practice also requires that a test is not performed if the study is underpowered: if the number of observations is not sufficiently large to be able to reliably detect the effect one hypothesizes, even if the effect exists [7]. Running underpowered studies runs the risk of false negative results. This creates tension in the guidelines and expectations for computer science education conferences: while things are clear for studies with a large number of observations, researchers should in fact not compute p-values and perform statistical tests if the number of observations is too small [2]. The issue is particularly live in CSed venues, since class sizes where those issues are salient are common We outline the considerations for when to compute and when not to compute p-values in different settings encountered by computer science education researchers. We survey the author and reviewer guidelines in different computer science education conferences (ICER, SIGCSE TS, ITiCSE, EAAI, CompEd, Koli Calling). We present summary data and make several preliminary observations about reviewer guidelines: guidelines vary from conference to conference; guidelines allow for qualitative studies, and, in some cases, experience reports, but guidelines do not generally explicitly indicate that a paper should have at least one of (1) an appropriately-powered statistical analysis or (2) rich qualitative descriptions. We present preliminary ideas for addressing the tension in the guidelines between small-n and large-n studies.
UR - http://www.scopus.com/inward/record.url?scp=85185531886&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85185531886&partnerID=8YFLogxK
U2 - 10.1145/3631802.3631854
DO - 10.1145/3631802.3631854
M3 - Conference contribution
AN - SCOPUS:85185531886
T3 - ACM International Conference Proceeding Series
BT - Proceedings of 23rd International Conference on Computing Education Research, Koli Calling 2023
PB - Association for Computing Machinery
T2 - 23rd International Conference on Computing Education Research, Koli Calling 2023
Y2 - 13 November 2023 through 19 November 2023
ER -