Author | Affiliation |
---|---|
Michael Darracq, MD, MPH | University of California, San Francisco, Department of Emergency Medicine, Fresno, California |
Fred Wu, MHS, PA-C | University of California, San Francisco, Department of Emergency Medicine, Fresno, California |
Dear Editor:
We wish to express appreciation to the authors for their kind words regarding our research article.1 We also appreciate the opportunity to respond to the methodological concerns raised.
The authors raise the concern that in the data analysis subsection there are no details regarding the statistical methodology that is used to compare group differences. We feel that this is an incorrect statement. Proportions with 95% confidence intervals are an accepted and often recommended method of comparison between groups. We disagree that hypothesis testing, as suggested by the authors, is necessary as hypothesis testing and confidence intervals rely on the same underlying methodology. Statistical significance can be determined by P-values or confidence intervals. These two approaches always agree.2
In 2016, the American Statistical Association released a position paper describing the over reliance on P-values in medical research and considerable misuses and misconceptions regarding P-values. They also described the use of alternative methods that emphasize estimation over testing, such as confidence, prediction, or credibility intervals.3 We also chose to limit the number of statistical tests performed because of issues related to multiple comparisons. As more attributes are compared, the greater the likelihood of the groups being different on the basis of random sampling error alone.4 The few P-values reported are the result of the two-sample z test for proportion, the only statistical test that we, as nonprofessional statisticians, are aware of for comparing proportions. This was requested by the journal editors to support our statements regarding statistically significant differences.
The authors also cite a reference that describes ways to interpret a graph with error bars as it relates to confidence intervals and overlap in determining a P-value or statistical significance.5 No inference was performed on the basis of overlap of confidence intervals. As this was not the methodology that was used in the present study, any description of inability to determine intergroup difference is not relevant.
Thank you for your comments and time on this important topic.
Footnotes
Section Editor: Mark I Langdorf, MD, MHPE
Full text available through open access at http://escholarship.org/uc/uciem_westjem
Address for Correspondence: Fred Wu, MHS, PA-C, UCSF Fresno, Department of Emergency Medicine, 155 N. Fresno Street, Fresno, California 93701. Email: frederick.wu@ucsf.edu. 9 / 2022; 23:793 – 793
Submission history: Revision received August 17, 2022; Accepted August 17, 2022
Conflicts of Interest: By the WestJEM article submission agreement, all authors are required to disclose all affiliations, funding sources and financial or management relationships that could be perceived as potential sources of bias. No author has professional or financial relationships with any companies that are relevant to this study. There are no conflicts of interest or sources of funding to declare.
REFERENCES
1. Wu F, Darracq MA. Comparing physician assistant and nurse practitioner practice in U.S. emergency departments, 2010–2017. West J Emerg Med. 2021;22(5):1150-5.
2. Frost J. Hypothesis testing and confidence intervals. Statistics By Jim. Available at: https://statisticsbyjim.com/hypothesis-testing/hypothesis-tests-confidence-intervals-levels/. Accessed August 5, 2022.
3. Wasserstein RL, Lazar NA. The ASA statement on -values: context, process, and purpose. The American Statistician. 2016;70(2):129-133.
4. Multiple comparisons problem. Wikipedia. Available at: https://en.wikipedia.org/wiki/Multiple_comparisons_problem#References. Accessed August 5, 2022.
5. Cumming G, Finch S. Inference by eye: confidence intervals and how to read pictures of data. Am Psychol. 2005;60(2):170-80.