Medical Citations for Statistical Issues
General: [9, 8, 35, 38, 59, 5]
Graphics: [89, 25, 100, 101, 96, 97, 98, 7, 26, 24, 27, 105, 103, 22, 39, 73]
Confidence Intervals: [87, 14, 41, 15, 13, 75, 81]
Statistical Significance:[76, 15, 3, 86]
Sample Size Determination:[62]
Reporting of Clinical Trials: [11, 23, 43, 93, 92, 64, 86, 21, 31, 33, 50, 28]
Generalizability of Clinical Trials:[33, 50]
Meta-Analysis, Publication Bias: [59, Chapter 11],[58, 80, 85, 31, 63, 51, 50, 40, 84, 94, 55, 70]
Reporting of Survival Analyses: [4, 57, 20, 53]
Diagnostic Tests: [79, 51, 72, 90, 10, 44, 48, 18, 42]
Subgroup Analysis: [31, 71, 106, 63, 104]
Multivariable Models: [30, 16, 45, 37, 60, 90, 57, 17, 46, 20, 53, 56]
Problems from Categorizing a Continuous Variable: [1, 47, 61, 6, 12, 36, 74, 91, 19, 68, 82, 2, 49, 77, 102]
Missing Data: [34, 69, 99]
Observational Studies, Retrospective Studies, Bias, Adjustment: [32, 20, 78, 28]
Assessing Equivalence: [52]
Measuring Change: [54, 95, 65, 88, 29]
Analysis of Serial (Repeated) Measurements: [67, 66, 83]

References

[1]
D. G. Altman. Categorising continuous covariates (letter to the editor). Brit J Cancer, 64:975, 1991.

[2]
D. G. Altman. Suboptimal analysis using `optimal' cutpoints. Brit J Cancer, 78:556–557, 1998.

[3]
D. G. Altman and J. M. Bland. Absence of evidence is not evidence of absence. BMJ, 311:485, 1995.

[4]
D. G. Altman, B. L. De Stavola, S. B. Love, and K. A. Stepniewska. Review of survival analyses published in cancer journals. Brit J Cancer, 72:511–518, 1995.

[5]
D. G. Altman, S. N. Goodman, and S. Schroter. How statistical expertise is used in medical research. JAMA, 287:2817–2820, 2002.

[6]
D. G. Altman, B. Lausen, W. Sauerbrei, and M. Schumacher. Dangers of using `optimal' cutpoints in the evaluation of prognostic factors. J Nat Cancer Inst, 86:829–835, 1994.

[7]
F. J. Anscombe. Graphs in statistical analysis. Am Statistician, 27:17–21, 1973.

[8]
J. C. Bailar III and F. Mosteller. Guidelines for statistical reporting in articles in medical journals. Ann Int Med, 108:266–273, 1988.

[9]
J. C. Bailar III and F. Mosteller. Medical Uses of Statistics. NEJM Books, Boston, second edition, 1995.

[10]
D. Bamber. The area above the ordinal dominance graph and the area below the receiver operating characteristic graph. J Mathe Psych, 12:387–415, 1975.

[11]
C. Begg, M. Cho, S. Eastwook, R. Horton, D. Moher, I. Olkin, and et al. Improving the quality of reporting of randomized controlled trials. The Consort statement. JAMA, 276:637–639, 1996.

[12]
H. Belcher. The concept of residual confounding in regression models and some applications. Stat Med, 11:1747–1758, 1992.

[13]
M. Borenstein. The case for confidence intervals in controlled clinical trials. Controlled Clin Trials, 15:411–428, 1994.

[14]
L. E. Braitman. Confidence intervals extract clinically useful information from data. Ann Int Med, 108:296–298, 1988.

[15]
L. E. Braitman. Confidence intervals assess both clinical significance and statistical significance. Ann Int Med, 114:515–517, 1991.

[16]
L. E. Braitman and F. Davidoff. Predicting clinical states in individual patients. Ann Int Med, 125:406–412, 1996.

[17]
S. R. Brazer, F. S. Pancotto, T. T. Long III, F. E. Harrell, K. L. Lee, M. P. Tyor, and D. B. Pryor. Using ordinal logistic regression to estimate the likelihood of colorectal neoplasia. J Clin Epi, 44:1263–1270, 1991.

[18]
H. Brenner and O. Gefeller. Variation of sensitivity, specificity, likelihood ratios and predictive values with disease prevalence. Stat Med, 16:981–991, 1997.

[19]
P. Buettner, C. Garbe, and I. Guggenmoos-Holzmann. Problems in defining cutoff points of continuous prognostic factors: Example of tumor thickness in primary cutaneous melanoma. J Clin Epi, 50:1201–1210, 1997.

[20]
K. Bull and D. Spiegelhalter. Survival analysis in observational studies. Stat Med, 16:1041–1074, 1997.

[21]
R. M. Califf, K. F. Adams, W. J. McKenna, M. Gheorghiade, B. F. Uretsky, S. E. McNulty, H. Darius, K. Schulman, F. Zannad, E. Handberg-Thurmond, F. E. Harrell, W. Wheeler, J. Soler-Soler, and K. Swedberg. A randomized controlled trial of epoprostenol therapy for severe congestive heart failure: The Flolan International Randomized Survival Trial (FIRST). Am Heart J, pages 44–54, 1997.

[22]
D. B. Carr. Designing linked micromap plots for states with many counties. Stat Med, 20:1331–1339, 2001.

[23]
T. C. Chalmers, H. Smith, B. Blackburn, B. Silverman, B. Schroeder, D. Reitman, and A. Ambroz. A method for assessing the quality of a randomized control trial. Controlled Clin Trials, 2:31–49, 1981.

[24]
W. S. Cleveland. Graphs in scientific publications (c/r: 85v39 p238-239). Am Statistician, 38:261–269, 1984.

[25]
W. S. Cleveland. The Elements of Graphing Data. Hobart Press, Summit, NJ, 1994.

[26]
W. S. Cleveland and R. McGill. A color-caused optical illusion on a statistical graph. Am Statistician, 37:101–105, 1983.

[27]
W. S. Cleveland and R. McGill. Graphical perception: Theory, experimentation, and application to the development of graphical methods. J Am Stat Assoc, 79:531–554, 1984.

[28]
V. Cohn. A perspective from the press: how to help reporters tell the truth. Stat Med, 20:1341–1346, 2001.

[29]
T. J. Cole. Sympercents: symmetric percentage differences on the 100 loge scale simplify the presentation of log transformed data. Stat Med, 19:3109–3125, 2000.

[30]
J. Concato, A. R. Feinstein, and T. R. Holford. The risk of determining risk with multivariable models. Ann Int Med, 118:201–210, 1993.

[31]
C. E. Counsell, M. J. Clarke, J. Slattery, and P. A. G. Sandercock. The miracle of DICE therapy for acute stroke: fact or fictional product of subgroup analysis? BMJ, 309:1677–1681, 1994.

[32]
R. B. D'Agostino and H. Kwan. Measuring effectiveness: What to expect without a randomized control group. Med Care, 33:AS95–AS105, 1995.

[33]
A. L. Dans, L. F. Dans, G. H. Guyatt, S. Richardson, and The Evidence-Based Medicine Working Group. How to decide on the applicability of clinical trial results to your patient. JAMA, 279:545–549, 1998.

[34]
A. R. T. Donders, G. J. M. G. van der Heijden, T. Stijnen, and K. G. M. Moons. Review: A gentle introduction to imputation of missing values. J Clin Epi, 59:1087–1091, 2006.

[35]
B. S. Everitt. The Cambridge Dictionary of Statistics in the Medical Sciences. Cambridge University Press, New York, 1995.

[36]
D. Faraggi and R. Simon. A simulation study of cross-validation for selecting an optimal cutpoint in univariate survival analysis. Stat Med, 15:2203–2213, 1996.

[37]
A. R. Feinstein. Multivariable Analysis. Yale University Press, New Haven, Connecticut, 1996.

[38]
J. L. Garb. Understanding Medical Research: A Practitioner's Guide. Little, Brown, Boston, 1996.

[39]
A. Gelman, C. Pasarica, and R. Dodhia. Let's practice what we preach: Turning tables into graphs. Am Statistician, 56:121–130, 2002.

[40]
Z. B. Gerbarg and R. I. Horwitz. Resolving conflicting clinical trials: Guidelines for meta-analysis. J Clin Epi, 41:503–509, 1988.

[41]
S. N. Goodman and J. A. Berlin. The use of predicted confidence intervals when planning experiments and the misuse of power when interpreting results. Ann Int Med, 121:200–206, 1994.

[42]
I. Guggenmoos-Holzmann and H. C. van Houwelingen. The (in)validity of sensitivity and specificity. Stat Med, 19:1783–1792, 2000.

[43]
D. C. Hadorn, D. Baker, J. S. Hodges, and N. Hicks. Rating the quality of evidence for clinical practice guidelines. J Clin Epi, 49:749–754, 1996.

[44]
J. A. Hanley and B. J. McNeil. The meaning and use of the area under a receiver operating characteristic (ROC) curve. Radiology, 143:29–36, 1982.

[45]
F. E. Harrell, K. L. Lee, and D. B. Mark. Multivariable prognostic models: Issues in developing models, evaluating assumptions and adequacy, and measuring and reducing errors. Stat Med, 15:361–387, 1996.

[46]
F. E. Harrell, K. L. Lee, and B. G. Pollock. Regression models in clinical studies: Determining relationships between predictors and response. J Nat Cancer Inst, 80:1198–1202, 1988.

[47]
S. G. Hilsenbeck and G. M. Clark. Practical p-value adjustment for optimally selected cutpoints. Stat Med, 15:103–112, 1996.

[48]
M. A. Hlatky, D. B. Pryor, F. E. Harrell, R. M. Califf, D. B. Mark, and R. A. Rosati. Factors affecting the sensitivity and specificity of the exercise electrocardiography. Multivariable analysis. Am J Med, 77:64–71, 1984.

[49]
N. Hollander, W. Sauerbrei, and M. Schumacher. Confidence intervals for the effect of a prognostic factor after selection of an `optimal' cutpoint. Stat Med, 23:1701–1713, 2004.

[50]
J. P. A. Ioannidis and J. Lau. The impact of high-risk patients on the results of clinical trials. J Clin Epi, 50:1089–1098, 1997.

[51]
L. Irwig, A. Tosteson, C. Gatsonis, J. Lau, G. Colditz, T. C. Chalmers, and F. Mosteller. Guidelines for meta-analyses evaluating diagnostic tests. Ann Int Med, 120:667–676, 1994.

[52]
B. Jones, P. Jarvis, and A. F. Ebbutt. Trials to assess equivalence: The importance of rigorous methods. BMJ, 313:36–39, 1996.

[53]
A. C. Justice, K. E. Covinsky, and J. A. Berlin. Assessing the generalizability of prognostic information. Ann Int Med, 130:515–524, 1999.

[54]
L. Kaiser. Adjusting for baseline: Change or percentage change? Stat Med, 8:1183–1190, 1989.

[55]
B. L. Kasiske, J. D. A. Lakatua, J. Z. Ma, and T. A. Louis. A meta-analysis of the effects of dietary protein restriction on the rate of decline in renal function. Am J Kidney Dis, 31:954–961, 1998.

[56]
M. H. Katz. Multivariable analysis: A primer for readers of medical research. Ann Int Med, 138:644–650, 2003.

[57]
W. A. Knaus, F. E. Harrell, J. Lynn, L. Goldman, R. S. Phillips, A. F. Connors, N. V. Dawson, W. J. Fulkerson, R. M. Califf, N. Desbiens, P. Layde, R. K. Oye, P. E. Bellamy, R. B. Hakim, and D. P. Wagner. The SUPPORT prognostic model: Objective estimates of survival for seriously ill hospitalized adults. Ann Int Med, 122:191–203, 1995.

[58]
L. A. L'Abbe, A. S. Detsky, and K. O'Rourke. Meta-analysis in clinical research. Ann Int Med, 107:224–233, 1987.

[59]
T. A. Lang and M. Secic. How to Report Statistics in Medicine: Annotated Guidelines for Authors, Editors, and Reviewers. American College of Physicians, Philadelphia, 1997.

[60]
A. Laupacis, N. Sekar, and I. G. Stiell. Clinical prediction rules: A review and suggested modifications of methodological standards. JAMA, 277:488–494, 1997.

[61]
B. Lausen and M. Schumacher. Evaluating the effect of optimized cutoff values in the assessment of prognostic factors. Comp Stat Data Analysis, 1996.

[62]
R. V. Lenth. Some practical guidelines for effective sample size determination. Am Statistician, 55:187–193, 2001.

[63]
A. S. Levey, C. H. Schmid, and J. Lau. Antilymphocyte antibodies, renal transplantation, and meta-analysis. Ann Int Med, 128:863–865, 1998.

[64]
C. J. Lynch and P. A. Lachenbruch. Statistical issues in biologics submissions to the FDA. Drug Info J, 30:921–932, 1996.

[65]
J. S. Maritz. Models and the use of signed rank tests. Stat Med, 4:145–153, 1985.

[66]
J. N. S. Matthews. A refinement to the analysis of serial data using summary measures. Stat Med, 12:27–37, 1993.

[67]
J. N. S. Matthews, D. G. Altman, M. J. Campbell, and P. Royston. Analysis of serial measurements in medical research (see letter to editor by S. Senn in same issue concerning this paper). BMJ, 300:230–235, 1990.

[68]
S. E. Maxwell and H. D. Delaney. Bivariate median splits and spurious statistical significance. Psychological Bulletin, 113:181–190, 1993.

[69]
K. G. M. Moons, R. A. R. T. Donders, T. Stijnen, and F. E. Harrell. Using the outcome for imputation of missing predictor values was preferred. J Clin Epi, 59:1092–1101, 2006.

[70]
S.-L. T. Normand. Tutorial in biostatistics: Meta-analysis: Formulating, evaluating, combining, and reporting. Stat Med, 18:321–359, 1999.

[71]
A. D. Oxman and G. H. Guyatt. A consumer's guide to subgroup analysis. Ann Int Med, 116:78–84, 1992.

[72]
D. B. Pryor, L. Shaw, C. B. McCants, K. L. Lee, D. B. Mark, F. E. Harrell, L. H. Muhlbaier, and R. M. Califf. Value of the history and physical examination in identifying patients at increased risk for coronary artery disease. Ann Int Med, 118:81–90, 1993.

[73]
M. A. Puhan, G. ter Riet, K. Eichler, J. Steurer, and L. M. Bachmann. More medical journals should inform their contributors about three key principles of graph construction. J Clin Epi, 59:1017–1022, 2006.

[74]
D. R. Ragland. Dichotomizing continuous outcome variables: Dependence of the magnitude of association and statistical power on the cutpoint. Epidemiology, 3:434–440, 1992.

[75]
K. J. Rothman. A show of confidence (editorial). NEJM, 299:1362–3, 1978.

[76]
K. J. Rothman. Significance questing. Ann Int Med, 105:445–447, 1986.

[77]
P. Royston, D. G. Altman, and W. Sauerbrei. Dichotomizing continuous predictors in multiple regression: a bad idea. Stat Med, 25:127–141, 2006.

[78]
D. B. Rubin. Estimating causal effects from a large data set using the propensity score. Ann Int Med, 127:757–763, 1997.

[79]
D. L. Sackett. A primer on the precision and accuracy of the clinical examination. JAMA, 267:2638–2644, 1992.

[80]
H. S. Sacks, J. Berrier, D. Reitman, V. A. Ancona-Berk, and T. C. Chalmers. Meta-analyses of randomized controlled trials. NEJM, 316:450–455, 1987.

[81]
N. Schenker and J. F. Gentleman. On judging the significance of differences by examining the overlap between confidence intervals. Am Statistician, 55:182–186, 2001.

[82]
G. Schulgen, B. Lausen, J. Olsen, and M. Schumacher. Outcome-oriented cutpoints in quantitative exposure. Am J Epi, 120:172–184, 1994.

[83]
S. Senn, L. Stevens, and N. Chaturvedi. Repeated measures in clinical trials: Simple strategies for analysis using summary measures. Stat Med, 19:861–877, 2000.

[84]
S. Shapiro. Is meta-analysis a valid approach to the evaluation of small effects in observational studies? J Clin Epi, 50:223–229, 1997.

[85]
S. J. Sharp, S. G. Thompson, and D. G. Altman. The relation between treatment benefit and underlying risk in meta-analysis. BMJ, 313:735–738, 1996.

[86]
L. B. Sheiner. The intellectual health of clinical drug evaluation. Clin Pharm Ther, 50:4–9, 1991.

[87]
R. Simon. Confidence intervals for reporting results of clinical trials. Ann Int Med, 105:429–435, 1986.

[88]
J. M. Singer and D. F. Andrade. Regression models for the analysis of pretest/posttest data. Biometrics, 53:729–735, 1997.

[89]
P. A. Singer and A. R. Feinstein. Graphical display of categorical data. J Clin Epi, 46:231–236, 1993.

[90]
A. Spanos, F. E. Harrell, and D. T. Durack. Differential diagnosis of acute meningitis: An analysis of the predictive value of initial observations. JAMA, 262:2700–2707, 1989.

[91]
S. Suissa and L. Blais. Binary regression with continuous outcomes. Stat Med, 14:247–255, 1995.

[92]
The Asilomar Working Group on Recommendations for Reporting of Clinical Trials in the Biomedical Literature. Checklist of information for inclusion in reports of clinical trials. Ann Int Med, 124:741–743, 1996.

[93]
The Standards of Reporting Trials Group. A proposal for structured reporting of randomized controlled trials. JAMA, 272:1926–1931, 1994.

[94]
S. G. Thompson. Why sources of heterogeneity in meta-analysis should be investigated. BMJ, 309:1351–1355, 1994.

[95]
L. Törnqvist, P. Vartia, and Y. O. Vartia. How should relative changes be measured? Am Statistician, 39:43–46, 1985.

[96]
E. R. Tufte. The Visual Display of Quantitative Information. Graphics Press, Cheshire, Connecticut, 1983.

[97]
E. R. Tufte. Envisioning Information. Graphics Press, Cheshire, Connecticut, 1990.

[98]
E. R. Tufte. Visual Explanations. Graphics Press, Cheshire, CT, 1997.

[99]
G. J. M. G. van der Heijden, A. R. T. Donders, T. Stijnen, and K. G. M. Moons. Imputation of missing values is superior to complete case analysis and the missing-indicator method in multivariable diagnostic research: A clinical example. J Clin Epi, 59:1102–1109, 2006.

[100]
H. Wainer. How to display data badly. Am Statistician, 38:137, 1984.

[101]
H. Wainer. Three graphic memorials. Chance, 7:52–55, 1994.

[102]
H. Wainer. Finding what is not there through the unfortunate binning of results: The Mendel effect. Chance, 19(1):49–56, 2006.

[103]
A. Wallgren, B. Wallgren, R. Persson, U. Jorner, and J. Haaland. Graphing Statistics & Data. Sage Publications, Thousand Oaks, 1996.

[104]
H. Wedel, D. DeMets, and et al. Challenges of subgroup analyses in multinational clinical trials: Experiences from the MERIT-HF trial. Am Heart J, 142:502–511, 2001.

[105]
L. Wilkinson. The Grammar of Graphics. Springer, New York, 1999.

[106]
S. Yusef, J. Wittes, J. Probstfield, and H. A. Tyroler. Analysis and interpretation of treatment effects in subgroups of patients in randomized clinical trials. JAMA, 266:93–98, 1991.



Acknowledgements: Richard Goldstein (Qualitas Inc.) and R. Localia (Center for Biostatistics and Epidemiology, Penn State U.) provided several key references.