# Current Notes

## 2017-07-24

### Maureen Saint Georges Chaumet (fellow)

I am starting a project that compares the cosmetic outcomes of 3 different laceration closure methods in kids: sutures, tape and glue. I will also be looking at several secondary outcomes.

### Samuel Younger (Nurse Practitioner)

Interested in determining sample size and best statistical approach. HLM-SEM vs Path analysis?

Research Abstract

Many organizations are looking to their staff to creatively engage in improving the safety of patients. Further, within the Magnet health care environment, transformational leadership is the theory that has been promoted as core to the achievement of patient outcomes, thus is the core focus of this study. The purpose of this research is to examine the role that leaders play in bringing together elements of a safety culture and a climate of innovation that support and enable staff to engage creatively in improving the quality and safety of patient care. There is little empirical evidence in the nursing literature related to patient safety in an innovative climate, and none could be found that study the leadership behaviors of nursing managers that are conducive to an innovation climate and impact on patient safety outcomes in a Magnet designated, Academic Medical Center. Therefore, this study seeks to fill that gap in knowledge and expand the leadership and innovation literature to include patient safety within a Magnet work environment.

This research uses a multi-level, cross-sectional, descriptive correlational design aimed at examining the relationship between nurse manager transformational leadership and front line nurse rated patient safety score, and to further investigate how, if any, does communication and feedback about error and the innovation climate influence the relationship. The independent variables in this study are transformational and transactional leadership. The dependent variable is front line nurse rated patient safety score. The innovation climate is proposed to be a mediating variable. Feedback and communication is proposed to be a moderator variable between transformational leadership and patient safety score. The variables will be measured through an online survey based the three validated and reliable survey instruments (54 questions): the MLQ-5x short (MLQ-5x), the Team Climate Inventory-short (TCI), and Feedback and Communication About Error and Patient Safety Grade (subscales of the AHRQ Hospital Survey on Patient Safety Culture) which are all appropriate for collecting data about the perceptions of front line nurses.

If findings confirm these relationships, then in order to impact outcomes, nursing managers may need to be adept at navigating and promoting the complex nature of innovation through communication and establishing an innovation climate. In this context, leadership facilitates communication and an understanding of the innovation climate, which supports creative solutions to patient outcomes and improved quality, in this case, patient safety. On a practical level, this study will contribute to a greater understanding of how to prepare future nursing leaders for the challenges of a changing healthcare landscape through an understanding of what behaviors are necessary to generate innovative and safe care delivery models.

H1a: There is a significant, positive relationship between nurse managers’ transformational leadership as measured by the Multifactor Leadership Questionnaire (MLQ-5X) and nurses’ perception of patient safety as measured by patient safety grade (AHRQ HSOPSC).

H1b: There is a significant, positive relationship between nurse managers’ transactional leadership as measured by the Multifactor Leadership Questionnaire (MLQ-5X) and nurses’ perception of patient safety as measured by patient safety grade (AHRQ HSOPSC).

H1c: There is a significant relationship between nurse managers transactional leadership as measured by the Multifactor Leadership Questionnaire (MLQ-5X) and nurses’ perception of patient safety as measured by patient safety grade (AHRQ HSOPSC), but to a lesser degree than transformational leadership. Included per our discussion on transformational leadership predicting quality above and beyond that of transactional leadership.

H2a: There is a significant relationship between nurse managers’ transformational leadership as measured by the Multifactor Leadership Questionnaire (MLQ-5X) and innovation climate as measured by the Team Climate Inventory (TCI-short).

H2b: There is a significant negative relationship between nurse manager’s transactional leadership as measured by the Multifactor Leadership Questionnaire (MLQ-5X) and innovation climate as measured by the Team Climate Inventory (TCI-short).

H3: The relationship between nurse manager transformational leadership as measured by the Multifactor Leadership Questionnaire (MLQ-5X) and nurses’ perception of patient safety as measured by patient safety grade (AHRQ HSOPSC), will be mediated by innovation climate as measured by the Team Climate Inventory (TCI-short).

H4: The relationship between transformational leadership and patient safety grade will be moderated by feedback and communication about error. In terms of this relationship, transformational leadership will have a stronger, positive relationship with patient safety scores when feedback and communication about error is high.

## 2017-07-10

### Ryan Skeens (fellow)

This is a patient activation measure survey conducted on parents/caregivers of NICU patients. Survey will be conducted at NICU enrollment, NICU discharge, and 30 day after discharge. The hypothesis is that patient activation measure will decrease at NICU discharge but increase over time (30 day after discharge). In addition, characters such as social economic status that links to high patient activation measure will be identified.

The measure has been validated and used by mentor team. This is a fellowship project, and Ryan will apply an internal grant for the 6-9 months project. Further, CTSA support will be explored.

Recommendations:
• Sample size is fixed based on fellowship time. Power and sample size should be calculated accordingly.
• Keep the measure in the continuous form (0-100) instead of dichonimization.
• Consider to have CTSA statistician's early involvement at the design stage. Given this involves design, grant writing, data collection, data analysis, and manuscript preparation, a 90 hour work maybe needed.
• As prediction is involved (identify characters that are related to high measures), model validation should be considered.

## 2017-06-12

### Danxia Yu, Epidemiology (faculty)

We will examine the associations of diet quality scores (assessed at baseline) with body weight change (from baseline to following visits) in a prospective cohort study. Generalized estimating equation model has been used in other studies, which we are not familiar with. We need statistical inputs on this model and the power estimation. We also would like to find a statistician whom we may work with on this project. Thank you.

Recommendations:
• If dropout is not random, either GLS with a serial correlation structure or a linear mixed-effects model would be more appropriate than GEE.
• Do not collapse the diet variables into quintiles; leave them as continuous variables
• For the power calculation, it may be possible to ask for conditional approval to have access to a subset of the data to get estimates of the quantities needed for a power calculation.
• You can do a simplified power calculation with just one wave of data, and argue that the power will be higher when there are more data points per person.
• Possibly useful R packages: longpower (thank you for bringing this to our attention!), pwr (in particular, the pwr.f2.test function).
• Simulation could also be a useful approach, but it would also require some background information about the standard deviations of the variables

### Joshua Cohn, Urologic Surgery (clinical fellow)

I have two questionnaire-based databases on overactive bladder that I have merged. I would like to use this data to develop a model that predicts bother based on symptoms and comorbidities and prioritizes necessary treatments. I am not sure if cluster analysis is the best way to do this.

Recommendations:

## 2017-06-05

### Paul Yoder, Special Education (faculty)

I'd like evaluation of area under the curve (AUC) as a way to quantify the magnitude of the between treatment-group-difference and its confidence interval for RCT with repeated measures of the dependent variable. A reference for an example is Gallop, R. J., Dimidjian, S., Atkins, D. C., & Muggeo, V. (2011). Quantifying treatment effects when flexibly modeling individual change in a nonlinear mixed effects model. J Data Sci, 9, 221-241.

Recommendations:
• Email Hakmook Kang to talk about the possibility of working through the KC biostatistics core to get an estimate of how many children and timepoints you would need to do the flexible-breakpoint approach discussed in the article
• We also discussed an approach using restricted cubic splines. It's possible that this approach would let you use fewer subjects; it may be useful even though you are expecting a linear relationship

### Bryan Hill, OB/GYN (fellow)

This is a follow up from recommendations from 5/15/2017 regarding a logistic regression model of post operative complications as the output variable and clinical and demographic variables as the independent variables. The recommendations, in summary were:

1) Treating the outcome as an ordinal, rather than binary, variable if there are enough people in the additional groups

2) Look at the cross-tabulation between physician and sling type to see whether it is feasible to include both

3) Leave the continuous variables as is (do not categorize them). May want to consider log-transforming age.

4) Try variable clustering to see which variables may be collinear/redundant

5) Consider combining less important (less interesting) variables into a score

Goal for the session: to discuss results of the model.

Recommendations:

## 2017-05-15

### Bryan Hill, Fellow, Gynecology

Reporting complications after surgery are important for quality improvement. Two methods of finding complications are: 1) administrative data from diagnosis codes and 2) key-word search from a manual chart review. We suspect the administrative reporting method under-reports complications. The primary aim of the study is to determine sensitivity and specificity of the administrative method compared to the manual reporting method. The secondary aim is to determine which risk factors are associated with having a complication.

We think that creating a logistic regression model would help address our secondary aim. Our plan is the following: setting the output as "complication present (1)" and using the variables: asa class, age, body-mass index, setting (outpatient or inpatient), sling type, attending, if a concomitant procedure was done, anesthesia time, operation time, smoking history, diabetes, and prior surgery.

Question #1: We need guidance on how many variables we can include in our model. Some have high numbers, and some are quite low.

#2 Some variables may influence each other. For example, sling type is heavily dependent on attending (they like to chose a particular brand or type). How do we adjust our model for that?

#3 It is known that older patients are more likely to experience complications. How do we determine if age is independently associated with "complication presence" versus just being a confounder influencing other variables?

Files we plan to append: data dictionary, STATA file, table of variables with total numbers of responses.

Recommendations:
• In deciding which categories to collapse, look at the sample overall (not by complication status)
• To increase power, consider treating the outcome as an ordinal, rather than binary, variable if there are enough people in the additional groups
• Look at the cross-tabulation between physician and sling type to see whether it is feasible to include both
• Leave the continuous variables as is (do not categorize them). May want to consider log-transforming age.
• Try variable clustering to see which variables may be collinear/redundant
• Consider combining less important (less interesting) variables into a score
• For binary logistic regression, we generally want to have 10--20 people in the smaller outcome group for every degree of freedom (continuous variable or single category) in the model
• If you apply for VICTR funding, we recommend the larger time amount if you are interested in a publication or presentation. In your application, you can cite these notes as evidence that you have been to a biostatistics clinic.

### Mike Temple, Biomedical Informatics, faculty

I am comparing the results of 2 surveys and need help calculating p-values and odds ratios to determine significance between the 2 surveys. I am using R

Recommendations:
• Get more information about the survey design (especially number of people surveyed) so that you can compare the response rates in 2012 and 2016. If they are not close to each other, it will be harder to justify comparing the results of the two surveys
• If possible, get info about demographic makeup of the people surveyed in 2012 and 2016 from the organization's records. If, for example, the mean age of respondents is very different from the known mean age of the people surveyed, you will know that in at least that one aspect, the respondents are not representative of the people surveyed.
• Chi-squared tests should be fine if the categories are exhaustive (but this is secondary to the nonresponse issue)
• If possible, get more info about the outcomes and model specifications used for the regressions in Table 3.

## 2017-05-08

### Chirayu Patel, resident physician, radiation oncology

The project is VEEP-C - Visually Enhanced Education for Prostate Cancer, a randomized, controlled trial to assess the impact of a visual presentation on prostate cancer treatment decision-regret, anxiety, satisfaction, and patient-reported symptoms, in the radiation oncology department. The expected accrual for patients was 112 patients based on 120 prostate cancer patient consultations seen within a 6-month timeframe. Unfortunately, due to a drop in consultations, only ~30 patients have been accrued, and only 1 patient has completed external beam radiation therapy over a 6 month timeframe (other have undergone brachytherapy, surgery, active surveillance, or are still deciding).

1. The sample size is based on an instrument which only 1 patient has completed. As originally written, the study is not feasible. Determination of new outcome and sample size?

2. Role for interim analysis on secondary outcomes?

3. Thoughts on closing the trial due to poor accrual?

## 2017-05-01

### Cara Singer, PhD Student, Speech and Hearing

• This project investigates speech-language imbalances in children. We are interested in the best way to measure imbalances using five standardized tests. Simple range scatter and standard deviation have been discussed. We are also interested in the best way to analyze whether increased synchrony between the five tests is associated with a decrease in stuttering frequency based on two years of development.

### Hatun Zengin-Bolatkale, Faculty, Hearing and Speech

The purpose of the present study was to longitudinally assess sympathetic arousal (i.e., physiological correlate of emotional reactivity) of preschool-age children with persisting stuttering (CWPS), those who recover from stuttering (CWRS), and their normally fluent peers (CWNS) during a stressful picture-naming task. The apriori research questions/ hypotheses are as following:

The first question addressed whether change in SCL in response to stress at initial testing - close to the onset of stuttering - is associated with stuttering chronicity (i.e., persistence vs. recovery). We hypothesized that children whose stuttering persists, compared to those who recover and those who do not stutter, would exhibit increased skin conductance reactivity to a stressful picture naming task at their initial testing (i.e., prior to stuttering resolution for children who recover).

The second question addressed whether change in SCL in response to stress - approximately 18 months after their first testing – is associated with stuttering chronicity (persistent vs. recovered patterns). We hypothesized that children whose stuttering persists, compared to those who recovered and those who do not stutter, would exhibit increased skin conductance reactivity to a stressful picture naming task at 18 months-post-initial testing (i.e., after stuttering resolution for children who recover).

The third question addressed whether changes in SCL in response to stress are associated with changes in stuttering frequency. We hypothesized that for children who persist, compared to children who recover and children who do not stutter, increased skin conductance reactivity would be associated with increases in stuttering frequency.

We would like help from the clinic with the analyses of the hypotheses above, especially for #3.

## 2017-03-27

### Sarah Diehl, Hearing and Speech Sciences , PhD student

* Questions for the clinic:

1. After removing the ratings that have a mean score of 2 or below, there will be ratings that will highly correlate. Should we first do something like a multi-dimensional scaling approach to identify dimensions and then a cluster analysis to see how these dimensions cluster? Or do we throw all ratings (potentially 38 if none receive a mean score of 2 or below – realistically perhaps something like 20 to 25) into a cluster analysis.

2. If we expect at least 2 or 3 clusters, what is a reasonable sample size given the number of items we have on the rating scale?

3. What do we need to put into a proposal that is going to use cluster analysis? What kind of information is critical?

4. Is there another approach that would work better than cluster analysis?

### Gurjeet Birdee, Health Services Research, Faculty.

• The objective of this study was to measure the energy expenditure (oxygen consumption O2/kg/min) of adults practicing common yoga movements. For each individual, participants were asked to do movements in a standing position, lying position, and seated position (body orientation). In addition, each movement was done with different variations serially. In addition, participants were asked to walk at low and moderate intensities to compare energy expenditure of a comparative aerobic exercise to yoga.

The main questions we would like addressed:

What is the best approach to measure if there was significant variation between individuals for mean energy expenditure by body orientation?

What is the best approach to measure if there was significant variation between individuals for each movement?

When considering if variation exists above, should we take into account resting energy expenditure for each individual?

## 2017-03-20

### Cara Singer, Hearing and Speech Sciences, PhD student

• This project investigates differences in skin conductance levels in children who stutter and are persisting, children who stuttered and recovered, and children who do not stutter. All children were followed 3-4 times across a two year period. At each visit, skin conductance levels were measured during a neutral video and speaking task, a positive emotion-inducing video and speaking task, and a negative emotion-inducing video and speaking task. We would like to discuss the best statistical models for our hypotheses.

• Note that at each timepoint, there are 7 skin conductance measures (a "baseline" and 6 other measures)

• Recommendations:
• Keep all possible timepoints from all possible subjects. Do not exclude subjects based on their trajectories or baseline characteristics
• Use continuous versions of the stuttering outcomes if possible; at a minimum, collapse the outcomes into 5 ordinal categories
• Use a longitudinal mixed-effects model. Each subject will contribute 1, 2, or 3 rows depending on how many of the timepoints they have. You can model severity as a function of time-1 severity, age, sex, the seven time-1 conductance measures (or a reduction thereof; try a redundancy analysis first), time in days, and squared time in days, with random effects for subject (and possibly time and squared time). We recommend a continuous-time correlation structure, but this might be tricky with the mixed-effects model; generalized least squares might work better.
• If we can get a clear, simple plan and the analysis is not a multi-step analysis and the dataset is clean (and tall and thin, with the relevant time-1 variables and non-identifying subject ID on each row), we may be able to conduct the analysis during a clinic.
• Starting next month, we will be able to take on longer short-term projects for a charge.
• The Kennedy Center statistics core may also be able to do this. If you come back to a clinic, please remind us to invite Hakmook.

## 2017-02-27

### Kristy Broman, Surgery Resident

Method to compare standardized incidence ratios using SEER data

## 2017-02-20

### Katie McGinnis (MPH candidate)

(followup from last two weeks)

Recommendations:
• For each overall question category, try a scatterplot of a) the means and b) the standard deviations for each item, with staff values on the x-axis and parent values on the y-axis (or vice-versa). Label each point with the question number or a short phrase to identify it
• Do variable clustering within the staff items and the parent items, to see which items tend to be answered similarly by the same person (hcavar in stata)
• Rather than doing several univariate analyses comparing the relationship between the demographic items and each survey item, do a single regression analysis for each survey item, with all the demographic items included in the model at once. Collapse the categorical items into 2 or at most 3 categories, and just assign numeric values (e.g. 1--5) to the levels in the binned continuous items like distance and treat those as continuous variables (so they will have just one term in the model). Actually, though, drop distance altogether and just use travel time. The overall F-statistic from the regression will tell you whether anything in the model matters. The best approach would be a proportional odds model, but ordinary regression will be next best.
• It's ok to take the means of means (across items in a particular category) and talk about those, but there aren't enough data points to warrant a statistical test.

## 2017-02-13

### Katie McGinnis (MPH candidate)

(followup from last week)

Recommendations:
• Instead of doing t-tests, do wilcoxon rank-sum test (only 5 response options)
• Rather than overlaying the parent and staff histograms, show the parent mean as a dot on the staff histograms
• Do the "dot-histograms" by hospital because the hospitals are so different, even if tests comparing hospitals are not significant
• Don't put too much weight on the p-values; this is exploratory research with relatively small sample sizes
• For the two similar staff questions, run a correlation on the responses to help justify using only one of the questions. Use a Spearman rank correlation.
• We don't think it would make sense to take the mean of the responses for the parent "how often" questions
• For any set of questions, it could be interesting to order the means to see which questions had the highest or lowest means, but it wouldn't make sense to do a statistical test comparing the means of the different items.

## 2017-02-06

### Antje Mefferd, Hearing and Speech Sciences

I’m an assistant professor in the Hearing and Speech Science department and I’m currently preparing a manuscript. I would like to have someone take a look at the analysis that I completed to make sure they are correct. I’m a bit unsure about some things (assign fixed and random effects, reporting of degrees of freedom). I have my data in excel spreadsheets and can share it ahead of time.

The topic is how the tongue and the jaw change in their range of motion during various speech tasks (speaking typical, loud, slow, clear). These speech tasks are used in speech therapy to help people with brain diseases (Parkinson’s disease) to be better understood. In this data set I look at this in just one group of speakers (healthy speakers).

Participants complete 5 repetitions for each task (5 reps x 4 tasks = 20 data points from each participant). There are 11 females and 10 males in this study (sex has a significant main effect due to anatomical differences between males and females, but it is typically not statistically controlled for in our field in repeated measures). There are three measures – tongue movement, jaw movement , and the acoustics. For all three I need to analyze task effects in separate analyses. I also need to look at how changes in tongue movements predict changes in acoustics and how well changes in jaw movements predict change sin acoustics using data of typical to loud speech, typical to clear speech, typical to slow speech -- this time regressions within females and within males.

In the meeting I would like to make sure that I ran these analyses correctly and also would like to verify that I used to correct degrees of freedom in my write-up.

Recommendations: 1. For primary analysis, either ANOVA using each subject's mean or mixed-effects model with fixed effect for task and random effects for subject would be fine. 2. For secondary analysis, it would be best to use the same approach (either one mean data point per person per task, or a mixed-effects model). If doing mixed effects model for secondary analysis, be careful with the interpretation of R-squared.

### Katie McGinnis (MPH candidate)

I have questions about my MPH Thesis project, specifically related to the best options for comparing some of my variables and running a few other statistical tests

Practicum in Kenya; originally a needs assessment, not designed for research. 16-page staff surveys (n= 94) & parent surveys (n= 69) from 2 children's hospitals, plus demographic data. Hoping to compare parent responses to staff responses in some way. Challenges: 1. parents are responding about 1 child but staff are responding about all children, and 2. for some items, the response scales for parents and staff are slightly or very different. She is comfortable treating the response options as numeric (taking the mean would be meaningful to her). The thesis does not have to contain a formal statistical analysis.

Recommendation for next steps: For survey items where the response scales are the same, continue the exploratory data analysis by plotting histograms for the staff responses, and then marking the mean of the parent responses on the x-axis.

## 2017-01-30

### Frances Anderson, MPH Global Health

I am an MPH Global Health track student and I need some assistance with ANOVA analysis on my thesis project. My project is an evaluation of Minnesota's TB screening of refugees and immigrants across four counties in the state. The data I am looking at for ANOVA includes mean days to initiation (TB testing) and mean days to disposition. There are some outliers in the data that I need to consider dropping. I seek advisement in this, completing the test, and if ANOVA is not appropriate for this dataset finding a new test.

## 2017-01-09

### Joshua Cockroft, MD student

We are looking to design and validate a new psychometric scale that measures a patient/client's trust in new providers. Though psychometric scales currently exist that measure trust in healthcare systems, trust in existing personal providers, and measures of global trust, there is currently no scale described in the literature that specifically measures trust in new providers. The hope is that such a scale would be of use in many underserved populations, particularly those populations with histories of either substance use disorder or severe mental illness, who are not regularly active participants within the healthcare system. We would hope to be able to use such a survey to measure the effect of this specific type of trust on outcomes such as healthcare service utilization. Like other healthcare trust-related scales, this scale would likely be a Likert-scale with questions that would span multiple domains of trust (i.e. competence, dependability). As there is no current gold standard for this type of measurement, advice on important considerations for internal validation would be greatly appreciated. We may consider the validation of this scale in multiple sub-populations if able. Conceptualization of this scale will be derived from the literature and our own qualitative research.

## 2016-12-19

### Angela Maxwell-Horn, MD, Assistant Professor of Developmental Pediatrics, Monroe Carell Jr. Children’s Hospital at Vanderbilt

I am a pediatrician wanting to do a study about the effectiveness of a medication to treat ADHD symptoms in children with autism. I would like to come to a biostats clinic to help me figure out what type of analysis that I should do and how many subjects I need to effectively power my study. I have attached a copy of my study proposal.

• Recommend a randomized cross-over study design with double blinding if possible
• Select a side-effect measurement tool
• Clearly state inclusion/exclusion criteria

### Heather Limper, Center for Clinical Quality and Implementation Research

"I would like to get some help with execution of times series analysis using STATA (ideally)."

## 2016-11-21

### Katie McGinnis, MPH Candidate, Global Health

Perform surveys in three children hospitals on parents and staff. 69 respondents from parents and 97 from staff. Parents survey: demographics about parents and children, how the experiences in hospital impact parents and children, patient satisfaction Staff survey: demographics, education, child's hospitalization needs

Research questions: what do you think caused the child's illness? The language barrier in receiving proper care? The correlation between child's experience in hospital and staff's education and experience.

Survey matrices are similar in parents' survey and in staff's survey (a dozen of likert-scale questions). Want to check the correspondence between parents' responses and staff's. First check if parents agree with each other. Code the answer to each question as 1,2,3,4,5. Summarize the score of each question across all the patients. Small SD is an indication of better agreement between parents. Second check the consensus of staff. Third, to evaluate the staff's characteristics, compare staff's responses to parents' consensus; to evaluate the parents' characteristics, compare parents' responses to staff's consensus. Take the difference between staff's response and parents' consensus as outcome, fit a regression model on providers' characteristics.

Could generate a summary score over multiple questions in one category (Rockwood's index).

## 2016-10-17

### Samantha Gustafson, Hearing and Speech Sciences

VICTR application for dissertation research. EEG measures for speech sound processing in quiet and in noise. Looking for age effects. How does the effect of noise change with age? Proposed analysis based on linear regression. Expects one EEG measure to be more sensitive than the other. Second question is to look for mediator with EEG response and how well they do behaviorally depending on age. Particularly tricky how to size a study for an exploratory mediation analysis. Have replaced repeated measures ANOVA with a linear model. Each EEG task takes 10 minutes. Two listening conditions, same task. Quiet vs noise order is randomized. Half of participants hear "da" and the other half receive "ga" (randomized). Model: EEG = intercept + age effect + noise/quiet + age x noise/quiet interaction. Can use generalized least squares (correlation structure irrelevant except don't assume the correlation is zero, since only 2 times per subject) or repeated measures ANOVA if very careful to use the correction for correlation (if can handle interaction between group (noise/quiet) and age). But GLS is ideal. Need to check normality assumption of residuals.

Power of a test of interaction is much lower than a test of main effect (difference in slopes vs. slope not being flat). Data not available for making initial guess of sample size required to achieve a given precision or power. Only thought is related to a minimum possible sample size - the size needed to estimate a difference in mean EEG for an adult with very good precision. The SD of the noise-quiet difference is used here. Once the acceptable margin of error (half-width of 0.95 confidence interval for the mean difference) is determined can plug in formulas related to precision - see e.g. http://biostat.mc.vanderbilt.edu/tmp/bbr.pdf . Beware: sample size needed for interaction is easily 4 times as large.

### Alec Pawlukiewicz, Neuroscience and Psychiatry

Effect of exercise on neuro cognitive testing. Database of 20,000 participants - 9,000 after exclusions. Control for covariates sex, age, education level, # prior concussions. Interested in matched analysis. Not having enough controls. Suggested using full qualifying sample without matching, to maximize power and avoid any arbitrariness in how matches are determined. Non-matched analysis requires careful specification of the statistical model.

Several neuro scores are given by the test. If scales are continuous enough can use the standard multiple regression linear model if analyze one score at a time. May need to model age as a smooth nonlinear effect and perhaps likewise for education. Age and education may be co-linear. Variable of major interest is exercise (binary). Need to consider whether exercise may interact with age, sex, etc. What about type of exercise? For variables such as # prior concussions a quadratic effect often suffices.

### Dillon Pruett, Hearing and Speech Sciences

Respiratory sinus arrhythmia. Comparing in children who do not stutter, stutter and persist, stutter and stop. They watch a video followed by a task, and this is repeated with different videos/tasks. Baseline re-measured at the end. Question about whether to form groups or to have a continuous-time longitudinal model with stuttering measures as the response variables (without categorization). Answer questions by estimating difference in means over time. Need to interpret the result in a clinically meaningful way. Need to adjust for baseline stuttering measure as a covariate. This might possibly be interacted with the intervention effect. Need to carefully formulate the linear model and account for within-subject correlation using something like GLS or mixed effects models (the latter is mainly used if there are more than 2 or 3 measurements over time within subject).

## 2016-10-10

### Omair Khan, Center for Research on Men's Health

• "I would like to request some time to talk to another statistician about exploratory factor analysis I am doing in R with the psych package. This procedure is fairly new to me and I have some questions that I would like help with."

## 2016-09-19

### Mary Lauren Neel, Neonatal

• Association between ITSP and illness severity score
• Association between parenting style (PSDQ) and infant adoptation.

### Mark Tyson, Urology

• Bladder neck size on incontinence, controlling for BMI, age, preop score, disease status, and stitch.
• Restricted cubic spline examples: MSCI Biostat II STATA

## 2016-09-12

### Dillon Pruett, PhD student in the dept of hearing and speech sciences working with Dr. Robin Jones

• I'm working on a project involving longitudinal data with children who stutter and persist, children who stutter and recover, and children who do not stutter.

## 2016-08-29

### Scott Karpowicz

• Matched design, 1:1, 1:many, BOOM
• match on socio-economic, clinical factors, etc.
• Change point analysis
• see if readmission rates change at time of policy implementation
• REQUEST FOR VICTR SUPPORT: Clinic statisticians recommend a 90 hour voucher.

## 2016-07-25

### Sam Gannon

#### VICTR

• Developing a randomized controlled clinical trial in mental literacy. Working notion, to increase mental literacy, communications which in turn increase mental health outcomes.
• Submit concept paper to NIMental Health. Questions to address and want to get statistical expertise.
• Questions: 4 educational arms and a control group for a total of groups. Setting community mental health clinics
Metric for outcome measure clinician reports notes - self management, behavioral adherence to protocol and rate of compliance These response measures are known to be correlated. Intervention: different educational programs. Control will have standard of care.
• Consider cluster randomization. Figure out how many clinics that you will have access to. Five arms note one clinic receive one arm.
• How to assess "fidelity"? Recording data consistently. Approach with assessment for some of inter-rater reliability.
How do you capture your outcome? If survey or standard form then it will be much easier to make results consistent. For example if reporting is done through RedCap, you will have the opportunity to formalize or standardize process.
• Mediation analysis (Baron & Kenny, structural equation modeling). First you need to show that your intervention has an association with response variable. Mediator will be communication for example
. What factors mediate the intervention?
• * (Y~X) Education is associated with improved mental health.
• * (X~M) Education works through health literacy and/or communication(Mediators) to improve mental health.
• Will I benefit from cross-over design? We believe that once knowledge is gained it will be difficult to have a "wash out". Cross over design will be more appropriate to a set up such the development of new drug with clear wash out.
• Question from biostatisticians: do you need 4 arms? Can you combine some of these educational programs.
• Transient effect: Is it common in the literacy literature and look into other clinical studies such as in diabetes which require behavioral changes. There are issues of relapse and maintaining adherence.
• Timeline: Extend two years follow up time to address the "transient effect" although most studies have short follow up. Can you follow up subjects on StarPanel to show that you can address long term effects. Need to sit down with statitiscians to address realistically the multiple issues. How many clinics do you think that you could have access to? Recruitment time? How many subjects are needed?
• Consider short term effects and long term outcomes. Can you design you study pragmatically without too much effort to collect data? Using the real set up Dr. entries for follow up assessment.
• Recommendation: Follow up with VICTR voucher and statistician for help with proposal.

## 2016-07-18

### Heather Lillimoe

#### General Surgery Resident

I am currently in the process of designing a research study pertaining to resident feedback within the department of surgery. My hope is to utilize REDCap for my primary mode of obtaining data. I was hoping to meet with a biostatistician as I apply for VICTR funding for the study. It involves an educational timeout before an operation. This is a 3rd year rotation in plastic surgery. There is an iphone app to do a competency rating.
• Survey - baseline assessment - residents and attendings - 85 questions

### Cara Singer

I am a PhD student in the Department of Hearing and Speech Sciences. I would like to attend the biostat clinic today (if possible) to discuss appropriate analyses for a study I am conducting under the mentorship of Robin Jones (Developmental Stuttering Lab). The study is investigating whether a risk factor assessment (a mix of categorical and continuous variables) can predict stuttering persistence. 70-80% spontaneously recover. Would like to identify those likely to persist, in advance, for focusing therapy. Multiple risk factors have been identified. Empirical evidence for supporting predictive ability of the risk factors is sought.
• Children previously seen - diagnostic visit; 4y ago; stuttering up to 18m; English is primary language
• New follow-up for status at one point in time
• Baseline variables that originate from continuous measurements (e.g., age at onset) need to be analyzed as continuous variables
• Include baseline stuttering severity as a predictor
• With a maximum of 150 children the maximum number of candidate predictors might be around 10 if the outcome variable is almost continuous (it's worse if outcome is almost binary)
• Stuttering is multi-dimensional, e.g., some children may reduce amount of speaking because of the problem, so they seem to stutter less
• May consider a compound summary of all the outcome measures, e.g., average rank across children; clinical ranking of scenarios can also be used
• Dependent variable needs to have at least 5 frequently levels and be ordered or continuous
• If there is one standout, popular scale, that one could be used by itself
• Empirical variable selection requires an enormous sample size to reliably find the "right variables" so it's best not to use selection procedures; can find various approximations to the model for clinical non-computerized application
• Data reduction methods (variable clustering, principle components, redundancy analysis) can be useful for effectively reducing the number of predictors to use in the multivariable model

## 2016-06-20

### Chris Brown, Internal Medicine Resident

• To go over analysis produced by VICTR biostatisticians

## 2016-05-09

### Kazeem Oshkoya, Division of Clinical Pharmacology, Dr. Dan Roden's Lab

Data analysis on blood sample storage and drug concentration - look at whether a gel absorbs too much of a drug in the blood to make drug assessment accurate enough. Measured at baseline and 4h. Need to know how to describe the base value. Triplicate measurements available. More interested in relative comparison.
• Best to present all the raw data
• Might use 3 quartiles (25th and 75th percentiles and median) as descriptive stats and use Wilcoxon signed rank test for testing for a difference between baseline and 4h
• There's also two types of samples - same study repeated with different samples, sample drug concentration
• Only have 2 patients; plan to have 5 later
• Better to not average over the 3 replicates - may hide variability
• Bland-Altman plot (mean-difference plot) is a good way to show agreement and whether variation is stable over base levels. If band of variability expands going from left to right, this is an indication that perhaps the analysis should be done on the log concentration scale.
• Other useful ways to summarize data: mean absolute difference between estimated and true concentrations - separately by no gel and gel
• Can also show mean absolute differences between replicates ignoring the true concentrations
• There are problems with lower limit of detection, representing missing values that are not randomly missing; ordinary analysis may be problematic

### Jessica Dennis, Lea Davis, Genetic Medicine

Modeling lab values to look for genetic variation; data from the synthetic derivative
• Interested in variation over time within patient
• Variants are summarized into polygenetic risk scores
• Difficulty in interpreting results if patients are being treated for the lab abnormality being studies
• How to define time zero?
• May want to ignore records corresponding to post-Rx periods
• Started with HDL
• Side study: confirm that med initiation that is supposed to modify HDL really does
• Simplest longitudinal analyses:
• Compute within-patient Gini's mean difference to correlation with gen. risk score; asks whether gen. risk is correlated with variability
• Similar but summarize with the median to correlate gen. risk with overall height of the longitudinal records
• Summarize entire longitudinal record with slope and intercept; AUC and relate summary measures to gen. risk score
• Would be useful to summarize the data using representative patients after clustering on mean HDL, shape, number of observations, maximum time gap between any two measurements
• Another type of analysis: summarize each patient using the 9 deciles of HDL; use these deciles to predict polygen. risk score
• Does not take time ordering into account
• Might add a slope or shape summary to the deciles

## 2016-05-02

### Amanda Peltier, Department of Neurology

Discuss Aims and power analysis for R01

## 2016-04-11

### Jake Landes, PT, DPT Vanderbilt Sports Medicine, Rehabilitation Services

• I am a physical therapist in the Sports Medicine outpatient department and we are planning two studies that we would like to discuss. Primarily, though, we would like to discuss a prospective observational study we will be performing this coming school year with overhead athletes – we will be looking at the relationship of core strength to the likelihood of shoulder injury in overhead athletes. We plan to test the athletes’ core strength at start of their season and then collect data on injuries and time lost from playing their sport during the season. Specifically, we have questions about what our number of subjects should be in order to determine a difference and what we will need to do statistically in order to analyze the data.
• Outcomes: number of days (or proportion) lost during the season due to shoulder injuries
• Need information on the proportion of athletes who would get shoulder injury during a season. Sample size needed would be large if the proportion is very low.
• Could use logistic regression to examine association between core strength and incidence of injury
• Consider other factors that could affect shoulder injury such as the type of sport, number of years practicing, etc. These factors can be adjusted for in the regression model.
• To calculate the sample size, need to specify the outcome, type of analysis used, the meaningful difference (effect size: odds ratio of injury upon one unit change in core strength) you want to detect, and some preliminary data on the outcome measurements (rate or variation). A rule of thumb: 20 cases of injury are needed for each factor you'd like to analyze.
• Consider choosing a type of sports with the greatest association between core strength and shoulder injury.
• how to quantify core strength, a single summary score?
• A second study I am wondering about is an Anterior Cruciate Ligament Reconstruction study where we are going to compare a group of patients in a home based program versus standard care (control). We are wanting to do a feasibility study this year in our clinic, and I think it will be a prospective case-control study, or maybe prospective cohort—we also want to know about N size and analysis after ward.
• Enroll 7 patients in one month. Feasibility study.

## 2016-03-14

### Katherine McDonell, Neurology

• Parkinson's disease - norepinephrine; VICTR application
• Original intention peripheral blood pressure support
• Interested in a combined medication regiment
• Goal to get nor. into CNS
• Propose to study n=16 patients
• Need dose titration 100mg bid -> 600mg 3/day
• Which dose do patients tend to end up with?
• Is a safety & tolerability study, partly dose-finding
• Patient response that is monitored is blood pressure - minimizing orthostatic symptoms without side effects; target supine BP plus headaches, dizzyness, mania; symptoms are of primary emphasis
• Is there an accepted symptom summary scale? If not may need to just count the number of symptoms present
• But dose adjustments are clinical adjustments based on a symptom "gestalt"
• Target for analysis is final dose
• Need SD of dose; best available data will probably come from what doses are used long-term in clinical practice; we'll assume this is a stand-in for the final tolerable dose
• Once a useful SD estimate is found, it can be used to compute the likely margin of error in estimating the population mean required dose when n=16, with say 0.95 confidence. The margin of error is the half-width of the confidence interval.
• Would be good to know what evidence exists for the usefulness of plasma drug concentrations in estimating the final required dose

## 2016-02-22

### Reagan Leverett, MD, MS, Assistant Professor, Department of Radiology, Women's Imaging

• PQI project. Two types of images (new vs. old method) were performed for each patient.
• Examine the agreement between the two methods based on the paired data (kappa stat). Readings are ordinal values.
• Let a few radiologists read the two sets of images in random order to study the agreement.
• May need a couple of hundreds of patients, and a few (2 to 6) radiologists. (also want to have good agreement between radiologists, that is, readings of a certain method do not heavily depend on the experiences of radiologists).

## 2016-02-01

### Akshitkumar Mistry

Reserved spot for consulting with Chris F. about meta-analysis

## 2016-01-25

### Stephen Patrick, Assistant Professor of Pediatrics and Health Policy, Division of Neonatology

• Mary-Margaret Fill, TDH EIS
• Neonatal abstinence syndrome and long term outcomes
• Merge TennCare data with educational data
• Suggest regression model with traditional covariate adjustment unless need to do special matching (family, neighborhood)
• Biggest assumptions: children move away from TN for reasons unrelated to potential educational achievement
• Confounding: women giving birth to infant with NAS may tend to be different from those not having an NAS child; need to adjust for all factors related to this that might be associated with educational outcome
• Also what is the effect of school on test scores?
• Birth records have mother's educational level, zip code, tobacco use
• Matching records may be challenged by mother changing last name
• Might also look at infant and mother utilization of services, diagnosis of ADHD, etc.; cross-correlate with educational achievement

## 2016-01-04

### Lindsey McKernan

Here is the feedback I received on my application: Power analysis never should involve having a power of detecting a previously observed (and probably measured with bias) effect. Power should always be defined as the probability of detecting a minimal clinically meaningful effect. Also, this type of study is more suited for justifying sample size on the basis of precision of an effect of interest (usually a difference or a correlation). Precision is stated as a margin of error e.g. half-width of a confidence interval. Please revise Section E of the proposal and feel free to attend a clinic to discuss.

What was initially written: Power Analyses: Previous researchers have found moderate relationships between trauma severity and pain symptoms (r = .29; Poundja, Fikretoglu, & Brunet, 2006). Power analyses using unadjusted effect size from this study based on their sample size of 130 suggest a necessary sample size of at least 97 for the present study to reveal similar effects. Power analyses of the results of studies of the relationships between trauma severity, pain severity, experiential avoidance, and anxiety sensitivity (Gootzeit, 2014; Ruiz-Párraga & López-Martínez, 2015) suggest that a sample size of 144-158 is necessary to find these associations. The hypotheses outlined above will be tested through bivariate correlation and linear regression analyses. Specifically, relationships among variables of interest (Hypotheses 1A, 1B, 2A, 2B) will be assessed through Pearson product-moment correlation analyses to determine the strength of the association among these constructs in our sample. Tests of moderation (Hypotheses 2C, 3) will be tested using multiple linear regression with cross-products of the variables of interest to assess the interaction between predictors. All analyses will be carried out on either SPSS 22 (IBM, 2013) or the R statistical package (R Development Core Team, 2010)
• See Chapter 8, P. 8-12 of http://biostat.mc.vanderbilt.edu/tmp/bbr.pdf - suggest using the r=0 curve. This approach is using the margin of error based on 0.95 confidence limits. E.g.: "With a sample size of N subjects we can estimate the correlation coefficient between two variables to within a margin of +/- xx with 0.95 confidence (see graph)."
• Important to prioritize the comparisons and to report them in this pre-specified order so that no multiplicity corrections will be needed
• A regression model that allows for interaction between time since trauma and amount of trauma would allow for estimation of the time-decay or enhancement of memories-effect. The time interaction effect may be nonlinear.

## 2015 Dec 14

### Sachin Patel, Psychiatry

• Animal model for exposure to stress, long at differential response to stress
• Interested in susceptibility to stress
• Measure of anxiety is a key measure (high = more anxious)
• Each animal has a baseline measure
• Would be good to do a Tukey mean-difference plot (Bland-Altman plot) to be sure that the delta is an adequate summary of the two measures
• Also watch for floor and ceiling effects
• Using the delta as a continuous stress response measure will optimize power and minimize arbitrariness
• Discussed regression to the mean
• Problem with choice of anxiety measure out of many
• A composite measure may help, e.g., average z-score or average rank; can do Spearman rho rank correlation on the result, against another variable; can describe variability in ranks across anxiety measures
• Otherwise analyses of disparate measures can be hard to reconcile

## 2015 Dec 7th

### Pierce Trumbo

• Shade tree clinic, where patients do not have insurance or do not have enough insurance can get medical service.
• Primary outcomes: number of ER visits, length of hospital length of stay. Will compare before and after pts visited the clinic.
• N=680 patients and estimate to have ~300 meet inclusion (time span between first visit and last visit greater or equal to 1 year).

### Christopher John Prendergast, Tracy McGregor

• We will specifically be seeking some guidance regarding graphical representation of data related to statin doses in children and adolescents.

### Christopher Lee Brown

• Discussed analysis for reviewer's comments

## 2015 Nov 23rd

### Mark A. Clay, Divisions of Cardiology and Critical Care

• The purpose of the study was to evaluate whether patients with single ventricle physiology undergoing the second stage of surgical palliation, who’s length to weight ratio was >90% were at higher risk for increased ICU length of stay, ventilator times, and increased non-invasive ventilation when compared to those whose length for weight was <90%. Analyzing the data with the Mann-Whitney U Test there was a statistically significant difference between ICU length of stay and ventilator hours for those with weight for length >90% compared to those <90%. However, I attempted to analyze the data again with Spearman’s to see if there was a correlation between increasing z-score percentile and there was no statistically significant correlation.
• Clinic question: Has the data been analyzed appropriately to answer the question? Should I be concerned that Spearman’s correlation did not show a statistically significant correlation between the variables even though there was a statistically significant difference between the groups? Should I use and how might I best demonstrate association or risk related to weight for length z-score >90% with linear regression?

### Rebekah Griesenauer (Conley), Biomedical Engineering

• I am designing a study for a small group of human subjects to test the feasibility of a new tool that I designed for breast cancer assessment using medical images. I would like some guidance on effective study designs for a small number of patients and for determining the accuracy of a new tool when there is no current clinical equivalent to compare to.
• Need a measureble outcome to calculate the required sample size

## 2015 Nov 16th

### Aaron C. Shaver, M.D., Ph.D. Assistant Professor of Pathology, Microbiology, and Immunology

• The csv consists of sample ID, the covariates I want to test (age as an integer and categorical variable; poor.risk through transcription, which are all categorical variables; and num.muts, which is an integer) and the OS and PFS data (for censoring rows, 0=censored and 1=dead). I would like to include the interaction between age and poor.risk, because I have biological reason to believe that that interaction is relevant. My questions concern: measuring goodness of fit of the model; how to interpret the interaction term; how to estimate power, given the large number of covariates and small sample size

## 2015 Nov 9th

### Fernanda Maruri

• "If possible I would like some help interpreting results of 2 Wilcoxon Rank Sum tests in which one is significant and the other is not."
• Compare

### Jessica Kaitlin Campbell

• The goal of the project is to examine the impact that the palliative care unit has had on the medical intensive care unit in terms of patient length of stay and mortality. I have collected data regarding some parameters per and post opening of the palliative care unit. I am interested in the best approach in analyzing the data.
• Have data a year before and a year after the unit opened. Want to compare LOS and mortality in MICU. Both groups had palliative consult, only some patients after went to the palliative care unit.

### Kendall Anne Ulbrich, Pediatrics

• I am requesting assistance in figuring out statistical significance. We see a trend in the data with the diagnosis of chronic lung disease leading to increased risk of death after trach placement vs other diagnosis.
• Babies in NICU, outcome is alive/died, want to compare chronic lung disease to other diagnosis.
• There were ~15 diagnosis, among whom 12 had chronic lung disease.
• Total 115 babies (25 died in NICU). Primary outcome is the death in NICU. 8 (or 11) babies who had lung disease and died.
• Plot Kaplan-Meier curve first for description, use log-rank test.
• Can use Cox proportional hazard model to analyze the association between lung disease and survival in NICU.

### Robert K. Tunney, Jr., Cardiology Resident

• Email: My research is investigating statin dose intensification according to the ACC/AHA 2013 Cholesterol Guidelines in post-ACS patients. I am interested in performing logistic regression analysis on ~300 patients and potentially Spearman rank r correlation coefficient.
• Two groups: historic control and intervention group. Binary outcome. Primary aim is to assess the outcome difference between groups.
• Chi-sq test and multivariable logistic regression can be used to test the primary hypothesis.

## 2015 Apr 20

### Lexy Morvant, Pediatric

• NICU data analysis
• time trend of gestational age when receiving ECMO (Y2004-2014) for C-section babies. To evaluate the effect of policy change (increase gestational age for C-section baby in 2007) on ECMO.
• Only have the information on birth year available. Fit a linear regression model
• Also have the information on the total number of all ECMO babies. With an assumption that the proportion of C-section babies remains the same, could fit a poisson linear regression model.

## 2015 Apr 13

### Jared

• I have a retrospective dataset of patients who underwent a new cochlear implant programming procedure. The data contain pre- and post-intervention objective performance data, demographic data, and information about the cochlear implant type and location. I am trying to develop model(s) that can answer the following questions: 1) How can we predict whether a patient will be a responder to re-programming? 2) Which variables are most predictive of change in performance from baseline?
• 177 patients.
• Endpoint: measurement performance (0-100)
• Predictors: 15 ~ 20
• Fit a multivariable linear regression model. Predictor importance can be measured based on the model.

## 2015 Mar 9

### Taylor Leath

• We attended a biostats clinic on February 23rd to develop a statistical plan. Now that we have a dataset completed, we are having difficultly with our regression models and would appreciate your input.

## 2015 Feb 23

### Katie Rizzone, M.D., Clinical Instructor, Orthopaedics and Rehabilitation

• I would like to request a methods clinic (to review my methods) for a retrospective chart review study on female college athletes and stress fractures I am writing an IRB for.

### Taylor Leath

I would like to reserve a time on Monday, February 23rd to develop an appropriate statistical plan for our study and dataset. I've attached the study protocol which details our specific aims and hypotheses. Our primary questions: 1) Is linear regression the appropriate model to use? Predictors would be sex, age, years of education, participant's current health, trauma exposure and religiosity (all continuous except for sex), and the outcome variable would be each of the individual health states (GOSE 2-8). If so, this would mean six different regression models for the six health states? 2) Alternatively, would it be more appropriate to develop one regression model that includes the health state (GOSE 2-8) as an additional predictor? 3) Do we have sufficient sample size to answer our study questions? Current n=2156 after exclusions. 4) We would also like to show whether the utility values for each of the six health states are significantly different from one another-- would that simply be a within-subjects ANOVA with pairwise comparisons? 5) Should we consider transforming the worse-than-death values?

## 2015 Jan 12

### Dr. Heidi J. Silver, Ph.D., R.D Research Associate Professor of Medicine

• Study of diet intervention, body composition, insulin resistance, lipo.

### Tomas DaVee, GI

• Patients underwent liver transplant who had plastic stent to treat leak, about 20-30% needed mental stent later
• Want to predict early whether patient needs mental or not so pt does not need to surfer pain
• The current data only gives conditional needs to mental if had plastic already
• Suggest do descriptive statistics and plan bigger study to develop prediction model
• Use R for internal validation and calibration using bootstrapping method (rms package)

## 2014 Nov 3

### Monica Ledoux

I am an adjunct at Vanderbilt's Dermatology department, working with Zhengzheng Tang from biostats on microbiome and skin and would like to know the biostat budget for VICTR application(s)
• Want to know the relationship between Cortisone treatment and bacterial change.
• Each subject will be his own control: cortisone on one arm and no cortisone on the other. Each arm will be tested at two sites, one normal skin and one tape stripping skin. Observe bacterial change. Therefore, each subject will have 4 tested samples and each sample measured twice (total 8 per person)

## 23June14

### Neelam Patel, Medical Student

• I am fourth year medical student doing a project for dermatology. We are doing a meta-analysis of pediatric vitiligo patients to assess which populations need thyroid studies performed. I have a spreadsheet of the data. I need help analyzing it.
• Research question: the percentage of thyroid abnormalities in pediatric vitiligo patients.
• Only have aggregated data. Could have an overall estimate of percentage. Also could explore the variability between studies.
• Apply for a \$2000 Voucher.

### Tyler Kendrick, Anesthesiology, Medical Student

• One-year prospective study. Will record the numbers of surgeries in Ethiopia (an African country) and the number of perioperative mortalities.
• Sample size calculation to reach a desirable precision of mortality rate estimate.

## 28April14

### Wei Xie, Computer Science, Brad Malin, DBMI

• we want to find out if the IRLS estimation algorithm is reversible -- e.g., given only the Fisher information matrix and scoring function (and \beta coefficients), can we go back to the original Y or X matrices
• Context is confidentiality with data coming from multiple sites, with each site's data maintained independently, and controlled
• How to do model diagnostics without residuals?
• Does the distributed computing model lead to good statistical modeling practice? E.g.: covariate transformations, Y transformation, normality of residuals [could compute residual vector separately by center and share an ECDF of the residuals)
• How often are practitioners of distributed statistical analysis assuming linearity of covariate effects? Being careful about transforming Y or modeling Y robustly?
• Can't reverse the process to solve for an individual's datum if model is full rank, n > p, no parameter is devoted to only one subject, residual vector is secret
• If a single parameter is devoted to 5 subjects at one site, may possibly be able to solve for a summary statistic for the 5 (e.g., race has 4 levels and one of the levels only applies to 5 subjects at a site)
• May be able to discern that one site has an overall better level of Y than another site
• Not able to get a robust sandwich covariance matrix estimator if residual vector is not provided; sandwich estimation requires U matrix not just U vector
• Even if residuals are available, it may not be possible to work backwards to an individual from a given site because estimates come from a global beta vector over all sites
• We seldom use OLS with health care data; the need for weighted X'X (X'VX) instead of X'X as used in OLS makes the identification problem more difficult in general, because V is a function of the current beta estimate (for all sites combined)
• Worthwhile working out the special case where Y is binary and there is a single X that is binary or polytomous, and there is no special knowledge (e.g., k subjects are of type x and all have the same Y)
• Worth taking another look at data squashing

### Neil Templeton, Engineering, CHBE

• Metabolic flux analysis
• Rate of metabolite turnover
• Which metabolic phenotypes are produced in high titre-achieving production processes
• Protein therapeutics; cost of production
• 14 conditions (cell lines); correlations between fluxes (80 reactions- flux, mass spec); looking for up-regulation
• 80 Spearman rank correlations x 14; each correlation 10 observations (clones)
• Two controls; secondary controls
• Independent experimental units: clones, manipulations of cell lines
• See if a unified model would be a better approach than pairwise analysis
• Must be able to precisely estimate a quantity such as a correlation coefficient in order to be reliable in picking "winners" across reactions
• Low precision (low number of independent experimental units) implies low probability of selecting the optimum reaction/condition
• Dimensionality is high enough that an "omics" method may be needed
• Recommend contining discussion at a Tuesday or Friday clinic

## 14April14

### Elizabeth Morse, RN, MSN, FNP-BC, MPH Vanderbilt University School of Nursing

• My project involves survey data of 220 Spanish and Arabic-speaking patients in the Center for Women's Health. I've completed all of the descriptive statistics but need help with the correlations. For example, I know from having surveyed patients myself that those patients who reported speaking "Arabic only" at home were more likely to self-report speaking English "not very well", but I don't know how to express this statistically.
• To test association between two variables A and B,
• If A is a continuous variable and B is categorical variable, use Kruskal Wallis test (or Wilcoxon rank-sum test)
• If A and B are both categorical variables, use chi-square test
• If A is ordinal variable and B is binary, use chi-square trend test
• If A and B are both continuous variables, use spearman's correlation coefficient.

## 31Mar14

### Brett Byram, BME

* Clinical image degradation with ultrasound
• What are major factors of degradation? Pulling apart mechanisms.
• Clinical target: liver tumors/biopsy; visualize needle
• What is the best study design?
• Discusssed hypothesis testing vs estimation study
• One estimand could be the mean absolute number of levels different
• Can relate an ordinal measure to quantitative measures of image quality
• Can estimate # patients needed if have a reliable estimate of the standard deviation of an absolute difference of interest
• May consider progressively ruining an image to see when it becomes uninterpretable
• One goal is to develop a model to predict expert's quality rating from multiple quantitative physics-based measures
• May consider an ordinal response model / multinomial model

## 10Feb14

### Steve Kahn, General Surgery Fellow

• can't arrive before 1pm on Wednesdays, so attending Monday clinic
• "I am going to perform an email survey of surgical residents (approx 5500 in the US) and wanted to know what you think an appropriate response rate would be and the best method to do statistical analysis (rough draft of survey attached). Or should the questions be revised to facilitate a better statistical analyisis?"
• make the variable as continuous as possible using sliding bar

### Philip Budge, Fellow, Division of Infectious Diseases

• grant proposal relating to the development of new diagnostic technologies for neglected tropical diseases

### LIsaMarit Wands, nursing

• Survey on two cohorts, VA-based cohort and university-based cohort.
• Outcome: global physical and mental health score. Pain is part of global score, and also a barrier to level of reintegration success. Could calculate a global score without pain. Could examine how pain correlates with reintegration and outcome.
• A specific question (meaning of life) in two standardized questionnaire. Could include both in the model predicting outcome.

## 27Jan14

### Stephanie Fecteau, Psychiatry Post-Doc

• Cortisol measures 3 per day
• % of increase because times not noted accurately
• Need Bland-Altman plot to check proper transformation: post - pre vs. (post + pre)/2 or log(post) - log(pre) vs. geometric mean of pre and post
• want the transformation that makes the graph flat and random
• 1/2 of families received a service dog after 3 weeks
• Suggest longitudinal analysis using 3 daily x 15 weeks, allowing for correlation; only one day per week
• Correlation structure based on approximate time of measurements in days + fraction of day
• Model smooth time trend, allowing for separate trend in those randomized to service dog; check for shape change between two groups
• Easiest-to-interpret method generalized least squares with AR1 continuous-time correlation structure

### David Dantzler and Donald Lynch, Cardiovascular Medicine

• ECMO: what predicts survival to hospital discharge; initiated by cardiac surgeons
• Collecting patients from last 2 years (N=60 so far)
• Discussed margin of error of 0.1 in estimating a single probability with n=96
• Alternate endpoints: LOS, censor on death, i.e. Y=time to successful discharge
• Or: ordinal outcome Y=1, 2, 3, ... longest LOS, dead = longest LOS + 1; effective sample size almost equal to # subjects
• Also have Glasgow coma scale at discharge; could factor into ordinal outcome
• May be possible to use a complex high-information scale to derive a severity of illness-based score that is then used to predict mortality
• Has reduced many variables to one
• What to do with patients who died before ECMO was available?

## 13Jan14

### Mitchell Odom, VUMC 3rd year medical student, Department of Neurosurgery.

I am currently helping with a project that requires a survey be employed, and we are creating an original one to send out. I would like to get some expert opinions on the questions that we ask, and to make sure we are honing in on what we're really looking for.
• CTE - Chronic Traumatic Encephalitis caused by multiple concussions. Survey is designed to ask questions about awareness of CTE among parents of young athletes (junior high and high school). The plan is to distribute the survey using Vanderbilt connections with local high schools.
• Recommendations:
• Maximize response rate (by giving parents incentives of some sort)
• Ensure that the survey is brief
• Make sure the responses are anonymous
• Use numbers instead of categories
• Simplify the language
• Branch questions
• Incorporate visual analog scale (instead of categories)
• Order questions in a logical way

