Research

Department of Biostatistics & Informatics

Faculty in the Department of Biostatistics & Informatics carry out research to develop, evaluate, and improve statistical methods for designing and analyzing healthcare studies. This research is often motivated by work with co-investigators doing clinical or public health research who encounter design or analysis questions that have not been adequately addressed in existing research literature. Explore B&I faculty research topics below. 

Code on multiple computer screens

'Omics and high-throughput studies


New technologies in the 'omics field have changed the landscape of biomedical research in the last twenty years. Data generated from a variety of these high-throughput assays (microarrays, sequencing, mass-spectrometry, etc.) pose interesting analytical challenges due to the high dimensionality, large volume, and complex structure of the data. Many of our faculty are involved with developing methods from all stages of the omics data life-cycle, including study design, data generation, quality control, pre-processing, modeling, analysis, interpretation, archiving, data dissemination, and software development. Depending on the question, statistical and computational approaches include machine learning, high-dimensional methods, latent variable models, statistical genetics, genetic risk prediction, sub-typing, causal modeling, data integration, and non-parametric methods. Faculty work closely with core facilities generating the data and a variety of biomedical investigators on campus and beyond who provide interesting applications for specific systems and diseases.

line graph on a computer screen

Longitudinal, correlated, and joint models


Observations in health sciences studies are often made sequentially on subjects, resulting in temporal patterns and/or serial correlation. Studies also commonly involve subjects treated at the same hospital or by the same provider, which gives rise to similarities of those subjects. These situations require special statistical methods to maintain validity in situations of dependent observations and to learn about these sources of variation. In many cases, multiple outcomes are also of interest. Faculty members are involved in developing new statistical methods for many such situations, such as where patterns of missing data depend on outcomes; complex temporal patterns in longitudinal data subjects; use of surrogate measures when true covariates are not observable, and jointly modeled longitudinal measures and time-to-event outcomes. 

scientist working with test tubes

Clinical trials, sample size estimation, and diagnostic studies


Estimating the number of subjects needed for a sensitive, efficient, and ethical study is a crucial aspect of healthcare research. Faculty members are developing innovative methods to perform these estimations, especially in complex study designs such as longitudinal measurements over time or subjects clustered within sites or clinicians. Current faculty research is available at SampleSizeShop.org, offering researchers—including those in behavioral and social sciences—free, open-source, peer-reviewed power and sample-size software, tutorials, and educational materials. 
healthcare worker pushing a gurney down a hospital hallway

Survival, recurrent event, and non-normal outcomes


Outcomes in biomedical and public health studies are often more complex than simple numerical measures, and normality assumptions are not appropriate. When time to an event such as death, admission to hospital, or recurrence of disease is of interest, models are available to account for censoring (when only ranges of event times are known). These models have been extended to events that can occur more than once, a situation that is common in clinical medicine. Faculty members are studying situations when only counts of events in intervals are available, and implications of this limitation on required sample sizes. Other examples of statistical methods for non-normal outcomes being studied by faculty include numbers of bacteria of different species in samples of lung microbiome, and cost of health care in multiple cost categories or when cost may be zero.

Person looking at computer lung imaging

Imaging analysis


Imaging analysis explores multiple types of imaging data, data processing algorithms, and new data analysis approaches for imaging data.
two women sitting at computer looking at data

Causal inference


Causal inference addresses various aspects of study design and analysis necessary for moving beyond correlation into causation.
Statistician looking at data and graphs on computers

Non-parametric and semi-parametric methods


Distributions of variables and relationships among variables in health sciences studies are often more complex than can be captured with traditional parametric approaches. Non-parametric and semi-parametric methods relax some of the assumptions of parametric approaches. For non-parametric approaches, a distribution is not implicitly assumed to perform inference. This can be useful when a distribution is not known or is difficult to work with, situations that are especially common with biological data such as microbiome or genomic data. Common non-parametric approaches include permutation, bootstrapping, and other resampling-based methods. In other situations trends or relationships may be complex and nonlinear, in which case semi-parametric methods such as smoothing or splines are useful. Faculty are developing methods for such situations as drop-out patterns that depend in complex ways on patients’ health status in HIV/AIDS studies.

Working Groups


In addition to weekly department seminars, monthly working groups convene to discuss specific biostatistical methods and challenges. For information about these meetings and how to join, please email [email protected].

Application areas


Faculty members in the department collaborate with various on-campus and off-campus groups to inspire and better equip researchers in applying statistical methodologies and interpreting results accurately. Below are some specific areas and groups we collaborate with.

  • General Biostatistics Collaboration.
  • Cancer Prevention and Treatment.
  • Respiratory and Lung Disease.
  • Infectious Diseases and HIV/AIDS.
  • Cardiology and Cardiovascular Disease.
  • Mental and Behavioral Health.
  • Maternal and Child Health, and Pediatrics.
  • Diabetes and Obesity.
  • Endocrinology and Metabolic Disorders.
  • Autoimmune Disorders.
CMS Login