About

I am an Assistant Professor in the Department of Statistics and Actuarial Science at the University of Waterloo. I received my PhD from the Department of Biostatistics at the University of Washington, where I was advised by Daniela Witten.

My research interests are in:

  1. Using statistical machine learning to solve problems in complex data settings, such as multi-view data, network data, compositional data, and spatial data, and
  2. Applying novel optimization methods to the design of experiments.

These diverse directions are unified by the common theme of using optimization to solve problems in statistics.

Research

1. Learning from Multi-View Data

In the multi-view data setting, multiple data sets (views) are available on a single common set of observations. For example, multivariate clinical and genomic data sets may be available on a single set of tissue samples, or we may have two network data sets that describe physical interactions and co-membership in protein complexes between a single set of proteins.

Publications

  • Lucy L. Gao, Jacob Bien and Daniela Witten (2019) Are clusterings of multiple data views independent? To appear in Biostatistics. [pdf] [cran]
    [Received a 2019 ASA Biometrics Section Student Travel Award.]

Preprints

  • Lucy L. Gao, Daniela Witten and Jacob Bien (2020+) Testing for association in multi-view network data. [pdf] [cran]
    [Received a 2020 ASA Statistical Learning and Data Science Section Student Paper Award.]

2. Optimal Experiment Design

The number of replicates in experiments limits the amount of information that is available, but we maximize the amount of information gained by carefully choosing the values of the experimental inputs. This is the central problem of optimal experiment design.

Publications

  • Pengqi Liu, Lucy L. Gao and Julie Zhou (2020). R-optimal designs for multi-response regression models with multi-factors. To appear in Communications in Statistics - Theory and Methods. [pdf]
  • Lucy L. Gao and Julie Zhou (2020). Minimax D-optimal designs for multivariate regression models with multi-factors. To appear in Journal of Statistical Planning and Inference . [pdf]
  • Lucy L. Gao and Julie Zhou (2017) D-optimal designs based on the second-order least squares estimator. Statistical Papers, 58(2): 77-94.
  • Lucy L. Gao and Julie Zhou (2014) New optimal design criteria for regression models with asymmetric errors. Journal of Statistical Planning and Inference, 149: 140-151.

3. Collaborative Research

During the first year of my Ph.D., I collaborated with researchers at the Seattle Children’s Research Institute to characterize liver transplantation offers to pediatric patients.

Publications

  • Evelyn Hsu, Michele Shaffer, Lucy L. Gao, Christopher Sonnenday, Michael Volk, John Bucuvalas and Jennifer Lai (2017) Analysis of liver offers to pediatric candidates on the transplant wait list. Gastroenterology, 153(4): 988-995.

Contact Me

Email: lucy dot gao at uwaterloo dot ca