I am a Canadian Ph.D. candidate in the Department of Biostatistics at the University of Washington advised by Daniela Witten, with an expected graduate date of June 2020. I completed an Honours B.Sc. in Combined Mathematics and Statistics at the University of Victoria. During my B.Sc., my research was advised by Julie Zhou.

My research interests are in:

  1. Using statistical machine learning to solve problems in complex data settings, such as multi-view data, network data, compositional data, and spatial data, and
  2. Applying novel optimization methods to the design of experiments.

These diverse directions are unified by the common theme of using optimization to solve problems in statistics.


1. Learning from Multi-View Data

In the multi-view data setting, multiple data sets (views) are available on a single common set of observations. For example, multivariate clinical and genomic data sets may be available on a single set of tissue samples, or we may have two network data sets that describe physical interactions and co-membership in protein complexes between a single set of proteins.


  • Lucy L. Gao, Jacob Bien and Daniela Witten (2019) Are clusterings of multiple data views independent? To appear in Biostatistics. [pdf] [cran]
    [Received a 2019 ASA Biometrics Section Student Travel Award.]


  • Lucy L. Gao, Daniela Witten and Jacob Bien (2020+) Testing for association in multi-view network data. [pdf] [cran]
    [Received a 2020 ASA Statistical Learning and Data Science Section Student Paper Award.]

2. Optimal Experiment Design

The number of replicates in experiments limits the amount of information that is available, but we maximize the amount of information gained by carefully choosing the values of the experimental inputs. This is the central problem of optimal experiment design.


  • Lucy L. Gao and Julie Zhou (2017) D-optimal designs based on the second-order least squares estimator. Statistical Papers, 58(2): 77-94.
  • Lucy L. Gao and Julie Zhou (2014) New optimal design criteria for regression models with asymmetric errors. Journal of Statistical Planning and Inference, 149: 140-151.


  • Pengqi Liu, Lucy L. Gao and Julie Zhou (2020+). R-optimal designs for multi-response regression models with multi-factors, submitted. [pdf]
  • Lucy L. Gao and Julie Zhou (2020+). Minimax D-optimal designs for multivariate regression models with multi-factors, submitted. [pdf]

3. Collaborative Research

During the first year of my Ph.D., I collaborated with researchers at the Seattle Children’s Research Institute to characterize liver transplantation offers to pediatric patients.


  • Evelyn Hsu, Michele Shaffer, Lucy L. Gao, Christopher Sonnenday, Michael Volk, John Bucuvalas and Jennifer Lai (2017) Analysis of liver offers to pediatric candidates on the transplant wait list. Gastroenterology, 153(4): 988-995.

Contact Me

Address: Box 357232, University of Washington, Seattle, WA 98195-7232
Email: lucygao at uw dot edu