2025-0268 - The YODA Project

                    array(40) {
  ["project_status"]=>
  string(7) "ongoing"
  ["project_assoc_trials"]=>
  array(4) {
    [0]=>
    object(WP_Post)#5539 (24) {
      ["ID"]=>
      int(8008)
      ["post_author"]=>
      string(4) "1363"
      ["post_date"]=>
      string(19) "2023-08-05 04:44:39"
      ["post_date_gmt"]=>
      string(19) "2023-08-05 04:44:39"
      ["post_content"]=>
      string(0) ""
      ["post_title"]=>
      string(210) "NCT02489318 - A Phase 3 Randomized, Placebo-controlled, Double-blind Study of Apalutamide Plus Androgen Deprivation Therapy (ADT) Versus ADT in Subjects With Metastatic Hormone-sensitive Prostate Cancer (mHSPC)"
      ["post_excerpt"]=>
      string(0) ""
      ["post_status"]=>
      string(7) "publish"
      ["comment_status"]=>
      string(6) "closed"
      ["ping_status"]=>
      string(6) "closed"
      ["post_password"]=>
      string(0) ""
      ["post_name"]=>
      string(194) "nct02489318-a-phase-3-randomized-placebo-controlled-double-blind-study-of-apalutamide-plus-androgen-deprivation-therapy-adt-versus-adt-in-subjects-with-metastatic-hormone-sensitive-prostate-canc"
      ["to_ping"]=>
      string(0) ""
      ["pinged"]=>
      string(0) ""
      ["post_modified"]=>
      string(19) "2025-04-30 16:20:21"
      ["post_modified_gmt"]=>
      string(19) "2025-04-30 20:20:21"
      ["post_content_filtered"]=>
      string(0) ""
      ["post_parent"]=>
      int(0)
      ["guid"]=>
      string(243) "https://dev-yoda.pantheonsite.io/clinical-trial/nct02489318-a-phase-3-randomized-placebo-controlled-double-blind-study-of-apalutamide-plus-androgen-deprivation-therapy-adt-versus-adt-in-subjects-with-metastatic-hormone-sensitive-prostate-canc/"
      ["menu_order"]=>
      int(0)
      ["post_type"]=>
      string(14) "clinical_trial"
      ["post_mime_type"]=>
      string(0) ""
      ["comment_count"]=>
      string(1) "0"
      ["filter"]=>
      string(3) "raw"
    }
    [1]=>
    object(WP_Post)#5540 (24) {
      ["ID"]=>
      int(1920)
      ["post_author"]=>
      string(4) "1363"
      ["post_date"]=>
      string(19) "2021-11-23 15:30:00"
      ["post_date_gmt"]=>
      string(19) "2021-11-23 15:30:00"
      ["post_content"]=>
      string(0) ""
      ["post_title"]=>
      string(278) "NCT02257736 - A Phase 3 Randomized, Placebo-controlled Double-blind Study of JNJ-56021927 in Combination With Abiraterone Acetate and Prednisone Versus Abiraterone Acetate and Prednisone in Subjects With Chemotherapy-naive Metastatic Castration-resistant Prostate Cancer (mCRPC)"
      ["post_excerpt"]=>
      string(0) ""
      ["post_status"]=>
      string(7) "publish"
      ["comment_status"]=>
      string(6) "closed"
      ["ping_status"]=>
      string(6) "closed"
      ["post_password"]=>
      string(0) ""
      ["post_name"]=>
      string(197) "nct02257736-a-phase-3-randomized-placebo-controlled-double-blind-study-of-jnj-56021927-in-combination-with-abiraterone-acetate-and-prednisone-versus-abiraterone-acetate-and-prednisone-in-subjects-w"
      ["to_ping"]=>
      string(0) ""
      ["pinged"]=>
      string(0) ""
      ["post_modified"]=>
      string(19) "2025-04-30 16:19:18"
      ["post_modified_gmt"]=>
      string(19) "2025-04-30 20:19:18"
      ["post_content_filtered"]=>
      string(0) ""
      ["post_parent"]=>
      int(0)
      ["guid"]=>
      string(246) "https://dev-yoda.pantheonsite.io/clinical-trial/nct02257736-a-phase-3-randomized-placebo-controlled-double-blind-study-of-jnj-56021927-in-combination-with-abiraterone-acetate-and-prednisone-versus-abiraterone-acetate-and-prednisone-in-subjects-w/"
      ["menu_order"]=>
      int(0)
      ["post_type"]=>
      string(14) "clinical_trial"
      ["post_mime_type"]=>
      string(0) ""
      ["comment_count"]=>
      string(1) "0"
      ["filter"]=>
      string(3) "raw"
    }
    [2]=>
    object(WP_Post)#5541 (24) {
      ["ID"]=>
      int(1568)
      ["post_author"]=>
      string(4) "1363"
      ["post_date"]=>
      string(19) "2016-10-31 14:30:00"
      ["post_date_gmt"]=>
      string(19) "2016-10-31 14:30:00"
      ["post_content"]=>
      string(0) ""
      ["post_title"]=>
      string(223) "NCT00887198 - A Phase 3, Randomized, Double-blind, Placebo-Controlled Study of Abiraterone Acetate (CB7630) Plus Prednisone in Asymptomatic or Mildly Symptomatic Patients With Metastatic Castration-Resistant Prostate Cancer"
      ["post_excerpt"]=>
      string(0) ""
      ["post_status"]=>
      string(7) "publish"
      ["comment_status"]=>
      string(6) "closed"
      ["ping_status"]=>
      string(6) "closed"
      ["post_password"]=>
      string(0) ""
      ["post_name"]=>
      string(193) "nct00887198-a-phase-3-randomized-double-blind-placebo-controlled-study-of-abiraterone-acetate-cb7630-plus-prednisone-in-asymptomatic-or-mildly-symptomatic-patients-with-metastatic-castration-re"
      ["to_ping"]=>
      string(0) ""
      ["pinged"]=>
      string(0) ""
      ["post_modified"]=>
      string(19) "2025-04-30 15:32:35"
      ["post_modified_gmt"]=>
      string(19) "2025-04-30 19:32:35"
      ["post_content_filtered"]=>
      string(0) ""
      ["post_parent"]=>
      int(0)
      ["guid"]=>
      string(242) "https://dev-yoda.pantheonsite.io/clinical-trial/nct00887198-a-phase-3-randomized-double-blind-placebo-controlled-study-of-abiraterone-acetate-cb7630-plus-prednisone-in-asymptomatic-or-mildly-symptomatic-patients-with-metastatic-castration-re/"
      ["menu_order"]=>
      int(0)
      ["post_type"]=>
      string(14) "clinical_trial"
      ["post_mime_type"]=>
      string(0) ""
      ["comment_count"]=>
      string(1) "0"
      ["filter"]=>
      string(3) "raw"
    }
    [3]=>
    object(WP_Post)#5542 (24) {
      ["ID"]=>
      int(1249)
      ["post_author"]=>
      string(4) "1363"
      ["post_date"]=>
      string(19) "2014-10-20 14:57:00"
      ["post_date_gmt"]=>
      string(19) "2014-10-20 14:57:00"
      ["post_content"]=>
      string(0) ""
      ["post_title"]=>
      string(233) "NCT00638690 - A Phase 3, Randomized, Double-Blind, Placebo-Controlled Study of Abiraterone Acetate (CB7630) Plus Prednisone in Patients With Metastatic Castration-Resistant Prostate Cancer Who Have Failed Docetaxel-Based Chemotherapy"
      ["post_excerpt"]=>
      string(0) ""
      ["post_status"]=>
      string(7) "publish"
      ["comment_status"]=>
      string(6) "closed"
      ["ping_status"]=>
      string(6) "closed"
      ["post_password"]=>
      string(0) ""
      ["post_name"]=>
      string(193) "nct00638690-a-phase-3-randomized-double-blind-placebo-controlled-study-of-abiraterone-acetate-cb7630-plus-prednisone-in-patients-with-metastatic-castration-resistant-prostate-cancer-who-have-fa"
      ["to_ping"]=>
      string(0) ""
      ["pinged"]=>
      string(0) ""
      ["post_modified"]=>
      string(19) "2025-10-24 14:58:34"
      ["post_modified_gmt"]=>
      string(19) "2025-10-24 18:58:34"
      ["post_content_filtered"]=>
      string(0) ""
      ["post_parent"]=>
      int(0)
      ["guid"]=>
      string(242) "https://dev-yoda.pantheonsite.io/clinical-trial/nct00638690-a-phase-3-randomized-double-blind-placebo-controlled-study-of-abiraterone-acetate-cb7630-plus-prednisone-in-patients-with-metastatic-castration-resistant-prostate-cancer-who-have-fa/"
      ["menu_order"]=>
      int(0)
      ["post_type"]=>
      string(14) "clinical_trial"
      ["post_mime_type"]=>
      string(0) ""
      ["comment_count"]=>
      string(1) "0"
      ["filter"]=>
      string(3) "raw"
    }
  }
  ["project_title"]=>
  string(83) "Detection of Subgroup Treatment Effects with Missing Patient-Reported Outcomes Data"
  ["project_narrative_summary"]=>
  string(659) "Patient-reported outcomes (PROs) track treatment impact but are hard to analyze due to skewed data, missing responses, and repeated measurements over time. Current methods ignore these PRO specific issues or only estimate average effects instead of individualized benefits, which is the core of precision medicine. We develop a new statistical method to estimate personalized treatment effects (ITEs) from longitudinal PROs with missing data, focusing on quantiles (e.g., median) rather than averages. This method will make better identification of patient subgroups that benefit (or don’t) from treatments, improving decision-making for precision medicine."
  ["project_learn_source"]=>
  string(9) "colleague"
  ["principal_investigator"]=>
  array(7) {
    ["first_name"]=>
    string(4) "Yang"
    ["last_name"]=>
    string(2) "Li"
    ["degree"]=>
    string(19) "Ph.D. in Statistics"
    ["primary_affiliation"]=>
    string(26) "Renmin University of China"
    ["email"]=>
    string(18) "yang.li@ruc.edu.cn"
    ["state_or_province"]=>
    string(7) "Beijing"
    ["country"]=>
    string(5) "China"
  }
  ["project_key_personnel"]=>
  array(1) {
    [0]=>
    array(6) {
      ["p_pers_f_name"]=>
      string(7) "Shenran"
      ["p_pers_l_name"]=>
      string(4) "Deng"
      ["p_pers_degree"]=>
      string(58) "Master of Medicine (Epidemiological and health statistics)"
      ["p_pers_pr_affil"]=>
      string(48) "School of Statistics, Renmin University of China"
      ["p_pers_scop_id"]=>
      string(0) ""
      ["requires_data_access"]=>
      string(3) "yes"
    }
  }
  ["project_ext_grants"]=>
  array(2) {
    ["value"]=>
    string(2) "no"
    ["label"]=>
    string(68) "No external grants or funds are being used to support this research."
  }
  ["project_date_type"]=>
  string(18) "full_crs_supp_docs"
  ["property_scientific_abstract"]=>
  string(1686) "1.Background

Patient-reported outcome (PRO) measures are included in clinical trials to provide insights into patients' perspectives on treatment. The treatment effect can vary across subjects, which is defined as the individualized treatment effect (ITE), and are important in the decision making about clinical treatment. However, when it comes to the PRO data, it is challenging due to the skewness of responses and the presence of missing data. Current methods ignore these issues or only estimate average effects, not individualized benefits.

2.Objective

To propose a novel method to estimate the ITEs and detect patients with or without significant treatment effect by longitudinal PRO data with missingness.

3.Study Design

We have examined the efficiency of our proposed method in numerical experiments. We will apply the proposed method on the trial data to show the application of our method.

4.Participants

The study population in each trial.

5.Primary and Secondary Outcome Measure(s)

The Functional Assessment of Cancer Therapy – Prostate (FACT-P) score.

6.Statistical Analysis

We will use the trial data to estimate the ITE of each patient and detect the subgroup of patients with or without significant treatment effect. First, we model the ITE as a non-linear covariate specific treatment effect (CSTE) curve in a semi-parametric quantile model. Then to address the missingness, we use an inverse probability (IPW) method and estimate the missing probability by a logistic model. The within-subject correlation is handled by a weighting method borrowing information from a mean model.

"
  ["project_brief_bg"]=>
  string(2970) "Patient-Reported Outcomes (PROs) are increasingly assessed in clinical trials as primary, secondary,

or exploratory endpoints, providing the patients’ perspective of treatment and the impact of medical conditions (Mercieca-Bebber et al., 2018). The value of PRO data has been highlighted by regulatory bodies, as an important factor when assessing the economic, social, and ethical implications of the approval and use of a treatment.



Traditional analyses mostly considered the average treatment effect and the subgroup analysis examines whether treatment effects vary across specific patient subgroups defined by characteristics such as age, gender, or disease severity. However, the division of patient subgroup may lose information from the data, for the artificial dichotomisation of continuous covariates. It could also reduce statistical power and an increased risk of false-positive findings due to multiple comparisons. Consequently, some individuals may not experience the anticipated benefits from a treatment that appears effective on average.



This limitation underscores the need for methodologies capable of assessing heterogeneity in treatment effects and tailor treatments more effectively to individual patients. Biomedical researches have paid much attention on the topic of "personalized medicine", which aims to recommend individualize treatment regime (ITR) according to patients’ unique characteristics (Kent et al., 2018; Kosorok and Laber, 2019). A popular method to estimate the ITR is to  apply non-parametric and semi-parametric models on the difference in average outcomes between two treatment groups conditionally on predictive characteristics (Ma and Zhou, 2014; Han et al., 2017; Guo et al. 2021). However, existing methods have some drawbacks in analysis on the PRO data. Usually, the range of PRO scores is restricted by the minimum and maximum scores of questions, and thus, in general, the PRO scores are not normal. As an alternative to the mean regression, quantile regression (Koenker, 2005) has been extended to the analysis of data when the distribution of the response is quite different from a normal distribution. Recently, Chatla and Bhattacharya (2025) extended the model proposed in Guo et al. (2021) to a quantile regression model for estimation and inference of individualized quantile treatment effect.



We aim to propose a novel semi-parametric quantile model to estimate the individualized treatment effect (ITE) using PRO data which are longitudinal with monotone missing responses. We adopted the IPW method to reduce the bias arising from missingness in responses and borrow information from a mean model to handle the within-subject correlation induced by the longitudinal structure. The proposed method can estimated the ITE on the PRO data by which the patients can be allocated to the optimal treatment group and get more benefits from different treatments."
  ["project_specific_aims"]=>
  string(507) "Specific hypotheses to be evaluated:

Some characteristics are related to treatment effect, and some patients with some levels of characteristic will benefit from a new treatment while some will not.



The primary objective is to apply our proposed method to estimate the ITEs with longitudinal PRO data with missingness. We would like to emphasize that this analysis is intended to demonstrate the applicability of our method rather than to draw any medical conclusions from the results."
  ["project_study_design"]=>
  array(2) {
    ["value"]=>
    string(8) "meth_res"
    ["label"]=>
    string(23) "Methodological research"
  }
  ["project_purposes"]=>
  array(3) {
    [0]=>
    array(2) {
      ["value"]=>
      string(37) "develop_or_refine_statistical_methods"
      ["label"]=>
      string(37) "Develop or refine statistical methods"
    }
    [1]=>
    array(2) {
      ["value"]=>
      string(34) "research_on_clinical_trial_methods"
      ["label"]=>
      string(34) "Research on clinical trial methods"
    }
    [2]=>
    array(2) {
      ["value"]=>
      string(50) "research_on_clinical_prediction_or_risk_prediction"
      ["label"]=>
      string(50) "Research on clinical prediction or risk prediction"
    }
  }
  ["project_research_methods"]=>
  string(192) "NCT02257736,  NCT02489318, NCT00887198 and NCT00638690 are requested for this study. The IPD data from both trial will be used in its entirety.

There are no exclusion criteria defined. "
  ["project_main_outcome_measure"]=>
  string(221) " The primary outcome measure will be the Functional Assessment of Cancer Therapy – Prostate (FACT-P), with the definition consistent with that used in the trials (NCT02257736,  NCT02489318, NCT00887198 and NCT00638690)."
  ["project_main_predictor_indep"]=>
  string(320) " The main predictor is the treatment arm that patients allocated and those may be related to the severity of diseases, or to the prognosis, which are more likely to be related to the treatment effect (e.g. the ECOG score, the tumor stage and the PSA value). All of these variables will be defined as the original trials."
  ["project_other_variables_interest"]=>
  string(135) "Baseline demographical characteristics like age, gender, weight and BMI. All of these variables will be defined as the original trials."
  ["project_stat_analysis_plan"]=>
  string(1036) "Step 1: First, we will do a descriptive analysis of the trial data, giving a general picture of the missing patterns in the longitudinal observations. Specifically, we will summarize the dropout rate in each group by visits, and the percentage of subjects who have different numbers of missing items at each visit. We will also examine missing data in baseline covariates, since missing covariates is not the focus of this study, we will only include those covariates with very few missing data for later analysis. 

Step 2: Conduct exploratory analysis to find out the potential variables most related to the interested outcome.

Step 3: Fit a logistic regression model to estimate a weight related to missingness per subject per visit.

Step 4: Estimate the empirical likelihood to handle the within-subject correlation.

Step 5, Fit a weighted quantile semi-parametric model to estimate the ITE at different levels of quantile.

Step 6: Identify subgroups of patients who may benefit from the treatment."
  ["project_software_used"]=>
  array(1) {
    [0]=>
    array(2) {
      ["value"]=>
      string(7) "rstudio"
      ["label"]=>
      string(7) "RStudio"
    }
  }
  ["project_timeline"]=>
  string(369) "Our research has already begun and we have completed the design of model and numerical experiments. Manuscript writing aside from the real-data component will be completed by May 2025, and analysis on the trial data as well as the finalization of the manuscript is estimated to be completed by June 2025. the manuscript will be submitted for publication by August 2025."
  ["project_dissemination_plan"]=>
  string(114) "The manuscript will be submitted to journals in the field of statistical methods, such us Statistics in Medicine. "
  ["project_bibliography"]=>
  string(1282) "Chatla, S. B. and Bhattacharya, I. (2025). Estimation and inference of quantile treatment effects in high-dimensional single-index model. Econometrics and Statistics.
Guo, W., Zhou, X.-H., and Ma, S. (2021). Estimation of optimal individualized treatment rules using a covariate-specific treatment effect curve with high-dimensional covariates. Journal of the American Statistical Association, 116(533):309–321.
Han, K., Zhou, X., and Liu, B. (2017). Cste curve for selection the optimal treatment when outcome is binary. Sci China Math, 47:497–514.
Ma, Y. and Zhou, X.-H. (2014). Treatment selection in a randomized clinical trial via covariate-specifictreatment effect curves. Statistical methods in medical research, 26(1):124–141
Mercieca-Bebber, R., King, M. T., Calvert, M. J., Stockler, M. R., and Friedlander, M. (2018). The importance of patient-reported outcomes in clinical trials and strategies for future optimization. Patient related outcome measures, pages 353–367.
Kent, D. M., Steyerberg, E., and Van Klaveren, D. (2018). Personalized evidence based medicine: predictive approaches to heterogeneous treatment effects. Bmj, 363.
Koenker, R. (2005). Quantile regression, volume 38. Cambridge university press.
"
  ["project_suppl_material"]=>
  bool(false)
  ["project_coi"]=>
  array(2) {
    [0]=>
    array(1) {
      ["file_coi"]=>
      array(21) {
        ["ID"]=>
        int(17121)
        ["id"]=>
        int(17121)
        ["title"]=>
        string(19) "COI_Yang-Li-1-1.pdf"
        ["filename"]=>
        string(19) "COI_Yang-Li-1-1.pdf"
        ["filesize"]=>
        int(20304)
        ["url"]=>
        string(68) "https://yoda.yale.edu/wp-content/uploads/2025/04/COI_Yang-Li-1-1.pdf"
        ["link"]=>
        string(65) "https://yoda.yale.edu/data-request/2025-0268/coi_yang-li-1-1-pdf/"
        ["alt"]=>
        string(0) ""
        ["author"]=>
        string(4) "2079"
        ["description"]=>
        string(0) ""
        ["caption"]=>
        string(0) ""
        ["name"]=>
        string(19) "coi_yang-li-1-1-pdf"
        ["status"]=>
        string(7) "inherit"
        ["uploaded_to"]=>
        int(17117)
        ["date"]=>
        string(19) "2025-04-25 10:12:24"
        ["modified"]=>
        string(19) "2025-04-25 10:12:27"
        ["menu_order"]=>
        int(0)
        ["mime_type"]=>
        string(15) "application/pdf"
        ["type"]=>
        string(11) "application"
        ["subtype"]=>
        string(3) "pdf"
        ["icon"]=>
        string(62) "https://yoda.yale.edu/wp/wp-includes/images/media/document.png"
      }
    }
    [1]=>
    array(1) {
      ["file_coi"]=>
      array(21) {
        ["ID"]=>
        int(17122)
        ["id"]=>
        int(17122)
        ["title"]=>
        string(42) "SV_57KskaKADT3U9Aq-R_9NOoPQkS3UFzDMq-1.pdf"
        ["filename"]=>
        string(42) "SV_57KskaKADT3U9Aq-R_9NOoPQkS3UFzDMq-1.pdf"
        ["filesize"]=>
        int(20045)
        ["url"]=>
        string(91) "https://yoda.yale.edu/wp-content/uploads/2025/04/SV_57KskaKADT3U9Aq-R_9NOoPQkS3UFzDMq-1.pdf"
        ["link"]=>
        string(88) "https://yoda.yale.edu/data-request/2025-0268/sv_57kskakadt3u9aq-r_9noopqks3ufzdmq-1-pdf/"
        ["alt"]=>
        string(0) ""
        ["author"]=>
        string(4) "2079"
        ["description"]=>
        string(0) ""
        ["caption"]=>
        string(0) ""
        ["name"]=>
        string(42) "sv_57kskakadt3u9aq-r_9noopqks3ufzdmq-1-pdf"
        ["status"]=>
        string(7) "inherit"
        ["uploaded_to"]=>
        int(17117)
        ["date"]=>
        string(19) "2025-04-25 10:12:26"
        ["modified"]=>
        string(19) "2025-04-25 10:12:27"
        ["menu_order"]=>
        int(0)
        ["mime_type"]=>
        string(15) "application/pdf"
        ["type"]=>
        string(11) "application"
        ["subtype"]=>
        string(3) "pdf"
        ["icon"]=>
        string(62) "https://yoda.yale.edu/wp/wp-includes/images/media/document.png"
      }
    }
  }
  ["data_use_agreement_training"]=>
  bool(true)
  ["human_research_protection_training"]=>
  bool(true)
  ["certification"]=>
  bool(true)
  ["search_order"]=>
  string(1) "0"
  ["project_send_email_updates"]=>
  bool(false)
  ["project_publ_available"]=>
  bool(true)
  ["project_year_access"]=>
  string(4) "2025"
  ["project_rep_publ"]=>
  bool(false)
  ["project_assoc_data"]=>
  array(0) {
  }
  ["project_due_dil_assessment"]=>
  array(21) {
    ["ID"]=>
    int(17753)
    ["id"]=>
    int(17753)
    ["title"]=>
    string(47) "YODA Project Due Diligence Assessment 2025-0268"
    ["filename"]=>
    string(51) "YODA-Project-Due-Diligence-Assessment-2025-0268.pdf"
    ["filesize"]=>
    int(147969)
    ["url"]=>
    string(100) "https://yoda.yale.edu/wp-content/uploads/2025/04/YODA-Project-Due-Diligence-Assessment-2025-0268.pdf"
    ["link"]=>
    string(93) "https://yoda.yale.edu/data-request/2025-0268/yoda-project-due-diligence-assessment-2025-0268/"
    ["alt"]=>
    string(0) ""
    ["author"]=>
    string(4) "1885"
    ["description"]=>
    string(0) ""
    ["caption"]=>
    string(0) ""
    ["name"]=>
    string(47) "yoda-project-due-diligence-assessment-2025-0268"
    ["status"]=>
    string(7) "inherit"
    ["uploaded_to"]=>
    int(17117)
    ["date"]=>
    string(19) "2025-08-08 14:40:50"
    ["modified"]=>
    string(19) "2025-08-08 14:40:50"
    ["menu_order"]=>
    int(0)
    ["mime_type"]=>
    string(15) "application/pdf"
    ["type"]=>
    string(11) "application"
    ["subtype"]=>
    string(3) "pdf"
    ["icon"]=>
    string(62) "https://yoda.yale.edu/wp/wp-includes/images/media/document.png"
  }
  ["project_title_link"]=>
  array(21) {
    ["ID"]=>
    int(17754)
    ["id"]=>
    int(17754)
    ["title"]=>
    string(46) "YODA Project Protocol - 2025-0268 - 2025-04-30"
    ["filename"]=>
    string(46) "YODA-Project-Protocol-2025-0268-2025-04-30.pdf"
    ["filesize"]=>
    int(128232)
    ["url"]=>
    string(95) "https://yoda.yale.edu/wp-content/uploads/2025/04/YODA-Project-Protocol-2025-0268-2025-04-30.pdf"
    ["link"]=>
    string(88) "https://yoda.yale.edu/data-request/2025-0268/yoda-project-protocol-2025-0268-2025-04-30/"
    ["alt"]=>
    string(0) ""
    ["author"]=>
    string(4) "1885"
    ["description"]=>
    string(0) ""
    ["caption"]=>
    string(0) ""
    ["name"]=>
    string(42) "yoda-project-protocol-2025-0268-2025-04-30"
    ["status"]=>
    string(7) "inherit"
    ["uploaded_to"]=>
    int(17117)
    ["date"]=>
    string(19) "2025-08-08 14:41:07"
    ["modified"]=>
    string(19) "2025-08-08 14:41:07"
    ["menu_order"]=>
    int(0)
    ["mime_type"]=>
    string(15) "application/pdf"
    ["type"]=>
    string(11) "application"
    ["subtype"]=>
    string(3) "pdf"
    ["icon"]=>
    string(62) "https://yoda.yale.edu/wp/wp-includes/images/media/document.png"
  }
  ["project_review_link"]=>
  array(21) {
    ["ID"]=>
    int(17755)
    ["id"]=>
    int(17755)
    ["title"]=>
    string(36) "YODA Project Review - 2025-0268_site"
    ["filename"]=>
    string(38) "YODA-Project-Review-2025-0268_site.pdf"
    ["filesize"]=>
    int(1315677)
    ["url"]=>
    string(87) "https://yoda.yale.edu/wp-content/uploads/2025/04/YODA-Project-Review-2025-0268_site.pdf"
    ["link"]=>
    string(80) "https://yoda.yale.edu/data-request/2025-0268/yoda-project-review-2025-0268_site/"
    ["alt"]=>
    string(0) ""
    ["author"]=>
    string(4) "1885"
    ["description"]=>
    string(0) ""
    ["caption"]=>
    string(0) ""
    ["name"]=>
    string(34) "yoda-project-review-2025-0268_site"
    ["status"]=>
    string(7) "inherit"
    ["uploaded_to"]=>
    int(17117)
    ["date"]=>
    string(19) "2025-08-08 14:41:31"
    ["modified"]=>
    string(19) "2025-08-08 14:41:31"
    ["menu_order"]=>
    int(0)
    ["mime_type"]=>
    string(15) "application/pdf"
    ["type"]=>
    string(11) "application"
    ["subtype"]=>
    string(3) "pdf"
    ["icon"]=>
    string(62) "https://yoda.yale.edu/wp/wp-includes/images/media/document.png"
  }
  ["project_highlight_button"]=>
  string(0) ""
  ["request_data_partner"]=>
  string(15) "johnson-johnson"
  ["request_overridden_res"]=>
  string(1) "3"
}
data partner
array(1) {
  [0]=>
  string(15) "johnson-johnson"
}


pi country
array(0) {
}


pi affil
array(0) {
}


products
array(2) {
  [0]=>
  string(7) "erleada"
  [1]=>
  string(6) "zytiga"
}


num of trials
array(1) {
  [0]=>
  string(1) "4"
}


res
array(1) {
  [0]=>
  string(1) "3"
}

General Information

How did you learn about the YODA Project?: Colleague

Conflict of Interest

Request Clinical Trials

Associated Trial(s):

What type of data are you looking for?: Individual Participant-Level Data, which includes Full CSR and all supporting documentation

Request Clinical Trials

Data Request Status

Status: Ongoing

Research Proposal

Project Title: Detection of Subgroup Treatment Effects with Missing Patient-Reported Outcomes Data

Scientific Abstract: 1.Background
Patient-reported outcome (PRO) measures are included in clinical trials to provide insights into patients' perspectives on treatment. The treatment effect can vary across subjects, which is defined as the individualized treatment effect (ITE), and are important in the decision making about clinical treatment. However, when it comes to the PRO data, it is challenging due to the skewness of responses and the presence of missing data. Current methods ignore these issues or only estimate average effects, not individualized benefits.
2.Objective
To propose a novel method to estimate the ITEs and detect patients with or without significant treatment effect by longitudinal PRO data with missingness.
3.Study Design
We have examined the efficiency of our proposed method in numerical experiments. We will apply the proposed method on the trial data to show the application of our method.
4.Participants
The study population in each trial.
5.Primary and Secondary Outcome Measure(s)
The Functional Assessment of Cancer Therapy -- Prostate (FACT-P) score.
6.Statistical Analysis
We will use the trial data to estimate the ITE of each patient and detect the subgroup of patients with or without significant treatment effect. First, we model the ITE as a non-linear covariate specific treatment effect (CSTE) curve in a semi-parametric quantile model. Then to address the missingness, we use an inverse probability (IPW) method and estimate the missing probability by a logistic model. The within-subject correlation is handled by a weighting method borrowing information from a mean model.

Brief Project Background and Statement of Project Significance: Patient-Reported Outcomes (PROs) are increasingly assessed in clinical trials as primary, secondary,
or exploratory endpoints, providing the patients' perspective of treatment and the impact of medical conditions (Mercieca-Bebber et al., 2018). The value of PRO data has been highlighted by regulatory bodies, as an important factor when assessing the economic, social, and ethical implications of the approval and use of a treatment.

Traditional analyses mostly considered the average treatment effect and the subgroup analysis examines whether treatment effects vary across specific patient subgroups defined by characteristics such as age, gender, or disease severity. However, the division of patient subgroup may lose information from the data, for the artificial dichotomisation of continuous covariates. It could also reduce statistical power and an increased risk of false-positive findings due to multiple comparisons. Consequently, some individuals may not experience the anticipated benefits from a treatment that appears effective on average.

This limitation underscores the need for methodologies capable of assessing heterogeneity in treatment effects and tailor treatments more effectively to individual patients. Biomedical researches have paid much attention on the topic of "personalized medicine", which aims to recommend individualize treatment regime (ITR) according to patients' unique characteristics (Kent et al., 2018; Kosorok and Laber, 2019). A popular method to estimate the ITR is to apply non-parametric and semi-parametric models on the difference in average outcomes between two treatment groups conditionally on predictive characteristics (Ma and Zhou, 2014; Han et al., 2017; Guo et al. 2021). However, existing methods have some drawbacks in analysis on the PRO data. Usually, the range of PRO scores is restricted by the minimum and maximum scores of questions, and thus, in general, the PRO scores are not normal. As an alternative to the mean regression, quantile regression (Koenker, 2005) has been extended to the analysis of data when the distribution of the response is quite different from a normal distribution. Recently, Chatla and Bhattacharya (2025) extended the model proposed in Guo et al. (2021) to a quantile regression model for estimation and inference of individualized quantile treatment effect.

We aim to propose a novel semi-parametric quantile model to estimate the individualized treatment effect (ITE) using PRO data which are longitudinal with monotone missing responses. We adopted the IPW method to reduce the bias arising from missingness in responses and borrow information from a mean model to handle the within-subject correlation induced by the longitudinal structure. The proposed method can estimated the ITE on the PRO data by which the patients can be allocated to the optimal treatment group and get more benefits from different treatments.

Specific Aims of the Project: Specific hypotheses to be evaluated:
Some characteristics are related to treatment effect, and some patients with some levels of characteristic will benefit from a new treatment while some will not.

The primary objective is to apply our proposed method to estimate the ITEs with longitudinal PRO data with missingness. We would like to emphasize that this analysis is intended to demonstrate the applicability of our method rather than to draw any medical conclusions from the results.

Study Design: Methodological research

What is the purpose of the analysis being proposed? Please select all that apply.: Develop or refine statistical methods Research on clinical trial methods Research on clinical prediction or risk prediction

Software Used: RStudio

Data Source and Inclusion/Exclusion Criteria to be used to define the patient sample for your study: NCT02257736, NCT02489318, NCT00887198 and NCT00638690 are requested for this study. The IPD data from both trial will be used in its entirety.
There are no exclusion criteria defined.

Primary and Secondary Outcome Measure(s) and how they will be categorized/defined for your study: The primary outcome measure will be the Functional Assessment of Cancer Therapy -- Prostate (FACT-P), with the definition consistent with that used in the trials (NCT02257736, NCT02489318, NCT00887198 and NCT00638690).

Main Predictor/Independent Variable and how it will be categorized/defined for your study: The main predictor is the treatment arm that patients allocated and those may be related to the severity of diseases, or to the prognosis, which are more likely to be related to the treatment effect (e.g. the ECOG score, the tumor stage and the PSA value). All of these variables will be defined as the original trials.

Other Variables of Interest that will be used in your analysis and how they will be categorized/defined for your study: Baseline demographical characteristics like age, gender, weight and BMI. All of these variables will be defined as the original trials.

Statistical Analysis Plan: Step 1: First, we will do a descriptive analysis of the trial data, giving a general picture of the missing patterns in the longitudinal observations. Specifically, we will summarize the dropout rate in each group by visits, and the percentage of subjects who have different numbers of missing items at each visit. We will also examine missing data in baseline covariates, since missing covariates is not the focus of this study, we will only include those covariates with very few missing data for later analysis.
Step 2: Conduct exploratory analysis to find out the potential variables most related to the interested outcome.
Step 3: Fit a logistic regression model to estimate a weight related to missingness per subject per visit.
Step 4: Estimate the empirical likelihood to handle the within-subject correlation.
Step 5, Fit a weighted quantile semi-parametric model to estimate the ITE at different levels of quantile.
Step 6: Identify subgroups of patients who may benefit from the treatment.

Narrative Summary: Patient-reported outcomes (PROs) track treatment impact but are hard to analyze due to skewed data, missing responses, and repeated measurements over time. Current methods ignore these PRO specific issues or only estimate average effects instead of individualized benefits, which is the core of precision medicine. We develop a new statistical method to estimate personalized treatment effects (ITEs) from longitudinal PROs with missing data, focusing on quantiles (e.g., median) rather than averages. This method will make better identification of patient subgroups that benefit (or don't) from treatments, improving decision-making for precision medicine.

Project Timeline: Our research has already begun and we have completed the design of model and numerical experiments. Manuscript writing aside from the real-data component will be completed by May 2025, and analysis on the trial data as well as the finalization of the manuscript is estimated to be completed by June 2025. the manuscript will be submitted for publication by August 2025.

Dissemination Plan: The manuscript will be submitted to journals in the field of statistical methods, such us Statistics in Medicine.

Bibliography:

Chatla, S. B. and Bhattacharya, I. (2025). Estimation and inference of quantile treatment effects in high-dimensional single-index model. Econometrics and Statistics.

Guo, W., Zhou, X.-H., and Ma, S. (2021). Estimation of optimal individualized treatment rules using a covariate-specific treatment effect curve with high-dimensional covariates. Journal of the American Statistical Association, 116(533):309--321.

Han, K., Zhou, X., and Liu, B. (2017). Cste curve for selection the optimal treatment when outcome is binary. Sci China Math, 47:497--514.

Ma, Y. and Zhou, X.-H. (2014). Treatment selection in a randomized clinical trial via covariate-specifictreatment effect curves. Statistical methods in medical research, 26(1):124--141

Mercieca-Bebber, R., King, M. T., Calvert, M. J., Stockler, M. R., and Friedlander, M. (2018). The importance of patient-reported outcomes in clinical trials and strategies for future optimization. Patient related outcome measures, pages 353--367.

Kent, D. M., Steyerberg, E., and Van Klaveren, D. (2018). Personalized evidence based medicine: predictive approaches to heterogeneous treatment effects. Bmj, 363.

Koenker, R. (2005). Quantile regression, volume 38. Cambridge university press.