array(43) {
  ["project_title"]=>
  string(125) "Can Machine Learning Algorithms using General Labs Predict Biologic Remission for Patients on Thiopurines in the SONIC trial?"
  ["project_narrative_summary"]=>
  string(349) "We have applied machine learning to common lab values to predict biologic remission in patients with inflammatory bowel disease in our local cohort. Now, we want to externally validate this by applying our algorithm to general labs from week 6 and week 10 in SONIC to predict which patients will achieve Biologic Remission at week 26 and at week 50."
  ["project_learn_source"]=>
  string(11) "data_holder"
  ["project_learn_source_exp"]=>
  string(0) ""
  ["project_key_personnel"]=>
  array(6) {
    [0]=>
    array(6) {
      ["p_pers_f_name"]=>
      string(3) "Kay"
      ["p_pers_l_name"]=>
      string(6) "Sauder"
      ["p_pers_degree"]=>
      string(4) "B.S."
      ["p_pers_pr_affil"]=>
      string(22) "University of Michigan"
      ["p_pers_scop_id"]=>
      string(0) ""
      ["requires_data_access"]=>
      string(0) ""
    }
    [1]=>
    array(6) {
      ["p_pers_f_name"]=>
      string(5) "Akbar"
      ["p_pers_l_name"]=>
      string(6) "Waljee"
      ["p_pers_degree"]=>
      string(11) "M.D., M.Sc."
      ["p_pers_pr_affil"]=>
      string(22) "University of Michigan"
      ["p_pers_scop_id"]=>
      string(0) ""
      ["requires_data_access"]=>
      string(0) ""
    }
    [2]=>
    array(6) {
      ["p_pers_f_name"]=>
      string(5) "Yiwei"
      ["p_pers_l_name"]=>
      string(5) "Zhang"
      ["p_pers_degree"]=>
      string(4) "M.S."
      ["p_pers_pr_affil"]=>
      string(22) "University of Michigan"
      ["p_pers_scop_id"]=>
      string(0) ""
      ["requires_data_access"]=>
      string(0) ""
    }
    [3]=>
    array(6) {
      ["p_pers_f_name"]=>
      string(2) "Ji"
      ["p_pers_l_name"]=>
      string(3) "Zhu"
      ["p_pers_degree"]=>
      string(5) "Ph.D."
      ["p_pers_pr_affil"]=>
      string(22) "University of Michigan"
      ["p_pers_scop_id"]=>
      string(0) ""
      ["requires_data_access"]=>
      string(0) ""
    }
    [4]=>
    array(6) {
      ["p_pers_f_name"]=>
      string(5) "Boang"
      ["p_pers_l_name"]=>
      string(3) "Liu"
      ["p_pers_degree"]=>
      string(2) "BS"
      ["p_pers_pr_affil"]=>
      string(22) "University of Michigan"
      ["p_pers_scop_id"]=>
      string(0) ""
      ["requires_data_access"]=>
      string(0) ""
    }
    [5]=>
    array(6) {
      ["p_pers_f_name"]=>
      string(7) "Zachary"
      ["p_pers_l_name"]=>
      string(9) "Bearinger"
      ["p_pers_degree"]=>
      string(0) ""
      ["p_pers_pr_affil"]=>
      string(22) "University of Michigan"
      ["p_pers_scop_id"]=>
      string(0) ""
      ["requires_data_access"]=>
      string(0) ""
    }
  }
  ["project_ext_grants"]=>
  array(2) {
    ["value"]=>
    string(2) "no"
    ["label"]=>
    string(68) "No external grants or funds are being used to support this research."
  }
  ["project_funding_source"]=>
  string(16) "NIH R01 GM097117"
  ["project_assoc_trials"]=>
  array(1) {
    [0]=>
    object(WP_Post)#3686 (24) {
      ["ID"]=>
      int(1123)
      ["post_author"]=>
      string(4) "1363"
      ["post_date"]=>
      string(19) "2014-09-22 10:47:00"
      ["post_date_gmt"]=>
      string(19) "2014-09-22 10:47:00"
      ["post_content"]=>
      string(0) ""
      ["post_title"]=>
      string(337) "NCT00094458 - Multicenter, Randomized, Double-Blind, Active Controlled Trial Comparing REMICADE® (infliximab) and REMICADE plus Azathioprine to Azathioprine in the Treatment of Patients with Crohn’s Disease Naive to both Immunomodulators and Biologic Therapy (Study of Biologic and Immunomodulator Naive Patients in Crohn’s Disease)"
      ["post_excerpt"]=>
      string(0) ""
      ["post_status"]=>
      string(7) "publish"
      ["comment_status"]=>
      string(4) "open"
      ["ping_status"]=>
      string(4) "open"
      ["post_password"]=>
      string(0) ""
      ["post_name"]=>
      string(187) "nct00094458-multicenter-randomized-double-blind-active-controlled-trial-comparing-remicade-infliximab-and-remicade-plus-azathioprine-to-azathioprine-in-the-treatment-of-patients-with-croh"
      ["to_ping"]=>
      string(0) ""
      ["pinged"]=>
      string(0) ""
      ["post_modified"]=>
      string(19) "2023-02-06 13:12:16"
      ["post_modified_gmt"]=>
      string(19) "2023-02-06 13:12:16"
      ["post_content_filtered"]=>
      string(0) ""
      ["post_parent"]=>
      int(0)
      ["guid"]=>
      string(236) "https://dev-yoda.pantheonsite.io/clinical-trial/nct00094458-multicenter-randomized-double-blind-active-controlled-trial-comparing-remicade-infliximab-and-remicade-plus-azathioprine-to-azathioprine-in-the-treatment-of-patients-with-croh/"
      ["menu_order"]=>
      int(0)
      ["post_type"]=>
      string(14) "clinical_trial"
      ["post_mime_type"]=>
      string(0) ""
      ["comment_count"]=>
      string(1) "0"
      ["filter"]=>
      string(3) "raw"
    }
  }
  ["project_date_type"]=>
  string(18) "full_crs_supp_docs"
  ["property_scientific_abstract"]=>
  string(1671) "Background:
Machine learning applied to common lab results, including the CBC with differential and the comprehensive chemistry panel (plus age in days at time of blood draw), can predict biologic remission in patients with inflammatory bowel disease (ThioMon 2.0).
Objective:
Our primary objective is to determine if the ThioMon algorithm can predict which patients will reach biologic remission as determined by CRP and endoscopy scores by week 26.
Study Design:
For each subject, we will determine immunosuppression scores from week 6 and week 10 lab values. We will see if these scores can accurately predict which patients reached either biologic remission or clinical remission at week 26 and week 50.
Participants:
Subjects randomized in the SONIC trial who received thiopurines.
Main Outcome Measure(s):
1) The ability of ThioMon to predict biologic remission (based on CRP and mucosal healing) at week 26.
2) The ability of ThioMon to predict biologic remission (based on CRP) at week 50.
3) The ability of ThioMon to predict clinical remission (based on CDAI) at weeks 26 and 50.
Statistical Analysis:
We will use a Student?s T test or Mann-Whitney U test, to compare the following:
1.) Immunosuppression scores between biologic remission (BR) and non-BR groups at week 26 and week 50
2.) Immunosuppression scores between clinical remission (CR) and non-CR groups at week 26 and week 50
We will also report the AuROC, sensitivity, specificity, NPV, and PPV for each of the comparisons above, using an Immunosuppression score of 100 as the cut point." ["project_brief_bg"]=> string(2083) "Machine Learning Methods
Machine learning is a group of methods that optimize splits in datasets to predict important outcomes(1). Applications of machine leaning have improved the analysis of gene microarrays(2), proteomics results from mass spectrophotometry(3), predictions in financial markets(4), and algorithms to optimize signal and reduce noise in images(5). Machine learning is often applied by businesses to identify customers for a product based on their purchasing history, as in Amazon.com recommendations for books, or Google optimized searches. More recently, machine learning has been applied to clinical problems in which large complex datasets are available.
Machine Learning Predicts Thiopurine Treatment Success in IBD Patients
Numerous patients with inflammatory bowel disease (IBD) require treatment with thiopurines. Physicians often monitor the efficacy and safety of this low cost medication by following blood counts and blood chemistry. Alternatively, a commercially available test measures the amount of 6TGN metabolite present in a patient?s blood.
Dr. Higgins? research group used the machine learning approach to build an algorithm that predicted a patient?s response to thiopurine treatment (6). This algorithm was optimized to predict clinical response.
In this study of ThioMon 1.0, the machine learning algorithm based on blood metabolites was 86% accurate in predicting clinical response to thiopurine treatment, while commercially available blood metabolite measurements were only 59% accurate.
Since clinical symptoms can be subjective and not always indicative of inflammation present, the Machine learning algorithm has been improved by using objective biological evidence of inflammation to determine patient response to thiopurine treatment.
The ThioMon 2.0 algorithm (for biologic response) was developed using 3,269 patient cases, and is significantly more accurate than the 6-TGN metabolites for predicting biologic response to thiopurines. This algorithm is significantly more accurate (P" ["project_specific_aims"]=> string(763) "Our primary objective is to determine if the ThioMon 2.0 algorithm can use lab values from week 10 to predict which patients will reach biologic remission by week 26.
Our secondary objectives are to determine if the ThioMon 2.0 algorithm can use lab values from week 6 and also week 10 to predict which patients will reach biologic remission by week 50 or reach clinical remission at weeks 26 and 50. Also, to determine if we can correlate subjects' Immunosuppression Scores to their IFX (infliximab) levels and HACA (human anti-chimeric antibody) levels for subjects in cohort 3.
We hypothesize that our ThioMon 2.0 predictive algorithm will accurately predict which patients achieve Biologic Remission at week 26 based on lab values from week 10." ["project_study_design"]=> string(0) "" ["project_study_design_exp"]=> string(0) "" ["project_purposes"]=> array(0) { } ["project_purposes_exp"]=> string(0) "" ["project_software_used"]=> string(0) "" ["project_software_used_exp"]=> string(0) "" ["project_research_methods"]=> string(1229) "We would like subjects randomized into cohort 1 (azathioprine only) and cohort 3 (combination therapy with both infliximab and azathioprine). We will look at these aims in both cohorts. We expect that could be a larger effect in cohort 1, as any effect in cohort 3 could be confounded by the efficacy of infliximab. We will exclude participants who did not have active ulceration or an elevated CRP at baseline.
We will obtain the following data:
1.) A complete blood count with differential and platelet count (CBCPD) and a comprehensive metabolic panel (COMP CHEM) from week 6 and week 10 for each subject.
2.) IFX (infliximab) levels and HACA (human anti-chimeric antibody) levels from each subject in cohort 3.
3.) Age (in days) of each subject at week 6.
4.) Age (in days) of each subject at week 10.
5.) CRP results from baseline, week 26, and week 50 for each subject
6.) Endoscopy scores (or reports) from baseline and week 26 for each subject (Biologic remission [BR] = normal CRP and no ulceration on endoscopy)
7.) Clinical Remission status (including CDAI) for each subject at week 26 and week 50
8.) Gender, race, and medication doses for each subject" ["project_main_outcome_measure"]=> string(78) "Biologic Remission at week 26 will be our main outcome. It is defined as a CRP" ["project_main_predictor_indep"]=> string(378) "We will calculate an "Immunosuppression Score" for each subject using the following data:
1.) A complete blood count with differential and platelet count (CBCPD) from week 6 and week 10.
2.) A comprehensive metabolic panel (COMP CHEM) from week 6 and week 10.
3.) Age (in days) of each subject at week 6.
4.) Age (in days) of each subject at week 10." ["project_other_variables_interest"]=> string(206) "We will also look at IFX (infliximab) levels and HACA (human anti-chimeric antibody) levels from each subject in cohort 3 to see if we can correlate these levels to each subjects' "Immunosuppression Score."" ["project_stat_analysis_plan"]=> string(885) "We will use a student?s T test or Mann-Whitney U test, as appropriate to compare the following:
1.) Immunosuppression scores between biologic remission (BR) and non-BR groups at week 26 and week 50
2.) Immunosuppression scores between clinical remission (CR) and non-CR groups at week 26 and week 50
3.) Immunosuppression scores between subjects with an elevated IFX level and subjects without
4.) Immunosuppression scores between subjects with an elevated HACA level and subjects without
We will also report the AuROC, sensitivity, specificity, NPV, and PPV for each of the comparisons above, using an Immunosuppression score of 100 as the cut point.
We will also explore multivariate models using thiopurine dose, immunosuppression score, infliximab dose, age, gender, and other demographics to predict infliximab levels and HACA levels." ["project_timeline"]=> string(280) "Project start date: January 1, 2015 (or when data received)
Analysis completion date: March 1, 2015
First manuscript draft: April 1, 2015
Date of expected manuscript submission: May 1, 2015
Date of results reported back to the YODA Project: May 1, 2015" ["project_dissemination_plan"]=> string(258) "Study manuscript, target audience: gastroenterologists.
Likely journals: NEJM, Gut, Gastroenterology, American Journal of Gastroenterology.
Oral presentations at UEGW 2015 and ECCO 2016, or we might make the late-breaking deadline for DDW 2015." ["project_bibliography"]=> string(1100) "

1. Breiman L. Classification and regression trees. Belmont, Calif.: Wadsworth International Group; 1984
2. Zhu J, Hastie T. Classification of gene microarrays by penalized logistic regression. Biostatistics. 2004;5:427-443
3. Ulintz PJ, Zhu J, Qin ZS, et al. Improved classification of mass spectrometry database search results using newer machine learning approaches. Mol Cell Proteomics. 2006;5:497-509
4. Wang L, Zhu J. Financial Market Forecasting Using a Two-Step Kernel Learning Method for Support Vector Regression. Annals of Operations Research. 2008;174:103-120
5. Wang L, Zhu J. Image Denoising via Solution Paths. Annals of Operations Research. 2008;174:3-17
6. Waljee AK, Joyce JC, Wang S, et al. Algorithms outperform metabolite tests in predicting response of patients with inflammatory bowel disease to thiopurines. Clin Gastroenterol Hepatol. 2010;8:143-150
7. Bonafede MM, Gandra SR, Watson C, et al. Cost per treated patient for etanercept, adalimumab, and infliximab across adult indications: a claims analysis. Adv Ther. 2012;29:234?248

" ["project_suppl_material"]=> bool(false) ["project_coi"]=> array(3) { [0]=> array(1) { ["file_coi"]=> array(21) { ["ID"]=> int(9466) ["id"]=> int(9466) ["title"]=> string(20) "skmbt_22314121710230" ["filename"]=> string(24) "skmbt_22314121710230.pdf" ["filesize"]=> int(4253310) ["url"]=> string(73) "https://yoda.yale.edu/wp-content/uploads/2019/08/skmbt_22314121710230.pdf" ["link"]=> string(66) "https://yoda.yale.edu/data-request/2014-0401/skmbt_22314121710230/" ["alt"]=> string(0) "" ["author"]=> string(4) "1363" ["description"]=> string(0) "" ["caption"]=> string(0) "" ["name"]=> string(20) "skmbt_22314121710230" ["status"]=> string(7) "inherit" ["uploaded_to"]=> int(3832) ["date"]=> string(19) "2023-07-31 15:43:52" ["modified"]=> string(19) "2023-08-01 01:13:51" ["menu_order"]=> int(0) ["mime_type"]=> string(15) "application/pdf" ["type"]=> string(11) "application" ["subtype"]=> string(3) "pdf" ["icon"]=> string(62) "https://yoda.yale.edu/wp/wp-includes/images/media/document.png" } } [1]=> array(1) { ["file_coi"]=> array(21) { ["ID"]=> int(9772) ["id"]=> int(9772) ["title"]=> string(31) "yoda_project_coi_form_boang_liu" ["filename"]=> string(35) "yoda_project_coi_form_boang_liu.pdf" ["filesize"]=> int(89043) ["url"]=> string(84) "https://yoda.yale.edu/wp-content/uploads/2020/03/yoda_project_coi_form_boang_liu.pdf" ["link"]=> string(77) "https://yoda.yale.edu/data-request/2014-0401/yoda_project_coi_form_boang_liu/" ["alt"]=> string(0) "" ["author"]=> string(4) "1363" ["description"]=> string(0) "" ["caption"]=> string(0) "" ["name"]=> string(31) "yoda_project_coi_form_boang_liu" ["status"]=> string(7) "inherit" ["uploaded_to"]=> int(3832) ["date"]=> string(19) "2023-07-31 15:57:28" ["modified"]=> string(19) "2023-08-01 01:13:51" ["menu_order"]=> int(0) ["mime_type"]=> string(15) "application/pdf" ["type"]=> string(11) "application" ["subtype"]=> string(3) "pdf" ["icon"]=> string(62) "https://yoda.yale.edu/wp/wp-includes/images/media/document.png" } } [2]=> array(1) { ["file_coi"]=> array(21) { ["ID"]=> int(9771) ["id"]=> int(9771) ["title"]=> string(31) "yoda_project_coi_form_bearinger" ["filename"]=> string(35) "yoda_project_coi_form_bearinger.pdf" ["filesize"]=> int(759267) ["url"]=> string(84) "https://yoda.yale.edu/wp-content/uploads/2020/04/yoda_project_coi_form_bearinger.pdf" ["link"]=> string(77) "https://yoda.yale.edu/data-request/2014-0401/yoda_project_coi_form_bearinger/" ["alt"]=> string(0) "" ["author"]=> string(4) "1363" ["description"]=> string(0) "" ["caption"]=> string(0) "" ["name"]=> string(31) "yoda_project_coi_form_bearinger" ["status"]=> string(7) "inherit" ["uploaded_to"]=> int(3832) ["date"]=> string(19) "2023-07-31 15:57:26" ["modified"]=> string(19) "2023-08-01 01:13:51" ["menu_order"]=> int(0) ["mime_type"]=> string(15) "application/pdf" ["type"]=> string(11) "application" ["subtype"]=> string(3) "pdf" ["icon"]=> string(62) "https://yoda.yale.edu/wp/wp-includes/images/media/document.png" } } } ["data_use_agreement_training"]=> bool(true) ["certification"]=> bool(true) ["project_send_email_updates"]=> bool(true) ["project_status"]=> string(9) "published" ["project_publ_available"]=> bool(true) ["project_year_access"]=> string(4) "2015" ["project_rep_publ"]=> array(1) { [0]=> array(1) { ["publication_link"]=> array(3) { ["title"]=> string(32) "Clin Gastroenterol Hepatol. 2018" ["url"]=> string(44) "https://www.ncbi.nlm.nih.gov/pubmed/28838785" ["target"]=> string(6) "_blank" } } } ["project_assoc_data"]=> array(1) { [0]=> string(8) "data_res" } ["project_due_dil_assessment"]=> array(21) { ["ID"]=> int(11395) ["id"]=> int(11395) ["title"]=> string(47) "yoda_project_due_diligence_assessment_2014-0401" ["filename"]=> string(51) "yoda_project_due_diligence_assessment_2014-0401.pdf" ["filesize"]=> int(195724) ["url"]=> string(100) "https://yoda.yale.edu/wp-content/uploads/2023/08/yoda_project_due_diligence_assessment_2014-0401.pdf" ["link"]=> string(93) "https://yoda.yale.edu/data-request/2014-0401/yoda_project_due_diligence_assessment_2014-0401/" ["alt"]=> string(0) "" ["author"]=> string(4) "1363" ["description"]=> string(0) "" ["caption"]=> string(0) "" ["name"]=> string(47) "yoda_project_due_diligence_assessment_2014-0401" ["status"]=> string(7) "inherit" ["uploaded_to"]=> int(3832) ["date"]=> string(19) "2023-08-09 17:39:41" ["modified"]=> string(19) "2023-08-09 19:25:12" ["menu_order"]=> int(0) ["mime_type"]=> string(15) "application/pdf" ["type"]=> string(11) "application" ["subtype"]=> string(3) "pdf" ["icon"]=> string(62) "https://yoda.yale.edu/wp/wp-includes/images/media/document.png" } ["project_title_link"]=> array(21) { ["ID"]=> int(11125) ["id"]=> int(11125) ["title"]=> string(31) "yoda_project_proposal_2014-0401" ["filename"]=> string(35) "yoda_project_proposal_2014-0401.pdf" ["filesize"]=> int(26192) ["url"]=> string(84) "https://yoda.yale.edu/wp-content/uploads/2023/08/yoda_project_proposal_2014-0401.pdf" ["link"]=> string(77) "https://yoda.yale.edu/data-request/2014-0401/yoda_project_proposal_2014-0401/" ["alt"]=> string(0) "" ["author"]=> string(4) "1363" ["description"]=> string(0) "" ["caption"]=> string(0) "" ["name"]=> string(31) "yoda_project_proposal_2014-0401" ["status"]=> string(7) "inherit" ["uploaded_to"]=> int(3832) ["date"]=> string(19) "2023-08-09 17:26:31" ["modified"]=> string(19) "2023-08-09 19:25:12" ["menu_order"]=> int(0) ["mime_type"]=> string(15) "application/pdf" ["type"]=> string(11) "application" ["subtype"]=> string(3) "pdf" ["icon"]=> string(62) "https://yoda.yale.edu/wp/wp-includes/images/media/document.png" } ["project_review_link"]=> array(21) { ["ID"]=> int(10805) ["id"]=> int(10805) ["title"]=> string(36) "yoda_project_review_-_2014-0401_site" ["filename"]=> string(40) "yoda_project_review_-_2014-0401_site.pdf" ["filesize"]=> int(122514) ["url"]=> string(89) "https://yoda.yale.edu/wp-content/uploads/2023/08/yoda_project_review_-_2014-0401_site.pdf" ["link"]=> string(82) "https://yoda.yale.edu/data-request/2014-0401/yoda_project_review_-_2014-0401_site/" ["alt"]=> string(0) "" ["author"]=> string(4) "1363" ["description"]=> string(0) "" ["caption"]=> string(0) "" ["name"]=> string(36) "yoda_project_review_-_2014-0401_site" ["status"]=> string(7) "inherit" ["uploaded_to"]=> int(3832) ["date"]=> string(19) "2023-08-09 17:12:45" ["modified"]=> string(19) "2023-08-09 19:25:12" ["menu_order"]=> int(0) ["mime_type"]=> string(15) "application/pdf" ["type"]=> string(11) "application" ["subtype"]=> string(3) "pdf" ["icon"]=> string(62) "https://yoda.yale.edu/wp/wp-includes/images/media/document.png" } ["project_highlight_button"]=> array(3) { ["title"]=> string(22) "Publication Available!" ["url"]=> string(44) "https://www.ncbi.nlm.nih.gov/pubmed/28838785" ["target"]=> string(6) "_blank" } ["search_order"]=> string(5) "-9250" ["request_data_partner"]=> string(15) "johnson-johnson" ["request_overridden_res"]=> string(1) "1" } data partner
array(1) { [0]=> string(15) "johnson-johnson" }

pi country
array(1) { [0]=> string(13) "United States" }

pi affil
array(1) { [0]=> string(8) "Academia" }

products
array(1) { [0]=> string(8) "remicade" }

num of trials
array(1) { [0]=> string(1) "1" }

res
array(1) { [0]=> string(1) "1" }

2014-0401

General Information

How did you learn about the YODA Project?: Data Holder (Company)

Conflict of Interest

Request Clinical Trials

Associated Trial(s):
  1. NCT00094458 - Multicenter, Randomized, Double-Blind, Active Controlled Trial Comparing REMICADE® (infliximab) and REMICADE plus Azathioprine to Azathioprine in the Treatment of Patients with Crohn’s Disease Naive to both Immunomodulators and Biologic Therapy (Study of Biologic and Immunomodulator Naive Patients in Crohn’s Disease)
What type of data are you looking for?: Individual Participant-Level Data, which includes Full CSR and all supporting documentation

Request Clinical Trials

Data Request Status

Status: Published

Research Proposal

Project Title: Can Machine Learning Algorithms using General Labs Predict Biologic Remission for Patients on Thiopurines in the SONIC trial?

Scientific Abstract: Background:
Machine learning applied to common lab results, including the CBC with differential and the comprehensive chemistry panel (plus age in days at time of blood draw), can predict biologic remission in patients with inflammatory bowel disease (ThioMon 2.0).
Objective:
Our primary objective is to determine if the ThioMon algorithm can predict which patients will reach biologic remission as determined by CRP and endoscopy scores by week 26.
Study Design:
For each subject, we will determine immunosuppression scores from week 6 and week 10 lab values. We will see if these scores can accurately predict which patients reached either biologic remission or clinical remission at week 26 and week 50.
Participants:
Subjects randomized in the SONIC trial who received thiopurines.
Main Outcome Measure(s):
1) The ability of ThioMon to predict biologic remission (based on CRP and mucosal healing) at week 26.
2) The ability of ThioMon to predict biologic remission (based on CRP) at week 50.
3) The ability of ThioMon to predict clinical remission (based on CDAI) at weeks 26 and 50.
Statistical Analysis:
We will use a Student?s T test or Mann-Whitney U test, to compare the following:
1.) Immunosuppression scores between biologic remission (BR) and non-BR groups at week 26 and week 50
2.) Immunosuppression scores between clinical remission (CR) and non-CR groups at week 26 and week 50
We will also report the AuROC, sensitivity, specificity, NPV, and PPV for each of the comparisons above, using an Immunosuppression score of 100 as the cut point.

Brief Project Background and Statement of Project Significance: Machine Learning Methods
Machine learning is a group of methods that optimize splits in datasets to predict important outcomes(1). Applications of machine leaning have improved the analysis of gene microarrays(2), proteomics results from mass spectrophotometry(3), predictions in financial markets(4), and algorithms to optimize signal and reduce noise in images(5). Machine learning is often applied by businesses to identify customers for a product based on their purchasing history, as in Amazon.com recommendations for books, or Google optimized searches. More recently, machine learning has been applied to clinical problems in which large complex datasets are available.
Machine Learning Predicts Thiopurine Treatment Success in IBD Patients
Numerous patients with inflammatory bowel disease (IBD) require treatment with thiopurines. Physicians often monitor the efficacy and safety of this low cost medication by following blood counts and blood chemistry. Alternatively, a commercially available test measures the amount of 6TGN metabolite present in a patient?s blood.
Dr. Higgins? research group used the machine learning approach to build an algorithm that predicted a patient?s response to thiopurine treatment (6). This algorithm was optimized to predict clinical response.
In this study of ThioMon 1.0, the machine learning algorithm based on blood metabolites was 86% accurate in predicting clinical response to thiopurine treatment, while commercially available blood metabolite measurements were only 59% accurate.
Since clinical symptoms can be subjective and not always indicative of inflammation present, the Machine learning algorithm has been improved by using objective biological evidence of inflammation to determine patient response to thiopurine treatment.
The ThioMon 2.0 algorithm (for biologic response) was developed using 3,269 patient cases, and is significantly more accurate than the 6-TGN metabolites for predicting biologic response to thiopurines. This algorithm is significantly more accurate (P

Specific Aims of the Project: Our primary objective is to determine if the ThioMon 2.0 algorithm can use lab values from week 10 to predict which patients will reach biologic remission by week 26.
Our secondary objectives are to determine if the ThioMon 2.0 algorithm can use lab values from week 6 and also week 10 to predict which patients will reach biologic remission by week 50 or reach clinical remission at weeks 26 and 50. Also, to determine if we can correlate subjects' Immunosuppression Scores to their IFX (infliximab) levels and HACA (human anti-chimeric antibody) levels for subjects in cohort 3.
We hypothesize that our ThioMon 2.0 predictive algorithm will accurately predict which patients achieve Biologic Remission at week 26 based on lab values from week 10.

Study Design:

What is the purpose of the analysis being proposed? Please select all that apply.:

Software Used:

Data Source and Inclusion/Exclusion Criteria to be used to define the patient sample for your study: We would like subjects randomized into cohort 1 (azathioprine only) and cohort 3 (combination therapy with both infliximab and azathioprine). We will look at these aims in both cohorts. We expect that could be a larger effect in cohort 1, as any effect in cohort 3 could be confounded by the efficacy of infliximab. We will exclude participants who did not have active ulceration or an elevated CRP at baseline.
We will obtain the following data:
1.) A complete blood count with differential and platelet count (CBCPD) and a comprehensive metabolic panel (COMP CHEM) from week 6 and week 10 for each subject.
2.) IFX (infliximab) levels and HACA (human anti-chimeric antibody) levels from each subject in cohort 3.
3.) Age (in days) of each subject at week 6.
4.) Age (in days) of each subject at week 10.
5.) CRP results from baseline, week 26, and week 50 for each subject
6.) Endoscopy scores (or reports) from baseline and week 26 for each subject (Biologic remission [BR] = normal CRP and no ulceration on endoscopy)
7.) Clinical Remission status (including CDAI) for each subject at week 26 and week 50
8.) Gender, race, and medication doses for each subject

Primary and Secondary Outcome Measure(s) and how they will be categorized/defined for your study: Biologic Remission at week 26 will be our main outcome. It is defined as a CRP

Main Predictor/Independent Variable and how it will be categorized/defined for your study: We will calculate an "Immunosuppression Score" for each subject using the following data:
1.) A complete blood count with differential and platelet count (CBCPD) from week 6 and week 10.
2.) A comprehensive metabolic panel (COMP CHEM) from week 6 and week 10.
3.) Age (in days) of each subject at week 6.
4.) Age (in days) of each subject at week 10.

Other Variables of Interest that will be used in your analysis and how they will be categorized/defined for your study: We will also look at IFX (infliximab) levels and HACA (human anti-chimeric antibody) levels from each subject in cohort 3 to see if we can correlate these levels to each subjects' "Immunosuppression Score."

Statistical Analysis Plan: We will use a student?s T test or Mann-Whitney U test, as appropriate to compare the following:
1.) Immunosuppression scores between biologic remission (BR) and non-BR groups at week 26 and week 50
2.) Immunosuppression scores between clinical remission (CR) and non-CR groups at week 26 and week 50
3.) Immunosuppression scores between subjects with an elevated IFX level and subjects without
4.) Immunosuppression scores between subjects with an elevated HACA level and subjects without
We will also report the AuROC, sensitivity, specificity, NPV, and PPV for each of the comparisons above, using an Immunosuppression score of 100 as the cut point.
We will also explore multivariate models using thiopurine dose, immunosuppression score, infliximab dose, age, gender, and other demographics to predict infliximab levels and HACA levels.

Narrative Summary: We have applied machine learning to common lab values to predict biologic remission in patients with inflammatory bowel disease in our local cohort. Now, we want to externally validate this by applying our algorithm to general labs from week 6 and week 10 in SONIC to predict which patients will achieve Biologic Remission at week 26 and at week 50.

Project Timeline: Project start date: January 1, 2015 (or when data received)
Analysis completion date: March 1, 2015
First manuscript draft: April 1, 2015
Date of expected manuscript submission: May 1, 2015
Date of results reported back to the YODA Project: May 1, 2015

Dissemination Plan: Study manuscript, target audience: gastroenterologists.
Likely journals: NEJM, Gut, Gastroenterology, American Journal of Gastroenterology.
Oral presentations at UEGW 2015 and ECCO 2016, or we might make the late-breaking deadline for DDW 2015.

Bibliography:

1. Breiman L. Classification and regression trees. Belmont, Calif.: Wadsworth International Group; 1984
2. Zhu J, Hastie T. Classification of gene microarrays by penalized logistic regression. Biostatistics. 2004;5:427-443
3. Ulintz PJ, Zhu J, Qin ZS, et al. Improved classification of mass spectrometry database search results using newer machine learning approaches. Mol Cell Proteomics. 2006;5:497-509
4. Wang L, Zhu J. Financial Market Forecasting Using a Two-Step Kernel Learning Method for Support Vector Regression. Annals of Operations Research. 2008;174:103-120
5. Wang L, Zhu J. Image Denoising via Solution Paths. Annals of Operations Research. 2008;174:3-17
6. Waljee AK, Joyce JC, Wang S, et al. Algorithms outperform metabolite tests in predicting response of patients with inflammatory bowel disease to thiopurines. Clin Gastroenterol Hepatol. 2010;8:143-150
7. Bonafede MM, Gandra SR, Watson C, et al. Cost per treated patient for etanercept, adalimumab, and infliximab across adult indications: a claims analysis. Adv Ther. 2012;29:234?248