**3.2 Feature selection for the cohort selection**

The significant features from Section 3.1, which were specific to the target cohort, seemed promising in defining the drivers of the endometriosis condition onset, and hence, were selected to identify the patient base list for scorning. Therapeutics as well as medical and surgical procedure codes specific to endometriosis treatment such as Orilissa, Marilissa, and Lupron Depot, were excluded from the analysis to avoid introducing any biases into the next phase of the study. Around 9.5 million female patients age 18 and above qualified for the scoring process.
