Background

JMIR Cancer

2369-1999

JMIR Publications

Toronto, Canada

v7i2e23161

33843600

10.2196/23161

Original Paper

Data Integration to Improve Real-world Health Outcomes Research for Non–Small Cell Lung Cancer in the United States: Descriptive and Qualitative Exploration

Vollmer Dahlke

Deborah

Burke

Thomas

Hesam-Shariati

Negin

Grabner

Michael

PhD 1

HealthCore Inc

123 Justison Street

Wilmington, DE, 19801

United States 1 3022302000 mgrabner@healthcore.com

https://orcid.org/0000-0002-6521-1007

Molife

Cliff

MPH, PhD 2

https://orcid.org/0000-0001-8993-7024

Wang

Liya

PhD 1

https://orcid.org/0000-0003-0317-5685

Winfree

Katherine B

PhD 2

https://orcid.org/0000-0003-2088-1004

Cui

Zhanglin Lin

PhD 2

https://orcid.org/0000-0002-7034-0946

Cuyun Carter

Gebra

PhD 2

https://orcid.org/0000-0002-6549-0069

Hess

Lisa M

PhD 2

https://orcid.org/0000-0003-3631-3941

1 HealthCore Inc

Wilmington, DE

United States 2 Eli Lilly and Company

Indianapolis, IN

United States

Corresponding Author: Michael Grabner mgrabner@healthcore.com

Apr-Jun 2021

12 4 2021

7 2

e23161

4 8 2020 19 10 2020 29 1 2021 1 2 2021

©Michael Grabner, Cliff Molife, Liya Wang, Katherine B Winfree, Zhanglin Lin Cui, Gebra Cuyun Carter, Lisa M Hess. Originally published in JMIR Cancer (http://cancer.jmir.org), 12.04.2021.

2021

This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in JMIR Cancer, is properly cited. The complete bibliographic information, a link to the original publication on http://cancer.jmir.org/, as well as this copyright and license information must be included.

Background

The integration of data from disparate sources could help alleviate data insufficiency in real-world studies and compensate for the inadequacies of single data sources and short-duration, small sample size studies while improving the utility of data for research.

Objective

This study aims to describe and evaluate a process of integrating data from several complementary sources to conduct health outcomes research in patients with non–small cell lung cancer (NSCLC). The integrated data set is also used to describe patient demographics, clinical characteristics, treatment patterns, and mortality rates.

Methods

This retrospective cohort study integrated data from 4 sources: administrative claims from the HealthCore Integrated Research Database, clinical data from a Cancer Care Quality Program (CCQP), clinical data from abstracted medical records (MRs), and mortality data from the US Social Security Administration. Patients with lung cancer who initiated second-line (2L) therapy between November 01, 2015, and April 13, 2018, were identified in the claims and CCQP data. Eligible patients were 18 years or older and received atezolizumab, docetaxel, erlotinib, nivolumab, pembrolizumab, pemetrexed, or ramucirumab in the 2L setting. The main analysis cohort included patients with claims data and data from at least one additional data source (CCQP or MR). Patients without integrated data (claims only) were reported separately. Descriptive and univariate statistics were reported.

Results

Data integration resulted in a main analysis cohort of 2195 patients with NSCLC; 2106 patients had CCQP and 407 patients had MR data. The claims-only cohort included 931 eligible patients. For the main analysis cohort, the mean age was 62.1 (SD 9.27) years, 48.56% (1066/2195) were female, the median length of follow-up was 6.8 months, and for 37.77% (829/2195), death was observed. For the claims-only cohort, the mean age was 66.6 (SD 12.69) years, 52.1% (485/931) were female, the median length of follow-up was 8.6 months, and for 29.3% (273/931), death was observed. The most frequent 2L treatment was immunotherapy (1094/2195, 49.84%), followed by platinum-based regimens (472/2195, 21.50%) and single-agent chemotherapy (441/2195, 20.09%); mean duration of 2L therapy was 5.6 (SD 4.9, median 4) months. We describe challenges and learnings from the data integration process, and the benefits of the integrated data set, which includes a richer set of clinical and outcome data to supplement the utilization metrics available in administrative claims.

Conclusions

The management of patients with NSCLC requires care from a multidisciplinary team, leading to a lack of a single aggregated data source in real-world settings. The availability of integrated clinical data from MRs, health plan claims, and other sources of clinical care may improve the ability to assess emerging treatments.

non–small cell lung cancer cancer data aggregation real-world data administrative claims data medical records electronic health record retrospective study population health health services research

Introduction Background

Real-world health outcomes research is often challenged by data insufficiency resulting from studies using a single data source and/or short durations [1-3]. For example, medical records (MRs) generally do not contain details of care outside of the point of service of the single health care provider, claims data contain few variables related to clinical outcomes, and registries often do not contain complete longitudinal data [4-7]. The integration of clinical data from different sources such as MRs [8], disease registries, or quality initiatives with large administrative claims repositories has been shown to increase the volume and quality of available data [9-12]. For example, integrated data allow the inclusion of important clinical factors when analyzing health care utilization and costs, as recorded in claims [13]. Such integrated observational data sets have also been used to generate predictive algorithms to better identify patients with cancer [14-17] and their disease characteristics [18-20].

Lung cancer is the second most common cancer in the United States, with approximately 230,000 new diagnoses in 2020 [21]. It is the leading cause of cancer-related deaths in the United States, projected at 136,000 in 2020 [22]. Non–small cell lung cancer (NSCLC) accounts for approximately 85% of all lung cancer cases [23]. Treatment modalities for advanced and/or metastatic NSCLC include radiotherapy, chemotherapy, targeted therapy, or a combination therapy [24]. Over the last few years, second-line (2L) treatment options have expanded rapidly with the introduction of immune checkpoint and epidermal growth factor receptor inhibitors and associated predictive biomarkers [25].

Treatment sequencing in the setting of NSCLC is not well characterized, largely because of the sparseness of applicable studies, which tend to be limited by inadequate data. This study was designed based on the rationale that a combination of retrospective data from multiple sources, such as MRs, administrative claims, and care quality initiatives, would provide a solid foundation for observing and characterizing real-world treatment outcomes at a lower cost than a traditional site-based prospective approach.

Objectives

The central objective of this study is to create an integrated database from several complementary sources and to assess the feasibility and effectiveness of these integrated observational data for health outcomes research. Patient characteristics and outcomes were described to evaluate the enrichment attained through integration. This analysis presents a descriptive summary of the final study cohort that was obtained for the study.

Methods Study Design

RESOUNDS (Real-World Treatment Sequences and Outcomes Among Patients With Non-Small Cell Lung Cancer) was a retrospective, observational cohort study that integrated data from 4 sources: administrative claims from the HealthCore Integrated Research Database (HIRD), clinical data from a quality initiative called the Cancer Care Quality Program (CCQP), clinical data extracted from patients’ MRs obtained from treating providers, and all-cause mortality data from the Death Master File of the US Social Security Administration. Details of the RESOUNDS study design and each of these data sources have been published previously [26]. The study protocol was approved by the New England Institutional Review Board before the commencement of data collection activities. This study was conducted in full compliance with the relevant provisions of the Health Insurance Portability and Accountability Act.

Patient Identification

Patients diagnosed with lung cancer who initiated 2L therapy between November 01, 2015, and April 13, 2018, were identified in the HIRD and CCQP data. Patients were required to receive 1 of the following 2L therapies alone or in combination: atezolizumab, docetaxel, erlotinib, nivolumab, pembrolizumab, pemetrexed, or ramucirumab. This subset of the original set of therapies listed in the protocol [26] was selected based on treatment guidelines and observed frequency of use during the study period, to ensure sufficient sample sizes to evaluate treatment patterns. Patients aged under 18 years at the start of 2L therapy were excluded. Due to the absence of specific International Classification of Diseases, Ninth and Tenth Revision, Clinical Modification codes for NSCLC, cancer type was confirmed via CCQP or MR data. Follow-up for all-cause death events was conducted through March 31, 2019.

Integrated Database Development

Patients were first identified in the CCQP data, where information on the type of lung cancer (NSCLC or not) was available, and information for patients with a record of 2L therapies of interest was retained. All cancer stages were included in the analyses. Second, lung cancer diagnosis and treatment claims were used to identify patients with 2L treatment in the HIRD. Patients who also had claims for other primary cancers were retained. All patients identified in the CCQP data were also included in the HIRD sample; patients who appeared in the HIRD but not the CCQP were retained. Third, copies of MRs were obtained from selected patients’ 2L prescribers (focusing on oncologists, as identified in the HIRD) and screened for qualification (presence of evidence for NSCLC and that the index treatment was used as therapy for NSCLC). Regulatory and operational requirements for inclusion in this process consisted of patients having a fully insured status (vs administrative services only) and presence of complete contact information for the 2L prescriber. Once obtained and screened, clinical information was abstracted from each record by trained health information management technicians using a standardized form. The target sample size for MR abstraction was 398 patients, based on the expected feasible accrual over the 2.5-year patient identification period.

Data from each source were accumulated in 3 consecutive waves to continuously build the database. After each MR abstraction wave was complete, the claims and CCQP data were refreshed to the most current date at that point to obtain additional follow-up outcomes. The integrated data were used to establish the main analysis cohort, consisting of patients with both claims and either CCQP or MR data (or both). Eligible patients from the HIRD who did not appear in the CCQP and for whom no MRs were obtained were included in the claims-only cohort (these patients could have any type and stage of lung cancer).

Patient Characteristics and Outcomes

Demographic and clinical characteristics, treatment patterns, and survival outcomes were recorded. Baseline was defined as the 6 months before the index date (start of 2L therapy). The Quan-Charlson Comorbidity Index (QCI) was calculated, excluding lung cancer and metastatic carcinoma [27]. A patient was considered to be on the same line of therapy until new agents were added (except for maintenance and platinum agent switching), a gap of >90 days between treatments, end of follow-up, or (for 2L and higher) discontinuation. The percentage of patients flagged as deceased (for all causes) was calculated using a combination of the Death Master File, a hospitalization discharge code of deceased from claims, and mortality recorded from the health plan enrollment files.

Statistical Analysis

Univariate statistics including means, SDs, and medians for continuous variables and relative frequencies and percentages for categorical variables were reported. No hypothesis testing was performed. Statistical analysis was performed using SAS version 9.3 (SAS Institute Inc).

Results Data Integration and Patient Selection

Following data integration, the main analysis cohort consisted of 2195 patients. All patients had claims data, 2106 patients had CCQP data, and 407 patients had MR data (Table 1).

Approximately 47.14% (997/2115) of patients fulfilled regulatory and operational requirements for their MRs to be requested from their 2L-prescribing providers; for 54.5% (543/997) of those, the records were obtained. A large number of MRs were not obtained as outreach was stopped after the planned sample size (n=398) was achieved; others could not be obtained because the provider did not have a record of the particular patient or because of inability to contact the provider. Among the obtained records, the most frequent reason for exclusion was the absence of confirmation of NSCLC (43/543, 7.9% of the obtained records). The claims-only cohort comprised 931 patients. Table 2 details what variables were obtained from which source.

Table 1

Patient selection.

Criteria		First wave sample (patients, n)	Second wave sample^a (patients, n)	Third wave sample (patients, n)	Final sample^b (patients, n)
Step A: Patients identified from CCQP^c
	Step 1: Patients with non–small cell lung cancer	295	760	1428	—^d
	Step 2: From step A1, patients with 2L^e therapy^f	174	469	863	—
Step B: Patients identified from claims
	Step 1: Patients with lung cancer claim before start of first-line therapy	640	1058	2187	—
	Step 2: From step B1, patients with 2L therapy	368	600	1127	—
Step C: Combined patients from CCQP and claims
	Step 1: From A2 and B2, unique patients with 2L therapy	423	756	1732	2115
Step D: Patients considered for MR^g review
	Step 1: Patients used for MR outreach	149	279	718	997
	Step 2: Number of patient MRs obtained	102	194	349	543
	Step 3: Number of failed MRs^h	15	20	45	65
	Step 4: Not used (target had been met previously)	—	—	62	62
	Step 5: Final MRs used	87	174	242	416
Step E: Main analysis cohort (patients with claims and either CCQP or MR data)		272	791	1446	2195ⁱ
	Step 1: Patients with CCQP data	223	748	1399	2106
	Step 2: Patients with MR data	85	168	239	407
Step F: Claims-only cohort (patients with claims data only, no CCQP or MR data)		377	243	659	931ⁱ

^aSecond wave included all patients from the first wave.

^bThe final sample removed duplicates that were included in >1 wave. For those patients, information from the most recent wave was used for analysis.

^cCCQP: Cancer Care Quality Program.

^dNot available.

^e2L: second-line therapy.

^f2L medications of interest included atezolizumab, docetaxel, erlotinib, nivolumab, pembrolizumab, pemetrexed, or ramucirumab.

^gMR: medical record.

^hMedical records excluded due to one or more of the following: no documentation of lung cancer, no documentation of non–small cell lung cancer, and patient mismatch (missing or unmatched name, sex, or date of birth; wrong timeframe; inconsistent clinical information).

ⁱThese are the final sample sizes for the 2 cohorts of interest.

Table 2

Variable sourcing by database type.

Variable	HealthCore Integrated Research Database (claims)	Cancer Care Quality Program	Medical record
Length of follow-up	✓^a	—^b	—
Age	✓	—	✓
Gender	✓	—	✓
Health plan type	✓	—	—
Geographic region of patient residence	✓	—	—
Race/ethnicity	—	—	✓
Weight, height, and BMI	—	—	✓
Histology	—	✓	✓
Staging	Y^c	✓	✓
Treating physician specialty	✓	—	—
Smoking status	—	—	✓
Performance status (Eastern Cooperative Oncology Group)	—	✓	✓
Comorbidities	✓ (Quan-Charlson Comorbidity Index, secondary cancers)	—	—
Mortality	Z^d	—	—

^aIndicates variable was sourced from the data set listed in the column header.

^bVariable was not sourced from the data set listed in the column header.

^cIndicates the presence of claims for metastatic disease.

^dThis was based on the Death Master File data from the US Social Security Administration.

Demographic Characteristics at Baseline

In the main analysis cohort, mean age was 62.1 (SD 9.27) years and 48.56% (1066/2195) were female (Table 3), whereas in the claims-only cohort, mean age was 66.6 (SD 12.69) years and 52.1% (485/931) were female. More than two-thirds (1498/2195, 68.25%) of the main analysis cohort were from the Midwest and South, and 23.01% (505/2195) had Medicare Advantage or Supplemental and Part D coverage. In the claims-only cohort, patients were almost equally distributed across the West, Midwest, and South, with a smaller proportion (164/931, 17.6%) from the Northeast; almost half (457/931, 49.1%) had Medicare Advantage coverage. Treating physician specialty based on claims listed oncologists for 67.52% (1482/2195) of the main analysis population and for 30.7% (286/931) of the claims-only sample; this difference is by design as only patients whose 2L-prescribing providers were listed as oncologists were included in the MR phase. Among the 407 patients with MR data, 45.7% (186/407) were White, 3.7% (15/407) were Black, 3.2% (13/407) were other races, and 47.4% (193/407) had no race information. Race was not available in patients without MRs.

Table 3

Demographic characteristics at baseline (on or close to second-line therapy initiation date).

Variables		Main analysis cohort (n=2195)	Claims-only cohort (n=931)
Age at second-line therapy initiation (years), mean (SD)		62.1 (9.27)	66.6 (12.69)
Age categories (years), n (%)
	18-39	22 (1.0)	33 (3.5)
	40-64	1509 (68.7)	343 (36.8)
	65-74	412 (18.8)	278 (29.9)
	≥75	252 (11.5)	277 (29.8)
Female, n (%)		1066 (48.6)	485 (52.1)
Health plan type, n (%)
	Health maintenance organization	769 (35.0)	225 (24.2)
	Preferred provider organization	1126 (51.3)	628 (67.5)
	Consumer-driven health plan	300 (13.7)	78 (8.4)
Medicare Advantage^a, n (%)		505 (23.0)	457 (49.1)
Affordable Care Act exchange plan, n (%)		550 (25.1)	106 (11.4)
Geographic region of patient, n (%)
	Northeast	344 (15.7)	164 (17.6)
	Midwest	815 (37.1)	262 (28.1)
	South	683 (31.1)	274 (29.4)
	West	353 (16.1)	231 (24.8)
Treating physician specialty, n (%)
	Oncology	1482 (67.5)	286 (30.7)
	Pulmonary medicine	34 (1.5)	18 (1.9)
	Primary care provider	77 (3.5)	36 (3.9)
	Other	481 (21.9)	133 (14.3)
	Missing	121 (5.5)	458 (49.2)

^aIncludes Supplemental and Part D plans.

Clinical Characteristics at Baseline

In the main analysis cohort, the mean QCI was 1.6 (SD 1.59). The most frequent comorbidities were dyspnea (1417/2195, 64.56%), chronic pulmonary disease (1125/2195, 51.25%), hypertension (1073/2195, 48.88%), anemia (880/2195, 40.09%), and dyslipidemia (792/2195, 36.08%; Table 4). More than half of the main analysis cohort (1224/2195, 55.76%) had claims for additional or secondary malignancies and 79.41% (1743/2195) had claims for metastatic disease. In the claims-only cohort, the mean QCI was 1.8 (SD 1.69). The most frequently occurring comorbidities were hypertension (565/931, 60.7%), dyspnea (542/931, 58.2%), and dyslipidemia (403/931, 43.3%). Almost three-quarters (681/931, 73.1%) had codes for other malignancies and 67.9% (632/931) had codes for metastatic disease.

In the main analysis cohort, additional clinical information was available via CCQP and/or MRs (Table 5). Among the 407 patients with MR data, 59.2% (241/407) were former smokers, 16.5% (67/407) were current smokers, 14.3% (58/407) were never smokers, and 10.1% (41/407) had no documentation. Height and weight were available for the majority (341/407, 83.8% height; 371/407, 91.2% weight) of patients; mean BMI was 26.1 (SD 6.36). The most common cancer histology was adenocarcinoma (271/407, 66.6%); for most of the remainder, histology was not documented. Metastasis was noted in MRs for 95.1% (387/407) of the patients, most commonly to the lymph nodes (289/407, 71.0%). Eastern Cooperative Oncology Group (ECOG) performance status was available for 96.26% (2113/2195) of the sample, and an ECOG score ≥2 was observed in 21.20% (448/2113) of patients.

Table 4

Clinical characteristics from claims at baseline (over 6 months before second-line therapy initiation date).

Variables			Main analysis cohort (n=2195)		Claims-only cohort (n=931)
QCI^a, mean (SD)			1.6 (1.59)		1.8 (1.69)
QCI categories, n (%)
	0	570 (26.0)		230 (24.7)
	1	705 (32.1)		271 (29.1)
	2	414 (18.9)		185 (19.9)
	3-5	444 (20.2)		212 (22.8)
	6+	62 (2.8)		33 (3.5)
QCI comorbidities, n (%)
	Myocardial infarction	112 (5.1)		46 (4.9)
	Congestive heart failure	195 (8.9)		111 (11.9)
	Peripheral vascular disease	357 (16.3)		186 (20.0)
	Cerebrovascular disease	255 (11.6)		100 (10.7)
	Dementia	18 (0.8)		10 (1.1)
	Chronic pulmonary disease	1125 (51.2)		390 (41.9)
	Connective tissue/rheumatic disease	57 (2.6)		32 (3.4)
	Peptic ulcer disease	31 (1.4)		13 (1.4)
	Mild liver disease	421 (19.2)		162 (17.4)
	Moderate or severe liver disease	10 (0.5)		<10^b
	Paraplegia and hemiplegia	50 (2.3)		<10^b
	Renal disease	172 (7.8)		127 (13.6)
	Diabetes with chronic complications	96 (4.4)		75 (8.1)
	Diabetes without chronic complications	380 (17.3)		211 (22.7)
	Malignancy (excluding lung cancer)	1224 (55.8)		681 (73.1)
	Metastatic carcinoma	1743 (79.4)		632 (67.9)
	AIDS/HIV	<10^b		<10^b
Other comorbidities of interest, n (%)
	Anemia (any)	880 (40.1)		376 (40.4)
	Anemia due to chemotherapy	323 (14.7)		92 (9.9)
	Asthma	166 (7.6)		88 (9.5)
	Cardiac dysrhythmias	375 (17.1)		199 (21.4)
	Coronary heart disease	410 (18.7)		209 (22.4)
	Depression	338 (15.4)		139 (14.9)
	Dyslipidemia	792 (36.1)		402 (43.2)
	Dyspnea	1417 (64.6)		542 (58.2)
	Hypertension	1073 (48.9)		565 (60.7)
	Idiopathic fibrosis of the lung	15 (0.7)		<10^b
	Interstitial lung disease	29 (1.3)		<10^b
	Peripheral vascular disease	361 (16.4)		187 (20.1)
	Pneumonia	508 (23.1)		151 (16.2)
	Pneumonitis	29 (1.3)		16 (1.7)
	Pulmonary fibrosis	112 (5.1)		<10^b
	Stroke	255 (11.6)		100 (10.7)
	Thyroid disease	272 (12.4)		165 (17.7)
	Tuberculosis	<10^b		<10^b

^aQCI: Quan-Charlson Comorbidity Index.

^bValues <10 have not been reported for patient confidentiality.

Table 5

Clinical characteristics from Cancer Care Quality Program and/or medical records at baseline (on or close to second-line therapy initiation date).

Variables				Main analysis cohort
Information from MRs^a; valid N=407
	Smoking status, n (%)
		Current smoker	67 (16.5)
		Former smoker	241 (59.2)
		Never smoker	58 (14.3)
		Not documented	41 (10.1)
	Presence of number of years smoked, n (%)		201 (49.4)
		Number of years smoked, mean (SD)	36.1 (13.48)
	Presence of weight, n (%)		371 (91.2)
		Weight (pounds), mean (SD)	165.0 (44.48)
	Presence of height, n (%)		341 (83.8)
		Height (inches), mean (SD)	66.5 (3.88)
	Presence of BMI, n (%)		339 (83.3)
		BMI, mean (SD)	26.1 (6.36)
	Histology, n (%)
		Adenocarcinoma	271 (66.6)
		Large-cell carcinoma	9 (2.2)
		Bronchioloalveolar carcinoma	2 (0.5)
		Mixed	3 (0.7)
		Unspecified nonsquamous	2 (0.5)
		Other	4 (1.0)
		Unknown/not documented	116 (28.5)
	Presence of metastasis, n (%)		387 (95.1)
		Lymph nodes (thoracic region)	289 (71.0)
		Supraclavicular nodes	87 (21.4)
		Superior mediastinal nodes	201 (49.4)
		Aortic nodes	64 (15.7)
		Inferior mediastinal nodes	132 (32.4)
		Hilar, lobar, and/or (sub)segmental nodes	199 (48.9)
		Bone	190 (46.7)
		Other respiratory systems (not trachea)	163 (40.0)
		Brain	121 (29.7)
		Liver	72 (17.7)
		Adrenal gland	59 (14.5)
	Number of metastases sites, mean (SD)		3.2 (1.90)
Information from Cancer Care Quality Program and/or MRs; valid N=2195
	Eastern Cooperative Oncology Group performance status, n (%)		2113 (96.26)
		0	464 (21.96)
		1	1201 (56.84)
		2	364 (17.23)
		3	74 (3.50)
		4	10 (0.47)
		5	0 (0)
	TNM^b stage classification, n (%)		2146 (97.77)
		0	0 (0)
		1	<10
		2	32 (1.49)
		3	167 (7.78)
		4	1935 (90.17)
		Unknown or not documented	<10

^aMR: medical record.

^bTNM: tumor/lymph nodes/metastasis cancer staging system.

Length of Follow-Up and Mortality

The mean length of follow-up in months was 7.9 (SD 5.77) for the main analysis cohort (median 6.8) and 9.1 (SD 6.06) for the claims-only cohort (median 8.6). Death (for all causes) was observed in 37.77% (829/2195) of the main analysis cohort and 29.3% (273/931) of the claims-only cohort.

Treatment Patterns

Among the 1974 patients with first-line (1L) treatment information, 69.50% (1372/1974) used platinum-based regimens, 37.69% (744/1974) used pemetrexed-containing regimens, and 16.51% (326/1974) used single-agent chemotherapy (treatment groups are not mutually exclusive; Table 6). The mean duration of 1L therapy was 128 (median 90) days; 56.84% (1122/1974) switched to 2L therapy with a gap ≤90 days and 43.16% (852/1974) had a gap of >90 days before initiating 2L. The most frequent 2L treatment was immunotherapy (1094/2195, 49.84%), followed by platinum-based regimens (472/2195, 21.50%). The mean duration of 2L therapy was 169 (median 121) days; this variable was right-censored due to loss of follow-up. For patients with third- and/or fourth-line therapy (n=731 and 265, respectively), platinum-based regimens were used most frequently (418/731, 57.2% of third-line patients and 139/265, 52.5% of fourth-line patients), and 21.6% (158/731) of third-line patients and 20.4% (54/265) of fourth-line patients also used immunotherapy. Among the 269 patients who received radiation therapy after the initial diagnosis of NSCLC, 46.1% (124/269) patients received radiation therapy as a palliative treatment.

Table 6

Treatment patterns from Cancer Care Quality Program and claims, measured from the initiation of first-line treatment to the end of follow-up.

Therapy				Main analysis cohort (N=2195)
1L^a therapy, n (%)				1974 (89.9)
	Chemotherapy, n (%)
		Platinum-based regimen	1372 (69.5)
		Nonplatinum-based regimen	90 (4.6)
		Pemetrexed-containing regimen	744 (37.7)
		Single-agent chemotherapy	326 (16.5)
	Immunotherapy, n (%)
		PD-1/PD-(L)1^b inhibitor–containing regimen	241 (12.2)
	Targeted therapy, n (%)
		EGFR^c TKIs^d-containing regimen	98 (5.0)
		EGFR mAb^e-containing regimen	11 (0.6)
		VEGF^f mAb-containing regimen	308 (15.6)
		ALK^g inhibitor	21 (1.1)
	Duration of time (days) between initial lung cancer diagnosis and 1L treatment, mean (SD)			134.6 (380.98)
	Duration (days) of 1L therapy, mean (SD)^h			127.7 (142.75)
	Treatment change, n (%)
		Gap of ≤90 days before 2Lⁱ	1122 (56.8)
		Gap of >90 days before 2L	852 (43.2)
2L therapy, n (%)				2195 (100.0)
	Chemotherapy
		Platinum-based regimen	472 (21.5)
		Nonplatinum-based regimen	221 (10.1)
		Pemetrexed-containing regimen	344 (15.7)
		Single-agent chemotherapy	441 (20.1)
	Immunotherapy
		PD-1/PD-L1 inhibitor–containing regimen	1094 (49.8)
	Targeted therapy
		EGFR TKIs-containing regimen	36 (1.6)
		EGFR mAb-containing regimen	10 (0.5)
		VEGF mAb-containing regimen	141 (6.4)
		ALK inhibitor	<10^j
	Duration (days) of 2L therapy, mean (SD)^k			168.6 (148.4)
Radiation therapy following initial diagnosis of non-small cell lung cancer, n (%)				269 (12.3)
	Intent of radiation therapy, n (%)
		Curative	21 (7.8)
		Palliative	124 (46.1)
		Both curative and palliative (separate instances)	15 (5.6)
		Unknown	109 (40.5)

^a1L: first-line therapy.

^bPD-(L)1: programmed death-(ligand) 1.

^cEGFR: epidermal growth factor receptor.

^dTKI: tyrosine kinase inhibitor.

^emAb: monoclonal antibodies.

^fVEGF: vascular endothelial growth factor.

^gALK: anaplastic lymphoma kinase.

^hMedian 90.0.

ⁱ2L: second-line therapy.

^jValues <10 have not been reported for patient confidentiality.

^kMedian 121.0.

Discussion Principal Findings

This study combined 3 data sources for the analysis of real-world outcomes in patients with NSCLC, conducting data integration on a large scale across disparate but complementary sources. It was designed to simulate a prospective observational study by identifying patients upfront within large preexisting databases and then following them within the data set to examine outcomes. One of the potential strengths of this approach is the development of a database that includes demographic, clinical, and health care resource utilization data that can more accurately assess health outcomes.

The use of big data from multiple sources, such as health plan enrollment, disease registries, and scanned image repositories, among others, is becoming more important for the accurate determination of patient outcomes, particularly in the setting of NSCLC [28-31]. With the current availability of a wide range of newer, more effective systemic therapies, including several novel biologic agents, the use of diverse provider, institutional, and registry databases is increasingly necessary to evaluate outcomes due to the gaps in administrative claims data alone [32-35]. As treatments in oncology have improved, patients with lung cancer are living longer with the ability to personalize care with novel targeted therapies. This approach, coupled with more effective treatment, means that treatment strategies are increasingly complex, and factors influencing these strategies and their resultant outcomes are not fully identifiable in administrative claims data. As a result, the effective evaluation of treatment outcomes increasingly draws on data from multiple sources across lines of treatment, providers, and institutions.

Real-world evidence (RWE), which is largely derived from big health care data, has increasingly been driven by important technological advances, including machine learning, natural language processing improvements in electronic medical systems, and the ability to link clinical and health claims data in private and public systems [9]. As RWE grows and gains value, especially for pragmatic clinical trials (PCTs), the traditional gold standard of a randomized clinical trial (RCT) is facing major hurdles: low recruitment rates, small patient populations, long durations, and high costs. This evolving environment, along with growing interest in PCTs, is increasing the importance of big data and RWE as a complement to RCTs [36,37].

Furthermore, a bigger role for RWE is developing in decision making across the health care system, including regulators, payers, providers, and patients. Part of the reason is that although RCTs have internal validity, which is essential for safety and efficacy determinations, results from clinical studies may have limited external validity. At the same time, RWE studies using big data are able to explore key clinical questions that are outside the scope of RCTs. Such studies are well suited for investigations seeking safety and effectiveness outcomes data for broader target populations. This is especially valuable for the evaluation of fast-tracked medical products, which typically gain regulatory approval based on limited data. In addition, large RWE studies are invaluable in detecting the side effects of treatments over longer periods. Other circumstances in which RWE is valuable include exploration of rare diseases, assessing the impact of treatment adherence, when rapid retrospective results are needed, comparing multiple treatments that have not been explored in trials, and focusing on population subsets of interest, given more heterogeneity and larger population sizes in real-world data compared with clinical trials [36-38].

Due to the frequency of onset of NSCLC later in life, our study sample included patients with an average age greater than 60 years, with females constituting about half of the study population, which is consistent with other real-world US outcomes studies that examined patients with NSCLC [39-48]. All prior studies, to our knowledge, that focused on the United States used 1 or 2 of the following data sources: administrative claims, registry data, or MR. Limitations of these studies fall into 2 categories: (1) missing data on potential confounders and/or outcomes of interest (eg, claims data can assess utilization outcomes but lack disease characteristics; MR data have a rich set of clinical characteristics but lack longitudinality and utilization or cost data) and (2) limited generalizability (eg, the SEER-Medicare linked data in the United States capture claims and cancer registry data only for patients aged 65 years or older).

The ability of our study to integrate data across 3 sources to create a cohort of NSCLC patients with rich clinical and economic data offers an important addition to the comparatively small body of data on the performance of data integration methods and the determination of health outcomes based on these data for patients with NSCLC. To the extent that our study sample reflects the larger national population affected by lung cancer and with commercial insurance, these data could be instructive for a range of decisions made by multiple health care stakeholders including providers and patients requiring insights into the allocation of resources and overall disease management that cannot be completely ascertained from a single data source alone. One example would be the interaction of biomarker testing, treatment choice, and health outcomes. Integrated data sets such as RESOUNDS that can be refreshed regularly also offer many opportunities for future research, such as treatment sequencing, disease progression, and health care resource utilization and costs.

Data Integration Challenges

Our study also highlighted some challenges in the creation, maintenance, and analysis of large integrated data sets. Integration of data sets in the midst of a rapid shift in the treatment landscape (such as the introduction of immune checkpoint inhibitors for oncology) may impact the value of data sets that are large and deep, but that include periods of time that are no longer relevant to current standards of care. The maintenance of these data sets requires constant refresh and update, so that the periods of interest to the investigator can be current and available for analysis. The wealth of data available in MRs presents challenges in identifying the trade-offs between generating a limited set of relevant but reasonably quickly available data versus a broader set of data that is potentially available but more difficult to obtain and prepare for analysis. Methods of data integration and data extraction may be improved with machine learning or natural language processing to reduce the manual extraction via data collection forms that was used in this study. Patient sample sizes available for analysis diminish when multiple data sources are required. Finally, there were specific data integration challenges in our study that resulted in additional effort needed by the project team to understand and address (eg, the estimated 2L therapy start date for a given patient sometimes differed between the data sources, plan enrollment changes entailed patients leaving or entering the data set multiple times, and conflicts between data sources for a given variable had to be resolved).

Study Limitations

Results based on integrated data must also be viewed with some limitations. The data quality and content will depend on the underlying data selected for integration. Specific to the data used for this project, limitations include the following: CCQP data were collected at the time of the prior authorization request, not at diagnosis. CCQP offers incentives to physicians for treating according to evidence-based guidelines created by the health plan, which could have influenced treatment choices. MR data may be underreported or missing due to vague, incomplete, or illegible entries; the inability to locate the required information; or missing patient files. ECOG performance status, a standard data item in cancer trials, is not always assessed in real-world patient care settings (in our study, this variable was available for 96.26% (2113/2195) of the sample, mostly from the CCQP), and information on race/ethnicity is often missing in claims data. Similarly, tumor growth and progression information is collected in various formats and levels of detail outside of a clinical trial setting. As a result, some of our research questions of interest had underpopulated data. Efforts by payers to tie provider reimbursement to the collection of key data points, for example, through quality improvement initiatives, may over time alleviate some of the missing data issues. Data collected during MR abstraction may have measurement errors linked to inconsistent coding, transcription, and data transfer errors. The typical limitations of claims data also apply. For example, a diagnosis code on a medical claim (eg, for secondary malignancies) does not guarantee the presence of a disease. Similarly, a claim for a prescription fill does not indicate that the medication was consumed or taken as prescribed. The generalizability of claims-based results is confined to similarly insured populations (eg, commercial, US-based in this study).

Conclusions

The care of patients with NSCLC requires a range of resources in a variety of settings in the real world. NSCLC and other forms of cancer are increasingly being managed like chronic diseases with a broad range of increasingly effective treatments. The assessment of real-world data to evaluate outcomes among patients with NSCLC will require the integration of a broad range of clinical data with health plan claims data. Overcoming data integration and completeness challenges will allow better informed decision making by all stakeholders of the health care system.

Abbreviations

first-line

second-line

CCQP

Cancer Care Quality Program

ECOG

Eastern Cooperative Oncology Group

HIRD

HealthCore Integrated Research Database

medical record

NSCLC

non–small cell lung cancer

QCI

Quan-Charlson Comorbidity Index

RCT

randomized clinical trial

RESOUNDS

Real-World Treatment Sequences and Outcomes Among Patients With Non-Small Cell Lung Cancer

RWE

real-world evidence

Funding for the study was provided to HealthCore, Inc by Eli Lilly and Company. Bernard Tulsi, an employee of HealthCore, Inc at the time of the study, provided writing and editorial support for this manuscript.

MG is an employee of HealthCore, Inc, an independent research organization that received funding from Eli Lilly and Company for the conduct of this study. CM, KW, ZC, and LH are employees and stockholders of Eli Lilly and Company. GC was an employee of Eli Lilly and Company at the time the study was conducted. LW was an employee of HealthCore at the time the study was conducted.

Garrison

Neumann

Erickson

Marshall

Mullins

Using real-world data for coverage and payment decisions: the ISPOR Real-World Data Task Force report

Value Health 2007 10 5 326 35

10.1111/j.1524-4733.2007.00186.x

17888097

S1098-3015(10)60470-6

Mahajan

Real world data: additional source for making clinical decisions

Int J Appl Basic Med Res 2015 5 2 82

10.4103/2229-516X.157148

26097811

IJABMR-5-82

PMC4456898

National Academies of Sciences, Engineering, and Medicine Health and Medicine Division Board on Health Sciences Policy Forum on Drug Discovery, Development, and Translation

Examining the impact of real-world evidence on medical product development

Proceedings published by the National Academies of Sciences, Engineering, and Medicine chronicle the presentations and discussions at a workshop, symposium, or other event convened by the National Academies 2019

10.17226/25352

30964617

Brooks

Bergquist

Landrum

Rose

Keating

Classifying Stage IV lung cancer from health care claims: a comparison of multiple analytic approaches

JCO Clin Cancer Inform 2019 05 3 1 19

10.1200/CCI.18.00156

31070985

PMC6873980

Hess

Winfree

Muehlenbein

Zhu

Oton

Princic

Aggarwal

Debunking myths while understanding limitations

Am J Public Health 2020 05 110 5 e2

10.2105/AJPH.2020.305603

32267743

PMC7144428

Jairam

Park

Strengths and limitations of large databases in lung cancer radiation oncology research

Transl Lung Cancer Res 2019 09 8 Suppl 2 172 83

10.21037/tlcr.2019.05.06

31673522

tlcr-08-S2-S172

PMC6795574

Takahashi

Nishida

Asai

Utilization of health care databases for pharmacoepidemiology

Eur J Clin Pharmacol 2012 02 68 2 123 9

10.1007/s00228-011-1088-2

21808989

Berner

Detmer

Simborg

Will the wave finally break? A brief view of the adoption of electronic medical records in the United States

J Am Med Inform Assoc 2005 12 1 3 7

10.1197/jamia.M1664

15492029

M1664

PMC543824

Agiro

Chen

Eshete

Sutphen

Clark

Burroughs

Nowell

Curtis

Loud

McBurney

Merkel

Sreih

Young

Haynes

Data linkages between patient-powered research networks and health plans: a foundation for collaborative research

J Am Med Inform Assoc 2019 07 01 26 7 594 602

10.1093/jamia/ocz012

30938759

5426086

PMC7647185

Chung

Shambhu

Roe

Cziraky

Jones

Haynes

Administrative claims data to support pragmatic clinical trial outcome ascertainment on cardiovascular health

Clin Trials 2019 08 16 4 419 30

10.1177/1740774519846853

31081367

Pine

Jordan

Elixhauser

Fry

Hoaglin

Jones

Meimban

Warner

Gonzales

Enhancement of claims data to improve risk adjustment of hospital mortality

J Am Med Assoc 2007 01 03 297 1 71 6

10.1001/jama.297.1.71

17200477

297/1/71

Wilson

Bock

The benefit of using both claims data and electronic medical record data in health care analysis

Optum White Paper 2012

2021-03-03

https://www.optum.com/content/dam/optum/resources/whitePapers/Benefits-of-using-both-claims-and-EMR-data-in-HC-analysis-WhitePaper-ACS.pdf

Navaratnam

Sasane

Lawrence

DFE

Friedman

Tulsi

Vollmer

Determinants of high cost in multiple sclerosis patients: a claims and chart review study

Curr Med Res Opin 2016 09 32 9 1589 97

10.1080/03007995.2016.1192529

27207562

Beachler

de Luise

Yin

Gangemi

Cochetti

Lanes

Predictive model algorithms identifying early and advanced stage ER+/HER2- breast cancer in claims data

Pharmacoepidemiol Drug Saf 2019 02 28 2 171 8

10.1002/pds.4681

30411431

Bronson

Kapadia

Austin

Wang

Feskanich

Bynum

JPW

Grodstein

Tosteson

ANA

Leveraging linkage of cohort studies with administrative claims data to identify individuals with cancer

Med Care 2018 12 56 12 83 9

10.1097/MLR.0000000000000875

29334524

PMC6043405

Turner

Chen

Fernandes

Validation of a case-finding algorithm for identifying patients with Non-small Cell Lung Cancer (NSCLC) in administrative claims databases

Front Pharmacol 2017 8 883

10.3389/fphar.2017.00883

29249970

PMC5714924

Parlett

Beachler

Lanes

Hoover

Cook

Validation of an algorithm for claims-based incidence of prostate cancer

Epidemiology 2019 05 30 3 466 71

10.1097/EDE.0000000000001007

30829831

PMC6456424

Clarke

Feigelson

Developing an algorithm to identify history of cancer using electronic medical records

EGEMS (Wash DC) 2016 4 1 1209

10.13063/2327-9214.1209

27195308

egems1209

PMC4862761

Sathiakumar

Delzell

Yun

Jooste

Godby

Falkson

Yong

Kilgore

Accuracy of medicare claim-based algorithm to detect breast, prostate, or lung cancer bone metastases

Med Care 2017 12 55 12 144 9

10.1097/MLR.0000000000000539

29135778

00005650-201712000-00022

Uno

Ritzwoller

Cronin

Carroll

Hornbrook

Hassett

Determining the time of cancer recurrence using claims or electronic medical record data

JCO Clin Cancer Inform 2018 12 2 1 10

10.1200/CCI.17.00163

30652573

PMC6338474

Siegel

Miller

Jemal

Cancer statistics, 2020

CA Cancer J Clin 2020 01 70 1 7 30

10.3322/caac.21590

31912902

American Cancer Society 2021-03-03

https://www.cancer.org/research/cancer-facts-statistics/all-cancer-facts-figures/cancer-facts-figures-2020.html

Houston

Henley

White

Richards

Patterns in lung cancer incidence rates and trends by histologic type in the United States, 2004-2009

Lung Cancer 2014 10 86 1 22 8

10.1016/j.lungcan.2014.08.001

25172266

S0169-5002(14)00323-7

PMC5823254

Kalemkerian

Loo

Akerley

Attia

Bassetti

Boumber

Decker

Dobelbower

Dowlati

Downey

Florsheim

Ganti

AKP

Grecula

Gubens

Hann

Hayman

Heist

Koczywas

Merritt

Mohindra

Molina

Moran

Morgensztern

Pokharel

Portnoy

Rhodes

Rusthoven

Sands

Santana-Davila

Williams

Hoffmann

Hughes

NCCN guidelines insights: small cell lung cancer, version 2.2018

J Natl Compr Canc Netw 2018 10 16 10 1171 82

10.6004/jnccn.2018.0079

30323087

jnccn_16_10_006

Morabito

Second-line treatment for advanced NSCLC without actionable mutations: is immunotherapy the 'panacea' for all patients?

BMC Med 2018 02 16 16 1 24

10.1186/s12916-018-1011-0

29448944

10.1186/s12916-018-1011-0

PMC5815183

Hess

Kern

Carter

Winfree

Wang

Sontag

Oton

Real-world treatment sequences and outcomes among patients with non-small cell lung cancer (RESOUNDS) in the United States: study protocol

JMIR Res Protoc 2017 10 11 6 10 e195

10.2196/resprot.7750

29021129

v6i10e195

PMC5658639

Quan

Couris

Fushimi

Graham

Hider

Januel

Sundararajan

Updating and validating the Charlson comorbidity index and score for risk adjustment in hospital discharge abstracts using data from 6 countries

Am J Epidemiol 2011 03 15 173 6 676 82

10.1093/aje/kwq433

21330339

kwq433

Asan

Nattinger

Gurses

Tyszka

Yen

TWF

Oncologists' views regarding the role of electronic health records in care coordination

JCO Clin Cancer Inform 2018 12 2 1 12

10.1200/CCI.17.00118

30652555

Cortinovis

Abbate

Bidoli

Pelizzoni

Canova

Interpretation of lung cancer study outcomes

J Thorac Dis 2015 11 7 11 E541 7

10.3978/j.issn.2072-1439.2015.11.26

26716052

jtd-07-11-E541

PMC4669287

Tevaarwerk

Wisinski

Buhr

Njiaju

Tun

Donohue

Sekhon

Yen

Wiegmann

Sesto

Leveraging electronic health record systems to create and provide electronic cancer survivorship care plans: a pilot study

J Oncol Pract 2014 05 10 3 e150 9

10.1200/JOP.2013.001115

24520142

JOP.2013.001115

PMC4018457

Tan

Chen

Zhao

Decision based on big data research for non-small cell lung cancer in medical artificial system in developing country

Comput Methods Programs Biomed 2018 06 159 87 101

10.1016/j.cmpb.2018.03.004

29650322

S0169-2607(17)30830-1

Borghaei

Paz-Ares

Horn

Spigel

Steins

Ready

Chow

Vokes

Felip

Holgado

Barlesi

Kohlhäufl

Arrieta

Burgio

Fayette

Lena

Poddubskaya

Gerber

Gettinger

Rudin

Rizvi

Crinò

Blumenschein

Antonia

Dorange

Harbison

Graf Finckenstein

Brahmer

Nivolumab versus docetaxel in advanced nonsquamous non-small-cell lung cancer

N Engl J Med 2015 10 22 373 17 1627 39

10.1056/NEJMoa1507643

26412456

PMC5705936

Brahmer

Reckamp

Baas

Crinò

Eberhardt

Poddubskaya

Antonia

Pluzanski

Vokes

Holgado

Waterhouse

Ready

Gainor

Arén Frontera

Havel

Steins

Garassino

Aerts

Domine

Paz-Ares

Reck

Baudelet

Harbison

Lestini

Spigel

Nivolumab versus docetaxel in advanced squamous-cell non–small-cell lung cancer

N Engl J Med 2015 07 09 373 2 123 35

10.1056/nejmoa1504627

Garon

Ciuleanu

Arrieta

Prabhash

Syrigos

Goksel

Park

Gorbunova

Kowalyszyn

Pikiel

Czyzewicz

Orlov

Lewanski

Thomas

Bidoli

Dakhil

Gans

Kim

Grigorescu

Karaseva

Reck

Cappuzzo

Alexandris

Sashegyi

Yurasov

Pérol

Ramucirumab plus docetaxel versus placebo plus docetaxel for second-line treatment of stage IV non-small-cell lung cancer after disease progression on platinum-based therapy (REVEL): a multicentre, double-blind, randomised phase 3 trial

Lancet 2014 08 23 384 9944 665 73

10.1016/S0140-6736(14)60845-X

24933332

S0140-6736(14)60845-X

Herbst

Baas

Kim

Felip

Pérez-Gracia

Han

Molina

Kim

Arvis

Ahn

Majem

Fidler

de Castro

Garrido

Lubiniecki

Shentu

Dolled-Filhart

Garon

Pembrolizumab versus docetaxel for previously treated, PD-L1-positive, advanced non-small-cell lung cancer (KEYNOTE-010): a randomised controlled trial

Lancet 2016 04 09 387 10027 1540 50

10.1016/S0140-6736(15)01281-7

26712084

S0140-6736(15)01281-7

Hampson

Towse

Dreitlein

Henshall

Pearson

Real-world evidence for coverage decisions: opportunities and challenges

J Comp Eff Res 2018 12 7 12 1133 43

10.2217/cer-2018-0066

30411972

Katkade

Sanders

Zou

Real world data: an opportunity to supplement existing evidence for the use of long-established medicines in health care decision making

J Multidiscip Healthc 2018 11 295 304

10.2147/JMDH.S160029

29997436

jmdh-11-295

PMC6033114

Berger

Daniel

Frank

Hernandez

McClellan

Okun

Overhage

Platt

Romine

Tunis

Wilson

A framework for regulatory use of real-world evidence

Duke Margolis Center for Health Policy 2017 9 13

2021-03-03

https://healthpolicy.duke.edu/sites/default/files/2020-08/rwe_white_paper_2017.09.06.pdf

Abernethy

Arunachalam

Burke

McKay

Cao

Sorg

Carbone

Real-world first-line treatment and overall survival in non-small cell lung cancer without known EGFR mutations or ALK rearrangements in US community oncology setting

PLoS One 2017 12 6

10.1371/journal.pone.0178420

28644837

PONE-D-16-35800

PMC5482433

Bittoni

Arunachalam

Camacho

Zhong

Lubiniecki

Carbone

Real-world treatment patterns, overall survival, and occurrence and costs of adverse events associated with first-line therapies for medicare patients 65 years and older with advanced non-small-cell lung cancer: a retrospective study

Clin Lung Cancer 2018 09 19 5 629 45

10.1016/j.cllc.2018.04.017

29885945

S1525-7304(18)30099-8

Davies

Patel

Gridelli

de Marinis

Waterkamp

McCusker

Real-world treatment patterns for patients receiving second-line and third-line treatment for advanced non-small cell lung cancer: a systematic review of recently published studies

PLoS One 2017 12 4

10.1371/journal.pone.0175679

28410405

PONE-D-16-50323

PMC5391942

Foster

Sher

Rusthoven

Verma

Spiotto

Weichselbaum

Koshy

Overall survival according to immunotherapy and radiation treatment for metastatic non-small-cell lung cancer: a National Cancer Database analysis

Radiat Oncol 2019 01 28 14 1 18

10.1186/s13014-019-1222-3

30691492

10.1186/s13014-019-1222-3

PMC6348608

Hess

Louder

Winfree

Zhu

Oton

Nair

Factors associated with adherence to and treatment duration of erlotinib among patients with non-small cell lung cancer

J Manag Care Spec Pharm 2017 06 23 6 643 52

10.18553/jmcp.2017.16389

28530522

Horn

Bauml

Forde

Davis

Myall

Sasane

Dalal

Culver

Wozniak

Baik

Mutebi

Zhang

Wakelee

Johnson

Real-world treatment patterns and survival of patients with BRAF V600-mutated metastatic non-small cell lung cancer

Lung Cancer 2019 02 128 74 90

10.1016/j.lungcan.2018.12.003

30642457

S0169-5002(18)30684-6

Appius

Pattipaka

Feyereislova

Cassidy

Ganti

Real-world management of patients with epidermal growth factor receptor (EGFR) mutation-positive non-small-cell lung cancer in the USA

PLoS One 2019 14 1

10.1371/journal.pone.0209709

30608948

PONE-D-18-28027

PMC6319739

Nadler

Espirito

Pavilack

Boyd

Vergara-Silva

Fernandes

Treatment patterns and clinical outcomes among metastatic non-small-cell lung cancer patients treated in the community practice setting

Clin Lung Cancer 2018 07 19 4 360 70

10.1016/j.cllc.2018.02.002

29576407

S1525-7304(18)30021-4

Ryan

Skinner

Fernandes

Punekar

Pavilack

Walker

VanderWalde

Real-world treatment patterns among patients with unresected stage III non-small-cell lung cancer

Future Oncol 2019 09 15 25 2943 53

10.2217/fon-2018-0939

31037966

Simeone

Nordstrom

Patel

Klein

Treatment patterns and overall survival in metastatic non-small-cell lung cancer in a real-world, US setting

Future Oncol 2019 10 15 30 3491 502

10.2217/fon-2019-0348

31497994