Deep Learning-Based Assessment of Built Environment from Satellite Images and Cardiometabolic Disease Prevalence

Zhuo Chen, Jean Eudes Dazard, Yassin Khalifa, Issam Motairek, Catherine Kreatsoulas, Sanjay Rajagopalan, Sadeer Al-Kindi

Research output: Contribution to journalArticlepeer-review

3 Scopus citations

Abstract

Importance: Built environment plays an important role in development of cardiovascular disease. Large scale, pragmatic evaluation of built environment has been limited owing to scarce data and inconsistent data quality. Objective: To investigate the association between image-based built environment and the prevalence of cardiometabolic disease in urban cities. Design, Setting, and Participants: This cross-sectional study used features extracted from Google satellite images (GSI) to measure the built environment and link them with prevalence of cardiometabolic disease. Convolutional neural networks, light gradient-boosting machines, and activation maps were used to assess the association with health outcomes and identify feature associations with coronary heart disease (CHD), stroke, and chronic kidney disease (CKD). The study obtained aerial images from GSI covering census tracts in 7 cities (Cleveland, Ohio; Fremont, California; Kansas City, Missouri; Detroit, Michigan; Bellevue, Washington; Brownsville, Texas; and Denver, Colorado). The study used census tract-level data from the US Centers for Disease Control and Prevention's 500 Cities project. The data were originally collected from the Behavioral Risk Factor Surveillance System that surveyed people 18 years and older across the country. Analyses were conducted from February to December 2022. Exposures: GSI images of built environment and cardiometabolic disease prevalence. Main Outcomes and Measures: Census tract-level estimated prevalence of CHD, stroke, and CKD based on image-based built environment features. Results: The study obtained 31786 aerial images from GSI covering 789 census tracts. Built environment features extracted from GSI using machine learning were associated with prevalence of CHD (R2= 0.60), stroke (R2= 0.65), and CKD (R2= 0.64). The model performed better at distinguishing differences between cardiometabolic prevalence between cities than within cities (eg, highest within-city R2= 0.39 vs between-city R2= 0.64 for CKD). Addition of GSI features both outperformed and improved the model that only included age, sex, race, income, education, and composite indices for social determinants of health (R2 = 0.83 vs R2= 0.76 for CHD; P <.001). Activation maps from the features revealed certain health-related built environment such as roads, highways, and railroads and recreational facilities such as amusement parks, arenas, and baseball parks. Conclusions and Relevance: In this cross-sectional study, a significant portion of cardiometabolic disease prevalence was associated with GSI-based built environment using convolutional neural networks..

Original languageEnglish (US)
Pages (from-to)556-564
Number of pages9
JournalJAMA Cardiology
Volume9
Issue number6
DOIs
StatePublished - Jun 12 2024

ASJC Scopus subject areas

  • Cardiology and Cardiovascular Medicine

Fingerprint

Dive into the research topics of 'Deep Learning-Based Assessment of Built Environment from Satellite Images and Cardiometabolic Disease Prevalence'. Together they form a unique fingerprint.

Cite this