Abstract Identifying the main environmental drivers of SARS‐CoV‐2 transmissibility in the population is crucial for understanding current and potential future outbursts of COVID‐19 and other infectious diseases. To address this problem, we concentrate on the basic reproduction number R 0 , which is not sensitive to testing coverage and represents transmissibility in an absence of social distancing and in a completely susceptible population. While many variables may potentially influence R 0 , a high correlation between these variables may obscure the result interpretation. Consequently, we combine Principal Component Analysis with feature selection methods from several regression‐based approaches to identify the main demographic and meteorological drivers behind R 0 . We robustly obtain that country’s wealth/development (GDP per capita or Human Development Index) is the most important R 0 predictor at the global level, probably being a good proxy for the overall contact frequency in a population. This main effect is modulated by built‐up area per capita (crowdedness in indoor space), onset of infection (likely related to increased awareness of infection risks), net migration, unhealthy living lifestyle/conditions including pollution, seasonality, and possibly BCG vaccination prevalence. Also, we argue that several variables that significantly correlate with transmissibility do not directly influence R 0 or affect it differently than suggested by naïve analysis. Key Points Machine learning techniques are utilized to select the most influential environmental factors behind COVID‐19 transmissibility The country’s wealth/development level is identified as the main global predictor of SARS‐CoV‐2 spread in a population Other important factors are indoor space per person, unhealthy living determinants, spontaneous behavior change, and weather seasonality
【저자키워드】 principal component analysis, basic reproduction number, Feature selection, regression analysis, COVID‐19 environmental dependence, disease spread risk factors, 【초록키워드】 Diseases, social distancing, Infection, COVID‐19, SARS‐CoV‐2, Spread, Prevalence, Coverage, Migration, Transmissibility, Interpretation, determinants, Other, Frequency, Analysis, Contact, Human Development Index, BCG vaccination, Factor, high correlation, naïve, infection risks, machine, component, while, identifying, variable, Affect, GDP, approach, environmental factor, susceptible, principal, country, identify, significantly, absence, suggested, modulated, driver, Point, 【제목키워드】 SARS‐CoV‐2, Selection,