Risk factors for SARS-CoV-2 infection: a test-negative case-control study with additional population controls in Norway

STRENGTHS AND LIMITATIONS OF THIS STUDY

  • The test-negative design can reduce confounding from healthcare-seeking bias because PCR-negative (PCR−) controls are likely to have similar healthcare-seeking attitudes as PCR-positive (PCR+) cases.

  • This study mostly included non-hospitalised patients, and the findings can be generalisable to the general population.

  • The use of an additional control group from the general population for comparison with the PCR+ and PCR− participants (triangulation) strengthens the study inferences by adding two more dimensions of comparison.

  • In the subgroup analyses, PCR+ cases and PCR− controls were compared with the population controls to assess the risk factors for those aged 18–55 years. Hence, the results may not be generalisable to patients older than 55 years.

  • PCR test results, rather than symptoms, were used to categorise the participants into cases or controls, and therefore risk factors for SARS-CoV-2 infection and not COVID-19 disease were assessed.

Introduction

Understanding the risk factors for SARS-COV-2 infection is essential for the prevention of new waves of COVID-19, developing new vaccination strategies and in preparation for future pandemics. The clinical spectrum of COVID-19 varies from asymptomatic SARS-CoV-2 infection to mild pneumonia and may lead to serious respiratory illness and death. Various studies have explored the risk of COVID-19 severity and mortality, but only a few studies have assessed the risk of SARS-CoV-2 infection, which can be asymptomatic and mild. Although some risk factors for SARS-CoV-2 infection and COVID-19 have been identified, findings have been conflicting.1–4 Particularly, the association between smoking status, obstructive lung diseases, including chronic obstructive pulmonary disease (COPD) and asthma, and the risk of SARS-CoV-2 infection and development of severe COVID-19, has shown varying results in the studies.3 5–12 Possible explanations for the conflicting results can be the different study designs, varying selection methods for cases and controls, different risk factors for severe disease and asymptomatic and mild infection, small sample size and geographical location for the studies.

Diabetes mellitus is an important risk factor for both disease severity and hospital mortality.1 3 13 In a meta-analysis of adults hospitalised in 11 countries, overweight and patients with diabetes were more likely to require respiratory support.13 Another meta-analysis showed that obesity was associated with both COVID-19 susceptibility and severity.14 Among the 50 most affected countries, obesity increased both susceptibility for SARS-CoV-2 infection and mortality.4 Hypertension has also been linked with COVID-19 severity, but a meta-analysis early in the pandemic showed no association of hypertension and susceptibility for SARS-CoV-2 infection.2 In addition, older age and male sex were associated with increased mortality.2 6 Air pollution can be a risk factor for upper and lower respiratory tract diseases. There are few previous studies that report association of concentrations of particulate pollutants in cities and COVID-19 incidence.15 To our knowledge, no studies have investigated environmental factor, such as air pollution from wood-fired heating, as potential risk factor for SARS-CoV-2 infection.15

Most SARS-CoV-2-infected individuals have mild symptoms and are not hospitalised, but few studies assess these patients.2 4 16 To date, most studies have been retrospective, designed as traditional case–control studies and cohort studies involving hospitalised patients with severe COVID-19, and have demonstrated substantial heterogeneity among findings.1 5–7 10 17 18

Therefore, this study aimed to determine the association between risk factors for SARS-CoV-2 PCR test positivity, by combination of three case–control study designs. We compared individuals with PCR positive (PCR+) and PCR negative (PCR−) tests as part of a test-negative design (TND) case–control study and each of them with additional population controls.19–21

Methods

Study design and setting

We designed a TND case–control study with additional population controls. TND differs from the classical case–control study in that the controls are defined by a negative test result and not sampled from a wider source population.19–21 The additional population control group makes it possible to assess risk factors for both the PCR test positivity and the PCR test negativity by a triangulation approach. PCR-negative participants have other infections than SARS-CoV-2. With this design, it is possible to distinguish between exposures that are combined risk factors for both SARS-CoV-2 infection and other respiratory infections, and risk factors that are specific for SARS-CoV-2 infection.21 This study design can also reduce potential bias resulting from differences in health care-seeking attitude between cases and controls.19 Participants were defined as ‘cases’ or ‘controls’ based on their PCR+ and PCR- test results, respectively. We used the first PCR test result of each participant. The participants were recruited from the counties of Agder and Telemark in South-Eastern Norway from February to December 2020 during the first and second waves of COVID-19 pandemic, and when the SARS-CoV-2 Alpha variant was dominant.

SARS-CoV-2 PCR+ and PCR– participants in our geographical area were first identified from results lists at the test centres and hospital laboratories. Eligible participants were then contacted by telephone by test centre or hospital, and invited to participate in the research project. On verbal agreement to participate, patients were invited to the hospital laboratory by the research team 3–5 months after PCR tests. At this appointment, written information about the project was provided. After the consent form was signed, the questionnaire was filled in. The population control group data were obtained from the pre-existing Telemark study dataset which included a random sample of 14 509 participants, aged 21–55 years, residing in Telemark, Norway in 2018.22

The inclusion criteria were as follows: (1) adults aged ≥18, (2) SARS-CoV-2 RT-PCR test result and (3) resident of South-Eastern Norway, specifically Agder and Telemark counties, during the inclusion period. Participants who were unable to answer the questionnaire, which was conducted in Norwegian, were excluded. PCR tests were used for inclusion as they are the gold standard for detection of SARS-CoV-2.23

Our first study is a TND study, including mainly non-hospitalised patients. Our second and third studies are case–control studies using additional population controls as a control group.

In the first study (hereafter study I), we compared risk factors for 400 SARS-CoV-2 PCR+ individuals (cases) with risk factors for 719 SARS-CoV-2 PCR– individuals (controls) in a classic TND. The controls in this design are ‘other patient’ controls who undergo the same PCR tests for same reasons and at the same healthcare facility as cases, but test negative.20

In the second traditional case–control study (hereafter study II), we compared risk factors for a subgroup of 286 SARS-CoV-2 PCR+ individuals (aged 18–55 years) with risk factors for the population control group (N=14 509, aged 21–55 years). Given that Telemark study dataset was collected more than 1 year before the pandemic, it was assumed to be PCR-negative for SARS-CoV-2.

In the third case–control study (hereafter study III), we compared risk factors for other infections than SARS-CoV-2 for a subgroup of 502 PCR- participants (aged 18–55 years) with the risk factors for the population study group from the Telemark study (N=14 509, aged 21–55 years). Study II and III were restricted to these age groups because the Telemark study had participants up to 55 years.

The extensive restrictions for risk groups with non-pharmaceutical interventions during lockdowns in Norway did not differ from the restrictions for the general population in 2020. Norway locked down on 12 March 2020 until summer 2020. These restrictions were partly eased during the summer months, but reinstated for all residents from autumn 2020. Risk groups for more serious COVID-19 were defined as people aged >65 years, age <65 with comorbidities such as diabetes, overweight and heart disease. The official Norwegian testing criteria for SARS-CoV-2 changed over time but were the same for the PCR+ and PCR– participants in the study period. In the first wave of the pandemic, PCR testing was restricted to symptomatic patients. In the second wave, PCR testing was additionally applied to close contacts and asymptomatic individuals during the outbreaks.

Participants were included regardless of their symptoms. Only 58 participants (5%) in the PCR+ cases and PCR– controls were asymptomatic. Most PCR+ cases had mild symptoms, with only 22 (6%) participants hospitalised.24 At the inclusion period, it was not possible to know how the pandemic waves would develop. We aimed to include 400 eligible PCR+ cases, and two PCR– controls per case matched for test time and geographical location to increase the power of the study.

We used strengthening the reporting of observational studies in epidemiology (STROBE) case–control reporting guidelines for our study.25

Questionnaire design

We used questions from the Norwegian Health Institute COVID-19 questionnaire and the questionnaire data from the Telemark study questionnaire,22 26 27 in addition, a few questions were provided by the study group. The questionnaire consisted of questions related to (1) education status, (2) smoking habits, (3) respiratory symptoms and/or diseases, such as asthma or COPD, (4) comorbidities, (5) exercise and (6) environmental exposure to air pollution from traffic or wood-fired heating. Questions are shown in online supplemental table S1.

Supplemental material

Statistical analysis

The mean, SD and median were reported for continuous variables, as appropriate. Categorical data were reported as frequencies and percentages. We used categorical variables for the adjustment. We followed the strategy proposed by Greenland et al28 and adjusted for all potential confounders, while checking for multicollinearity. Since there was no evidence that this was occurring, all potential confounders were retained in the final model. Wood heating and diabetes status were considered to be potential confounders; wood heating is potentially associated with respiratory symptoms, and diabetes is potentially associated with healthcare seeking behaviour. Both education and income were included in the regression models, but income did not impact the estimation. Hence, we used only education as a predictor for the socioeconomic status. Logistic regression models were used to assess the possible risk factors. We used a one-step regression analysis for each variable. All the variables considered had some a priori evidence that they could be potential risk factors for COVID-19 infection. Thus, we considered that the problem of multiple comparisons did not apply. Therefore, we did not adjust for multiple comparisons.

To determine the association between risk factors and PCR test positivity, ORs were reported with 95% CIs. Statistical analyses were performed using R (V.4.2; R Core Team, Vienna, Austria).

Our questionnaire had a low rate of missing data, ranging from 0% to 6.2% for each question, except for those related to smoking habits, which had 11.2% (N=45) and 15.3% (N=110) missing data for PCR+ and PCR− participants, respectively. Due to the subsequent follow-up questions, the questionnaire used in the population control group had no missing data for questions related to smoking habits, asthma, COPD, diabetes, hypertension and wood heating. There were no missing for the question about wood heating in our study due to the subsequent follow-up questions. The remaining questions among the population controls had a low rate of missing data, ranging from 1.8% to 6.2%, except for exercise where 13.5% of data was missing. A sensitivity analysis was performed excluding participants with missing values for smoking and the results were not impacted. We did not perform data imputation.

Patient and public involvement

According to the Norwegian National Guidelines for User Involvement in Health Research in May 2018, two user representatives of SARS-CoV-2 PCR-positive patients were involved. They played an active role in all project phases, including the development and testing of questionnaires. The user representatives helped us understand the patient’s perspective, gave feedback on our study protocol, study methods, information and consent forms, and questionnaires, and participated actively in the dissemination of results achieved until now. All study results are also communicated via www.sthf.no/helsefaglig/forskning-og- innovasjon/forskningsprosjekter/covita and www.sshf.no/helsefaglig/forskning-og-innovasjon/covita-studien.

Results

Of 656 eligible PCR+ participants and 1812 eligible PCR− participants, 400 PCR+ cases and 719 PCR− controls were included. The study flow chart is shown in figure 1. The characteristics and comorbidities of the PCR+, PCR− and population controls are shown in table 1.

Table 1

Characteristics of the PCR+ cases and PCR− controls 3–5 months after PCR test and the population control group data from the pre-existing Telemark study dataset

Figure 1
Figure 1

Flow chart for study inclusion.

The PCR+ cases and PCR− controls had a mean age of 48±15 years and 47±14 years, respectively. Male participants represented 49% PCR+, 34% PCR− and 42% of the population controls. Asthma was present in 64 (16.0%) of the PCR+ group, 135 (18.8%) of the PCR− group and 1760 (12.1%) of the population controls.

In study I, risk factors for SARS-CoV-2 PCR-positive individuals (cases) were compared with risk factors for SARS-CoV-2 PCR-negative individuals (controls) in a classic TND. The results are presented in online supplemental table S2. Male sex was significantly associated with the risk for SARS-CoV-2 infection when comparing PCR+ cases and PCR− controls (OR 1.92, 95% CI 1.43 to 2.57). Age, body mass index (BMI), education level and comorbidities were not associated with SARS-CoV-2 infection. Daily or occasional smoking was negatively associated with SARS-CoV-2 infection (OR 0.50, 95% CI 0.31 to 0.81).

Supplemental material

Characteristics and comorbidities for the subgroup of the PCR+ cases (18–55 years) and PCR− controls (18–55 years) and population control group (21–55 years) are shown in online supplemental table S3. In study II, risk factors for SARS-CoV-2 PCR+individuals (aged 18–55 years) were compared first with risk factors for PCR− controls (aged 18–55 years) in a TND and then with the population control group (aged 21–55 years) in a traditional case−control study. The results are shown in online supplemental table S4. Comparison of PCR+ cases with population controls in study II revealed that exercising once a week (OR 2.02, 95% CI 1.35 to 3.05), 2–3 times a week (OR 1.47, 95% CI 1.01 to 2.19) and 4–7 days a week (OR 1.85, 95% CI 1.23 to 2.83), having asthma (OR 1.56, 95% CI 1.12 to 2.14) and using wood heating seldom (OR 4.25, 95% CI 3.07 to 5.86), 2‒3 times a week (OR 1.65, 95% CI 1.17 to 2.30) and daily during the winter season (OR 2.13, 95% CI 1.50 to 2.99) were associated with SARS-CoV-2 infection. Comparison of PCR+ cases with PCR− controls or with the population controls revealed that daily or occasional smoking (OR 0.48, 95% CI 0.28 to 0.79) and (OR 0.55, 95% CI 0.35 to 0.82), respectively, was negatively associated with SARS-CoV-2 infection. Hypertension was negatively associated with SARS-CoV-2 infection when PCR+ cases were compared with population controls (OR 0.37, 95% CI 0.19 to 0.65). Age, BMI and comorbidities were not associated with SARS-CoV-2 infection when comparing PCR+and PCR− controls.

Supplemental material

Supplemental material

In study III, risk factors for other infections than SARS-CoV-2 for PCR− participants (aged 18–55 years) were compared with risk factors for the population study group from the Telemark study (aged 21–55 years). The outcome of interest was non-SARS-CoV-2 infections. More than 95% of the PCR-negative participants had symptoms similar to SARS-CoV-2 infection. The results are shown in online supplemental table S5. BMI>30 (OR 1.54, 95% CI 1.17 to 1.99), past smoking (OR 1.39, 95% CI 1.13 to 1.73) and asthma (OR 1.70, 95% CI 1.34 to 2.16) were associated with PCR negativity in the study III when comparing PCR-negative participants with the population control group. Daily wood heating in the winter season was also associated with PCR negativity (OR 3.42, 95% CI 2.66 to 4.44).

Supplemental material

ORs from the three different case−control studies are summarised in table 2.

Table 2

Risk factors for SARS-CoV-2 infection and non-SARS-CoV-2 infection, adjusted OR* (ORadj) (95% CI) from three case–control studies: study I (PCR+ cases vs PCR− controls), study II (PCR+ cases vs population controls), study III (PCR− vs population controls)

Male sex was associated with the risk of SARS-CoV-2 infection in study I (OR 1.92, 95% CI 1.43 to 2.57). Smoking was negatively associated with SARS-CoV-2 infection in study I (OR 0.50, 95% CI 0.31 to 0.81) and in study II (OR 0.55, 95% CI 0.35 to 0.82), respectively. BMI>30 (OR 1.54, 95% CI 1.17 to 1.99), past smoking (OR 1.39, 95% CI 1.13 to 1.73) and asthma (OR 1.70, 95% CI 1.34 to 2.16) were associated with PCR negativity in study III. Wood heating was associated with SARS-CoV-2 infection in study II (OR 2.13, CI 95% 1.50 to 2.99) for daily use in the winter season and it was associated with PCR negativity in study III (OR 3.42, 95% CI 2.66 to 4.44).

Discussion

In study I (TND), we identified the male sex as a risk factor for SARS-CoV-2 infection. In addition, smoking was negatively associated with SARS-CoV-2 infection in both studies I and II (PCR+ vs population controls) analyses. No evidence of association was found between asthma and SARS-CoV-2 infection in study I, but there was a positive association in study II. COPD showed no association with SARS-CoV-2 infection in both studies I and II. While exercising and wood heating during the winter were highlighted as possible risk factors for SARS-CoV-2 infection in study II, this was not the case in study I.

Male sex was associated with SARS-CoV-2 infection, which is in line with previous studies.29 In a systematic review and meta-analysis, a higher ratio of SARS-COV-2 infection in males than females (100:82.5) was reported.16 29 A meta-analysis also showed a higher risk for SARS-CoV-2 infection in men than that in women, with a relative risk of 1.08.16

In our study I (TND) and study II (PCR+ cases vs population controls), current smoking status was negatively associated with SARS-CoV-2 infection. This paradoxical finding is reflected in the literature, with many studies reporting discordant results depending on disease severity and other comorbidities associated with smoking.5 12 30 31 The lack of severe COVID-19 among our participants can have contributed to this result, with only 6% of our participants hospitalised. Moreover, the PCR– controls in our study had other common respiratory infections, which may be associated with smoking.12 Previous studies have shown that when infected with COVID-19, current smokers have worse outcomes than non-smokers.8 30 31 Given that we did not obtain data related to the pack-years or duration of smoking, our results should be interpreted with caution.

Asthma and COPD were not associated with SARS-CoV-2 infection in study I, which is comparable to studies from the early phase of the pandemic.5 9 This may be due to the willingness of individuals with asthma to be tested whenever they develop respiratory symptoms that could indicate COVID-19. Additionally, in study I-III, the PCR– participants had signs and symptoms of respiratory tract infections other than SARS-CoV-2 infection. Thus, asthma may still be associated with COVID-19, as well as with other respiratory tract infections. Interestingly, asthma was associated with SARS-CoV-2 infection in our study II with subgroup analysis comparing PCR+ cases with population controls. The reason for this finding is not clear; however, we observed a relatively high prevalence of asthma (16%) among the PCR+ cases in our study, but not COPD (1.5%). In many COVID-19 studies, a low prevalence of asthma (1%) and varying prevalence of COPD (2%–14%) for SARS-CoV-2 infected patients have been reported.5–7 9 Patients with chronic diseases may have been isolated more than others during lockdowns in some countries or regions. The potential protective immunity provided by therapies used to treat chronic respiratory diseases may also explain the low prevalence of COVID-19 among adults with asthma or COPD in some studies.5 7 9 In a study by Lacedonia et al, the prevalence of COPD and current smokers was low for SARS-CoV-2 infection, but when infected, these groups had the highest all-cause mortality.5 A nationwide Korean study showed that COPD was associated with an increased risk of COVID-19 susceptibility; however, the prevalence of COPD among severe COVID-19 patients or COVID-19 mortality did not increase, but smoking influenced COPD outcomes.10 The heterogeneity of these findings may be attributed to differences in disease severity, study design, such as the selection of controls, sample size and geographical location.

Our study demonstrated no association between age, BMI, diabetes, having a bedroom window close to a trafficked road and SARS-CoV-2 infection. Hypertension was inversely associated with SARS-CoV-2 infection in study II, which contradicts other studies.7 Obesity has been associated with COVID-19 susceptibility and severity,17 32 and is thought to be an important prognostic factor.4 14 17 32 33 Diabetes has also been proposed as a risk factor for developing severe COVID-19 and mortality.1 3 13 Given that our study mostly included patients with mild COVID-19 symptoms and few hospitalised participants, this may have contributed to the finding of no associations between these factors and SARS-CoV-2 infection. In the subgroup analysis (study II) comparing PCR+ cases and population controls, asthma, exercise and wood heating were possible risk factors for SARS-CoV-2 infection. However, given the possibility of selection bias due to differences in healthcare-seeking attitudes, these findings should be interpreted with caution. Analysing PCR+ cases with PCR− participants as controls in study I (TND) may have reduced this bias.

We obtained different results for asthma, exercise and wood heating in study I than in the study II, although findings for sex, age, smoking and COPD showed similar directions of association. However, the interpretation of how smoking habits affect the risk of SARS-CoV-2 infection requires further assessment owing to the limited study size.

This study had some limitations. First, the questionnaire was conducted in South-Eastern Norway during a period when the SARS-CoV-2 Alpha variant was dominant; therefore, these results may not be entirely representative of other countries or virus strains. However, Telemark and Agder have both rural and urban areas and are considered to represent Nordic populations. Second, we compared PCR+ cases and PCR− controls with population controls in study II and III to assess the risk factors for individuals aged 18–55 years; hence, the results of our subgroup analyses may not be generalisable among those >55 years. The data from the population controls were collected 2 years prior to the beginning of the pandemic. We cannot rule out the possibility that this has affected our results, although the time period is relatively short. Third, there is a possibility of recall bias due to the use of a self-reported questionnaire; however, questions included were comparable to other studies,22 34 including studies assessing COVID-19.26 27 Fourth, the study might not be generalisable to all migrant groups; still we included Norwegian-speaking migrants in the study. The Telemark study questionnaire from 2018 was also restricted to Norwegian-speaking participants in the same way. Fifth, confounding unknown factors are possible in all epidemiological studies. Theoretically, misclassification of controls in TND may be more likely than in classical case–control studies.21 Misclassification of cases was considered less likely due to the high sensitivity of PCR tests.35 36 We also confirmed a high specificity of the PCR tests with only few PCR– controls with positive antibodies in our previous study.24 Due to time constraints during the pandemic, the study protocol was not published.

With the TND, which is often used for vaccine studies, it is possible to identify risk factors that are specific for COVID-19 by adding population controls.21 37 38 Furthermore, in the traditional TND design, participants are included before the test results. In our study, all individuals who matched our inclusion criteria were recruited and defined as cases or controls regardless of symptoms and depending only on their PCR test results.39 40 However, we did not consider this as a limitation because the majority of the participants in our study had symptoms.

Healthcare-seeking attitude as a possible source of selection bias may be reduced with TND, as both groups have the same reason for testing.41 42 In contrast to traditional case–control studies, controls are tested for the disease under study and are those with negative test results without exception. Although the criteria for PCR testing changed, the differences in the groups due to the variation of testing strategies during the two pandemic phases were reduced because the PCR tests were matched for time and place. Population controls are useful to strengthen the study inferences by adding two more dimensions of comparison. As demonstrated in our study, the choice of test-negative controls or population controls can affect outcomes regarding risk factors for SARS-CoV-2 infection.

Overall, selecting appropriate study designs and combining all relevant information from studies assessing risk factors for SARS-CoV-2 infection and COVID-19 are vital for the prevention of new waves of COVID-19 and other pandemics in the future. In particular, findings from TND studies assessing risk factors may also contribute to the development of new vaccination strategies. Combined design with TND and additional population controls can be applied to future pandemics. However, further research is needed to address the evolution of virus variants, uptake of vaccination and differences in humoral and cellular protective immunity among risk groups.

Conclusion

Male sex was associated with the risk of SARS-CoV-2 infection only in the TND study, but not when PCR-positive cases were compared with population controls. Smoking was negatively associated with SARS-CoV-2 infection in both the TND study and when comparing PCR-positive cases to population controls. Several factors were associated with SARS-CoV-2 infection when PCR-positive cases were compared with population controls, but not in the TND study, highlighting the strength of combining different case–control study designs during the pandemic.

Data availability statement

Data are available on reasonable request. There are legal and ethical restrictions on sharing our dataset. Our dataset is not fully anonymised and has a relatively small sample size making identification possible. The potentially identifying patient information is age, birthdate, location and dates for PCR testes. However, data requests for the minimal dataset, which includes only the main variables of the final analyses, can be made to the Research department at the Telemark Hospital trust, Ulefossvegen 55, 3710 Skien, Norway email: [email protected].

Ethics statements

Patient consent for publication

Ethics approval

All participants provided written informed consent before inclusion. The Regional Committee for Medical and Health Research Ethics of Southeast Norway A (ID 146469), Norwegian Centre for Research Data (ID 533954), and data protection officers in the participating hospitals approved the study (ID 20-02553 and ID 20-06971).

Acknowledgments

The authors would like to thank Trude Belseth Sanden, Astrid Bjørkeid, June Bakstevold, Gølin Finkenhagen Gundersen, Emile van Gelderen, Elin Skjørvold Christensen, Louise Myrland, Signe Seljåsen, Mona Brekke, Siv Stigen, Anne Cecilie Tveiten, Oda Eikeland Myrnes and Siri Cathrine Rølland for their assistance with data collection and analysis. The authors would like to thank Magnus Tarangen for the assistance in editing the tables. The authors would also like to express their gratitude to all participants and user representatives involved in this study. We would like to thank Editage (www.editage.com) for English language editing.

This post was originally published on https://bmjopen.bmj.com