Spatiotemporal Analysis of Lung Cancer Incidence and Case Fatality in Villa Clara Province , Cuba

INTRODUCTION
Cancer has historically been a main cause of death in Cuba, with lung cancer the number one cause of cancer death in both sexes. Cancer morbidity and mortality rates are the basic measures of cancer impact in the community. Cancer mortality has been one of the major applications of geographic analysis and has made important progress in recent decades thanks to access to mortality statistics and to development and availability of geographic information systems. Cuba does not have a strong tradition of etiologic research using spatial analysis. High levels of lung cancer morbidity and mortality in Villa Clara and growing interest in spatial analysis as an epidemiologic tool motivated this study.


OBJECTIVE
To identify spatial and/or spatiotemporal clusters of lung cancer morbidity and case fatality in the province of Villa Clara, and to demonstrate the value of cluster analysis as an epidemiologic tool.


METHODS
Descriptive observational study based on administrative data, using the technique of space-time scan statistics. The study focused on new cases diagnosed in 2004 and case-fatality for those cases through 2009. Variables used were: cases diagnosed, deaths, date of diagnosis, date of death, municipality and Cartesian geocoding for each municipality.


RESULTS
The study identified significant spatial and spatiotemporal clusters of greater than expected lung cancer incidence (municipalities of Encrucijada, Camajuaní, Cifuentes, Sagua la Grande, Caibarién and Santa Clara) and case fatality (Encrucijada, Camajuaní, Cifuentes, Sagua la Grande, Caibarién, Santa Clara, Placetas and Manicaragua).


CONCLUSIONS
Although the results are not explanatory, the spatial and spatiotemporal patterns of excess lung cancer risk and case-fatality can support hypothesis generation for research and eventual interventions for targeted prevention and management.


INTRODUCTION
Cancer is a leading cause of death globally; each year 9 million people are diagnosed with cancer and approximately 5 million die.[1] This high morbidity and mortality, and frequently unclear etiology, make cancer a major health problem.Among the main factors associated with rising cancer rates are smoking, in both men and women, and population aging.[1,2] The latter also makes cancer an urgent social and economic challenge, with more people reaching the ages of highest cancer risk, due to economic and social development and health system improvements.[3,4] In 2008, WHO reported Cuba's cancer mortality rate (age adjusted to the WHO world standard population) at 129 per 100,000 population, compared to 132 per 100,000 globally.Cuba ranked 103rd (highest to lowest) among the 193 countries included in the report, 6th in the Caribbean and 15th in the Americas (after Bolivia, Grenada, Peru, Uruguay, Jamaica, Antigua and Barbuda, Argentina, Paraguay, Honduras, Canada, Chile, Barbados, the United States and the Dominican Republic, in that order).[1] For over three decades, while cancer mortality trends varied, cancer was consistently the second cause of death for all ages in the Cuban population-surpassing cardiovascular disease to become fi rst in 2011-and the fi rst cause of years of potential life lost.[5,6] In 2010, the central Cuban province of Villa Clara had a population of 800,300 with a population density of 95.1 inhabitants per km 2 .The main industries were agriculture (sugar and tobacco), chemicals and metallurgy.The provincial cancer mortality rate (age adjusted to the 1981 Cuban census population) in 2009 was 201.3 per 100,000 population; this was above the national average (189.7 per 100,000 population), and surpassed the mortality rate from cardiovascular diseases (194.6 per 100,000 population), making Villa Clara one of the provinces with the highest cancer burden.[7] Lung cancer is the most frequent cancer in Cuban men and the third most frequent in Cuban women; it is the fi rst cause of cancer death in both sexes, both nationally and in Villa Clara where mortality rates in 2009 were 54.7 and 21.6 per 100,000 population for men and women, respectively.[7] Air pollution is a serious environmental problem worldwide, regardless of level of economic development.It differs from other types of pollution since once contaminants are in the air, exposure cannot be easily avoided.Indoor air pollution (in houses, work centers, public places) presents many challenges; it is sometimes worse than outdoor air pollution.[8,9] Household air pollution is a major concern of researchers around the world, due to the prolonged and continuous exposure; a typical individual spends at least 50% of his/her time in the home.It has become the main source of exposure for children.[10,11] Air contaminants are more common in the city than in the countryside, due to industrial waste emissions; motor vehicle exhaust; sulphur dioxide, one of the most important carcinogenic reducing agents; pavement tar particles; and radioactive particles from naturally occurring radon gas decay (the main cause of lung cancer in nonsmokers).[12] Some 75.8% of Villa Clara's population reside in urban settings.[7] Although most authors acknowledge the existence of an urban factor in lung cancer incidence, the main culprit by far is smoking.[12][13][14][15] Tobacco use is responsible for high cancer incidence and mortality in Cuba, contributing to 15% of all deaths in 1995 and 18% in 2007, surpassing attributable mortality reported in the adult population globally (12%) and in countries such as Canada (16.6%), Paraguay (13.4%) and France (12%).Tobacco use caused 86% of the 1209 lung cancer deaths in Cuba in 2007.Seven out of every ten tobacco-associated cancer deaths were in

Original Research
men, 64-65% of these from tracheobronchial-pulmonary cancer; in women, 65% of smoking-related deaths were also from this type of cancer.[16] Environmental tobacco smoke (ETS), also known as secondhand smoke, has been classifi ed by the International Agency for Research on Cancer as a Group 1 carcinogen, that is, there is convincing evidence that it produces cancer in humans.[17] A non-smoker's risk of contracting lung cancer has been shown to increase 20%-30% when exposed to ETS.ETS also produces an increase in all-cause death rates, as well as changes in the immune system.[17][18][19][20] Among the 41 countries that reported data on population exposure to ETS in 2002, Cuba ranked in the top fi ve, and in the top 24 in a 2006 report on 82 countries.[21,22] In 2001 in Villa Clara, 45%-49.9% of families were exposed to ETS.In Cuba, research indicates that persons with higher educational levels may tend to underestimate ETS risk, constituting failed responsibility for protection of their own and others' health.The extent of this problem in Cuba denotes a knowledge gap that could compromise future population health outcomes in the country.[23] Cancer morbidity and mortality rates are indicators most frequently used in surveillance of cancer and lung cancer in Cuba.Variables considered risk factors (smoking, ETS exposure, work in certain industries, family history, etc.) may cluster in certain population groups located in certain places and at certain times.Thus, a population group's lifestyle and behaviors can determine exposure patterns, concentrating or excluding risk factors.Spatial aggregation or clustering can provide evidence of shared risk factors, and identify regions where strong patterns predominate of common ethnicity, eating habits, soil type, agricultural activities, climate, social organization, inter alia.[24] For population health problems, characterizing their geographic and spatial differentiation reveals patterns that can provide important clues to causation and identifi cation of systemic problems that may require intervention.Integrating such information into the public health surveillance system can offer important evidence for health service management and decisionmaking.Thus, consistent study of health problems' geographic distribution and assessment of inequalities in environmental and population living conditions is an additional tool for rational resource allocation and for achieving greater equity in health and all public policies.[25] Viewing geography and its spatial heterogeneity as context also helps link the conceptual framework of the health-illness process to how it plays out in reality.
The importance of identifying high-risk areas for intervention explains the current keen interest among epidemiologists in spatiotemporal distribution of diseases, and in particular, cluster analysis.A cluster is defi ned as the occurrence of a signifi cantly greater number of cases than expected in a specifi c geographic area or specifi c time period, or both.Epidemiological demonstration of a cluster does not necessarily imply causal association between the possible risk factor and the disease.An etiologic study is required to confi rm association when evidence indicates ill persons have been exposed to a common risk factor.[26] Analyzing complex spatial and temporal patterns related to risk factors and disease has been used by several researchers in the Americas, with encouraging results.[26][27][28][29][30][31][32][33][34][35][36] It has been infrequently applied to case fatality; and Cuba does not have a strong tradition of etiologic cancer research using spatial analysis.
We used spatial analysis to examine patterns of lung cancer morbidity and case fatality in Villa Clara Province to identify spatial and spatiotemporal clusters and demonstrate utility of this technique as an epidemiologic tool.

METHODS
A descriptive observational study was conducted in Villa Clara Province using a statistical scanning software program to explore spatial and spatiotemporal distributions of lung cancer.
The study reviewed records of patients residing in the province of Villa Clara aged >15 years who were diagnosed with lung cancer (387) during the period from January 1 to December 31, 2004; and among these, deaths occurring through 2009 (311).
Data source and analysis Data on lung cancer incidence in 2004 and deaths in these cases from 2004 through 2009 obtained from databases of the Provincial Health Statistics Bureau, Villa Clara.Incidence rates were calculated per 10,000 population using the number of lung cancer cases diagnosed in 2004 and the census population reported for that year.Case fatality was calculated as percentage of patients diagnosed in 2004 who died before the close of 2009.
To generate records required by the program on geographic coordinates, populations and cases, various databases were used: • Information on lung cancer cases diagnosed in 2004, and the outcome of these through study termination on December 31, 2009, from Villa Clara's Provincial Health Statistics Bureau; • Information on total population from the National Statistics Bureau; [7] • Records on geocoding (Cartesian) of the province's municipalities from MapInfo v. 8.5.
To determine the existence of clusters by spatial analysis unit, Kulldorff's space-time scan statistic (SaTScan Program v. 7.01) was used.[37] The main advantages of this statistical tool include: it prevents selection bias; for each cluster a unique value of p is obtained, which is adjusted by the multiple tests derived from the large quantity of potential clusters evaluated; if the null hypotheses is rejected, the methodology indicates the approximate cluster site responsible.
Each type of analysis has a different purpose: spatial analysis disregards time and detects anomalous concentrations in space, while spatiotemporal analysis detects clusters coinciding in space and time.The space-time scan statistical method explained in detail by Kulldorff creates a type of circular window (space) or cylindrical window (space-time) with a circular base of varying radius, corresponding to a geographical area; the cylinder's height corresponds to time.These are centered

Peer Reviewed
Original Research around one of many possible centroids in the study, with each circle or cylinder representing a possible cluster.Each cylinder has a specifi c number of observed cases and the program calculates the number of expected cases under the null hypothesis for the total risk population.[38] We set α at 0.05 for rejecting the null hypothesis of 'no cluster.' Relative risk (RR) estimates reported denote risk of incidence of cancer or mortality from cancer in the cluster, compared with risk in the rest of the province.Tables were created to display clusters with a statistically signifi cant (p <0.05) excess of observed over expected cases, under the hypothesis of a Poisson distribution and using the likelihood ratio for parameter estimation.Cartographic representations were made with MapInfo v. 8.5, importing statistical results into the geographic information system.Municipalities forming part of signifi cant spatiotemporal clusters were colored and those in spatial clusters identifi ed by a star.
Spatial analysis Municipalities at highest risk are displayed in Figure 1.A maximum-likelihood spatial cluster was identifi ed that included the municipalities of Encrucijada, Camajuaní, Cifuentes, Sagua and Caibarién, with 133 cases observed, compared to 106.5 expected (p = 0.044).
Spatiotemporal analysis A signifi cant spatiotemporal cluster was detected that included the municipalities of Cifuentes, Encrucijada, Sagua la Grande and Santa Clara.During December 2004, there were 32 cases observed versus 14.5 expected (p = 0.030), that is, more than twice the risk observed in the rest of the province (Figure 1).
Spatial analysis A single spatial cluster was found that included the municipalities of Encrucijada, Camajuaní, Cifuentes, Sagua la Grande and Caibarién, with a relative risk of 1.42 (Figure 2).

Spatiotemporal analysis
The same municipalities as in the spatial cluster (Encrucijada, Camajuaní, Cifuentes, Sagua la Grande and Caibarién) formed a spatiotemporal cluster for the period of March  2).

DISCUSSION
One constraint in this investigation was the lack of similar studies in the national literature for comparison purposes.However, several international studies have applied this method to lung cancer.One was conducted by Fukuda, who identifi ed the principal incidence clusters in Japan's metropolitan regions, related primarily to habits and socioeconomic conditions.[39] Another study conducted in Costa Rica identifi ed high-incidence clusters of bronchopulmonary cancer in the country's northeast and Atlantic regions, implying a relationship with major risk factors such as smoking and environmental pollution.[26] Incidence The lung cancer incidence clusters identifi ed in this study indicate the need for further research on risk factors-mainly tobacco use, which constitutes the most signifi cant etiological agent.A study conducted by Varona reported that some 23% of all deaths from cancer in Cuba in 2010 were due to lung cancer, of which 85% were attributed to smoking (92% in men and 75% in women).[40] An analytical cross-sectional epidemiological study in Matanzas Province, Cuba found that patients who smoked were four times more likely to die from lung cancer.[41] Population aging is one of the most important contributors to rising numbers of cancer cases, and Villa Clara Province has the oldest population in the country (18.2% aged ≥60 years).[42] All of the province's municipalities are among the 40 oldest in the country (of a total of 169 municipalities).[7] Placetas, Cifuentes, Camajuaní and Encrucijada have the oldest populations in Villa Clara and are also where higher incidence and case fatality clusters were identifi ed in this study.
Along with exposure to risk factors, genetic mutations predisposing to lung cancer may increase the probability of developing the disease.There is a hypotheses on varying susceptibility to carcinogens, based on an individual's ability to form genotoxic intermediates, detoxify these intermediates and repair the damage to DNA, all inheritable genetic features that would affect potential for suffering lung cancer and other cancers as well.[43] Family aggregations can also be explained by the fact that individuals in the same family tend to have similar habits, such as smoking, drinking, diet and occupation, hence the value of studying clusters.[44] The spatiotemporal incidence cluster in December 2004 could be due to year-end reporting procedures or other factors that warrant further examination in future studies.With a disease of such long latency, a biological explanation is implausible.
Case fatality Spatial analysis revealed close geographic overlap of incidence and case fatality (see Figures 1 and 2).However, case fatality patterns do not depend on incidence but are infl uenced more by stage at diagnosis and by treatment applied, both of which can affect patient survival.
Both spatiotemporal case fatality clusters were within two years of diagnosis, consistent with lung cancer's established poor prognosis, one of the worst for any cancer.It is asymptomatic over most of its evolution and, in most patients, by time of diagnosis, the disease is already well advanced and nonresponsive to most treatments.[40,45] Clearly screening and close examination of atrisk groups in primary health care are essential.This can overcome obstacles at various levels: individual (such as awareness), primary care (identifi cation of at-risk individuals, streamlining of diagnosis and prompt referral to specialized care), and secondary care (prompt application of clinical protocol for diagnosed cases).[46] A study in Villa Clara Province found a direct association between patient survival and specifi c aspects of treatment: early tumor stage at diagnosis, morphologic and histologic diagnosis, and diagnostic methods used.[47] The fi nding that Placetas Municipality did not form part of any incidence clusters but did appear in a spatiotemporal case fatality cluster merits further study, as it could suggest health system factors at play, such as delays in diagnosis or errors in treatment protocols.
The importance of this type of study resides in its ability to raise questions and provide clues that may lead to causal explanations further down the road.Use of this tool in the surveillance system can help increase health equity and system effi ciency by focusing medical services where they are most needed, and of course can help set priorities for research.

CONCLUSIONS
Application of the SaTScan analytical method revealed signifi cant spatial and spatiotemporal lung cancer clusters, in both incidence and case fatality.While our fi ndings are not explanatory, they do identify areas at higher risk and thus can support generation of hypotheses for research and eventual intervention.
Efforts are needed to determine factors contributing to the observed excess risk in areas of Villa Clara, to prevent the disease, initiate earlier detection and treat it more effectively, as well as to identify systemic factors that could account for differences in case fatality.Bringing about a major change in lung cancer burden will require improvements across the entire spectrum of prevention, early diagnosis and treatment.
It would also be valuable to extend application of this tool to epidemiological surveillance systems in other health jurisdictions in Cuba, to deepen epidemiological exploration of population health status, identify possible contextual determinants to facilitate better risk factor control, and seek solutions in health systems and services to problems identifi ed through cluster detection.Such an investment would strengthen health and disease surveillance and provide a valuable addition to Cuba's universal health care system.