## EAAMO'24 Data Set Specifications


The **EAAMO'24** Social Hackathon dataset is organized into five sections:

- **Geostatistical Cartography:** Located in the **geometry** folder.
- **Industrial Emissions:** Located in the **emissions** folder.
- **Stationary Air Pollution Sources:** Located in the **stationary pollution sources** folder.
- **Stationary Fuel Emission Sources:** Located in the **stationary fuel pollution sources** folder.
- **Breast Cancer Data:** Located in the **bc_data** folder.

Here, we describe the contents of each section.

### 1. Geostatistical Cartography


The files in this folder contain México's cartographic data collated by the [Instituto Nacional de Estadística, Geografía e Informática](https://www.inegi.org.mx/). This data is encoded using the [National Geostatistical Framework](https://www.inegi.org.mx/temas/mg/), which is designed to correctly reference statistical information with the corresponding geographical locations; this is understood as the delimitation of the Mexican Republic into three levels of disaggregation, called Geostatistical Areas: AGEE, AGEM, and AGEB.

- **AGEE** stands for Área Geoestadística Estatal, which translates to State Geo-Statistical Area. These units are used to organize data collection and analysis at the state level. AGEEs are designed to facilitate collecting and analyzing data related to demographic, economic, social, and geographic characteristics at the state level. Each AGEE corresponds to one of the 32 states of Mexico, which include 31 states and Mexico City (the capital). AGEEs are assigned a consecutive code of two digits (01 to 32).

- **AGEM** stands for Área Geoestadística Municipal, which translates to Municipal Geo-Statistical Area. These units are used to organize data collection and analysis at the municipal level, providing a detailed view of each municipality. AGEMs are designed to facilitate the collection and analysis of data related to demographic, economic, social, and geographic characteristics at the municipal level.Each AGEM corresponds to a municipality, which is the second-level administrative division in Mexico, after states. Mexico is divided into 32 states, which are further subdivided into 2,457 municipalities. The AGEM is assigned a three-digit code that is not always consecutive, nor necessarily according to the alphabetical order of the municipalities that make up the state, ensuring a precise identification of each municipality.

- **AGEB** stands for Área Geoestadística Básica, which translates to Basic Geo-Statistical Area. These are the smallest geographic units used for census and statistical purposes in urban and rural areas. Each AGEB has been assigned a code made up of three numbers, a hyphen, and a number called the verification digit, which goes from 0 to 9 or the letter A. There are two types of AGEBs. Urban AGEBs are found within cities and towns, often defined by blocks or clusters of blocks, and are used to provide detailed urban statistics. Rural AGEBs cover larger areas in rural regions, which can include multiple communities or settlements. 

#### 1.1 Application Examples

The following notebooks demonstrate how to load and display geographic data from the files in the **geometry** folder using GeoPandas. GeoPandas is an open-source project that simplifies working with geospatial data in Python (https://geopandas.org/).

- `plot_mexico.ipynb`: Demonstrates how to plot the states of Mexico.
- `plot_slp.ipynb`: Shows how to plot the state of San Luis Potosí and its municipalities.
- `plot_slp_metropolitan_area.ipynb`: Illustrates how to plot the urban AGEBs in the San Luis Potosí Metropolitan Area, which includes the municipalities of San Luis Potosí and Soledad de Graciano Sánchez.



### 2. Industrial Air Pollution Emissions

The file in `dataset/Mexico/emissions/` contains a database listing the establishments that produce air pollution emissions classified according to the Registry of Emissions and Transfers of Pollutants (RETC) published by the Ministry of Environment and Natural Resources [(Scretaría del Medio Ambiente y Recurso Humanos)](https://www.gob.mx/semarnat). Some of these emissions may relate to breast cancer.

A description of the variables stored in the emissions file is shown in the following table.


| Variable                 | Full Variable Name                                            | Variable Description                                                                                                                                                                                                                                                                   |
|--------------------------|---------------------------------------------------------------|------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| **id**                       | Identifier                                                    | Emission identifier                                                                                                                                                                                                                                                                     |
| **Año**                      | Year                                                          | Year of RETC (Registry of Emissions and Pollutant Transfers) report                                                                                                                                                                                                                     |
| **cve_ent**                  | Entity Key                                                    | Key of the entity assigned by INEGI (National Institute of Statistics and Geography)                                                                                                                                                                                                    |
| **nom_ent**                  | Entity Name                                                   | Name of the entity where the establishment reporting the emission is located                                                                                                                                                                                                            |
| **cve_mun**                  | Municipality Key                                              | Municipal key assigned by INEGI                                                                                                                                                                                                                                                         |
| **nom_mun**                  | Municipality Name                                             | Name of the municipality where the establishment reporting the emission is located                                                                                                                                                                                                      |
| **sector**                   | Industrial Sector Name                                        | Name of the industrial sector according to SEMARNAT (Secretary of Environment and Natural Resources) classification                                                                                                                                                                     |
| **NRA**                      | Environmental Registration Number                             | Identifier generated by SEMARNAT for the “establishments” of the Individual or Legal Entity (applicant) obliged to carry out any procedure in environmental matters, which relates it to the applicant's RFC and the municipality where it is located, unique and non-transferable.       |
| unidad                   | Unit of Measurement                                           | Unit of measurement used to measure the amount of the substance emitted (Kg/year)                                                                                                                                                                                                       |
| **aire**                     | Air Emissions                                                 | Substance in any physical state released, directly or indirectly, into the air                                                                                                                                                                                                          |
| **agua**                     | Water Emissions                                               | Substance in any physical state released, directly or indirectly, into the water                                                                                                                                                                                                        |
| **suelo**                    | Soil Emissions                                                | Substance in any physical state released, directly or indirectly, into the soil                                                                                                                                                                                                         |
| **Nombre de la sustancia**   | Substance Name                                                | Name of the substance emitted by the establishment subject to reporting. This substance must be within the list of 200 substances of interest for NOM-165-SEMARNAT-2013.                                                                                                                |
| **gpo_IARC**                 | IARC Group                                                    | Group assigned by the database creators. Based on the classification of the substance according to the IARC (International Agency for Research on Cancer)                                                                                                                                                   |
| **X**                        | Longitude                                                     | Geographic coordinate                                                                                                                                                                                                                                                                   |
| **Y**                        | Latitude                                                      | Geographic coordinate                                                                                                                                                                                                                                                                   |
| **descrip_coordenadas**      | Coordinates Description                                       | Description of the method of obtaining the establishment's geographic coordinates                                                                                                                                                                                                       |
| **canc_mamario_SSI**         | *Considered Breast Carcinogen* by SSI (Silent Spring Institute) | Class label assigned to identify those substances related to breast cancer by SSI                                                                                                                                                                                                       |
| **cat_SSI**                  | Category Assigned to the Substance by SSI                     | Name of the category assigned to the substance by SSI based on the CAS number                                                                                                                                                                            
              


 The emission's file can be loaded  and displayed using the Pandas library, as shown below. 

In [17]:
import pandas as pd

Import pandas as pd: This line imports the pandas library and assigns it the alias pd. Pandas is a powerful data manipulation and analysis library for Python.

In [18]:
df_emissions = pd.read_csv("../dataset/Mexico/emissions/emissions.csv")

In [19]:
display(df_emissions)

Unnamed: 0,id,Año,cve_ent,nom_ent,cve_mun,nom_mun,sector,NRA,unidad,aire,agua,suelo,Nombre de la sustancia,gpo_IARC,X,Y,descrip_coordenadas,canc_mamario_SSI,cat_SSI
0,1,2004,1,Aguascalientes,5,Jesús María,Metalúrgica (incluye la siderúrgica),AAGND0100511,Kg/año,0.000,0.067500,0.0,Cianuro inorgánico/orgánico,NoConsiderada,-102.326944,21.963611,3,,
1,2,2004,1,Aguascalientes,5,Jesús María,Metalúrgica (incluye la siderúrgica),AAGND0100511,Kg/año,0.000,0.003132,0.0,"Arsénico (polvos, respirables, vapores o humos)",1,-102.326944,21.963611,3,,
2,3,2004,1,Aguascalientes,5,Jesús María,Metalúrgica (incluye la siderúrgica),AAGND0100511,Kg/año,0.000,0.000297,0.0,Mercurio (compuestos),3,-102.326944,21.963611,3,,
3,4,2004,1,Aguascalientes,11,San Francisco de los Romo,Automotriz,CAS9M0101111,Kg/año,0.000,0.003692,0.0,Mercurio (compuestos),3,-102.270365,22.072369,1,,
4,5,2004,1,Aguascalientes,11,San Francisco de los Romo,Automotriz,CAS9M0101111,Kg/año,0.000,0.095667,0.0,"Arsénico (polvos, respirables, vapores o humos)",1,-102.270365,22.072369,1,,
...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...
134902,134903,2022,32,Zacatecas,55,Villanueva,Otros,SMA3205500012,Kg/año,0.000,40.996800,0.0,Plomo (Compuestos solubles),2B,-102.883333,22.357500,3,,
134903,134904,2022,32,Zacatecas,29,Miguel Auza,Metalúrgica (incluye la siderúrgica),SPRMJ3202911,Kg/año,132226.100,0.000000,0.0,Bióxido de carbono,NoConsiderada,-103.459167,24.291667,3,,
134904,134905,2022,32,Zacatecas,29,Miguel Auza,Metalúrgica (incluye la siderúrgica),SPRMJ3202911,Kg/año,347.500,0.000000,0.0,"Plomo (polvos respirables, humos o vapores)",2B,-103.459167,24.291667,3,,
134905,134906,2022,32,Zacatecas,17,Guadalupe,Automotriz,YAM9M3201711,Kg/año,0.000,14.650000,0.0,Níquel (Compuestos solubles),2B,-102.444167,22.761667,3,,


 This line displays the data frame *df_emissions* . In a Jupyter notebook, this will render the GeoDataFrame as a table, showing its contents and structure, including columns like cve_ent, cve_mun, latitude (X) and longitude (Y).

#### 2.1 Application Example

The following notebook demonstrate how to load and display geographic data from the files in the **geometry** and **emissions**folder using GeoPandas. GeoPandas is an open-source project that simplifies working with geospatial data in Python (https://geopandas.org/).

- `plot_emissions_slp_metropolitan_area.ipyn`: Demonstrates how to plot the emissions sources located in San Luis Potosí Metropolitan Area.

### 3. Stationary Air Pollution Sources 

The file in `dataset/Mexico/stationary_pollution_sources/` contains a database listing the establishments considered air pollution sources, such as gas stations, metal shops, grilled food stores, etc. This data, crucial for understanding and addressing air pollution, was collated from the [Directorio Estadístico Nacional de Unidades Económicas (DENUE)](https://www.inegi.org.mx/app/mapa/denue/default.aspx). DENUE, the cloud infrastructure of the National Economic Information Subsystem (SNIE), is a key resource for both specialized and non-specialized users. It provides identification, location, and contact data of the active economic units in the national territory, supporting the development and evaluation of public policies and economic development programs at all levels of government.


#### 3.1 Application Examples



The following notebooks demonstrate how to load and display geographic data from the files in the **geometry** folder and *stationary_pollution_sources* using GeoPandas. GeoPandas is an open-source project that simplifies working with geospatial data in Python (https://geopandas.org/).

- `plot_stationary_pullution_sources.ipynb`: Demonstrates how to plot the location of stationary pollution sources in the Map.
- `plot_stationary_pollution_sources_slp_metropolitan_area.ipynb`: Illustrates how to plot the location of stationary pollution sources in the San Luis Potosí Metropolitan Area, which includes the municipalities of San Luis Potosí and Soledad de Graciano Sánchez.
- `plot_fuels.ipynb`: Demonstrates how to plot the location of fuels sources in the Map. Data are available for San Luis Potosí and Zacatecas states.

### 4. Water Sources


The file in the folder `dataset/Mexico/stationary_pollution_sources/` contains a database listing  spatial variables related to water contamination in some Mexican states. Data were collated from a dataset published by the [Consejo Nacional del Agua (CONAGUA)](https://www.gob.mx/conagua/).


A description of the variables stored in the `water_source.csv file`  is shown in the following table.

Here is the translation of the table:

| Variable            | Full Name of the Variable     | Description of the Variable                                                        |
|---------------------|-------------------------------|-------------------------------------------------------------------------------------|
| id                  | Identifier                    | Identifier of the well or spring assigned by the data-generating institution         |
| cve_ent             | Entity Key                    | Key of the entity assigned by INEGI                                                  |
| nom_ent             | Entity Name                   | Name of the entity where the well or spring is located                               |
| actividad           | Activity                      | Type of water source: well, spring, or domestic intake                               |
| F                   | Fluorides                     | Concentration of fluoride in mg/L                                                    |
| C_F                 | Fluoride Quality              | Water quality by fluoride content                                                    |
| N                   | NO3                           | Concentration of NO3 in mg/L                                                         |
| C_N                 | NO3 Quality                   | Water quality by NO3 content                                                         |
| As                  | Arsenic                       | Concentration of arsenic in mg/L                                                     |
| C_As                | Arsenic Quality               | Water quality by arsenic content                                                     |
| Cd                  | Cadmium                       | Concentration of cadmium in mg/L                                                     |
| C_Cd                | Cadmium Quality               | Water quality by cadmium content                                                     |
| Cr                  | Chromium                      | Concentration of chromium in mg/L                                                    |
| C_Cr                | Chromium Quality              | Water quality by chromium content                                                    |
| Hg                  | Mercury                       | Concentration of mercury in mg/L                                                     |
| C_Hg                | Mercury Quality               | Water quality by mercury content                                                     |
| Pb                  | Lead                          | Concentration of lead in mg/L                                                        |
| C_Pb                | Lead Quality                  | Water quality by lead content                                                        |
| Mn                  | Manganese                     | Concentration of manganese in mg/L                                                   |
| C_Mn                | Manganese Quality             | Water quality by manganese content                                                   |
| X                   | Longitude                     | Geographic coordinates                                                              |
| Y                   | Latitude                      | Geographic coordinates                                                              |
| fecha_registro      | Registration Date             | Date of registration of the concentrations                                          |
| fuente              | Information Source            | Source of information from where the data comes                                      |


#### 4.1 Application Example

In [7]:
df_ws = pd.read_csv("../dataset/Mexico/water_sources/water_sources.csv")

This line displays the data frame df_ws . In a Jupyter notebook, this will render the GeoDataFrame as a table, showing its contents and structure.


Here is the translation of the table from Spanish to English:

| variable        | full variable name           | variable description                                                               | id        | name of the well or spring                       |
|-----------------|------------------------------|------------------------------------------------------------------------------------|-----------|--------------------------------------------------|
| id              | identifier                   | identifier of the well or spring 

In [8]:
display(df_ws)

Unnamed: 0,id,cve_ent,nom_ent,actividad,F,C_F,N,C_N,As,C_As,...,Hg,C_Hg,Pb,C_Pb,Mn,C_Mn,X,Y,fecha_registro,fuente
0,DLCHI297,8,Chihuahua,POZO,1.83,Alta,0.781878,Potable - Excelente,<0.01,Potable - Excelente,...,<0.0005,Potable - Excelente,<0.005,Potable - Excelente,0.510564,Puede afectar la salud,-106.112050,31.341430,2012-2022,RENAMECA
1,DLCHI300,8,Chihuahua,POZO,0.62,Media,0.11345,Potable - Excelente,<0.01,Potable - Excelente,...,<0.0005,Potable - Excelente,<0.005,Potable - Excelente,2.04455625,Puede afectar la salud,-105.881300,31.271970,2012-2022,RENAMECA
2,DLCHI302,8,Chihuahua,POZO,2.334,Alta,5.479,Potable - Buena calidad,<0.01,Potable - Excelente,...,0.000593333,Potable - Excelente,<0.005,Potable - Excelente,0.002057143,Potable - Excelente,-106.847550,30.214110,2012-2022,RENAMECA
3,DLCHI303,8,Chihuahua,POZO,2.8176,Alta,0.8117,Potable - Excelente,<0.01,Potable - Excelente,...,<0.0005,Potable - Excelente,<0.005,Potable - Excelente,<0.0015,Potable - Excelente,-106.792700,29.920680,2012-2022,RENAMECA
4,DLCHI304,8,Chihuahua,POZO,2.275347,Alta,1.166143,Potable - Excelente,<0.01,Potable - Excelente,...,<0.0005,Potable - Excelente,<0.005,Potable - Excelente,0.001607143,Potable - Excelente,-107.217710,29.990360,2012-2022,RENAMECA
...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...
211,OCRBR5710,5,Coahuila,POZO,0.3045,Baja,26.451248,No apta como FAAP,<0.01,Potable - Excelente,...,<0.0005,Potable - Excelente,<0.005,Potable - Excelente,<0.0015,Potable - Excelente,-100.935561,25.608067,2012-2022,RENAMECA
212,OCRBR5711,5,Coahuila,POZO,0.9368,Potable - Optima,19.065063,No apta como FAAP,<0.01,Potable - Excelente,...,<0.0005,Potable - Excelente,<0.005,Potable - Excelente,0.0044,Potable - Excelente,-101.549652,25.621133,2012-2022,RENAMECA
213,P19C-6+350,5,Coahuila,POZO,1.32,Potable - Optima,2.036,Potable - Excelente,<0.01,Potable - Excelente,...,<0.0005,Potable - Excelente,0.072,No apta como FAAP,0.074,Potable - Excelente,-101.071290,29.449410,2012-2022,RENAMECA
214,PLA16M-6+950,5,Coahuila,POZO,1.15,Potable - Optima,0.194,Potable - Excelente,<0.01,Potable - Excelente,...,<0.0005,Potable - Excelente,<0.005,Potable - Excelente,0.178,Sin efectos en la salud - Puede dar color al agua,-101.077450,29.449320,2012-2022,RENAMECA


This line displays the data frame df_ws . In a Jupyter notebook, this will render the GeoDataFrame as a table, showing its contents and structure.

### 5. Breast cancer mortality


The file in this folder contains a database listing the breast cancer cases at a national level. Collated data were obtained from  the [Instituto Nacional de Estadística y Geografía (INEGI)](https://www.inegi.org.mx/) and the  [Consejo Nacional de Población (CONAPO)](https://www.gob.mx/conapo).

A description of the variables stored in the emissions file is shown in the following table. Notice that the data set contains locality information (`loc_resid`) as the only source of geographical information. Localities ar identified by a four digit number for  given entity (`cve_ent`) and municipality (`cve_mun`) identifiers.


Here's the translated table:

| Variable      | Full Name                          | Description                                      | Range                                 |
|---------------|------------------------------------|--------------------------------------------------|---------------------------------------|
| cve_ent       | Entity key                         | Entity residence key assigned by INEGI           | na                                    |
| nom_ent       | Entity name                        | Entity residence name assigned by INEGI           | na                                    |
| cve_mun       | Municipality key                   | Municipality residence key assigned by INEGI      | na                                    |
| nom_mun       | Municipality name                  | Municipality residence name assigned by INEGI     | na                                    |
| t_loc         | Size of usual residence locality   | 1 to 999 inhabitants                              | 1                                     |
|               |                                    | 1,000 to 2,499                                    | 2.5                                   |
|               |                                    | 2,500 to 4,999                                    | 4                                     |
|               |                                    | 5,000 to 9,999                                    | 5                                     |
|               |                                    | 10,000 to 14,999                                  | 6                                     |
|               |                                    | 15,000 to 29,999                                  | 7.5                                   |
|               |                                    | 30,000 to 49,999                                  | 9.5                                   |
|               |                                    | 50,000 to 99,999                                  | 11.5                                  |
|               |                                    | 100,000 to 249,999                                | 13                                    |
|               |                                    | 250,000 to 499,999                                | 14                                    |
|               |                                    | 500,000 to 999,999                                | 15                                    |
|               |                                    | 1,000,000 and more                                | 16.5                                  |
| loc_resid     | Usual residence locality           | Locality according to Municipality.               | 0001… 6999                            |
|               |                                    | Confidential figure. Established in the SNIEG Law.| 7777                                  |
|               |                                    | Unspecified locality key                          | 9999                                  |
| gpo_quinq     | Five-year age group                | Five-year age group                               |                                       |
| poblacion     | Population                         | Population according to the size of the locality and the five-year age group in the municipality and the entity, 2010 and 2020 censuses |                                       |
| Censo         | Census                             | Census from which the population was extracted    |                                       |
| causa_def     | Cause of death (Detailed list)     | Malignant tumor of the nipple and areola          | C500                                  |
|               |                                    | Malignant tumor of the central portion of the breast| C501                                 |
|               |                                    | Malignant tumor of the inner lower quadrant of the breast| C503                             |
|               |                                    | Malignant tumor of the outer upper quadrant of the breast| C504                             |
|               |                                    | Malignant tumor of the axillary extension of the breast| C506                               |
|               |                                    | Contiguous sites lesion of the breast             | C508                                  |
|               |                                    | Malignant tumor of the breast, unspecified part   | C509                                  |
| sexo          | Sex of the deceased                | Female                                            | 1                                     |
| nacionalidad  | Nationality of the deceased        | Mexican                                           | 1                                     |
|               |                                    | Foreign                                           | 2                                     |
|               |                                    | Unspecified nationality                           | 9                                     |
| dia_ocurr     | Day of occurrence                  | Day                                               | 01… 31                                |
|               |                                    | Unspecified day                                   | 99                                    |
| mes_ocurr     | Month of occurrence                | January … December                                | 01… 12                                |
|               |                                    | Unspecified month                                 | 99                                    |
| anio_ocurr    | Year of occurrence                 | 1900… Statistical year                            |                                       |
|               |                                    | Unspecified year                                  | 9999                                  |
| dia_nacim     | Birth day of the deceased          | Day                                               | 01… 31                                |
|               |                                    | Unspecified day                                   | 99                                    |
| mes_nacim     | Birth month of the deceased        | January … December                                | 01… 12                                |
|               |                                    | Unspecified month                                 | 99                                    |
| anio_nacim    | Birth year of the deceased         | 1900… Statistical year                            |                                       |
|               |                                    | Unspecified year                                  | 9999                                  |
| ocupacion     | Occupation 2009-2012 of the deceased | Unemployed                                       | 2                                     |
|               |                                    | Professionals                                     | 11                                    |
|               |                                    | Technicians                                       | 12                                    |
|               |                                    | Education workers                                 | 13                                    |
|               |                                    | Arts, sports, and entertainment workers           | 14                                    |
|               |                                    | Officials and executives                          | 21                                    |
|               |                                    | Agricultural, livestock, hunting and fishing workers| 41                                  |
|               |                                    | Control personnel in the industrial production process| 51                                |
|               |                                    | Transformation industry workers                   | 52                                    |
|               |                                    | Fixed machinery operators                         | 53                                    |
|               |                                    | Assistants in the industrial and artisanal production process| 54                            |
|               |                                    | Mobile machinery and transport operators          | 55                                    |
|               |                                    | Intermediate level administrative workers         | 61                                    |
|               |                                    | Lower level administrative workers                | 62                                    |
|               |                                    | Traders, commerce employees, and sales agents     | 71                                    |
|               |                                    | Street vendors                                    | 72                                    |
|               |                                    | Personal services workers in establishments       | 81                                    |
|               |                                    | Domestic workers                                  | 82                                    |
|               |                                    | Armed forces, protection, and surveillance workers| 83                                    |
|               |                                    | Not applicable to minors under 12 years           | 97                                    |
|               |                                    | Insufficiently specified occupations              | 98                                    |
|               |                                    | Unspecified occupation                            | 99                                    |
| ocupacion     | Occupation 2013-2021 of the deceased | Officials, directors, and heads                  | 1                                     |
|               |                                    | Professionals and technicians                     | 2                                     |
|               |                                    | Auxiliary workers in administrative activities    | 3                                     |
|               |                                    | Traders, sales employees, and sales agents        | 4                                     |
|               |                                    | Personal services and surveillance workers        | 5                                     |
|               |                                    | Agricultural, livestock, forestry, hunting, and fishing workers| 6                             |
|               |                                    | Artisanal workers                                 | 7                                     |
|               |                                    | Industrial machinery operators, assemblers, drivers, and transport conductors| 8                  |
|               |                                    | Workers in elementary and support activities      | 9                                     |
|               |                                    | Job seekers                                       | 10                                    |
|               |                                    | Unemployed                                        | 11                                    |
|               |                                    | Not applicable to minors under 5 years            | 97                                    |
|               |                                    | Insufficiently specified occupations              | 98                                    |
|               |                                    | Unspecified occupation                            | 99                                    |
| ocupacion     | Occupation 2022 of the deceased    | See Occupation Catalog 2022                       | 011-099, 100, 110, 997, 998, 999      |
| escolarida    | Schooling 2009-2011 of the deceased| No schooling                                      | 1                                     |
|               |                                    | Incomplete primary (one to five years)            | 2                                     |
|               |                                    | Complete primary                                  | 3                                     |
|               |                                    | Incomplete secondary                              | 4                                     |
|               |                                    | Complete secondary                                | 5                                     |
|               |                                    | High school or preparatory                        | 6                                     |
|               |                                    | Professional                                      | 7                                     |
|               |                                    | Not applicable to minors under 6 years            | 8                                     |
|               |                                    | Unspecified                                       | 9                                     |
| escolarida    | Schooling 2012 onwards of the deceased | No schooling                                     | 1                                     |
|               |                                    | Preschool                                         | 2                                     |
|               |                                    | Incomplete primary                                | 3                                     |
|               |                                    | Complete primary                                  | 4                                     |
|               |                                    | Incomplete secondary                              | 5                                     |
|               |                                    | Complete secondary                                | 6                                     |
|               |                                    | Incomplete high school or preparatory             | 7                                     |
|               |                                    | Complete high school or preparatory               | 8                                     |
|               |                                    | Professional                                      | 9                                     |
|               |                                    | Postgraduate                                      | 10                                    |
|               |                                    | Not applicable to minors under 3 years            | 88                                    |
|               |                                    | Unspecified                                       | 99                                    |
| edo_civil     | Marital status 2009-2011 of the deceased | Single                                           | 1                                     |
|               |                                    | Widowed                                           | 2                                     |
|               |                                    | Divorced                                          | 3                                     |
|               |                                    | Common-law union                                  | 4                                     |
|               |                                    | Married                                           | 5                                     |
|               |                                    | Not applicable to minors under 12 years           | 8                                     |
|               |                                    | Unspecified                                       | 9                                     |
| 

#### 5.1 Application Example

In [5]:
df_bc = pd.read_csv("../dataset/Mexico/bc_data/BDmortalidadCAMAbase.csv")

This line displays the data frame *df_bc* . In a Jupyter notebook, this will render the GeoDataFrame as a table, showing its contents and structure.

In [21]:
display(df_bc)

Unnamed: 0,cve_ent,nom_ent,cve_mun,nom_mun,t_loc,loc_resid,gpo_quinq,poblacion,Censo,causa_def,...,dia_nacim,mes_nacim,anio_nacim,ocupacion,escolarida,edo_civil,asist_med,derechohab,area_ur,anio_cert
0,1,Aguascalientes,1,001 Aguascalientes,1.0,7777,45-49 años,662,2010,C509,...,26,7,1965,71,6.0,2,1,2,2,2012
1,1,Aguascalientes,1,001 Aguascalientes,1.0,285,45-49 años,662,2010,C509,...,14,2,1969,11,4.0,3,9,2,2,2015
2,1,Aguascalientes,1,001 Aguascalientes,1.0,125,45-49 años,662,2010,C509,...,9,10,1966,11,3.0,5,1,2,2,2015
3,1,Aguascalientes,1,001 Aguascalientes,1.0,7777,50-54 años,553,2010,C509,...,99,99,1962,11,9.0,5,1,7,2,2012
4,1,Aguascalientes,1,001 Aguascalientes,1.0,2180,50-54 años,553,2010,C509,...,99,99,1961,2,4.0,3,1,2,2,2012
...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...
89478,32,Zacatecas,56,056 Zacatecas,13.0,1,80-84 años,724,2020,C509,...,4,4,1935,11,6.0,5,1,3,1,2019
89479,32,Zacatecas,56,056 Zacatecas,13.0,1,80-84 años,724,2020,C509,...,8,2,1936,11,4.0,1,1,3,1,2020
89480,32,Zacatecas,56,056 Zacatecas,13.0,1,85-89 años,371,2020,C509,...,18,7,1931,11,8.0,3,1,3,1,2021
89481,32,Zacatecas,58,058 Santa MarÃ­a de la Paz,2.5,1,40-44 años,58,2020,C509,...,5,5,1976,11,4.0,5,1,1,2,2020


### 5. Breast cancer mortality