# LANGUAGE COMPETENCE PART 2: Dataset + Data Collection

## Project Aims

### Problem Statement

Applications for degree courses linked to European languages have fallen by almost a quarter in the past five years, and applications to other language courses have dropped by almost a fifth (source, Press Association analysis).
- Would it make a difference if we could predict competence in a foreign language?

### Target Audience

- Schools/boards of education who need justification for funding in targeted areas
- Students who need justification of the factors leading to success in order to continue studying languages

### Project Goal

A model that can predict the achievement of competence in a foreign language on the basis of variable factors.

### Data Source

European Survey on Language Competences (ESLC) conducted by the Centre for Research on Education and Lifelong Learning: https://crell.jrc.ec.europa.eu/?q=article/eslc-database

The European Survey on Language Competences (ESLC) was carried out in 2012 to provide the Commission and the participant countries comparable data - across skills and languages – of foreign language competences of secondary school students. The data collected in the 16 participating educational systems includes the results of language tests according to the Common European Framework of Reference (CEFR) and the results of the different administered questionnaires.

The test results correspond to the first and the second most commonly taught languages in the participating educational systems and the questionnaire data includes student, teacher and school/principal responses to information pertaining to teaching and learning. For this project I will concentrate on the student-specific data and only refer to the teacher/school-specific data for context.

### Data Format

Scoring:
- Language tests have been scored according to the Common European Framework of Reference for Languages (https://en.wikipedia.org/wiki/Common_European_Framework_of_Reference_for_Languages) and I need to work out whether it is more beneficial to split out each level or to group them together into larger categories.

### Risks and Challenges

- The best predictors may turn out to be non-variable features
- The most common language in the survey is English so for there to be benefit in the UK, the model should be able to predict performance across all languages, not simply for English
- The expected benefit of the project is based on the assumption that a higher level of competence would lead to continued study

## Data Cleaning

### Data Import

In [3]:
import pandas as pd
pd.set_option('display.max_rows',None)

In [4]:
# set columns widths for data (info provided with data)
colwidths=[6,10,5,9,2,1,2,2,2,2,10,2,2,2,2,2,2,2,2,2,2,2,2,2,2,4,4,2,2,\
           2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,\
           2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,\
           2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,\
           2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,\
           2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,\
           2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,7,\
           4,7,4,7,4,7,4,7,4,2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,\
           2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,\
           2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,\
           2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,\
           2,2,2,2,2,2,2,2,4,4,2,2,2,4,4,4,6,2,2,2,2,2,2,3,1,1,1,2,2,1,\
           1,1,1,1,1,1,1,1,3,1,4,4,3,1,1,1,4,1,4,4,1,2,1,14,1,1,3,4,4,5,\
           14,5,10,2,1,1,1,1,14,14,14,14,15,14,14,14,14,14,15,14,15,15,\
           15,15,15,17,14,15,15,14,15,15,15,3,3,3,3,3,3,3,3,3,3,3,3,3,3,\
           3,3,3,3,3,3,3,3,3,3,3,10,10,11,11,11,11,11,11,1,1,1,1,1,1,1,\
           1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,\
           1,1,1,1,2,1,10]

In [5]:
# set column names for data (info provided with data)
colnames=["country_id","school_id","main_study_sample","respondent_id",\
          "targetLanguage_id","TL","questLanguage_id","TESTING_grade",\
          "test_mode","SQt01i01","SQt02i01C","SQt03i01","SQt04i00","SQt04i01",\
          "SQt04i02","SQt04i03","SQt04i04","SQt04i05","SQt04i06","SQt04i07",\
          "SQt04i08","SQt04i09","SQt04i10","SQt04i11","SQt04i12","SQt05i01",\
          "SQt05i01TC","SQt06i01","SQt09i01","SQt12i01","SQt13i01","SQt14i01",\
          "SQt15i01","SQt16i01","SQt17i01","SQt18i01","SQt19i01","SQt19i02",\
          "SQt19i03","SQt19i04","SQt19i05","SQt19i06","SQt19i07","SQt19i08",\
          "SQt20i01","SQt20i02","SQt20i03","SQt20i04","SQt20i05","SQt20i06",\
          "SQt20i07","SQt20i08","SQt21i01","SQt22i01","SQt22i02","SQt22i03",\
          "SQt22i04","SQt22i05","SQt23i01","SQt23i02","SQt23i03","SQt23i04",\
          "SQt23i05","SQt23i06","SQt23i07","SQt23i08","SQt23i09","SQt24i01",\
          "SQt24i02","SQt24i03","SQt24i04","SQt24i05","SQt24i06","SQt25i00",\
          "SQt25i01","SQt25i02","SQt25i03","SQt25i04","SQt25i05","SQt25i06",\
          "SQt25i07","SQt25i08","SQt25i09","SQt25i10","SQt25i11","SQt25i12",\
          "SQt26i00","SQt26i01","SQt26i02","SQt26i03","SQt26i04","SQt26i05",\
          "SQt26i06","SQt26i07","SQt26i08","SQt26i09","SQt26i10","SQt26i11",\
          "SQt26i12","SQt27i01","SQt28i01","SQt28i02","SQt29i01","SQt29i02",\
          "SQt29i03","SQt29i04","SQt29i05","SQt29i06","SQt29i07","SQt30i01",\
          "SQt30i02","SQt30i03","SQt30i04","SQt30i05","SQt30i06","SQt31i01",\
          "SQt31i02","SQt31i03","SQt31i04","SQt31i05","SQt31i06","SQt31i07",\
          "SQt31i08","SQt31i09","SQt32i01","SQt32i02","SQt32i03","SQt32i04",\
          "SQt33i01","SQt33i02","SQt33i03","SQt33i04","SQt33i05","SQt33i06",\
          "SQt33i07","SQt33i08","SQt33i09","SQt33i10","SQt34i01","SQt34i02",\
          "SQt34i03","SQt34i04","SQt34i05","SQt34i06","SQt34i07","SQt34i08",\
          "SQt34i09","SQt35i01","SQt35i02","SQt35i03","SQt35i04","SQt35i05",\
          "SQt35i06","SQt35i07","SQt35i08","SQt35i09","SQt36i01","SQt36i02",\
          "SQt36i03","SQt36i04","SQt36i05","SQt36i06","SQt36i07","SQt36i08",\
          "SQt36i09","SQt37i00","SQt37i01","SQt37i02","SQt37i03","SQt37i04",\
          "SQt37i05","SQt37i06","SQt37i07","SQt37i08","SQt37i09","SQt37i10",\
          "SQt38i01","SQt39i00","SQt39i01","SQt39i02","SQt39i03","SQt39i04",\
          "SQt39i05","SQt39i06","SQt39i07","SQt39i08","SQt39i09","SQt39i10",\
          "SQt39i11","SQt39i12","SQt39i13","SQt39i14","SQt40i00","SQt40i01",\
          "SQt40i02","SQt40i03","SQt40i04","SQt40i05","SQt40i06","SQt40i07",\
          "SQt40i08","SQt40i09","SQt40i10","SQt40i11","SQt40i12","SQt40i13",\
          "SQt40i14","SQt41i01","SQt42i01","SQt42i01C","SQt43i01","SQt43i01C",\
          "SQt44i01","SQt44i01C","SQt44i02","SQt44i02C","SQt44i03","SQt44i03C",\
          "SQt45i01","SQt45i02","SQt45i03","SQt45i04","SQt45i05","SQt45i06",\
          "SQt46i01","SQt46i02","SQt46i03","SQt46i04","SQt46i05","SQt46i06",\
          "SQt46i07","SQt47i01","SQt48i01","SQt48i02","SQt48i03","SQt48i04",\
          "SQt48i05","SQt48i06","SQt48i07","SQt49i01","SQt49i02","SQt50i01",\
          "SQt50i02","SQt50i03","SQt51i01","SQt51i02","SQt51i03","SQt51i04",\
          "SQt51i05","SQt51i06","SQt51i07","SQt51i08","SQt51i09","SQt52i01",\
          "SQt52i02","SQt52i03","SQt52i04","SQt52i05","SQt52i06","SQt52i07",\
          "SQt53i01","SQt53i02","SQt53i03","SQt53i04","SQt53i05","SQt53i06",\
          "SQt54i01","SQt54i02","SQt54i03","SQt54i04","SQt54i05","SQt54i06",\
          "SQt55i01","SQt55i02","SQt55i03","SQt55i04","SQt55i05","SQt55i06",\
          "SQt56i01","SQt56i02","SQt57i01","SQt57i02","SQt57i03","SQt57i04",\
          "SQt57i05","SQt57i06","SQt57i07","SQt58i01","SQt58i02","SQt58i03",\
          "SQt58i04","SQt58i05","SQt58i06","SQt58i07","SQt59i01","SQt59i02",\
          "SQt60i01","SQt61i01","SQt61i02","SQt61i03","SQt61i04","SQt61i05",\
          "SQt61i06","SQt61i07","SQt62i01","SQt62i02","SQt62i03","SQt62i04",\
          "SQt62i05","SQt62i06","SQt62i07","SQt62i08","SQt62i09","SQt63i01",\
          "SQt63i02","SQt64i01","SQt64i02","SQt64i03","SQt64i04","SQt64i05",\
          "SQt64i06","SQtA1i01","SQtA1i02","SQtA1i03","SQtA1i04","SQtA2i01",\
          "SQtA2i02","SQtA2i03","SQtA2i04","SQtA3i01","SQtA3i02","SQtA3i03",\
          "SQtA3i04","SQtA4i01","SQtA4i02","SQtA4i03","SQtA4i04","ISCO_M",\
          "ISCO_F","ISEI_M","ISEI_F","HISEI","MPARED","FPARED","PARED",\
          "HOMEPOS","I01_ST_M_S39A","I01_ST_M_S39B","I01_ST_M_S40A",\
          "I01_ST_M_S40B","I01_ST_M_S44A","I01_ST_M_S44B","I01_ST_M_S59A",\
          "I01_ST_M_S63A","I01_ST_M_S63B","I02_ST_M_S37A","I02_ST_M_S37B",\
          "I02_ST_M_S38A","I02_ST_M_S41A","I03_ST_A_S03A","I03_ST_A_S04A",\
          "I03_ST_A_S04B","I03_ST_A_S25A","I03_ST_A_S25B","I03_ST_A_S26A",\
          "I03_ST_A_S26B","I03_ST_A_S27B","I03_ST_A_S28A","I03_ST_A_S29A",\
          "I03_ST_A_S30A","I03_ST_A_S31A","I03_ST_A_S45A","I04_ST_M_S64A",\
          "I04_ST_M_S64B","I05_ST_A_S23A","I05_ST_A_S24A","I05_ST_M_S62A",\
          "I06_ST_M_S45A","I06_ST_M_S46A","I08_ST_A_S01A","I08_ST_A_S02A",\
          "I08_ST_A_S15A","I08_ST_A_S19B","I08_ST_M_S64A","I08_ST_M_S64B",\
          "I09_IN_M_S49A","I09_IN_M_S50A","I09_IN_M_S51A","I09_IN_M_S57A",\
          "I09_ST_M_S33B","I09_ST_M_S48A","I09_ST_M_S52B","I14_IN_A_S42A",\
          "I14_ST_A_S06A","I14_ST_A_S06B","I14_ST_A_S06C","I14_ST_M_S47A",\
          "PV1_LIST","PV2_LIST","PV3_LIST","PV4_LIST","PV5_LIST","PV1_READ",\
          "PV2_READ","PV3_READ","PV4_READ","PV5_READ","PV1_WRIT1","PV2_WRIT1",\
          "PV3_WRIT1","PV4_WRIT1","PV5_WRIT1","PV1_WRIT2","PV2_WRIT2",\
          "PV3_WRIT2","PV4_WRIT2","PV5_WRIT2","PV1_WRIT_C","PV2_WRIT_C",\
          "PV3_WRIT_C","PV4_WRIT_C","PV5_WRIT_C","PL1_READ","PL2_READ",\
          "PL3_READ","PL4_READ","PL5_READ","PL1_LIST","PL2_LIST","PL3_LIST",\
          "PL4_LIST","PL5_LIST","PL1_WRIT1","PL2_WRIT1","PL3_WRIT1",\
          "PL4_WRIT1","PL5_WRIT1","PL1_WRIT2","PL2_WRIT2","PL3_WRIT2",\
          "PL4_WRIT2","PL5_WRIT2","PL1_WRIT_C","PL2_WRIT_C","PL3_WRIT_C",\
          "PL4_WRIT_C","PL5_WRIT_C","FSW_LIST","FSW_LIST_TR","FSW_READ",\
          "FSW_READ_TR","FSW_WRIT","FSW_WRIT_TR","FSW_QUES","FSW_QUES_TR",\
          "RPW1","RPW2","RPW3","RPW4","RPW5","RPW6","RPW7","RPW8","RPW9",\
          "RPW10","RPW11","RPW12","RPW13","RPW14","RPW15","RPW16","RPW17",\
          "RPW18","RPW19","RPW20","RPW21","RPW22","RPW23","RPW24","RPW25",\
          "RPW26","RPW27","RPW28","RPW29","RPW30","RPW31","RPW32","RPW33",\
          "RPW34","RPW35","RPW36","RPW37","RPW38","RPW39","RPW40","RPW41",\
          "JKzone","JKrep","version_stu"]

NOTE: Throughout the dataset, there are specific response codes that have been used to indicate invalid or missing data:

- 77 Not applicable
- 88 Invalid
- 98 Missing
- 99 Missing
- 8887 Coded as invalid
- 8888 Invalid
- 9998 Question missing

On import, I will replace these with 'na' so it is clear how much data is missing.

In [6]:
na_list = ['77', '88', '98', '99', '8887', '8888', '9998']

In [7]:
# read in data to df
lang = pd.read_fwf("/Users/lizspiking/Documents/Data Science/\
                    Capstone Project/Language/INT_stu.txt",\
                   names=colnames, widths=colwidths,\
                   delimiter='\t', keep_default_na=True,\
                   decimal=',', na_values=na_list)

In [8]:
# check number of rows/features
lang.shape

(52434, 499)

In [9]:
# check that data has been separated appropriately into columns
lang.head(10)

Unnamed: 0,country_id,school_id,main_study_sample,respondent_id,targetLanguage_id,TL,questLanguage_id,TESTING_grade,test_mode,SQt01i01,SQt02i01C,SQt03i01,SQt04i00,SQt04i01,SQt04i02,SQt04i03,SQt04i04,SQt04i05,SQt04i06,SQt04i07,SQt04i08,SQt04i09,SQt04i10,SQt04i11,SQt04i12,SQt05i01,SQt05i01TC,SQt06i01,SQt09i01,SQt12i01,SQt13i01,SQt14i01,SQt15i01,SQt16i01,SQt17i01,SQt18i01,SQt19i01,SQt19i02,SQt19i03,SQt19i04,SQt19i05,SQt19i06,SQt19i07,SQt19i08,SQt20i01,SQt20i02,SQt20i03,SQt20i04,SQt20i05,SQt20i06,SQt20i07,SQt20i08,SQt21i01,SQt22i01,SQt22i02,SQt22i03,SQt22i04,SQt22i05,SQt23i01,SQt23i02,SQt23i03,SQt23i04,SQt23i05,SQt23i06,SQt23i07,SQt23i08,SQt23i09,SQt24i01,SQt24i02,SQt24i03,SQt24i04,SQt24i05,SQt24i06,SQt25i00,SQt25i01,SQt25i02,SQt25i03,SQt25i04,SQt25i05,SQt25i06,SQt25i07,SQt25i08,SQt25i09,SQt25i10,SQt25i11,SQt25i12,SQt26i00,SQt26i01,SQt26i02,SQt26i03,SQt26i04,SQt26i05,SQt26i06,SQt26i07,SQt26i08,SQt26i09,SQt26i10,SQt26i11,SQt26i12,SQt27i01,SQt28i01,SQt28i02,SQt29i01,SQt29i02,SQt29i03,SQt29i04,SQt29i05,SQt29i06,SQt29i07,SQt30i01,SQt30i02,SQt30i03,SQt30i04,SQt30i05,SQt30i06,SQt31i01,SQt31i02,SQt31i03,SQt31i04,SQt31i05,SQt31i06,SQt31i07,SQt31i08,SQt31i09,SQt32i01,SQt32i02,SQt32i03,SQt32i04,SQt33i01,SQt33i02,SQt33i03,SQt33i04,SQt33i05,SQt33i06,SQt33i07,SQt33i08,SQt33i09,SQt33i10,SQt34i01,SQt34i02,SQt34i03,SQt34i04,SQt34i05,SQt34i06,SQt34i07,SQt34i08,SQt34i09,SQt35i01,SQt35i02,SQt35i03,SQt35i04,SQt35i05,SQt35i06,SQt35i07,SQt35i08,SQt35i09,SQt36i01,SQt36i02,SQt36i03,SQt36i04,SQt36i05,SQt36i06,SQt36i07,SQt36i08,SQt36i09,SQt37i00,SQt37i01,SQt37i02,SQt37i03,SQt37i04,SQt37i05,SQt37i06,SQt37i07,SQt37i08,SQt37i09,SQt37i10,SQt38i01,SQt39i00,SQt39i01,SQt39i02,SQt39i03,SQt39i04,SQt39i05,SQt39i06,SQt39i07,SQt39i08,SQt39i09,SQt39i10,SQt39i11,SQt39i12,SQt39i13,SQt39i14,SQt40i00,SQt40i01,SQt40i02,SQt40i03,SQt40i04,SQt40i05,SQt40i06,SQt40i07,SQt40i08,SQt40i09,SQt40i10,SQt40i11,SQt40i12,SQt40i13,SQt40i14,SQt41i01,SQt42i01,SQt42i01C,SQt43i01,SQt43i01C,SQt44i01,SQt44i01C,SQt44i02,SQt44i02C,SQt44i03,SQt44i03C,SQt45i01,SQt45i02,SQt45i03,SQt45i04,SQt45i05,SQt45i06,SQt46i01,SQt46i02,SQt46i03,SQt46i04,SQt46i05,SQt46i06,SQt46i07,SQt47i01,SQt48i01,SQt48i02,SQt48i03,SQt48i04,SQt48i05,SQt48i06,SQt48i07,SQt49i01,SQt49i02,SQt50i01,SQt50i02,SQt50i03,SQt51i01,SQt51i02,SQt51i03,SQt51i04,SQt51i05,SQt51i06,SQt51i07,SQt51i08,SQt51i09,SQt52i01,SQt52i02,SQt52i03,SQt52i04,SQt52i05,SQt52i06,SQt52i07,SQt53i01,SQt53i02,SQt53i03,SQt53i04,SQt53i05,SQt53i06,SQt54i01,SQt54i02,SQt54i03,SQt54i04,SQt54i05,SQt54i06,SQt55i01,SQt55i02,SQt55i03,SQt55i04,SQt55i05,SQt55i06,SQt56i01,SQt56i02,SQt57i01,SQt57i02,SQt57i03,SQt57i04,SQt57i05,SQt57i06,SQt57i07,SQt58i01,SQt58i02,SQt58i03,SQt58i04,SQt58i05,SQt58i06,SQt58i07,SQt59i01,SQt59i02,SQt60i01,SQt61i01,SQt61i02,SQt61i03,SQt61i04,SQt61i05,SQt61i06,SQt61i07,SQt62i01,SQt62i02,SQt62i03,SQt62i04,SQt62i05,SQt62i06,SQt62i07,SQt62i08,SQt62i09,SQt63i01,SQt63i02,SQt64i01,SQt64i02,SQt64i03,SQt64i04,SQt64i05,SQt64i06,SQtA1i01,SQtA1i02,SQtA1i03,SQtA1i04,SQtA2i01,SQtA2i02,SQtA2i03,SQtA2i04,SQtA3i01,SQtA3i02,SQtA3i03,SQtA3i04,SQtA4i01,SQtA4i02,SQtA4i03,SQtA4i04,ISCO_M,ISCO_F,ISEI_M,ISEI_F,HISEI,MPARED,FPARED,PARED,HOMEPOS,I01_ST_M_S39A,I01_ST_M_S39B,I01_ST_M_S40A,I01_ST_M_S40B,I01_ST_M_S44A,I01_ST_M_S44B,I01_ST_M_S59A,I01_ST_M_S63A,I01_ST_M_S63B,I02_ST_M_S37A,I02_ST_M_S37B,I02_ST_M_S38A,I02_ST_M_S41A,I03_ST_A_S03A,I03_ST_A_S04A,I03_ST_A_S04B,I03_ST_A_S25A,I03_ST_A_S25B,I03_ST_A_S26A,I03_ST_A_S26B,I03_ST_A_S27B,I03_ST_A_S28A,I03_ST_A_S29A,I03_ST_A_S30A,I03_ST_A_S31A,I03_ST_A_S45A,I04_ST_M_S64A,I04_ST_M_S64B,I05_ST_A_S23A,I05_ST_A_S24A,I05_ST_M_S62A,I06_ST_M_S45A,I06_ST_M_S46A,I08_ST_A_S01A,I08_ST_A_S02A,I08_ST_A_S15A,I08_ST_A_S19B,I08_ST_M_S64A,I08_ST_M_S64B,I09_IN_M_S49A,I09_IN_M_S50A,I09_IN_M_S51A,I09_IN_M_S57A,I09_ST_M_S33B,I09_ST_M_S48A,I09_ST_M_S52B,I14_IN_A_S42A,I14_ST_A_S06A,I14_ST_A_S06B,I14_ST_A_S06C,I14_ST_M_S47A,PV1_LIST,PV2_LIST,PV3_LIST,PV4_LIST,PV5_LIST,PV1_READ,PV2_READ,PV3_READ,PV4_READ,PV5_READ,PV1_WRIT1,PV2_WRIT1,PV3_WRIT1,PV4_WRIT1,PV5_WRIT1,PV1_WRIT2,PV2_WRIT2,PV3_WRIT2,PV4_WRIT2,PV5_WRIT2,PV1_WRIT_C,PV2_WRIT_C,PV3_WRIT_C,PV4_WRIT_C,PV5_WRIT_C,PL1_READ,PL2_READ,PL3_READ,PL4_READ,PL5_READ,PL1_LIST,PL2_LIST,PL3_LIST,PL4_LIST,PL5_LIST,PL1_WRIT1,PL2_WRIT1,PL3_WRIT1,PL4_WRIT1,PL5_WRIT1,PL1_WRIT2,PL2_WRIT2,PL3_WRIT2,PL4_WRIT2,PL5_WRIT2,PL1_WRIT_C,PL2_WRIT_C,PL3_WRIT_C,PL4_WRIT_C,PL5_WRIT_C,FSW_LIST,FSW_LIST_TR,FSW_READ,FSW_READ_TR,FSW_WRIT,FSW_WRIT_TR,FSW_QUES,FSW_QUES_TR,RPW1,RPW2,RPW3,RPW4,RPW5,RPW6,RPW7,RPW8,RPW9,RPW10,RPW11,RPW12,RPW13,RPW14,RPW15,RPW16,RPW17,RPW18,RPW19,RPW20,RPW21,RPW22,RPW23,RPW24,RPW25,RPW26,RPW27,RPW28,RPW29,RPW30,RPW31,RPW32,RPW33,RPW34,RPW35,RPW36,RPW37,RPW38,RPW39,RPW40,RPW41,JKzone,JKrep,version_stu
0,PT,SC00102205,MS,100000852,EN,1,PT,9,CB,1,2/24/1996,1,1,,,,,0,0,0,,0,,0,0,9,9,0,0,0,3,3,0,0,0,6,1,1,1,1,1,0,1.0,1,1,1.0,1,1,1,1,0,1,2,3,3,3,2,2,1,1,1,1,1,1,1,1,1,1.0,1.0,3.0,3.0,4.0,4.0,1,,,,,0,0,0,,0,,0,0,1,,,,,0,0,0,,0,,0,0,0.0,1,2,0,0,0,0,0,1.0,0,3,0,1,0,0,3,4,1,3,1,3,4,1,1,4,2,2,2,2,3,3,2,2,3,3,2,2,2,3,2,1,1,2,1,2,1,,3,3,2,1,2,3,3,1,,2,2,2,2,0,1,1,0,,2,1,1,0,0,,,,,,,0,0,,,,,,1.0,1,1,1,1,0,0,0,0,0,,,,,,1.0,1,1,1,1,0,0,0,0,0,0,23.0,25,90.0,90,2.0,2,3.0,3,24.0,24,0,0,0,1,0,0,0,0,0,1,0,0,0,0,1,1,1,2,1,2,1,4,2,2,2,3,3,1,1,0,3,0,4,2,3,2,2,3,2,3,3,2,2,3,2,2,4,3,3,2,3,2,2,3,2,2,2,2,1,2,2,1,1,1,1,2,1,2,1,2,3,3,3,3,3.0,3,2,2,1,3,2,3,3,2,2,3,2,2,2,2,2,2,2.0,2,4,1,1,0,0,0,0,0,0,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,4144,3118,39,51.0,51.0,12.0,12.0,12.0,0.685,5,5,5,5,3,5,1.0,1,1,0,2,5,0,1,1,0,1,0,1,0,0.0,1.5,1,1.19,2.42,0.5,0,0,9,2.72,2,0.0,0.14,1,15,0,0.1634351045,0,0,3.0,2.31,1.87,1.749,1.98309424665,1.63,3.44572086,25,2,1,1,0,,,,,,2.91380626498,2.03677279366,2.33531965797,2.30667417994,2.746295772,-0.02376953755,-0.3889235603,-0.7126770257,0.86042431243,0.45129878992,0.3183184342,1.89961226655,-0.2251897454474,0.6434822729,0.30906504384,0.25016975618,0.7808953443,-0.51195754065,1.0421322256,0.58370280421,B2,B2,B2,B2,B2,,,,,,A2,A2,A2,B1,B1,A2,B1,A2,B1,A2,A2,B1,A2,B1,B1,68.052632,68.052632,0.0,0.0,86.2,86.2,92.357143,92.357143,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,2.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,13.0,2.0,2012-20-07
1,EL,SC00070798,MS,100005616,EN,1,EL,9,PB,1,7/10/1996,4,1,,,,,0,0,0,0.0,0,0.0,0,0,3,9,0,0,0,3,3,0,0,0,6,1,1,1,1,1,1,1.0,1,1,1.0,1,0,1,1,1,0,4,3,3,2,2,2,1,1,1,1,0,1,1,1,1,,,,,,,1,,,,,0,0,0,0.0,0,0.0,0,0,1,,,,,0,0,0,0.0,0,0.0,0,0,0.0,1,3,1,0,1,0,0,1.0,1,3,0,0,0,0,4,4,4,4,0,4,4,0,3,4,3,2,3,3,3,2,3,3,3,3,3,3,0,3,0,2,1,1,2,3,2,2.0,3,3,2,1,0,3,3,1,3.0,3,1,1,1,1,1,3,1,1.0,3,1,1,0,1,1.0,1.0,1.0,0.0,0.0,,0,1,,,,,,1.0,1,1,1,1,1,1,0,0,0,,,,,,1.0,1,1,1,1,1,1,0,0,0,0,10.0,10,45.0,45,2.0,2,7.0,7,35.0,35,0,0,0,0,0,0,1,0,0,0,0,3,0,2,0,0,0,0,0,0,0,4,3,3,0,2,1,3,2,1,4,4,3,2,4,3,3,3,3,3,3,3,3,3,2,1,1,4,3,3,3,2,2,3,2,3,3,3,3,3,0,0,3,2,1,2,2,1,1,3,2,3,4,4,2.0,2,2,1,2,2,2,2,3,3,1,1,1,2,3,3,2,4,4.0,3,2,1,3,1,1,1,1,1,1,1,1,0,1,1,1,0,1,1,1,0,1,0,0,1,1,4121,8322,51,30.0,51.0,12.0,12.0,12.0,0.739,7,3,7,3,2,5,1.0,1,3,1,5,5,0,4,1,0,1,0,1,0,0.0,2.0,4,1.19,2.97,0.0,1,1,8,,3,0.0,0.56,1,14,0,0.2262391595,0,0,3.5,1.65,2.64,1.749,2.43163118135,0.0,4.25356499,10,2,1,1,2,,,,,,0.27686016243,-0.59128171772,0.31468377116,1.13327571209,-0.418990327,0.4873374268,0.6495980895,0.32806587715,-0.23476758103,0.28163965598,-0.13277967931,1.03188531309,-0.2175094586431,0.2222478701,0.18145288888,0.3884332077,1.0862649297,0.23269678131,0.05194335408,0.39913117397,A1,A1,A1,B1,A1,,,,,,B1,B1,B1,A2,B1,A2,B1,A2,A2,A2,A2,B1,A2,A2,A2,75.959069,75.959069,0.0,0.0,102.456419,102.456419,118.158552,118.158552,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,2.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,10.0,2.0,2012-20-07
2,EL,SC00071784,MS,100014232,FR,2,EL,9,PB,0,4/11/1996,0,1,,,,,0,0,0,0.0,0,0.0,0,1,3,9,0,3,2,4,0,0,0,0,6,1,1,1,1,1,0,1.0,1,1,0.0,1,1,1,0,0,1,3,3,2,2,3,2,1,1,1,0,0,1,0,1,1,3.0,3.0,4.0,4.0,4.0,4.0,1,,,,,1,0,0,0.0,0,0.0,0,0,1,,,,,1,0,0,0.0,0,0.0,0,0,0.0,1,2,1,1,0,0,0,1.0,1,4,1,2,0,1,3,4,0,3,0,1,1,1,0,2,3,2,3,3,3,1,3,3,3,3,2,3,1,2,0,1,3,2,3,1,2,0.0,1,2,2,2,1,3,3,1,1.0,1,1,2,2,1,3,1,0,0.0,1,1,1,0,1,0.0,0.0,0.0,0.0,0.0,,0,1,,,,,,1.0,1,1,1,1,1,1,0,0,0,,,,,,1.0,1,1,1,1,0,0,0,0,0,0,20.0,20,45.0,45,2.0,2,7.0,7,35.0,35,0,2,1,3,0,2,0,1,2,0,0,1,1,0,1,0,2,2,0,0,0,4,2,0,0,2,1,1,2,3,3,1,4,1,4,3,3,2,2,2,2,2,4,3,1,2,4,2,2,3,2,2,2,0,0,0,2,3,1,0,2,0,2,0,0,1,0,2,3,4,4,4,2,2,4.0,2,1,1,2,1,2,2,0,2,2,2,2,2,0,0,0,0,3.0,0,3,1,4,1,0,0,0,0,0,1,1,1,0,1,1,0,0,1,1,1,0,1,1,1,0,9501,6131,0,23.0,23.0,11.5,17.0,17.0,0.613,7,3,5,5,2,5,1.0,1,4,1,2,5,0,0,2,0,2,1,2,1,0.0,1.5,4,1.87,1.32,2.0,1,0,6,3.74,1,1.0,0.7,0,14,0,0.08314165774,0,0,3.0,0.66,2.2,1.166,1.72758685514,0.978,2.71078911,20,2,1,1,0,,,,,,-0.00509599795,-0.02523455693,-0.18920972925,0.35722124275,-0.08715048432,-6.14682481161,-4.054852226,-6.99740003527,-3.78404292208,-5.56430119516,-5.96154633398,-5.87409150936,-7.0664089500784,-5.9733223969,-5.09120546803,-6.85769707474,-5.49944354,-7.94460161552,-5.37889140762,-6.0564881401,A1,A1,A1,A1,A1,,,,,,-A1,-A1,-A1,-A1,-A1,-A1,-A1,-A1,-A1,-A1,-A1,-A1,-A1,-A1,-A1,25.181902,25.181902,0.0,0.0,37.22542,37.22542,38.917485,38.917485,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,2.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,27.0,2.0,2012-20-07
3,BE nl,SC00066783,MS,100014947,EN,2,BV,10,CB,0,7/15/1994,0,1,0.0,0.0,,,0,0,0,0.0,0,,0,0,4,10,5,1,0,4,4,0,0,0,6,1,1,1,1,1,1,1.0,1,0,0.0,0,1,1,0,0,0,1,3,2,2,2,1,1,1,1,1,1,1,0,1,1,1.0,1.0,1.0,2.0,3.0,3.0,1,0.0,0.0,,,0,0,0,0.0,0,,0,0,1,0.0,0.0,,,0,0,0,0.0,0,,0,0,0.0,1,0,0,0,0,0,0,0.0,0,0,0,0,0,0,0,4,0,3,0,3,0,0,0,0,2,1,1,1,3,2,3,2,2,3,2,3,2,3,1,1,2,3,2,1,0,3.0,2,3,2,2,2,3,3,1,3.0,2,2,0,0,1,1,2,0,1.0,1,1,0,0,0,0.0,0.0,0.0,,,,0,0,1.0,1.0,,,,,1,1,1,1,1,0,0,0,0,1.0,1.0,,,,,0,0,0,0,0,0,0,0,0,1,8.0,10,50.0,50,1.0,1,0.0,1,35.0,35,0,1,0,0,0,0,1,0,0,0,0,1,1,0,3,2,2,2,2,2,2,3,2,2,1,2,3,0,0,0,0,0,0,0,2,0,0,0,0,0,0,0,2,2,2,2,4,2,3,3,3,3,3,1,3,2,2,3,1,2,2,1,2,2,2,2,2,2,2,3,3,3,3,3,,3,2,1,2,3,3,3,3,3,3,3,0,0,0,0,0,0,0.0,0,0,1,2,0,0,1,1,0,0,1,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,5133,8324,25,34.0,34.0,12.0,12.0,12.0,0.16,7,4,2,9,1,1,1.0,1,2,0,2,7,1,0,1,0,1,0,1,0,0.0,0.5,0,0.0,1.1,0.0,1,1,8,1.87,0,0.25,0.42,0,16,0,-0.8878089209,0,0,2.5,1.65,0.55,2.332,1.40544433445,2.934,2.64711589,10,3,3,3,0,0.19524929944,0.29637822608,-0.15864132185,0.21874920506,0.350384288948,,,,,,-0.62202257193,-0.9637936662,-2.33729371964,-1.65962501803,-1.91985150722,-4.69059367001,-3.34803941896,-3.3948215254531,-3.166216676,-4.27662647318,-2.68013049724,-2.2521304306,-3.25319648028,-2.65652076132,-3.3969582205,,,,,,A1,A1,A1,A1,A1,A2,A2,A1,A1,A1,-A1,A1,A1,A1,A1,A1,A1,A1,A1,A1,26.609372,26.609372,39.914058,39.914058,39.914058,39.914058,0.0,0.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,2.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,14.0,2.0,2012-20-07
4,ES,SC00103699,EXTRA,100025313,FR,2,ES,10,PB,0,8/11/1995,1,1,0.0,0.0,0.0,0.0,0,0,0,0.0,0,0.0,0,0,4,10,0,1,0,3,6,6,6,6,4,1,1,1,1,1,0,1.0,1,1,1.0,1,0,1,0,0,1,2,2,3,2,1,1,1,1,0,1,0,0,1,1,1,2.0,1.0,4.0,1.0,4.0,4.0,1,0.0,0.0,0.0,0.0,0,0,0,0.0,0,0.0,0,0,1,0.0,0.0,0.0,0.0,0,0,0,0.0,0,0.0,0,0,0.0,0,0,1,0,1,1,0,0.0,1,1,1,1,0,0,0,3,0,1,0,0,1,0,1,1,1,1,1,2,2,2,2,2,2,2,2,1,2,1,0,0,3,1,2,1,2,0.0,3,2,2,3,0,3,2,3,1.0,3,2,2,3,1,3,3,2,1.0,2,1,1,0,0,0.0,0.0,0.0,,,,0,0,,,,,1.0,1.0,1,1,1,1,1,1,1,0,0,,,,,1.0,1.0,1,1,0,0,0,0,0,0,0,1,20.0,20,60.0,60,3.0,3,6.0,6,30.0,30,0,0,0,1,0,3,0,0,0,0,0,1,1,2,1,2,3,3,1,2,1,3,2,2,1,3,3,2,2,0,0,0,4,1,3,3,3,1,3,3,1,3,0,4,1,2,4,2,3,3,2,1,1,3,2,1,2,3,1,2,2,2,1,1,1,2,1,1,2,4,4,4,4,4,4.0,4,3,3,3,2,1,1,2,3,2,3,2,1,1,0,0,0,0.0,1,1,2,3,0,0,0,0,0,0,1,1,1,1,1,1,0,0,1,1,1,0,1,1,1,0,9131,8286,16,30.0,30.0,12.0,5.0,12.0,0.309,9,2,4,7,3,6,3.0,2,3,0,2,5,1,1,1,0,1,0,1,0,0.0,0.0,4,0.51,0.77,0.5,0,0,6,2.72,1,0.75,0.28,0,15,2,-0.81819837915,0,0,2.5,1.98,1.65,1.749,0.79454985711,2.282,3.2375537,20,2,1,1,2,0.28482102892,0.53587085007,0.32951165283,-0.17647625858,0.837402614448,,,,,,-1.28111128435,-1.4504948008,-1.69959548035,-2.63163831766,-1.32337526188,-3.55215397099,-1.94420964939,-1.2470718072466,-1.7731911386,-3.71878317707,-2.5955480471,-1.8980108648,-1.70596854096,-2.55469298319,-2.70541993053,,,,,,A1,A1,A1,-A1,A2,A1,A1,A1,A1,A1,A1,A1,A1,A1,A1,A1,A1,A1,A1,A1,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,2012-20-07
5,BE fr,SC00082564,MS,100059752,DE,2,BF,10,CB,1,4/16/1995,1,1,0.0,0.0,,0.0,0,0,0,0.0,0,1.0,0,0,4,10,3,0,0,0,0,8,0,8,5,1,1,1,1,1,0,1.0,1,1,0.0,1,1,1,1,1,1,4,3,2,3,2,3,1,1,1,1,1,1,1,1,1,4.0,4.0,3.0,4.0,4.0,4.0,1,0.0,0.0,,0.0,0,0,0,0.0,0,1.0,0,0,1,0.0,0.0,,0.0,0,0,0,0.0,0,1.0,0,0,10.0,1,2,0,1,0,0,0,,0,1,3,1,0,0,1,0,0,1,0,0,0,0,0,1,1,3,2,3,1,1,3,3,3,2,2,0,0,0,0,0,2,2,3,3,3,0.0,3,1,1,3,1,3,3,3,0.0,2,1,1,2,2,1,3,3,0.0,0,0,1,0,0,0.0,0.0,0.0,0.0,,,0,1,1.0,1.0,,,,,1,1,1,1,1,0,0,0,0,1.0,1.0,,,,,0,0,0,0,0,0,0,0,0,1,16.0,20,50.0,50,4.0,4,8.0,8,33.0,33,0,2,2,3,0,3,0,0,0,0,0,3,3,2,2,2,1,3,3,2,1,4,4,3,3,4,3,2,2,0,0,0,4,4,4,2,1,0,3,3,0,2,2,3,1,2,4,2,3,3,3,3,3,1,3,3,3,3,1,3,2,2,2,2,2,2,1,3,3,4,4,4,4,4,4.0,4,3,3,2,2,2,2,2,2,2,2,3,2,2,0,1,2,0.0,2,3,3,3,0,0,0,0,0,0,1,1,1,0,1,1,0,0,1,1,0,0,1,1,0,0,3231,2221,38,,,17.0,17.0,17.0,0.937,7,4,2,9,3,7,2.0,3,3,0,2,5,1,1,2,0,2,0,2,0,0.0,1.5,1,1.02,0.22,2.5,0,0,9,3.91,2,1.25,0.84,1,15,0,2.1448005763,0,0,4.0,3.3,2.09,2.332,1.93737793771,2.608,3.80173519,20,3,1,1,2,1.55480758272,1.84771749057,2.4259021146,1.21141766288,1.338381578685,,,,,,0.07208990997,-0.7888181624,-1.5393482422,-1.13374732059,0.87674244764,-0.72853962949,1.4823606203,-0.8423769631195,-0.4122801237,2.12359612955,-0.34021010414,0.2155973332,-1.42594383872,-0.9519414287,1.59958438286,,,,,,B1,B1,B2,A2,B1,A2,A1,A1,A1,B1,A2,B1,A1,A2,B1,A2,A2,A1,A1,B1,1.094906,1.094906,1.692128,1.692128,1.551117,1.551117,0.0,0.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,2.0,1.0,1.0,1.0,1.0,1.0,36.0,2.0,2012-20-07
6,ES,SC00106150,MS,100061524,EN,1,EC,10,PB,1,6/1/1995,2,1,1.0,0.0,0.0,0.0,0,0,0,,0,0.0,0,0,4,10,0,0,3,1,0,0,0,0,6,1,1,1,1,1,0,1.0,1,0,0.0,1,1,1,1,0,1,2,3,2,2,1,1,1,1,1,1,1,1,1,1,1,2.0,2.0,3.0,4.0,4.0,4.0,1,1.0,0.0,0.0,0.0,0,0,0,,0,0.0,0,0,1,1.0,0.0,0.0,0.0,0,0,0,,0,0.0,0,0,0.0,1,0,1,0,1,0,0,1.0,1,2,1,1,0,2,4,4,0,1,1,0,3,0,0,3,1,1,1,2,1,1,3,2,3,3,2,2,0,3,1,0,0,0,1,2,0,2.0,3,3,2,3,2,3,3,3,2.0,3,1,0,1,1,2,1,0,1.0,1,1,0,0,0,0.0,0.0,0.0,,,,0,0,,,,,1.0,1.0,1,1,1,1,1,1,1,1,0,,,,,1.0,1.0,1,1,1,1,1,1,1,1,0,0,27.0,30,60.0,60,3.0,3,0.0,3,30.0,30,3,0,0,3,0,3,2,0,2,0,0,0,0,0,2,1,1,1,1,1,2,2,1,0,0,0,3,1,1,1,1,0,4,4,3,1,1,2,2,2,1,2,0,4,0,1,4,2,1,3,2,2,1,1,0,0,0,0,2,1,1,1,1,1,1,0,0,0,0,2,3,2,4,3,2.0,4,2,1,2,3,2,2,2,2,2,3,3,3,1,1,1,0,1.0,2,2,3,0,0,0,0,0,0,0,1,1,1,0,1,1,0,0,1,1,0,0,1,1,1,0,3231,5161,38,42.0,42.0,13.0,16.5,16.5,0.293,10,1,10,1,3,3,1.0,3,0,0,1,5,0,2,2,0,2,0,2,0,0.0,0.5,4,1.7,1.32,1.5,0,0,9,3.23,2,1.5,0.56,1,15,0,0.13053757117,0,0,1.5,0.0,1.98,0.583,1.73251149294,1.63,1.74991681,30,2,1,1,0,,,,,,-1.26123973293,-0.95157078762,-1.13872792949,-1.39243593233,-0.78217250625,-1.41264683399,-0.4960035437,1.01150318611,-0.68242904384,-1.18220420763,-1.63032896911,-1.48819488169,-3.4411054230493,-2.3849066203,-3.66548426211,-1.71277466118,-0.9892291387,-0.89262173776,-1.5702850757,-2.56632028356,-A1,-A1,-A1,-A1,-A1,,,,,,A1,A2,B1,A2,A1,A1,A1,A1,A1,A1,A1,A2,A2,A1,A1,141.373485,141.373485,0.0,0.0,222.158333,222.158333,207.347778,207.347778,1.0,1.0,1.0,1.0,0.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,5.0,1.0,2012-20-07
7,BE nl,SC00066554,MS,100067470,FR,1,BV,8,CB,1,10/1/1997,1,1,,0.0,,,0,0,0,0.0,0,0.0,0,0,2,8,0,1,0,5,5,0,0,0,6,1,1,1,1,1,1,,1,0,,0,0,0,0,1,0,5,3,3,3,2,2,1,1,1,1,1,1,1,1,1,4.0,3.0,3.0,4.0,4.0,4.0,1,,0.0,,,0,0,0,0.0,0,0.0,0,0,1,,0.0,,,0,0,0,0.0,0,1.0,0,0,0.0,2,1,0,0,0,0,0,0.0,1,0,0,0,0,1,0,1,2,2,1,1,0,0,1,1,1,1,1,1,0,0,0,0,0,0,0,0,0,0,1,1,2,1,1,1,1,2.0,3,3,1,1,1,2,2,0,3.0,3,1,1,2,1,1,2,0,2.0,2,1,1,0,0,0.0,0.0,0.0,,,,0,0,,,,,,,1,1,1,1,0,0,0,0,0,,,,,,,1,1,1,1,0,0,0,0,0,0,18.0,20,50.0,50,3.0,3,5.0,5,34.0,34,3,1,3,3,2,1,1,0,0,0,0,0,1,0,2,2,3,2,3,2,3,4,2,2,1,3,2,0,1,0,0,0,2,0,2,1,1,2,1,2,1,1,1,3,1,2,3,2,2,2,2,2,2,1,2,2,2,1,2,1,2,2,2,2,2,2,2,2,2,3,3,3,3,3,3.0,3,3,3,2,2,2,2,2,2,2,2,2,2,2,2,2,2,,2,2,3,3,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,9504,8270,94,29.0,29.0,9.0,9.0,9.0,0.41,4,5,4,5,3,4,2.0,3,3,0,2,7,0,1,1,0,1,0,2,0,0.0,1.5,1,0.17,0.99,3.0,1,1,9,3.74,2,1.75,0.28,1,13,0,-1.16418418343,0,0,3.0,1.98,0.77,2.332,-0.09403351311,3.26,2.47663867,20,2,1,1,0,,,,,,0.37261667299,0.22211633552,0.98177082809,0.74792220629,0.49282391241,-3.51208361077,-3.1904069348,-3.15068375213,-2.19509940195,-2.99541946774,-7.97111216684,-8.88334498202,-9.3517830394747,-6.2775552824,-7.10133500644,-6.20689609002,-6.4608815536,-6.6701398531,-4.53256849287,-5.44735291481,A1,A1,A2,A1,A1,,,,,,A1,A1,A1,A1,A1,-A1,-A1,-A1,-A1,-A1,-A1,-A1,-A1,-A1,-A1,42.067441,42.067441,0.0,0.0,60.471946,60.471946,56.914773,56.914773,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,2.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,9.0,2.0,2012-20-07
8,EL,SC00071831,MS,100070049,FR,2,EL,9,PB,0,1/3/1996,2,1,,,,,0,0,0,0.0,0,0.0,0,0,3,9,0,0,0,0,0,0,0,0,6,1,1,1,1,1,1,1.0,1,1,1.0,1,1,1,0,0,0,5,3,2,2,1,2,1,0,1,1,1,1,1,1,1,4.0,2.0,2.0,0.0,4.0,4.0,1,,,,,0,0,0,0.0,0,0.0,0,0,1,,,,,1,0,0,0.0,0,0.0,0,0,,1,1,0,1,1,1,0,1.0,0,1,0,3,0,2,3,4,0,4,0,4,3,4,2,3,2,2,3,2,3,2,3,3,3,2,3,2,2,3,3,3,0,3,1,1,0,2.0,2,3,3,2,3,3,3,0,3.0,3,2,2,3,3,3,1,2,0.0,0,1,1,0,1,0.0,0.0,0.0,0.0,0.0,,0,1,,,,,,1.0,0,0,0,0,0,0,0,0,0,,,,,,0.0,0,0,0,0,0,0,1,0,0,1,26.0,30,45.0,45,2.0,2,9.0,9,35.0,35,0,0,0,3,0,0,0,0,1,0,0,0,0,0,2,3,3,2,2,3,2,4,4,3,1,0,0,0,2,0,0,1,3,3,2,1,1,0,1,1,0,2,1,4,2,3,3,3,3,3,3,2,1,3,1,0,3,0,0,1,3,2,3,3,3,3,3,3,3,3,4,3,3,4,4.0,4,2,3,2,3,3,3,3,3,3,3,2,2,3,3,4,2,4.0,4,4,3,4,1,0,0,0,0,0,1,1,1,1,1,1,0,0,1,1,1,0,1,1,1,1,2332,4100,43,45.0,45.0,17.0,17.0,17.0,0.673,1,9,1,2,2,7,2.0,3,4,1,2,5,1,2,1,0,1,0,2,1,,1.0,4,1.53,2.64,1.5,1,0,8,2.72,3,0.0,0.14,0,15,0,0.72317564797,0,0,4.0,1.32,1.21,3.498,1.36363104688,3.26,2.10704306,30,2,1,1,0,,,,,,0.34431860816,1.04669986135,-0.22651717206,-0.13479773608,0.74574779663,-1.03945608952,-1.748868386,-3.25273126668,-4.64187069578,-0.51088276741,-2.71020010039,-3.30386415215,-2.7427392983085,-8.0791312561,-1.97340581352,-2.02272300996,-2.7653261715,-3.42974125283,-6.97082633165,-1.32218793876,A1,A2,A1,A1,A1,,,,,,A1,A1,A1,-A1,A2,A1,A1,A1,-A1,A1,A1,A1,A1,-A1,A1,25.711251,25.711251,0.0,0.0,37.398183,37.398183,37.398183,37.398183,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,27.0,1.0,2012-20-07
9,SI,SC00075273,MS,100075038,DE,2,SI,9,PB,0,3/17/1996,3,1,0.0,0.0,0.0,,0,0,0,,0,,0,0,9,9,0,3,0,3,2,0,1,0,6,1,1,0,1,1,0,1.0,0,0,0.0,1,0,1,1,0,0,1,3,1,3,1,1,1,1,1,1,0,1,0,1,1,1.0,0.0,2.0,2.0,4.0,4.0,1,0.0,0.0,0.0,,0,1,0,,0,,0,0,1,0.0,0.0,0.0,,0,0,0,,0,,0,0,0.0,1,1,0,1,0,0,0,0.0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,2,3,1,2,2,0,2,2,2,1,0,1,1,1,0,0,1,3,1,0,2,0.0,1,0,0,1,3,1,2,2,2.0,0,0,0,1,0,1,1,0,0.0,0,1,1,0,0,0.0,,,,,,0,0,,,,,,1.0,1,1,1,1,1,1,1,1,0,,,,,,1.0,1,1,0,0,0,0,0,0,0,1,16.0,20,45.0,45,2.0,2,3.0,3,26.0,26,0,0,1,3,0,0,0,0,0,0,0,3,0,2,3,1,3,1,2,1,2,2,1,2,1,2,3,0,0,0,0,0,4,0,4,1,1,1,2,3,1,2,1,4,1,1,4,2,1,2,1,1,1,2,0,0,1,0,0,0,2,2,1,1,1,1,2,2,1,3,3,2,3,4,4.0,4,3,4,2,2,2,2,1,2,1,1,0,0,0,0,0,0,0.0,0,0,1,1,0,0,0,0,0,0,1,0,0,0,0,0,1,0,0,1,0,0,1,1,0,0,4115,2132,53,71.0,71.0,12.0,12.0,12.0,-0.228,9,1,3,7,2,2,2.5,1,1,0,2,5,1,3,1,0,2,0,1,0,0.0,1.0,1,0.0,0.0,2.0,0,0,7,2.21,0,0.0,0.42,0,14,0,-0.38000799759,0,0,1.5,1.65,1.21,1.749,0.64870061884,2.282,1.49102288,20,2,1,1,2,-1.61252416558,-0.28036578552,-0.79913591727,-0.09981358421,-0.361331283172,,,,,,-2.29891924005,-1.2014523217,-2.13630707855,-4.20234005728,-1.16277341397,-0.03335334258,-1.33464606783,-0.5799493385136,-1.5938047885,-1.04873617015,-1.50637654001,-1.4563503385,-1.67585557246,-3.50183228247,-1.28870102397,,,,,,-A1,-A1,-A1,-A1,-A1,A1,A1,A1,-A1,A1,A2,A1,A2,A1,A1,A1,A1,A1,A1,A1,5.613232,5.613232,8.419848,8.419848,8.419848,8.419848,0.0,0.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,2.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,11.0,2.0,2012-20-07


### Count null values

In [10]:
# count null values
lang.isnull().any().count()

499

NOTE: The count of specific null values is relatively low so I should be able to continue with my analysis without the null values having a significant negative impact on the model. I will evaluate the null values for each feature to confirm this during the EDA. 

### Check data types

In [11]:
# check data types
lang.dtypes.head()

country_id           object
school_id            object
main_study_sample    object
respondent_id         int64
targetLanguage_id    object
dtype: object

NOTE: Most of the data throughout the df are of type object, which may not be the most suitable for analysis. One of the first data cleaning tasks will be to set the data type appropriately according to the way I will be using the data.

### Remove whitespace throughout df

In [12]:
# a high number of numeric fields have been imported as type 'object'
# check an example for whitespace to see if that is contributi
lang['TESTING_grade'][0]

' 9'

In [13]:
# remove whitespace from data throughout df
for column in lang.columns:
    try:
        lang[column] = lang[column].map(lambda x: x.strip())
    except:
        continue

In [14]:
# check to see that whitespace has been removed
lang['TESTING_grade'][0]

'9'

### Convert object columns to numeric where possible

In [15]:
# check an object column that should be numeric for empty strings
numeric_mask = lang['TESTING_grade'] == ''
no_test_grade = lang[numeric_mask]

In [16]:
# review rows with no test grade
no_test_grade.shape

(573, 499)

DECISION: After reviewing the 573 rows in this subset, I have decided to drop these rows on the basis of the following:
- the data in these rows contains only test results and none of the supporting feature information that I would use as predictors
- the data set overall is large enough that I should have sufficient data without these rows to be able to build a robust model

In [17]:
# drop rows with empty strings in the 'TESTING_grade' column
lang = lang[lang['TESTING_grade'] != '']

In [18]:
# this now seems to have allowed that field to become an integer
# I will check the impact on the other empty columns as part of the EDA
lang['TESTING_grade'].value_counts()

9     23226
10    17372
8      6148
11     5115
Name: TESTING_grade, dtype: int64

In [19]:
# resetting index to account for the dropped rows
lang.reset_index(drop=True, inplace=True)

In [20]:
#lang.apply(pd.to_numeric, errors='ignore')

NOTE: I will review all data types and set accordingly as part of the EDA stage of the project.

### Convert date columns to datetime format

In [21]:
# convert column version_stu to datetime format
for date in lang:
    try:
        lang['version_stu'] = pd.to_datetime(lang.version_stu, format='%Y-%d-%m',\
                                             infer_datetime_format=False, exact=True,\
                                             errors='raise')
    except:
        continue

In [22]:
# check conversion applied correctly
lang['version_stu'].dtype

dtype('<M8[ns]')

NOTE: In the EDA phase I will convert SQt02i01C to datetime and do some feature engineering from it.

### Map parents' professions

In [87]:
# create tuple of fathers' profession keys to zip with profession names
iscom_keys = (1000,1100,1110,1120,1130,1140,1141,1142,1143,1200,1210,1220,\
              1221,1222,1223,1224,1225,1226,1227,1228,1229,1230,1231,1232,\
              1233,1234,1235,1236,1237,1239,1240,1250,1251,1252,1300,1310,\
              1311,1312,1313,1314,1315,1316,1317,1318,1319,2000,2100,2110,\
              2111,2112,2113,2114,2120,2121,2122,2130,2131,2132,2139,2140,\
              2141,2142,2143,2144,2145,2146,2147,2148,2149,2200,2210,2211,\
              2212,2213,2220,2221,2222,2223,2224,2229,2230,2300,2310,2320,\
              2321,2322,2330,2331,2332,2340,2350,2351,2352,2359,2400,2410,\
              2411,2412,2419,2420,2421,2422,2429,2430,2431,2432,2440,2441,\
              2442,2443,2444,2445,2446,2450,2451,2452,2453,2454,2455,2460,\
              3000,3100,3110,3111,3112,3113,3114,3115,3116,3117,3118,3119,\
              3120,3121,3122,3123,3130,3131,3132,3133,3139,3140,3141,3142,\
              3143,3144,3145,3150,3151,3152,3200,3210,3211,3212,3213,3220,\
              3221,3222,3223,3224,3225,3226,3227,3228,3229,3230,3231,3232,\
              3240,3241,3242,3300,3310,3320,3330,3340,3400,3410,3411,3412,\
              3413,3414,3415,3416,3417,3419,3420,3421,3422,3423,3429,3430,\
              3431,3432,3433,3434,3439,3440,3441,3442,3443,3444,3449,3450,\
              3451,3452,3460,3470,3471,3472,3473,3474,3475,3480,4000,4100,\
              4110,4111,4112,4113,4114,4115,4120,4121,4122,4130,4131,4132,\
              4133,4140,4141,4142,4143,4144,4190,4200,4210,4211,4212,4213,\
              4214,4215,4220,4221,4222,4223,5000,5100,5110,5111,5112,5113,\
              5120,5121,5122,5123,5130,5131,5132,5133,5139,5140,5141,5142,\
              5143,5149,5150,5151,5152,5160,5161,5162,5163,5164,5169,5200,\
              5210,5220,5230,6000,6100,6110,6111,6112,6113,6114,6120,6121,\
              6122,6123,6124,6129,6130,6131,6132,6133,6134,6140,6141,6142,\
              6150,6151,6152,6153,6154,6200,6210,7000,7100,7110,7111,7112,\
              7113,7120,7121,7122,7123,7124,7129,7130,7131,7132,7133,7134,\
              7135,7136,7137,7140,7141,7142,7143,7200,7210,7211,7212,7213,\
              7214,7215,7216,7220,7221,7222,7223,7224,7230,7231,7232,7233,\
              7234,7240,7241,7242,7243,7244,7245,7300,7310,7311,7312,7313,\
              7320,7321,7322,7323,7324,7330,7331,7332,7340,7341,7342,7343,\
              7344,7345,7346,7400,7410,7411,7412,7413,7414,7415,7416,7420,\
              7421,7422,7423,7424,7430,7431,7432,7433,7434,7435,7436,7437,\
              7440,7441,7442,7500,7510,7520,7530,8000,8100,8110,8111,8112,\
              8113,8120,8121,8122,8123,8124,8130,8131,8139,8140,8141,8142,\
              8143,8150,8151,8152,8153,8154,8155,8159,8160,8161,8162,8163,\
              8170,8171,8172,8200,8210,8211,8212,8220,8221,8222,8223,8224,\
              8229,8230,8231,8232,8240,8250,8251,8252,8253,8260,8261,8262,\
              8263,8264,8265,8266,8269,8270,8271,8272,8273,8274,8275,8276,\
              8277,8278,8279,8280,8281,8282,8283,8284,8285,8286,8290,8300,\
              8310,8311,8312,8320,8321,8322,8323,8324,8330,8331,8332,8333,\
              8334,8340,8400,9000,9100,9110,9111,9112,9113,9120,9130,9131,\
              9132,9133,9140,9141,9142,9150,9151,9152,9153,9160,9161,9162,\
              9200,9210,9211,9212,9213,9300,9310,9311,9312,9313,9320,9321,\
              9322,9330,9331,9332,9333,9501,9502,9503,9504,9505,9997,9998,\
              9999)

In [88]:
# create tuple of fathers' profession descriptions to zip with profession keys
iscom_values = ("LEGISLATORS, SENIOR OFFICIALS & MANAGERS","LEGISLATORS & SENIOR OFFICIALS",\
                "LEGISLATORS [incl. Member of Parliament, Member of Local Council]",\
                "SENIOR [NATIONAL] GOVERNMENT OFFICIALS [incl. Minister, Ambassador]",\
                "[SENIOR LOCAL GOVERNMENT OFFICIALS] [incl. Local Government Senior\
                Officials, Mayor]","SENIOR OFFICIALS SPECIAL-INTEREST ORGANIZATIONS",\
                "Senior officials political-party organizations [incl. Politician]",\
                "Senior officials economic-interest organizations [incl. Union Leader,\
                Director Employers’ Organization]","Senior officials special-interest\
                organizations [incl. Lodge Official, Official Red Cross]",\
                "CORPORATE MANAGERS [LARGE ENTERPRISES]","[LARGE ENTERPRISES] DIRECTORS &\
                CHIEF EXECUTIVES [incl. CEO, Large Business Owner 25+ employees]",\
                "[LARGE ENTERPRISE OPERATION] DEPARTMENT MANAGERS [incl. Manager in\
                establishment with 25+ employees]",\
                "Production department managers agriculture & fishing",\
                "Production department managers manufacturing [incl. Factory Manager nfs]",\
                "Production department managers construction",\
                "Production department managers wholesale & retail trade [incl. Floor Manager]",\
                "Production department managers restaurants & hotels",\
                "Production department managers transportation, storage & communications\
                [incl. Postmaster, Stationmaster]",\
                "Production department managers business services [incl. Banker, Bank Manager]",\
                "Production department managers personal care, cleaning, etc.",\
                "Production department managers nec [incl. Impresario, Film Producer,\
                College Dean, School Principal]",\
                "[LARGE ENTERPRISES] OTHER DEPARTMENT MANAGERS",\
                "Finance & administration department managers [incl. Company Secretary]",\
                "Personnel & industrial relations department managers","Sales & marketing\
                department managers","Advertising & public relations department managers",\
                "Supply & distribution department managers",\
                "Computing services department managers",\
                "Research & development department managers",\
                "Other department managers nec","OFFICE MANAGERS [incl. Clerical Supervisor]",\
                "MILITARY OFFICERS","Higher military officers [Captain and above]",\
                "Lower-grade commissioned officers [incl. Army Lieutenant]",\
                "[SMALL ENTERPRISE] GENERAL MANAGERS",\
                "[SMALL ENTERPRISE] GENERAL MANAGERS [incl. Businessman, Trader, Manager nfs]",\
                "[Small enterprise] General managers agriculture, forestry & fishing [incl.\
                Farm Manager, Self-employed Farmer with perso",\
                "[Small enterprise] General managers manufacturing",\
                "[Small enterprise] General managers construction [incl. Building Contractor]",\
                "[Small enterprise] General managers wholesale & retail trade [incl.\
                Shop Owner/Manager, Retail Owner/Manager, Merchant]",\
                "[Small enterprise] General managers restaurants & hotels [incl. Manager\
                Camping Site, Bar Owner/Manager, Restaurateur]",\
                "[Small enterprise] General managers transp., storage & communications\
                [incl. Owner Small Transport Company]",\
                "[Small enterprise] General managers business services\
                [incl. Manager Insurance Agency]",\
                "[Small enterprise] General managers personal care, cleaning, etc. services\
                [incl. Owner Laundry]","[Small enterprise] General managers nec\
                [incl. Manager Travel Agency, Manager Fitness Center, Garage Owner]",\
                "PROFESSIONALS","PHYSICAL, MATHEMATICAL & ENGINEERING SCIENCE PROFESSIONALS",\
                "PHYSICISTS, CHEMISTS & RELATED PROFESSIONALS","Physicists & astronomers",\
                "Meteorologists","Chemists","Geologists & geophysicists [incl. Geodesist]",\
                "MATHEMATICIANS, STATISTICIANS AND RELATED PROFESSIONALS",\
                "Mathematicians, etc. professionals","Statisticians [incl. Actuary]",\
                "COMPUTING PROFESSIONALS",\
                "Computer systems designers & analysts [incl. Software Engineer]",\
                "Computer programmers","Computing professionals nec",\
                "ARCHITECTS, ENGINEERS AND RELATED PROFESSIONALS",\
                "Architects town & traffic planners [incl. Landscape Architect]",\
                "Civil engineers [incl. Construction Engineer]","Electrical engineers",\
                "Electronics & telecommunications engineers","Mechanical engineers",\
                "Chemical engineers","Mining engineers, metallurgists, etc. professionals",\
                "Cartographers & surveyors",\
                "Architects, engineers, etc. professionals nec [incl. Consultant]",\
                "LIFE SCIENCE & HEALTH PROFESSIONALS","LIFE SCIENCE PROFESSIONALS",\
                "Biologists, botanists, zoologists, etc. professionals",\
                "Pharmacologists, pathologists, etc. professionals [incl. Biochemist]",\
                "Agronomists, etc. professionals","HEALTH PROFESSIONALS (EXCEPT NURSING)",\
                "Medical doctors","Dentists","Veterinarians","Pharmacists",\
                "Health professionals except nursing nec",\
                "NURSING & MIDWIFERY PROFESSIONALS [incl. Registered Nurses, Registered\
                Midwives, Nurse nfs]","TEACHING PROFESSIONALS",\
                "HIGHER EDUCATION TEACHING PROFESSIONALS [incl. University Professor]",\
                "SECONDARY EDUCATION TEACHING PROFESSIONALS",\
                "[Secondary teachers, academic track] [incl. Middle-School Teacher]",\
                "[Secondary teachers, vocational track] [incl. Vocational Instructor]",\
                "PRIMARY & PRE-PRIMARY EDUCATION TEACHING PROFESSIONALS",\
                "Primary education teaching professionals",\
                "Pre-primary education teaching professionals [incl. Kindergarten Teacher]",\
                "SPECIAL EDUCATION TEACHING PROFESSIONALS [incl. Remedial Teacher,\
                Teacher of the Blind]","OTHER TEACHING PROFESSIONALS",\
                "Education methods specialists [incl. Curricula Developer]",\
                "School inspectors","Other teaching professionals nec",\
                "OTHER PROFESSIONALS [incl. Professional nfs, Administrative Professional]",\
                "BUSINESS PROFESSIONALS","Accountants [incl. Auditor]",\
                "Personnel & careers professionals [incl. Job Analyst, Student Counselor]",\
                "Business professionals nec [incl. Publicity Agent, Patent Agent,\
                Home Economist, Market Researcher]","LEGAL PROFESSIONALS","Lawyers","Judges",\
                "Legal professionals nec [incl. Notary, Notary Public]",\
                "ARCHIVISTS, LIBRARIANS AND RELATED INFORMATION PROFESSIONALS",\
                "Archivists & curators","Librarians, etc. information professionals\
                [incl. Documentalist, Health Records Technician]",\
                "SOCIAL SCIENCE AND RELATED PROFESSIONALS","Economists",\
                "Sociologists, anthropologists, etc. professionals",\
                "Philosophers, historians & political scientists",\
                "Philologists, translators & interpreters",\
                "Psychologists","Social work professionals [incl.WelfareWorker]",\
                "WRITERS & CREATIVE OR PERFORMING ARTISTS",\
                "Authors, journalists & other writers [incl. Editor, TechnicalWriter]",\
                "Sculptors, painters, etc. artists","Composers, musicians & singers",\
                "Choreographers & dancers","Film, stage, etc. actors & directors",\
                "RELIGIOUS PROFESSIONALS [incl. Priest, Chaplain, Theologian,\
                Professional Nun]","TECHNICIANS AND ASSOCIATE PROFESSIONALS",\
                "PHYSICAL & ENGINEERING SCIENCE ASSOCIATE PROFESSIONALS",\
                "PHYSICAL & ENGINEERING SCIENCE TECHNICIANS",\
                "Chemical & physical science technicians","Civil engineering technicians",\
                "Electrical engineering technicians",\
                "Electronics & telecommunications engineering technicians",\
                "Mechanical engineering technicians","Chemical engineering technicians",\
                "Mining & metallurgical technicians",\
                "Draftspersons [incl. Technical Illustrator]",\
                "Physical & engineering science technicians nec [incl. Quantity Surveyor]",\
                "COMPUTER ASSOCIATE PROFESSIONALS",\
                "Computer assistants [incl. Assistant Users’ Services]",\
                "Computer equipment operators [incl. Computer Printer Equipment Operator]",\
                "Industrial robot controllers","OPTICAL & ELECTRONIC EQUIPMENT OPERATORS",\
                "Photographers & electronic equipment operators [incl. Cameraman, Sound Mixer]",\
                "Broadcasting & telecommunications equipment operators",\
                "Medical equipment operators [incl. X-ray Technician]",\
                "Optical & electronic equipment operators nec\
                [incl. Cinema Projectionist, Telegrapher]",\
                "SHIP & AIRCRAFT CONTROLLERS & TECHNICIANS","Ships engineers",\
                "Ships deck officers & pilots [incl. River Boat Captain]",\
                "Aircraft pilots, etc. associate professionals","Air traffic controllers",\
                "Air traffic safety technicians","SAFETY & QUALITY INSPECTORS",\
                "Building & fire inspectors","Safety, health & quality inspectors\
                [incl. Occupational Safety Inspector, Inspector nfs]",\
                "LIFE SCIENCE & HEALTH ASSOCIATE PROFESSIONALS",\
                "LIFE SCIENCE TECHNICIANS AND RELATED ASSOCIATE PROFESSIONALS",\
                "Life science technicians [incl. Medical Laboratory Assistant,\
                Medical Technician nfs, Physical and Life Science Technici",\
                "Agronomy & forestry technicians","Farming & forestry advisers",\
                "MODERN HEALTH ASSOCIATE PROFESSIONALS EXCEPT NURSING",\
                "Medical assistants","Sanitarians","Dieticians & nutritionists",\
                "Optometrists & opticians [incl. Dispensing Optician]",\
                "Dental assistants [incl. Oral Hygienist]",\
                "Physiotherapsits, etc. associate professionals\
                [incl. Chiropractor, Masseur, Osteopath]",\
                "Veterinary assistants [incl. Veterinarian Vaccinator]",\
                "Pharmaceutical assistants",\
                "Modern health associate professionals except nursing nec\
                [incl. Homeopath, Speech Therapist, Occupational Therapist]",\
                "NURSING & MIDWIFERYASSOCIATE PROFESSIONALS",\
                "Nursing associate professionals [incl. Trainee Nurses]",\
                "Midwifery associate professionals [incl. Trainee Midwife]",\
                "TRADITIONAL MEDICINE PRACTITIONERS & FAITH HEALERS",\
                "Traditional medicine practitioners [incl. Herbalist]","Faith healers",\
                "TEACHING ASSOCIATE PROFESSIONALS",\
                "PRIMARY EDUCATION TEACHING ASSOCIATE PROFESSIONALS [incl. Teacher’s Aid]",\
                "PRE-PRIMARY EDUCATION TEACHING ASSOCIATE PROFESSIONALS\
                [incl. Kindergarten Teacher’s Aid]",\
                "SPECIAL EDUCATION TEACHING ASSOCIATE PROFESSIONALS",\
                "OTHER TEACHING ASSOCIATE PROFESSIONALS","OTHER ASSOCIATE PROFESSIONALS",\
                "FINANCE & SALES ASSOCIATE PROFESSIONALS",\
                "Securities & finance dealers & brokers",\
                "Insurance representative [incl. Insurance Agent, Underwriter]",\
                "[Real] estate agents [incl. Real Estate Broker]",\
                "Travel consultants & organizers",\
                "Technical & commercial sales representatives\
                [incl. Traveling Salesman, Technical Salesman]",\
                "Buyers","Appraisers, valuers & auctioneers [incl. Claims Adjuster]",\
                "Finance & sales associate professionals nec",\
                "BUSINESS SERVICES AGENTS AND TRADE BROKERS",\
                "Trade brokers","Clearing & forwarding agents",\
                "Employment agents & labor contractors",\
                "Business services agents & trade brokers nec\
                [incl. Literary Agent, Sports Promoter, Salesman Advertisements]",\
                "ADMINISTRATIVE ASSOCIATE PROFESSIONALS",\
                "Administrative secretaries, etc. associate professionals",\
                "Legal, etc. business associate professionals [incl. Bailiff, Law Clerk]",\
                "Bookkeepers","Statistical, mathematical, etc. associate professionals",\
                "Administrative associate professionals nec [incl. Management Assistant]",\
                "CUSTOMS, TAX AND RELATED GOVERNMENT ASSOCIATE PROFESSIONALS\
                [incl. Administrative Associate Professional, Executive Civi",\
                "Customs & border inspectors","Government tax & excise officials",\
                "Government social benefits officials","Government licensing officials",\
                "Customs tax, etc. government associate professionals nec\
                [incl. Price Inspector, Electoral Official, Middle-Rank Civil S",\
                "POLICE INSPECTORS & DETECTIVES/[ARMY]",\
                "Police inspectors & detectives [incl. Police Investigator, Private Detective]",\
                "[Armed forces non commissioned officers] [incl. Sergeant]",\
                "SOCIAL WORK ASSOCIATE PROFESSIONALS",\
                "ARTISTIC, ENTERTAINMENT & SPORTS ASSOCIATE PROFESSIONALS",\
                "Decorators & commercial designers [incl.Window Dresser, Interior\
                Decorator, Furniture Designer, Book Illustrator, Tattoo",\
                "Radio, television & other announcers",\
                "Street nightclub, etc. musicians, singers & dancers\
                [incl. Band Leader, Chorus Dancer, Nightclub Singer]",\
                "Clowns, magicians, acrobats, etc. associate professionals\
                [incl. Striptease Artist, Juggler]",\
                "Athletes, sports persons, etc. associate professionals\
                [incl. Trainer, Umpire]",\
                "RELIGIOUS ASSOCIATE PROFESSIONALS\
                [incl. Evangelist, Lay Preacher, Salvationist]",\
                "CLERKS","OFFICE CLERKS [incl. Clerk nfs, Government Office Clerk nfs]",\
                "SECRETARIES & KEYBOARD-OPERATING CLERKS","Stenographers & typists",\
                "Word-processor, etc. operators [incl. Teletypist]",\
                "Data-entry operators [incl. Key Puncher]",\
                "Calculating-machine operators [incl. Bookkeeping Machine Operator]",\
                "Secretaries","NUMERICAL CLERKS",\
                "Accounting & bookkeeping clerks [incl. Payroll Clerk]",\
                "Statistical & finance clerks [incl. Credit Clerk]",\
                "MATERIAL-RECORDING & TRANSPORT CLERKS",\
                "Stock clerks [incl.Weighing Clerk, Storehouse Clerk]",\
                "Production clerks [incl. Planning Clerks]",\
                "Transport clerks [incl. Dispatcher, Expeditor]",\
                "LIBRARY, MAIL AND RELATED CLERKS","Library & filing clerks",\
                "Mail carriers & sorting clerks","Coding proofreading, etc. clerks",\
                "Scribes, etc. workers [incl. Form Filling Assistance Clerk]",\
                "OTHER OFFICE CLERKS [incl. Address Clerk, Timekeeper,\
                Office Boy, Photocopy Machine Operator]",\
                "CUSTOMER SERVICES CLERKS [incl. Customer Service Clerk nfs]",\
                "CASHIERS, TELLERS AND RELATED CLERKS",\
                "Cashiers & ticket clerks [incl. Bank Cashier, Store Cashier, Toll Collector]",\
                "Tellers & other counter clerks [incl. Bank Teller, Post Office Clerk]",\
                "Bookmakers & croupiers","Pawnbrokers & money-lenders",\
                "Debt-collectors, etc. workers","CLIENT INFORMATION CLERKS",\
                "Travel agency, etc. clerks",\
                "Receptionists & information clerks [incl. Medical Receptionist]",\
                "Telephone switchboard operators [incl. Telephone Operator]",\
                "SERVICE WORKERS & SHOP & MARKET SALES WORKERS",\
                "PERSONAL & PROTECTIVE SERVICES WORKERS","TRAVELATTENDANTS, ETC.",\
                "Travel attendants & travel stewards [incl. Airplane Steward, Airplane Purser]",\
                "Transport conductors [incl. Train Conductor]","Travel, museum guides",\
                "HOUSEKEEPING & RESTAURANT SERVICES WORKERS",\
                "Housekeepers, etc. workers [incl. Butler, Matron, DormitoryWarden,\
                Estate Manager, Property Manager, Building Superinten",\
                "Cooks","Waiters, waitresses & bartenders","PERSONAL CARE AND RELATED WORK",\
                "Child-care workers [incl. Nursemaid, Governess]",\
                "Institution-based personal care workers\
                [incl. Ambulance Man, Hospital Orderly]",\
                "Home-based personal care workers [incl. Attendant]",\
                "[Other] care, etc. workers nec [incl. Animal Feeder]",\
                "OTHER PERSONAL SERVICES WORKERS",\
                "Hairdressers, barbers, beauticians, etc. workers",\
                "Companions & valets [incl. Personal Maid]",\
                "Undertakers & embalmers [incl. Funeral Director]",\
                "Other personal services workers nec\
                [incl. Escort, Dancing Partner, Prostitute]",\
                "ASTROLOGERS, FORTUNE-TELLERS AND RELATED WORKERS","Astrologers, etc. workers",\
                "Fortune-tellers, palmists, etc. workers","PROTECTIVE SERVICES WORKERS",\
                "Firefighters","Police officers [incl. Policeman, Constable, Marshal]",\
                "Prison guards","[Armed forces, soldiers] [incl. Enlisted Man]",\
                "Protective services workers nec [incl. Night Guard, Bodyguard, Coast Guard]",\
                "[SALESPERSONS, MODELS & DEMONSTRATORS]",\
                "FASHION & OTHER MODELS [incl. Mannequin, Artist’s Model]",\
                "SHOP SALESPERSONS & DEMONSTRATORS [incl. Shop Assistant,\
                Gas Station Attendant, Retail Assistant]",\
                "STALL & MARKET SALESPERSONS","SKILLED AGRICULTURAL & FISHERY WORKERS",\
                "MARKET-ORIENTED SKILLED AGRICULTURAL & FISHERY WORKERS\
                [This category includes skilled farm workers and self-employed sm",\
                "MARKET GARDENERS & CROP GROWERS",\
                "Field crop & vegetable growers [incl. Specialized Crop\
                Farmers, Specialized Crop FarmWorkers]",\
                "Tree & shrub crop growers [incl. Skilled RubberWorker,\
                Coffee Farmer, Tea Grower, Fruit Tree Pruner]",\
                "Gardeners, horticultural & nursery growers\
                [incl. Bulb Grower, Market Gardener]",\
                "Mixed-crop growers [incl. Share Cropper]",\
                "MARKET-ORIENTED ANIMAL PRODUCERS AND RELATED WORKERS",\
                "Dairy & livestock producers [incl. Cattle Breeder,\
                Dairy Farmer, Grazier, Shepherd]",\
                "Poultry producers [incl. Chicken Farmer, Skilled HatcheryWorker]",\
                "Apiarists & sericulturists [incl. Beekeeper, Silkworm Raiser]",\
                "Mixed-animal producers","Market-oriented animal producers, etc. workers nec\
                [incl. Bird Breeder, Gamekeeper, Kennel Keeper, Dog Trainer, Animal C",\
                "MARKET-ORIENTED CROP & ANIMAL PRODUCERS","[Mixed farmers]",\
                "[Farm foremen/supervisor]","[Farmers nfs]",\
                "[Skilled farm workers nfs]","FORESTRY AND RELATED WORKERS",\
                "Forestry workers & loggers [incl. Forestery, Rafter, Timber Cruiser]",\
                "Charcoal burners, etc. workers","FISHERY WORKERS, HUNTERS & TRAPPERS",\
                "Aquatic-life cultivation workers [incl. Oyster Farmer,\
                Pearl Cultivator, Fish Hatcher]",\
                "Inland & coastal waters fishery workers [incl. Sponge Diver, Fisherman]",\
                "Deep-sea fishery workers [incl. Fisherman nfs, Trawler Crewman]",\
                "Hunters & trappers [incl. Whaler]",\
                "SUBSISTENCE AGRICULTURAL & FISHERY WORKERS",\
                "SUBSISTENCE AGRICULTURAL & FISHERY WORKERS",\
                "CRAFT AND RELATED TRADES WORKERS",\
                "EXTRACTION & BUILDING TRADES WORKERS",\
                "MINERS, SHOTFIRERS, STONE CUTTERS & CARVERS",\
                "Miners & quarry workers [incl. Miner nfs]","Shotfirers & blasters",\
                "Stone splitters, cutters & carvers [incl. Tombstone Carver]",\
                "BUILDING FRAME AND RELATED TRADES WORKERS",\
                "Builders traditional materials","Bricklayers & stonemasons [incl. Pavior]",\
                "Concrete placers, concrete finishers, etc. workers [incl. TerrazzoWorker]",\
                "Carpenters & joiners","Building frame, etc. trades workers nec\
                [incl. ConstructionWorker nfs, Billboard Erector, DemolitionWorker, Scaffolder]",\
                "BUILDING FINISHERS AND RELATED TRADES WORKERS","Roofers",\
                "Floor layers & tile setters [incl. ParquetryWorker]",\
                "Plasterers [incl. Stucco Mason]","Insulation workers",\
                "Glaziers","Plumbers & pipe fitters [incl.Well Digger]",\
                "Building, etc. electricians",\
                "PAINTERS, BUILDING STRUCTURE CLEANERS AND RELATED TRADES WORKERS",\
                "Painters, etc. workers [incl. Construction Painter, Paperhanger]",\
                "Varnishers, etc. painters [incl. Automobile Painter]",\
                "Building structure cleaners [incl. Chimney Sweep,\
                Sandblaster, Boiler Engine Cleaner]",\
                "METAL, MACHINERY AND RELATED TRADES WORKERS",\
                "METAL MOLDERS, WELDERS, SHEETMETAL WORKERS STRUCTURAL METAL",\
                "Metal molders & coremakers",\
                "Welders & flamecutters [incl. Brazier, Solderer]",\
                "Sheet-metal workers [incl. Panel Beater, Coppersmith, Tinsmith]",\
                "Structural-metal preparers & erectors [incl. Ship Plater, Riveter, Shipwright]",\
                "Riggers & cable splicers","Underwater workers [incl. Frogman]",\
                "BLACKSMITHS, TOOL-MAKERS AND RELATED TRADES WORKERS",\
                "Blacksmiths, hammer-smiths & forging press workers [incl. Toolsmith]",\
                "Tool-makers, etc. workers [incl. Locksmith]",\
                "Machine-tool setters & setter-operators [incl. Metal driller, Turner]",\
                "Metal wheel-grinders, polishers & tool sharpeners",\
                "MACHINERY MECHANICS & FITTERS",\
                "Motor vehicle mechanics & fitters [incl. Bicycle Repairman",\
                "Aircraft engine mechanics & fitters",\
                "[Industrial & agricultural] machinery mechanics & fitters\
                [incl. Mechanic Heavy Equipment, Millwright]",\
                "[Unskilled garage worker] [incl. Oiler-Greaser]",\
                "ELECTRICAL & ELECTRONIC EQUIPMENT MECHANICS & FITTERS",\
                "Electrical mechanics & fitters [incl. Office Machine Repairman]",\
                "Electronics fitters","Electronics mechanics & servicers",\
                "Telegraph & telephone installers & servicers",\
                "Electrical line installers, repairers & cable jointers",\
                "PRECISION, HANDICRAFT, PRINTING AND RELATED TRADES WORKERS",\
                "PRECISION WORKERS IN METAL AND RELATED MATERIALS",\
                "Precision-instrument makers & repairers [incl. Dental Mechanic,Watch Maker]",\
                "Musical-instrument makers & tuners",\
                "Jewelry & precious-metal workers [incl. Diamond Cutter, Goldsmith]",\
                "POTTERS, GLASS-MAKERS AND RELATED TRADES WORKERS",\
                "Abrasive wheel formers, potters, etc. workers",\
                "Glass-makers, cutters, grinders & finishers",\
                "Glass engravers & etchers",\
                "Glass ceramics, etc. decorative painters\
                [incl. Decorative Painter, Signpainter]",\
                "HANDICRAFT WORKERS IN WOOD, TEXTILE, LEATHER, ETC.",\
                "Handicraft workers in wood, etc. materials\
                [incl. Candle Maker, Straw-Hat Maker]",\
                "Handicraft workers in textile, leather, etc. materials [incl. CarpetWeaver]",\
                "PRINTING AND RELATED TRADES WORKERS",\
                "Compositors, typesetters, etc. workers [incl. Phototypesetter, Linotypist]",\
                "Stereotypers & electrotypers","Printing engravers & etchers",\
                "Photographic, etc. workers [incl. Darkroom worker]",\
                "Bookbinders, etc. workers","Silkscreen, block & textile printers",\
                "OTHER CRAFT AND RELATED TRADES WORKERS",\
                "FOOD PROCESSING AND RELATED TRADES WORKERS",\
                "Butchers, fishmongers, etc. food preparers",\
                "Bakers, pastry-cooks & confectionery makers","Dairy-products makers",\
                "Fruit, vegetable, etc. preservers","Food & beverage tasters & graders",\
                "Tobacco preparers & tobacco products makers",\
                "WOOD TREATERS, CABINET-MAKERS AND RELATED TRADES WORKERS",\
                "Wood treaters [incl.Wood Grader,Wood Impregnator]",\
                "Cabinet-makers, etc. workers [incl. Cartwright, Cooper]",\
                "Woodworking-machine setters & setter-operators [incl.Wood-Turner]",\
                "Basketry weavers, brush makers, etc. workers [incl. Broom Maker]",\
                "TEXTILE, GARMENT AND RELATED TRADES WORKERS", "Fiber preparers",\
                "Weavers, knitters, etc. workers",\
                "Tailors, dressmakers & hatters [incl. Milliner]",\
                "Furriers, etc. workers",\
                "Textile, leather, etc. pattern-makers & cutters",\
                "Sewers, embroiderers, etc. workers","Upholsterers, etc. workers",\
                "PELT, LEATHER & SHOEMAKING TRADES WORKERS",\
                "Pelt dressers, tanners & fellmongers",\
                "Shoe-makers, etc. workers","[SKILLED WORKERS NFS]",\
                "[MANUAL FOREMEN NFSUNON-FARM]",\
                "[SKILLED WORKERS NFS] [incl. Craftsman, Artisan, Tradesman]",\
                "[APPRENTICE SKILLED WORK NFS]","PLANT & MACHINE OPERATORS & ASSEMBLERS",\
                "STATIONARY-PLANT AND RELATED OPERATORS",\
                "MINING- & MINERAL-PROCESSING PLANT OPERATORS",\
                "Mining-plant operators","Mineral-ore- & stone-processing plant operators",\
                "Well-drillers & borers, etc. workers","METAL-PROCESSING PLANT OPERATORS",\
                "Ore & metal furnace operators",\
                "Metal melters, casters & rolling-mill operators",\
                "Metal heat-treating plant operators","Metal drawers & extruders",\
                "GLASS, CERAMICS AND RELATED PLANT OPERATORS",\
                "Glass & ceramics kiln, etc. machine operators",\
                "Glass, ceramics, etc. plant operators nec",\
                "WOOD-PROCESSING & PAPERMAKING PLANT OPERATORS",\
                "Wood-processing plant operators [incl. Sawyer]",\
                "Paper-pulp plant operators",\
                "Papermaking plant operators","CHEMICAL-PROCESSING PLANT OPERATORS",\
                "Crushing grinding & chemical-mixing machinery operators",\
                "Chemical heat-treating plant operators",\
                "Chemical-filtering & separating-equipment operators",\
                "Chemical-still & reactor operators",\
                "Petroleum & natural-gas refining plant operators",\
                "Chemical-processing plant operators nec",\
                "POWER-PRODUCTION AND RELATED PLANT OPERATORS",\
                "Power-production plant operators",\
                "Steam-engine & boiler operators [incl. Stoker,\
                Ship Engine Room Ratings]",\
                "Incinerator water-treatment, etc. plant operators\
                [incl. Sewage Plant Operator]",\
                "AUTOMATED ASSEMBLY-LINE & INDUSTRIAL-ROBOT OPERTORS",\
                "Automated assembly-line operators",\
                "Industrial-robot operators","MACHINE OPERATORS & ASSEMBLERS",\
                "METAL- & MINERAL-PRODUCTS MACHINE OPERATORS",\
                "Machine-tool operators [incl. Machine Operator nfs]",\
                "Cement & other mineral products machine operators",\
                "CHEMICAL-PRODUCTS MACHINE OPERATORS",\
                "Pharmaceutical & toiletry products machine operators",\
                "Ammunition & explosive-products machine operators",\
                "Metal-finishing, -plating, & -coating machine operators\
                [incl. Electroplater, Fettler]",\
                "Photographic-products machine operators",\
                "Chemical-products machine operators nec",\
                "RUBBER- & PLASTIC-PRODUCTS MACHINE OPERATORS",\
                "Rubber-products machine operators",\
                "Plastic-products machine operators",\
                "WOOD-PRODUCTS MACHINE OPERATORS",\
                "PRINTING, BINDING & PAPER-PRODUCTS MACHINE OPERATORS",\
                "Printing-machine operators",\
                "Bookbinding-machine operators","Paper-products machine operators",\
                "TEXTILE, FUR & LEATHER-PRODUCTS MACHINE OPERATORS",\
                "Fiber-preparing, spinning & winding machine operators",\
                "Weaving- & knitting-machine operators","Sewing-machine operators",\
                "Bleaching-, dyeing- & cleaning-machine operators [incl. Launderer]",\
                "Fur- & leather-preparing-maching operators",\
                "Shoemaking-, etc. machine operators",\
                "Textile, fur & leather-products machine operators nec",\
                "FOOD AND RELATED PRODUCTS MACHINE OPERATORS",\
                "Meat- & fish-processing machine operators",\
                "Dairy-products machine operators",\
                "Grain- & spice-milling machine operators",\
                "Baked-goods cereal & chocolate products machine operators",\
                "Fruit-, vegetable- & nut-processing machine operators",\
                "Sugar-production machine operators",\
                "Tea-, coffee- & cocoa-processing machine operators",\
                "Brewers-, wine & other beverage machine operators",\
                "Tobacco-production machine operators",\
                "ASSEMBLERS","Mechanical-machinery assemblers\
                [incl. Car Assembly-LineWorker]",\
                "Electrical-equipment assemblers","Electronic-equipment assemblers",\
                "Metal, rubber & plastic products assemblers",\
                "Wood, etc. products assemblers",\
                "Paperboard, textile, etc. products assemblers",\
                "OTHER MACHINE OPERATORS & ASSEMBLERS",\
                "DRIVERS & MOBILE-PLANT OPERATORS",\
                "LOCOMOTIVE-ENGINE DRIVERS AND RELATED WORKERS",\
                "Locomotive-engine drivers","Railway brakers signalers & shunters",\
                "MOTOR-VEHICLE DRIVERS [incl. Driver nfs]","Motorcycle drivers",\
                "Car, taxi & van drivers [incl. Taxi Owner nfs]","Bus & tram drivers",\
                "Heavy truck & lorry drivers",\
                "AGRICULTURAL & OTHER MOBILE PLANT OPERATORS",\
                "Motorized farm & forestry plant operators\
                [incl. Tractor Driver, Combine Harvester Operator]",\
                "Earth-moving, etc. plant operators [incl. Bulldozer\
                Driver, Dredge Operator, RoadRoller Driver]",\
                "Crane, hoist, etc. plant operators","Lifting-truck operators",\
                "SHIPS DECK CREWS AND RELATED WORKERS [incl. Boatman,\
                Deck Hand, Sailor, Ship Deck Ratings]",\
                "SEMISKILLED WORKERS NFS [incl. Production\
                ProcessWorker nfs, FactoryWorker nfs]",\
                "ELEMENTARY OCCUPATIONS","SALES & SERVICES ELEMENTARY OCCUPATIONS",\
                "STREET VENDORS AND RELATED WORKERS","Street food vendors",\
                "Street vendors nonfood products [incl. Hawker,\
                Peddler, Newsvendor, Rag Picker, Scavenger]",\
                "Door-to-door & telephone salespersons [incl. Solicitor, Canvasser]",\
                "STREET SERVICES ELEMENTARY OCCUPATIONS\
                [incl. Billposter, Shoeshiner, CarWindowWasher]",\
                "DOMESTIC AND RELATED HELPERS, CLEANERS & LAUNDERERS",\
                "Domestic helpers & cleaners [incl. Housemaid, Housekeeper nfs]",\
                "Helpers & cleaners in establishments [Kitchen Hand, Chambermaid]",\
                "Hand-launderers & pressers",\
                "BUILDING CARETAKERS, WINDOW AND RELATED CLEANERS",\
                "Building caretakers [incl. Janitor, Sexton, Verger]",\
                "Vehicle, window, etc. cleaners",\
                "MESSENGERS, PORTERS, DOORKEEPERS AND RELATED WORKERS",\
                "Messengers, package & luggage porters & deliverers\
                [incl. Elevator Attendant, Bellboy, Messenger]",\
                "Doorkeepers, watchpersons, etc. workers [incl. Amusement\
                Park Attendant, Ticket Collector, Usher,Watchman nfs, Park Atte",\
                "Vending-maching money collectors, meter readers, etc. workers",\
                "GARBAGE COLLECTORS AND RELATED LABORERS",\
                "Garbage collectors [incl. Dustman]",\
                "Sweepers, etc. laborers [incl. Odd-JobWorker]",\
                "AGRICULTURAL, FISHERY AND RELATED LABORERS",\
                "AGRICULTURAL, FISHERY AND RELATED LABORERS",\
                "Farm-hands & laborers [incl. Cow Herd, Farm Helper, Fruit Picker]",\
                "Forestry laborers","Fishery, hunting & trapping laborers",\
                "LABORERS IN MINING, CONSTRUCTION, MANUFACTURING & TRANSPORT\
                [incl. UnskilledWorker nfs]",\
                "MINING & CONSTRUCTION LABORERS","Mining & quarrying laborers",\
                "Construction & maintenance laborers: roads, dams, etc.\
                [incl. Navvy, Shoveller, Railway Trackworker]",\
                "Building construction laborers [incl. Handyman, Hod Carrier]",\
                "MANUFACTURING LABORERS",\
                "Assembling laborers [incl. Sorter, Bottle Sorter,\
                Winder, Checker nfs, Grader nfs]",\
                "Handpackers & other manufacturing laborers [incl. Crater, Labeler]",\
                "TRANSPORT LABORERS & FREIGHT HANDLERS",\
                "Hand or pedal vehicle drivers [incl. Rickshaw Driver]",\
                "Drivers of animal-drawn vehicles & machinery",\
                "Freight handlers [incl. Docker, Loader, Longshoreman",\
                "Housewife","Student",\
                "Social beneficiary (unemployed, retired, sickness, etc.)","Dont know",\
                "Vague(a good job, a quiet job, a well paid job, an office job, etc.)",\
                "N/A","Invalid","Missing"
)

In [89]:
# confirm lengths match
print(len(iscom_keys), len(iscom_values))

541 541


In [90]:
# create tuple of mothers' profession keys to zip with profession names
iscof_keys = (1000,1100,1110,1120,1130,1140,1141,1142,1143,1200,1210,1220,\
              1221,1222,1223,1224,1225,1226,1227,1228,1229,1230,1231,1232,\
              1233,1234,1235,1236,1237,1239,1240,1250,1251,1252,1300,1310,\
              1311,1312,1313,1314,1315,1316,1317,1318,1319,2000,2100,2110,\
              2111,2112,2113,2114,2120,2121,2122,2130,2131,2132,2139,2140,\
              2141,2142,2143,2144,2145,2146,2147,2148,2149,2200,2210,2211,\
              2212,2213,2220,2221,2222,2223,2224,2229,2230,2300,2310,2320,\
              2321,2322,2330,2331,2332,2340,2350,2351,2352,2359,2400,2410,\
              2411,2412,2419,2420,2421,2422,2429,2430,2431,2432,2440,2441,\
              2442,2443,2444,2445,2446,2450,2451,2452,2453,2454,2455,2460,\
              3000,3100,3110,3111,3112,3113,3114,3115,3116,3117,3118,3119,\
              3120,3121,3122,3123,3130,3131,3132,3133,3139,3140,3141,3142,\
              3143,3144,3145,3150,3151,3152,3200,3210,3211,3212,3213,3220,\
              3221,3222,3223,3224,3225,3226,3227,3228,3229,3230,3231,3232,\
              3240,3241,3242,3300,3310,3320,3330,3340,3400,3410,3411,3412,\
              3413,3414,3415,3416,3417,3419,3420,3421,3422,3423,3429,3430,\
              3431,3432,3433,3434,3439,3440,3441,3442,3443,3444,3449,3450,\
              3451,3452,3460,3470,3471,3472,3473,3474,3475,3480,4000,4100,\
              4110,4111,4112,4113,4114,4115,4120,4121,4122,4130,4131,4132,\
              4133,4140,4141,4142,4143,4144,4190,4200,4210,4211,4212,4213,\
              4214,4215,4220,4221,4222,4223,5000,5100,5110,5111,5112,5113,\
              5120,5121,5122,5123,5130,5131,5132,5133,5139,5140,5141,5142,\
              5143,5149,5150,5151,5152,5160,5161,5162,5163,5164,5169,5200,\
              5210,5220,5230,6000,6100,6110,6111,6112,6113,6114,6120,6121,\
              6122,6123,6124,6129,6130,6131,6132,6133,6134,6140,6141,6142,\
              6150,6151,6152,6153,6154,6200,6210,7000,7100,7110,7111,7112,\
              7113,7120,7121,7122,7123,7124,7129,7130,7131,7132,7133,7134,\
              7135,7136,7137,7140,7141,7142,7143,7200,7210,7211,7212,7213,\
              7214,7215,7216,7220,7221,7222,7223,7224,7230,7231,7232,7233,\
              7234,7240,7241,7242,7243,7244,7245,7300,7310,7311,7312,7313,\
              7320,7321,7322,7323,7324,7330,7331,7332,7340,7341,7342,7343,\
              7344,7345,7346,7400,7410,7411,7412,7413,7414,7415,7416,7420,\
              7421,7422,7423,7424,7430,7431,7432,7433,7434,7435,7436,7437,\
              7440,7441,7442,7500,7510,7520,7530,8000,8100,8110,8111,8112,\
              8113,8120,8121,8122,8123,8124,8130,8131,8139,8140,8141,8142,\
              8143,8150,8151,8152,8153,8154,8155,8159,8160,8161,8162,8163,\
              8170,8171,8172,8200,8210,8211,8212,8220,8221,8222,8223,8224,\
              8229,8230,8231,8232,8240,8250,8251,8252,8253,8260,8261,8262,\
              8263,8264,8265,8266,8269,8270,8271,8272,8273,8274,8275,8276,\
              8277,8278,8279,8280,8281,8282,8283,8284,8285,8286,8290,8300,\
              8310,8311,8312,8320,8321,8322,8323,8324,8330,8331,8332,8333,\
              8334,8340,8400,9000,9100,9110,9111,9112,9113,9120,9130,9131,\
              9132,9133,9140,9141,9142,9150,9151,9152,9153,9160,9161,9162,\
              9200,9210,9211,9212,9213,9300,9310,9311,9312,9313,9320,9321,\
              9322,9330,9331,9332,9333,9501,9502,9503,9504,9505,9997,9998,\
              9999)

In [91]:
# create tuple of mothers' profession descriptions to zip with profession keys
iscof_values = ("LEGISLATORS, SENIOR OFFICIALS & MANAGERS","LEGISLATORS & SENIOR OFFICIALS",\
                "LEGISLATORS [incl. Member of Parliament, Member of Local Council]",\
                "SENIOR [NATIONAL] GOVERNMENT OFFICIALS [incl. Minister, Ambassador]",\
                "[SENIOR LOCAL GOVERNMENT OFFICIALS] [incl. Local Government Senior\
                Officials, Mayor]","SENIOR OFFICIALS SPECIAL-INTEREST ORGANIZATIONS",\
                "Senior officials political-party organizations [incl. Politician]",\
                "Senior officials economic-interest organizations [incl. Union Leader,\
                Director Employers’ Organization]","Senior officials special-interest\
                organizations [incl. Lodge Official, Official Red Cross]",\
                "CORPORATE MANAGERS [LARGE ENTERPRISES]","[LARGE ENTERPRISES] DIRECTORS &\
                CHIEF EXECUTIVES [incl. CEO, Large Business Owner 25+ employees]",\
                "[LARGE ENTERPRISE OPERATION] DEPARTMENT MANAGERS [incl. Manager in\
                establishment with 25+ employees]",\
                "Production department managers agriculture & fishing",\
                "Production department managers manufacturing [incl. Factory Manager nfs]",\
                "Production department managers construction",\
                "Production department managers wholesale & retail trade [incl. Floor Manager]",\
                "Production department managers restaurants & hotels",\
                "Production department managers transportation, storage & communications\
                [incl. Postmaster, Stationmaster]",\
                "Production department managers business services [incl. Banker, Bank Manager]",\
                "Production department managers personal care, cleaning, etc.",\
                "Production department managers nec [incl. Impresario, Film Producer,\
                College Dean, School Principal]",\
                "[LARGE ENTERPRISES] OTHER DEPARTMENT MANAGERS",\
                "Finance & administration department managers [incl. Company Secretary]",\
                "Personnel & industrial relations department managers","Sales & marketing\
                department managers","Advertising & public relations department managers",\
                "Supply & distribution department managers",\
                "Computing services department managers",\
                "Research & development department managers",\
                "Other department managers nec","OFFICE MANAGERS [incl. Clerical Supervisor]",\
                "MILITARY OFFICERS","Higher military officers [Captain and above]",\
                "Lower-grade commissioned officers [incl. Army Lieutenant]",\
                "[SMALL ENTERPRISE] GENERAL MANAGERS",\
                "[SMALL ENTERPRISE] GENERAL MANAGERS [incl. Businessman, Trader, Manager nfs]",\
                "[Small enterprise] General managers agriculture, forestry & fishing [incl.\
                Farm Manager, Self-employed Farmer with perso",\
                "[Small enterprise] General managers manufacturing",\
                "[Small enterprise] General managers construction [incl. Building Contractor]",\
                "[Small enterprise] General managers wholesale & retail trade [incl.\
                Shop Owner/Manager, Retail Owner/Manager, Merchant]",\
                "[Small enterprise] General managers restaurants & hotels [incl. Manager\
                Camping Site, Bar Owner/Manager, Restaurateur]",\
                "[Small enterprise] General managers transp., storage & communications\
                [incl. Owner Small Transport Company]",\
                "[Small enterprise] General managers business services\
                [incl. Manager Insurance Agency]",\
                "[Small enterprise] General managers personal care, cleaning, etc. services\
                [incl. Owner Laundry]","[Small enterprise] General managers nec\
                [incl. Manager Travel Agency, Manager Fitness Center, Garage Owner]",\
                "PROFESSIONALS","PHYSICAL, MATHEMATICAL & ENGINEERING SCIENCE PROFESSIONALS",\
                "PHYSICISTS, CHEMISTS & RELATED PROFESSIONALS","Physicists & astronomers",\
                "Meteorologists","Chemists","Geologists & geophysicists [incl. Geodesist]",\
                "MATHEMATICIANS, STATISTICIANS AND RELATED PROFESSIONALS",\
                "Mathematicians, etc. professionals","Statisticians [incl. Actuary]",\
                "COMPUTING PROFESSIONALS",\
                "Computer systems designers & analysts [incl. Software Engineer]",\
                "Computer programmers","Computing professionals nec",\
                "ARCHITECTS, ENGINEERS AND RELATED PROFESSIONALS",\
                "Architects town & traffic planners [incl. Landscape Architect]",\
                "Civil engineers [incl. Construction Engineer]","Electrical engineers",\
                "Electronics & telecommunications engineers","Mechanical engineers",\
                "Chemical engineers","Mining engineers, metallurgists, etc. professionals",\
                "Cartographers & surveyors",\
                "Architects, engineers, etc. professionals nec [incl. Consultant]",\
                "LIFE SCIENCE & HEALTH PROFESSIONALS","LIFE SCIENCE PROFESSIONALS",\
                "Biologists, botanists, zoologists, etc. professionals",\
                "Pharmacologists, pathologists, etc. professionals [incl. Biochemist]",\
                "Agronomists, etc. professionals","HEALTH PROFESSIONALS (EXCEPT NURSING)",\
                "Medical doctors","Dentists","Veterinarians","Pharmacists",\
                "Health professionals except nursing nec",\
                "NURSING & MIDWIFERY PROFESSIONALS [incl. Registered Nurses, Registered\
                Midwives, Nurse nfs]","TEACHING PROFESSIONALS",\
                "HIGHER EDUCATION TEACHING PROFESSIONALS [incl. University Professor]",\
                "SECONDARY EDUCATION TEACHING PROFESSIONALS",\
                "[Secondary teachers, academic track] [incl. Middle-School Teacher]",\
                "[Secondary teachers, vocational track] [incl. Vocational Instructor]",\
                "PRIMARY & PRE-PRIMARY EDUCATION TEACHING PROFESSIONALS",\
                "Primary education teaching professionals",\
                "Pre-primary education teaching professionals [incl. Kindergarten Teacher]",\
                "SPECIAL EDUCATION TEACHING PROFESSIONALS [incl. Remedial Teacher,\
                Teacher of the Blind]","OTHER TEACHING PROFESSIONALS",\
                "Education methods specialists [incl. Curricula Developer]",\
                "School inspectors","Other teaching professionals nec",\
                "OTHER PROFESSIONALS [incl. Professional nfs, Administrative Professional]",\
                "BUSINESS PROFESSIONALS","Accountants [incl. Auditor]",\
                "Personnel & careers professionals [incl. Job Analyst, Student Counselor]",\
                "Business professionals nec [incl. Publicity Agent, Patent Agent,\
                Home Economist, Market Researcher]","LEGAL PROFESSIONALS","Lawyers","Judges",\
                "Legal professionals nec [incl. Notary, Notary Public]",\
                "ARCHIVISTS, LIBRARIANS AND RELATED INFORMATION PROFESSIONALS",\
                "Archivists & curators","Librarians, etc. information professionals\
                [incl. Documentalist, Health Records Technician]",\
                "SOCIAL SCIENCE AND RELATED PROFESSIONALS","Economists",\
                "Sociologists, anthropologists, etc. professionals",\
                "Philosophers, historians & political scientists",\
                "Philologists, translators & interpreters",\
                "Psychologists","Social work professionals [incl.WelfareWorker]",\
                "WRITERS & CREATIVE OR PERFORMING ARTISTS",\
                "Authors, journalists & other writers [incl. Editor, TechnicalWriter]",\
                "Sculptors, painters, etc. artists","Composers, musicians & singers",\
                "Choreographers & dancers","Film, stage, etc. actors & directors",\
                "RELIGIOUS PROFESSIONALS [incl. Priest, Chaplain, Theologian,\
                Professional Nun]","TECHNICIANS AND ASSOCIATE PROFESSIONALS",\
                "PHYSICAL & ENGINEERING SCIENCE ASSOCIATE PROFESSIONALS",\
                "PHYSICAL & ENGINEERING SCIENCE TECHNICIANS",\
                "Chemical & physical science technicians","Civil engineering technicians",\
                "Electrical engineering technicians",\
                "Electronics & telecommunications engineering technicians",\
                "Mechanical engineering technicians","Chemical engineering technicians",\
                "Mining & metallurgical technicians",\
                "Draftspersons [incl. Technical Illustrator]",\
                "Physical & engineering science technicians nec [incl. Quantity Surveyor]",\
                "COMPUTER ASSOCIATE PROFESSIONALS",\
                "Computer assistants [incl. Assistant Users’ Services]",\
                "Computer equipment operators [incl. Computer Printer Equipment Operator]",\
                "Industrial robot controllers","OPTICAL & ELECTRONIC EQUIPMENT OPERATORS",\
                "Photographers & electronic equipment operators [incl. Cameraman, Sound Mixer]",\
                "Broadcasting & telecommunications equipment operators",\
                "Medical equipment operators [incl. X-ray Technician]",\
                "Optical & electronic equipment operators nec\
                [incl. Cinema Projectionist, Telegrapher]",\
                "SHIP & AIRCRAFT CONTROLLERS & TECHNICIANS","Ships engineers",\
                "Ships deck officers & pilots [incl. River Boat Captain]",\
                "Aircraft pilots, etc. associate professionals","Air traffic controllers",\
                "Air traffic safety technicians","SAFETY & QUALITY INSPECTORS",\
                "Building & fire inspectors","Safety, health & quality inspectors\
                [incl. Occupational Safety Inspector, Inspector nfs]",\
                "LIFE SCIENCE & HEALTH ASSOCIATE PROFESSIONALS",\
                "LIFE SCIENCE TECHNICIANS AND RELATED ASSOCIATE PROFESSIONALS",\
                "Life science technicians [incl. Medical Laboratory Assistant,\
                Medical Technician nfs, Physical and Life Science Technici",\
                "Agronomy & forestry technicians","Farming & forestry advisers",\
                "MODERN HEALTH ASSOCIATE PROFESSIONALS EXCEPT NURSING",\
                "Medical assistants","Sanitarians","Dieticians & nutritionists",\
                "Optometrists & opticians [incl. Dispensing Optician]",\
                "Dental assistants [incl. Oral Hygienist]",\
                "Physiotherapsits, etc. associate professionals\
                [incl. Chiropractor, Masseur, Osteopath]",\
                "Veterinary assistants [incl. Veterinarian Vaccinator]",\
                "Pharmaceutical assistants",\
                "Modern health associate professionals except nursing nec\
                [incl. Homeopath, Speech Therapist, Occupational Therapist]",\
                "NURSING & MIDWIFERYASSOCIATE PROFESSIONALS",\
                "Nursing associate professionals [incl. Trainee Nurses]",\
                "Midwifery associate professionals [incl. Trainee Midwife]",\
                "TRADITIONAL MEDICINE PRACTITIONERS & FAITH HEALERS",\
                "Traditional medicine practitioners [incl. Herbalist]","Faith healers",\
                "TEACHING ASSOCIATE PROFESSIONALS",\
                "PRIMARY EDUCATION TEACHING ASSOCIATE PROFESSIONALS [incl. Teacher’s Aid]",\
                "PRE-PRIMARY EDUCATION TEACHING ASSOCIATE PROFESSIONALS\
                [incl. Kindergarten Teacher’s Aid]",\
                "SPECIAL EDUCATION TEACHING ASSOCIATE PROFESSIONALS",\
                "OTHER TEACHING ASSOCIATE PROFESSIONALS","OTHER ASSOCIATE PROFESSIONALS",\
                "FINANCE & SALES ASSOCIATE PROFESSIONALS",\
                "Securities & finance dealers & brokers",\
                "Insurance representative [incl. Insurance Agent, Underwriter]",\
                "[Real] estate agents [incl. Real Estate Broker]",\
                "Travel consultants & organizers",\
                "Technical & commercial sales representatives\
                [incl. Traveling Salesman, Technical Salesman]",\
                "Buyers","Appraisers, valuers & auctioneers [incl. Claims Adjuster]",\
                "Finance & sales associate professionals nec",\
                "BUSINESS SERVICES AGENTS AND TRADE BROKERS",\
                "Trade brokers","Clearing & forwarding agents",\
                "Employment agents & labor contractors",\
                "Business services agents & trade brokers nec\
                [incl. Literary Agent, Sports Promoter, Salesman Advertisements]",\
                "ADMINISTRATIVE ASSOCIATE PROFESSIONALS",\
                "Administrative secretaries, etc. associate professionals",\
                "Legal, etc. business associate professionals [incl. Bailiff, Law Clerk]",\
                "Bookkeepers","Statistical, mathematical, etc. associate professionals",\
                "Administrative associate professionals nec [incl. Management Assistant]",\
                "CUSTOMS, TAX AND RELATED GOVERNMENT ASSOCIATE PROFESSIONALS\
                [incl. Administrative Associate Professional, Executive Civi",\
                "Customs & border inspectors","Government tax & excise officials",\
                "Government social benefits officials","Government licensing officials",\
                "Customs tax, etc. government associate professionals nec\
                [incl. Price Inspector, Electoral Official, Middle-Rank Civil S",\
                "POLICE INSPECTORS & DETECTIVES/[ARMY]",\
                "Police inspectors & detectives [incl. Police Investigator, Private Detective]",\
                "[Armed forces non commissioned officers] [incl. Sergeant]",\
                "SOCIAL WORK ASSOCIATE PROFESSIONALS",\
                "ARTISTIC, ENTERTAINMENT & SPORTS ASSOCIATE PROFESSIONALS",\
                "Decorators & commercial designers [incl.Window Dresser, Interior\
                Decorator, Furniture Designer, Book Illustrator, Tattoo",\
                "Radio, television & other announcers",\
                "Street nightclub, etc. musicians, singers & dancers\
                [incl. Band Leader, Chorus Dancer, Nightclub Singer]",\
                "Clowns, magicians, acrobats, etc. associate professionals\
                [incl. Striptease Artist, Juggler]",\
                "Athletes, sports persons, etc. associate professionals\
                [incl. Trainer, Umpire]",\
                "RELIGIOUS ASSOCIATE PROFESSIONALS\
                [incl. Evangelist, Lay Preacher, Salvationist]",\
                "CLERKS","OFFICE CLERKS [incl. Clerk nfs, Government Office Clerk nfs]",\
                "SECRETARIES & KEYBOARD-OPERATING CLERKS","Stenographers & typists",\
                "Word-processor, etc. operators [incl. Teletypist]",\
                "Data-entry operators [incl. Key Puncher]",\
                "Calculating-machine operators [incl. Bookkeeping Machine Operator]",\
                "Secretaries","NUMERICAL CLERKS",\
                "Accounting & bookkeeping clerks [incl. Payroll Clerk]",\
                "Statistical & finance clerks [incl. Credit Clerk]",\
                "MATERIAL-RECORDING & TRANSPORT CLERKS",\
                "Stock clerks [incl.Weighing Clerk, Storehouse Clerk]",\
                "Production clerks [incl. Planning Clerks]",\
                "Transport clerks [incl. Dispatcher, Expeditor]",\
                "LIBRARY, MAIL AND RELATED CLERKS","Library & filing clerks",\
                "Mail carriers & sorting clerks","Coding proofreading, etc. clerks",\
                "Scribes, etc. workers [incl. Form Filling Assistance Clerk]",\
                "OTHER OFFICE CLERKS [incl. Address Clerk, Timekeeper,\
                Office Boy, Photocopy Machine Operator]",\
                "CUSTOMER SERVICES CLERKS [incl. Customer Service Clerk nfs]",\
                "CASHIERS, TELLERS AND RELATED CLERKS",\
                "Cashiers & ticket clerks [incl. Bank Cashier, Store Cashier, Toll Collector]",\
                "Tellers & other counter clerks [incl. Bank Teller, Post Office Clerk]",\
                "Bookmakers & croupiers","Pawnbrokers & money-lenders",\
                "Debt-collectors, etc. workers","CLIENT INFORMATION CLERKS",\
                "Travel agency, etc. clerks",\
                "Receptionists & information clerks [incl. Medical Receptionist]",\
                "Telephone switchboard operators [incl. Telephone Operator]",\
                "SERVICE WORKERS & SHOP & MARKET SALES WORKERS",\
                "PERSONAL & PROTECTIVE SERVICES WORKERS","TRAVELATTENDANTS, ETC.",\
                "Travel attendants & travel stewards [incl. Airplane Steward, Airplane Purser]",\
                "Transport conductors [incl. Train Conductor]","Travel, museum guides",\
                "HOUSEKEEPING & RESTAURANT SERVICES WORKERS",\
                "Housekeepers, etc. workers [incl. Butler, Matron, DormitoryWarden,\
                Estate Manager, Property Manager, Building Superinten",\
                "Cooks","Waiters, waitresses & bartenders","PERSONAL CARE AND RELATED WORK",\
                "Child-care workers [incl. Nursemaid, Governess]",\
                "Institution-based personal care workers\
                [incl. Ambulance Man, Hospital Orderly]",\
                "Home-based personal care workers [incl. Attendant]",\
                "[Other] care, etc. workers nec [incl. Animal Feeder]",\
                "OTHER PERSONAL SERVICES WORKERS",\
                "Hairdressers, barbers, beauticians, etc. workers",\
                "Companions & valets [incl. Personal Maid]",\
                "Undertakers & embalmers [incl. Funeral Director]",\
                "Other personal services workers nec\
                [incl. Escort, Dancing Partner, Prostitute]",\
                "ASTROLOGERS, FORTUNE-TELLERS AND RELATED WORKERS","Astrologers, etc. workers",\
                "Fortune-tellers, palmists, etc. workers","PROTECTIVE SERVICES WORKERS",\
                "Firefighters","Police officers [incl. Policeman, Constable, Marshal]",\
                "Prison guards","[Armed forces, soldiers] [incl. Enlisted Man]",\
                "Protective services workers nec [incl. Night Guard, Bodyguard, Coast Guard]",\
                "[SALESPERSONS, MODELS & DEMONSTRATORS]",\
                "FASHION & OTHER MODELS [incl. Mannequin, Artist’s Model]",\
                "SHOP SALESPERSONS & DEMONSTRATORS [incl. Shop Assistant,\
                Gas Station Attendant, Retail Assistant]",\
                "STALL & MARKET SALESPERSONS","SKILLED AGRICULTURAL & FISHERY WORKERS",\
                "MARKET-ORIENTED SKILLED AGRICULTURAL & FISHERY WORKERS\
                [This category includes skilled farm workers and self-employed sm",\
                "MARKET GARDENERS & CROP GROWERS",\
                "Field crop & vegetable growers [incl. Specialized Crop\
                Farmers, Specialized Crop FarmWorkers]",\
                "Tree & shrub crop growers [incl. Skilled RubberWorker,\
                Coffee Farmer, Tea Grower, Fruit Tree Pruner]",\
                "Gardeners, horticultural & nursery growers\
                [incl. Bulb Grower, Market Gardener]",\
                "Mixed-crop growers [incl. Share Cropper]",\
                "MARKET-ORIENTED ANIMAL PRODUCERS AND RELATED WORKERS",\
                "Dairy & livestock producers [incl. Cattle Breeder,\
                Dairy Farmer, Grazier, Shepherd]",\
                "Poultry producers [incl. Chicken Farmer, Skilled HatcheryWorker]",\
                "Apiarists & sericulturists [incl. Beekeeper, Silkworm Raiser]",\
                "Mixed-animal producers","Market-oriented animal producers, etc. workers nec\
                [incl. Bird Breeder, Gamekeeper, Kennel Keeper, Dog Trainer, Animal C",\
                "MARKET-ORIENTED CROP & ANIMAL PRODUCERS","[Mixed farmers]",\
                "[Farm foremen/supervisor]","[Farmers nfs]",\
                "[Skilled farm workers nfs]","FORESTRY AND RELATED WORKERS",\
                "Forestry workers & loggers [incl. Forestery, Rafter, Timber Cruiser]",\
                "Charcoal burners, etc. workers","FISHERY WORKERS, HUNTERS & TRAPPERS",\
                "Aquatic-life cultivation workers [incl. Oyster Farmer,\
                Pearl Cultivator, Fish Hatcher]",\
                "Inland & coastal waters fishery workers [incl. Sponge Diver, Fisherman]",\
                "Deep-sea fishery workers [incl. Fisherman nfs, Trawler Crewman]",\
                "Hunters & trappers [incl. Whaler]",\
                "SUBSISTENCE AGRICULTURAL & FISHERY WORKERS",\
                "SUBSISTENCE AGRICULTURAL & FISHERY WORKERS",\
                "CRAFT AND RELATED TRADES WORKERS",\
                "EXTRACTION & BUILDING TRADES WORKERS",\
                "MINERS, SHOTFIRERS, STONE CUTTERS & CARVERS",\
                "Miners & quarry workers [incl. Miner nfs]","Shotfirers & blasters",\
                "Stone splitters, cutters & carvers [incl. Tombstone Carver]",\
                "BUILDING FRAME AND RELATED TRADES WORKERS",\
                "Builders traditional materials","Bricklayers & stonemasons [incl. Pavior]",\
                "Concrete placers, concrete finishers, etc. workers [incl. TerrazzoWorker]",\
                "Carpenters & joiners","Building frame, etc. trades workers nec\
                [incl. ConstructionWorker nfs, Billboard Erector, DemolitionWorker, Scaffolder]",\
                "BUILDING FINISHERS AND RELATED TRADES WORKERS","Roofers",\
                "Floor layers & tile setters [incl. ParquetryWorker]",\
                "Plasterers [incl. Stucco Mason]","Insulation workers",\
                "Glaziers","Plumbers & pipe fitters [incl.Well Digger]",\
                "Building, etc. electricians",\
                "PAINTERS, BUILDING STRUCTURE CLEANERS AND RELATED TRADES WORKERS",\
                "Painters, etc. workers [incl. Construction Painter, Paperhanger]",\
                "Varnishers, etc. painters [incl. Automobile Painter]",\
                "Building structure cleaners [incl. Chimney Sweep,\
                Sandblaster, Boiler Engine Cleaner]",\
                "METAL, MACHINERY AND RELATED TRADES WORKERS",\
                "METAL MOLDERS, WELDERS, SHEETMETAL WORKERS STRUCTURAL METAL",\
                "Metal molders & coremakers",\
                "Welders & flamecutters [incl. Brazier, Solderer]",\
                "Sheet-metal workers [incl. Panel Beater, Coppersmith, Tinsmith]",\
                "Structural-metal preparers & erectors [incl. Ship Plater, Riveter, Shipwright]",\
                "Riggers & cable splicers","Underwater workers [incl. Frogman]",\
                "BLACKSMITHS, TOOL-MAKERS AND RELATED TRADES WORKERS",\
                "Blacksmiths, hammer-smiths & forging press workers [incl. Toolsmith]",\
                "Tool-makers, etc. workers [incl. Locksmith]",\
                "Machine-tool setters & setter-operators [incl. Metal driller, Turner]",\
                "Metal wheel-grinders, polishers & tool sharpeners",\
                "MACHINERY MECHANICS & FITTERS",\
                "Motor vehicle mechanics & fitters [incl. Bicycle Repairman",\
                "Aircraft engine mechanics & fitters",\
                "[Industrial & agricultural] machinery mechanics & fitters\
                [incl. Mechanic Heavy Equipment, Millwright]",\
                "[Unskilled garage worker] [incl. Oiler-Greaser]",\
                "ELECTRICAL & ELECTRONIC EQUIPMENT MECHANICS & FITTERS",\
                "Electrical mechanics & fitters [incl. Office Machine Repairman]",\
                "Electronics fitters","Electronics mechanics & servicers",\
                "Telegraph & telephone installers & servicers",\
                "Electrical line installers, repairers & cable jointers",\
                "PRECISION, HANDICRAFT, PRINTING AND RELATED TRADES WORKERS",\
                "PRECISION WORKERS IN METAL AND RELATED MATERIALS",\
                "Precision-instrument makers & repairers [incl. Dental Mechanic,Watch Maker]",\
                "Musical-instrument makers & tuners",\
                "Jewelry & precious-metal workers [incl. Diamond Cutter, Goldsmith]",\
                "POTTERS, GLASS-MAKERS AND RELATED TRADES WORKERS",\
                "Abrasive wheel formers, potters, etc. workers",\
                "Glass-makers, cutters, grinders & finishers",\
                "Glass engravers & etchers",\
                "Glass ceramics, etc. decorative painters\
                [incl. Decorative Painter, Signpainter]",\
                "HANDICRAFT WORKERS IN WOOD, TEXTILE, LEATHER, ETC.",\
                "Handicraft workers in wood, etc. materials\
                [incl. Candle Maker, Straw-Hat Maker]",\
                "Handicraft workers in textile, leather, etc. materials [incl. CarpetWeaver]",\
                "PRINTING AND RELATED TRADES WORKERS",\
                "Compositors, typesetters, etc. workers [incl. Phototypesetter, Linotypist]",\
                "Stereotypers & electrotypers","Printing engravers & etchers",\
                "Photographic, etc. workers [incl. Darkroom worker]",\
                "Bookbinders, etc. workers","Silkscreen, block & textile printers",\
                "OTHER CRAFT AND RELATED TRADES WORKERS",\
                "FOOD PROCESSING AND RELATED TRADES WORKERS",\
                "Butchers, fishmongers, etc. food preparers",\
                "Bakers, pastry-cooks & confectionery makers","Dairy-products makers",\
                "Fruit, vegetable, etc. preservers","Food & beverage tasters & graders",\
                "Tobacco preparers & tobacco products makers",\
                "WOOD TREATERS, CABINET-MAKERS AND RELATED TRADES WORKERS",\
                "Wood treaters [incl.Wood Grader,Wood Impregnator]",\
                "Cabinet-makers, etc. workers [incl. Cartwright, Cooper]",\
                "Woodworking-machine setters & setter-operators [incl.Wood-Turner]",\
                "Basketry weavers, brush makers, etc. workers [incl. Broom Maker]",\
                "TEXTILE, GARMENT AND RELATED TRADES WORKERS", "Fiber preparers",\
                "Weavers, knitters, etc. workers",\
                "Tailors, dressmakers & hatters [incl. Milliner]",\
                "Furriers, etc. workers",\
                "Textile, leather, etc. pattern-makers & cutters",\
                "Sewers, embroiderers, etc. workers","Upholsterers, etc. workers",\
                "PELT, LEATHER & SHOEMAKING TRADES WORKERS",\
                "Pelt dressers, tanners & fellmongers",\
                "Shoe-makers, etc. workers","[SKILLED WORKERS NFS]",\
                "[MANUAL FOREMEN NFSUNON-FARM]",\
                "[SKILLED WORKERS NFS] [incl. Craftsman, Artisan, Tradesman]",\
                "[APPRENTICE SKILLED WORK NFS]","PLANT & MACHINE OPERATORS & ASSEMBLERS",\
                "STATIONARY-PLANT AND RELATED OPERATORS",\
                "MINING- & MINERAL-PROCESSING PLANT OPERATORS",\
                "Mining-plant operators","Mineral-ore- & stone-processing plant operators",\
                "Well-drillers & borers, etc. workers","METAL-PROCESSING PLANT OPERATORS",\
                "Ore & metal furnace operators",\
                "Metal melters, casters & rolling-mill operators",\
                "Metal heat-treating plant operators","Metal drawers & extruders",\
                "GLASS, CERAMICS AND RELATED PLANT OPERATORS",\
                "Glass & ceramics kiln, etc. machine operators",\
                "Glass, ceramics, etc. plant operators nec",\
                "WOOD-PROCESSING & PAPERMAKING PLANT OPERATORS",\
                "Wood-processing plant operators [incl. Sawyer]",\
                "Paper-pulp plant operators",\
                "Papermaking plant operators","CHEMICAL-PROCESSING PLANT OPERATORS",\
                "Crushing grinding & chemical-mixing machinery operators",\
                "Chemical heat-treating plant operators",\
                "Chemical-filtering & separating-equipment operators",\
                "Chemical-still & reactor operators",\
                "Petroleum & natural-gas refining plant operators",\
                "Chemical-processing plant operators nec",\
                "POWER-PRODUCTION AND RELATED PLANT OPERATORS",\
                "Power-production plant operators",\
                "Steam-engine & boiler operators [incl. Stoker,\
                Ship Engine Room Ratings]",\
                "Incinerator water-treatment, etc. plant operators\
                [incl. Sewage Plant Operator]",\
                "AUTOMATED ASSEMBLY-LINE & INDUSTRIAL-ROBOT OPERTORS",\
                "Automated assembly-line operators",\
                "Industrial-robot operators","MACHINE OPERATORS & ASSEMBLERS",\
                "METAL- & MINERAL-PRODUCTS MACHINE OPERATORS",\
                "Machine-tool operators [incl. Machine Operator nfs]",\
                "Cement & other mineral products machine operators",\
                "CHEMICAL-PRODUCTS MACHINE OPERATORS",\
                "Pharmaceutical & toiletry products machine operators",\
                "Ammunition & explosive-products machine operators",\
                "Metal-finishing, -plating, & -coating machine operators\
                [incl. Electroplater, Fettler]",\
                "Photographic-products machine operators",\
                "Chemical-products machine operators nec",\
                "RUBBER- & PLASTIC-PRODUCTS MACHINE OPERATORS",\
                "Rubber-products machine operators",\
                "Plastic-products machine operators",\
                "WOOD-PRODUCTS MACHINE OPERATORS",\
                "PRINTING, BINDING & PAPER-PRODUCTS MACHINE OPERATORS",\
                "Printing-machine operators",\
                "Bookbinding-machine operators","Paper-products machine operators",\
                "TEXTILE, FUR & LEATHER-PRODUCTS MACHINE OPERATORS",\
                "Fiber-preparing, spinning & winding machine operators",\
                "Weaving- & knitting-machine operators","Sewing-machine operators",\
                "Bleaching-, dyeing- & cleaning-machine operators [incl. Launderer]",\
                "Fur- & leather-preparing-maching operators",\
                "Shoemaking-, etc. machine operators",\
                "Textile, fur & leather-products machine operators nec",\
                "FOOD AND RELATED PRODUCTS MACHINE OPERATORS",\
                "Meat- & fish-processing machine operators",\
                "Dairy-products machine operators",\
                "Grain- & spice-milling machine operators",\
                "Baked-goods cereal & chocolate products machine operators",\
                "Fruit-, vegetable- & nut-processing machine operators",\
                "Sugar-production machine operators",\
                "Tea-, coffee- & cocoa-processing machine operators",\
                "Brewers-, wine & other beverage machine operators",\
                "Tobacco-production machine operators",\
                "ASSEMBLERS","Mechanical-machinery assemblers\
                [incl. Car Assembly-LineWorker]",\
                "Electrical-equipment assemblers","Electronic-equipment assemblers",\
                "Metal, rubber & plastic products assemblers",\
                "Wood, etc. products assemblers",\
                "Paperboard, textile, etc. products assemblers",\
                "OTHER MACHINE OPERATORS & ASSEMBLERS",\
                "DRIVERS & MOBILE-PLANT OPERATORS",\
                "LOCOMOTIVE-ENGINE DRIVERS AND RELATED WORKERS",\
                "Locomotive-engine drivers","Railway brakers signalers & shunters",\
                "MOTOR-VEHICLE DRIVERS [incl. Driver nfs]","Motorcycle drivers",\
                "Car, taxi & van drivers [incl. Taxi Owner nfs]","Bus & tram drivers",\
                "Heavy truck & lorry drivers",\
                "AGRICULTURAL & OTHER MOBILE PLANT OPERATORS",\
                "Motorized farm & forestry plant operators\
                [incl. Tractor Driver, Combine Harvester Operator]",\
                "Earth-moving, etc. plant operators [incl. Bulldozer\
                Driver, Dredge Operator, RoadRoller Driver]",\
                "Crane, hoist, etc. plant operators","Lifting-truck operators",\
                "SHIPS DECK CREWS AND RELATED WORKERS [incl. Boatman,\
                Deck Hand, Sailor, Ship Deck Ratings]",\
                "SEMISKILLED WORKERS NFS [incl. Production\
                ProcessWorker nfs, FactoryWorker nfs]",\
                "ELEMENTARY OCCUPATIONS","SALES & SERVICES ELEMENTARY OCCUPATIONS",\
                "STREET VENDORS AND RELATED WORKERS","Street food vendors",\
                "Street vendors nonfood products [incl. Hawker,\
                Peddler, Newsvendor, Rag Picker, Scavenger]",\
                "Door-to-door & telephone salespersons [incl. Solicitor, Canvasser]",\
                "STREET SERVICES ELEMENTARY OCCUPATIONS\
                [incl. Billposter, Shoeshiner, CarWindowWasher]",\
                "DOMESTIC AND RELATED HELPERS, CLEANERS & LAUNDERERS",\
                "Domestic helpers & cleaners [incl. Housemaid, Housekeeper nfs]",\
                "Helpers & cleaners in establishments [Kitchen Hand, Chambermaid]",\
                "Hand-launderers & pressers",\
                "BUILDING CARETAKERS, WINDOW AND RELATED CLEANERS",\
                "Building caretakers [incl. Janitor, Sexton, Verger]",\
                "Vehicle, window, etc. cleaners",\
                "MESSENGERS, PORTERS, DOORKEEPERS AND RELATED WORKERS",\
                "Messengers, package & luggage porters & deliverers\
                [incl. Elevator Attendant, Bellboy, Messenger]",\
                "Doorkeepers, watchpersons, etc. workers [incl. Amusement\
                Park Attendant, Ticket Collector, Usher,Watchman nfs, Park Atte",\
                "Vending-maching money collectors, meter readers, etc. workers",\
                "GARBAGE COLLECTORS AND RELATED LABORERS",\
                "Garbage collectors [incl. Dustman]",\
                "Sweepers, etc. laborers [incl. Odd-JobWorker]",\
                "AGRICULTURAL, FISHERY AND RELATED LABORERS",\
                "AGRICULTURAL, FISHERY AND RELATED LABORERS",\
                "Farm-hands & laborers [incl. Cow Herd, Farm Helper, Fruit Picker]",\
                "Forestry laborers","Fishery, hunting & trapping laborers",\
                "LABORERS IN MINING, CONSTRUCTION, MANUFACTURING & TRANSPORT\
                [incl. UnskilledWorker nfs]",\
                "MINING & CONSTRUCTION LABORERS","Mining & quarrying laborers",\
                "Construction & maintenance laborers: roads, dams, etc.\
                [incl. Navvy, Shoveller, Railway Trackworker]",\
                "Building construction laborers [incl. Handyman, Hod Carrier]",\
                "MANUFACTURING LABORERS",\
                "Assembling laborers [incl. Sorter, Bottle Sorter,\
                Winder, Checker nfs, Grader nfs]",\
                "Handpackers & other manufacturing laborers [incl. Crater, Labeler]",\
                "TRANSPORT LABORERS & FREIGHT HANDLERS",\
                "Hand or pedal vehicle drivers [incl. Rickshaw Driver]",\
                "Drivers of animal-drawn vehicles & machinery",\
                "Freight handlers [incl. Docker, Loader, Longshoreman",\
                "Housewife","Student",\
                "Social beneficiary (unemployed, retired, sickness, etc.)","Dont know",\
                "Vague(a good job, a quiet job, a well paid job, an office job, etc.)",\
                "N/A","Invalid","Missing"
)

In [92]:
# check length of codes/descriptions
len(iscof_values), len(iscof_keys)

(541, 541)

In [93]:
# create dictionaries of professions
prof_dict_m = {str(key):description for key, description in zip(iscom_keys, iscom_values)}
prof_dict_f = {str(key):description for key, description in zip(iscof_keys, iscof_values)}

In [94]:
# check that mappings concur between the two dictionaries
prof_dict_f['1000'], prof_dict_m['1000']

('LEGISLATORS, SENIOR OFFICIALS & MANAGERS',
 'LEGISLATORS, SENIOR OFFICIALS & MANAGERS')

In [95]:
# map profession descriptions to df
lang['ISCO_M_description'] = lang['ISCO_M'].map(prof_dict_m)
lang['ISCO_F_description'] = lang['ISCO_F'].map(prof_dict_f)

Note: From checking the available list of professions, there are many that are missing from the list that has been provided so I will need to extend the lists manually and remap to ensure all available professions are covered. This will be an action in the next phase.

### Convert numerical language codes to text

In [96]:
# mapping for column I02_ST_M_S38A
I02_ST_M_S38A_keys = (0,1,2,3,4,5,6,7,8,9,10,11,12,13,14,15,16,17,18,19)
I02_ST_M_S38A_values = ("Ancient Greek","Arabic","Bengali",\
                        "Chinese","Dutch","English","Finnish",\
                        "French","German","Hebrew","Italian",\
                        "Japanese","Latin","Portuguese",\
                        "Russian","Sami languages","Spanish",\
                        "Swedish","Turkish","Urdu")

In [97]:
# create and map dictionary for column I02_ST_M_S38A
I02_ST_M_S38A_dict = {str(key):description for key, description\
                      in zip(I02_ST_M_S38A_keys, I02_ST_M_S38A_values)}
lang['I02_ST_M_S38A_str'] = lang['I02_ST_M_S38A'].map(I02_ST_M_S38A_dict)

In [98]:
# confirm values have been mapped
lang['I02_ST_M_S38A_str'].value_counts()

English           36281
French             7532
German             3712
Dutch               981
Spanish             380
Russian             374
Italian             270
Ancient Greek       220
Latin                98
Arabic               55
Finnish              20
Urdu                  8
Chinese               8
Turkish               6
Portuguese            5
Sami languages        3
Bengali               2
Name: I02_ST_M_S38A_str, dtype: int64

Note: As there are so many categorical data fields reported as numeric data types I will need to check in the EDA phase whether the numerical values make sense or whether I should map the string descriptions to the dataframe and dummify the variables instead. This will be an action in the next stage of the project.

## Data Distribution

In [99]:
# check distribution of data across countries
lang['country_id'].value_counts()

ES        7651
BE nl     3634
HR        3334
PL        3320
PT        3268
SI        3222
BG        3181
FR        3049
EE        3016
EL        2949
SE        2923
NL        2841
UK-ENG    2835
BE fr     2689
MT        2280
BE de     1669
Name: country_id, dtype: int64

In [100]:
# check distribution of data across languages
lang['targetLanguage_id'].value_counts()

EN    25582
DE    11883
FR    10348
ES     2945
IT     1103
Name: targetLanguage_id, dtype: int64

In [101]:
# check how many different schools are reported in data
lang['school_id'].value_counts().count()

2136

In [102]:
# check proportion of data for first/second target language
lang['TL'].value_counts()

1    27333
2    24528
Name: TL, dtype: int64

In [103]:
# check how languages are split for first/second target language
lang.groupby('TL')['targetLanguage_id'].value_counts()

TL  targetLanguage_id
1   EN                   23093
    FR                    4240
2   DE                   11883
    FR                    6108
    ES                    2945
    EN                    2489
    IT                    1103
Name: targetLanguage_id, dtype: int64

## Target Distribution

In [104]:
# subset only to categorical results data
results_only = lang.loc[:,["PL1_READ","PL2_READ","PL3_READ","PL4_READ","PL5_READ",\
                           "PL1_LIST","PL2_LIST","PL3_LIST","PL4_LIST","PL5_LIST",\
                           "PL1_WRIT1","PL2_WRIT1","PL3_WRIT1","PL4_WRIT1","PL5_WRIT1",\
                           "PL1_WRIT2","PL2_WRIT2","PL3_WRIT2","PL4_WRIT2","PL5_WRIT2",\
                           "PL1_WRIT_C","PL2_WRIT_C","PL3_WRIT_C","PL4_WRIT_C","PL5_WRIT_C"]]

In [105]:
# check shape of df
results_only.shape

(51861, 25)

In [106]:
# count results across whole df to ensure there is sufficient data across all achievement levels
results_only.stack().value_counts()

       453805
A1     270918
A2     159972
B1     153631
-A1    137106
B2     121093
dtype: int64

NOTE: There is sufficient results data for me to be able to use these fields as my target(s). As part of the EDA I will make a decision about how exactly to classify these results.

## Export Data to .csv

In [107]:
# set path to save csv
path='/Users/lizspiking/GeneralAssembly/DSI5-lessons/projects/project-capstone/'

# export data to .csv to allow for import into new notebook for the next stage of the project
lang.to_csv(path + 'lang.csv', sep=',', na_rep='', float_format=None,\
                 header=True, index=False, mode='w', encoding='UTF-8',\
                 compression=None, quoting=None, quotechar='"',\
                 line_terminator='\n', decimal='.')

## Notes on data fields

SQt04i00 (col 13) to SQt04i12 (col 25):
- first language

SQt05i01 (col 26) to SQt06i01 (col 28):
- school grade & study programme

SQt09i01 (col 29) SQt17i01 (col 35):
- parents' education

SQt19i01 (col 37) to SQt22i05 (col 58):
- home possessions (books, electronics, etc)

SQt23i01 (col 59) to SQt24i06 (col 73):
- exposure to computers

SQt25i00 (col 74) to SQt33i10 (col 138):
- exposure to languages outside school and opinions on the usefulness of foreign languages and the target language

SQt34i01 (col 139) to SQt36i09 (col 165):
- students' opinion of non-language subjects and the amount of time spent studying for them

SQt37i00 (col 166) to SQt37i10 (col 176):
- languages studied and how they rank in the most studied languages in that country
- the languages themselves differ from country to country (ie. the most studied language in Spain might be English but in Luxembourg it might be German) so there will be mapping to do here to ensure the correct language is represented

SQt39i00 (col 178) to SQt40i14 (col 207):
- school year in which the student began studying foreign languages and the target language
- I will review the data in these columns together to understand the distribution of the responses

SQt45i01 (col 219) to SQt46i07 (col 231):
- language trips

SQt48i01 (col 233) to SQt48i07 (col 239):
- perceived difficulty of language

SQt49i01 (col 240) to SQt53i06 (col 266):
- frequency of language activities in class

SQt54i01 (col 267) to SQt56i02 (col 280)
- perception of language classes/learning experience of others

SQt57i01 (col 281) to SQt59i02 (col 296):
- emphasis on language in lessons, frequency of tests

SQt60i01 (col 297) to SQt61i07 (col 304):
- time spent studying for tests
- perceived importance of grade for competence

SQt62i01 (col 305) to SQt62i09 (col 313):
- frequency of computer use

SQt63i01 (col 314) to SQt64i06 (col 321):
- time spent on homework
- extra lessons

SQtA1i01 (col 322) to SQtA4i04 (col 337):
- what the student can do in various language disciplines (reading, writing, etc)

ISCO_M (col 338) to HOMEPOS (col 346):
- info about parents/home

PV1_LIST (col 398) to PL5_WRIT_C (col 447):
- test results, according to Common European Framework of Reference for Languages
- these are categorical results rather than continuous scores

# Parking Lot

I have a terrible memory so anything I have looked at and thought I might need to do something with later, I have moved here as a reminder. Please ignore this section as it won't be part of my final submission!

In [80]:
# alternative column headers giving more detail about whath each field represents.
# I will probably map these later
columns_expanded = ['country_id', 'school_id', 'main_study_sample',\
           'respondent_id', 'target_lang_id', 'target_lang_no',\
           'quest_lang_id', 'testing_grade', 'test_mode',\
           'gender', 'dob', 'home_location', 'first_language1',\
           'first_language2', 'first_language3', 'first_language4',\
          'first_language5', 'first_language_target',\
           'first_lang_non_indig1', 'first_lang_non_indig2',\
          'first_lang_non_indig3', 'first_lang_non_indig4',\
          'first_lang_non_indig5', 'first_lang_other_euro1',\
          'first_lang_other_euro2', 'grade', 'grade_clean',\
           'study_prog', 'empl_status_mother', 'empl_status_father',\
          'mother_highest_school_level', 'father_highest_school_level',\
          'birth_country', 'birth_country_mother', 'birth_country_father',\
          'years_in_country', 'desk_at_home', 'own_room',\
           'quiet_place_to_study', 'books_to_help', 'computer',\
          'educational_software', 'internet', 'dictionary',\
           'classic_literature', 'poetry_books', 'art_works',\
          'dishwasher', 'dvd_player', 'country_specific_item1',\
          'country_specific_item2', 'country_specific_item3',\
          'no_of_books', 'no_of_mobiles', 'no_of_tvs', 'no_of_computers',\
          'no_of_cars', 'no_of_bathrooms', 'own_computer', 'internet_access',\
          'printer', 'cd_dvd_writer', 'scanner', 'usb_stick',\
          'games_console', 'own_mp3_player', 'own_mobile',\
          'freq_computer_use_homework', 'freq_computer_use_lang',\
          'freq_computer_use_info', 'freq_computer_use_games',\
          'freq_computer_use_entertainment', 'freq_computer_use_contact',\
          'lang_home1', 'lang_home2', 'lang_home3', 'lang_home4',\
          'lang_home5', 'lang_home_target', 'lang_home_non_indig1',\
          'lang_home_non_indig2', 'lang_home_non_indig3',\
          'lang_home_non_indig4', 'lang_home_non_indig5',\
          'lang_home_other_euro', 'lang_home_other_non_euro',\
           'lang_used_at_home_indig1', 'lang_used_at_home_indig2',\
          'lang_used_at_home_indig3', 'lang_used_at_home_indig4',\
          'lang_used_at_home_indig5', 'lang_used_at_home_target',\
          'lang_used_at_home_non_indig1', 'lang_used_at_home_non_indig2',\
          'lang_used_at_home_non_indig3', 'lang_used_at_home_non_indig4',\
          'lang_used_at_home_non_indig5', 'lang_used_at_home_other_euro',\
          'lang_used_at_home_non_euro', 'lang_most_used_at_home',\
          'target_lang_level_father', 'target_lang_level_mother',\
          'target_lang_penfriend', 'target_lang_relatives_in_country',\
          'target_lang_friends_in_country', 'target_lang_tourist_home_country',\
          'target_lang_people_in_residence', 'target_lang_internet',\
          'target_lang_holidays', 'target_lang_write_friends_freq', 'target_lang_speak_relatives_freq',\
          'target_lang_speak_friends_freq', 'target_language_speak_residence_freq',\
          'target_lang_speak_tourists_freq', 'target_lang_speak_internet_freq',\
          'target_lang_songs_freq', 'target_lang_movies_no_subtitles_freq',\
          'target_lang_movies_subtitles_freq', 'target_lang_tv_no_subtitles_freq',\
          'target_lang_tv_subtitles_freq', 'target_lang_games_freq',\
          'target_lang_books_freq', 'target_lang_mag_comic_freq',\
          'target_lang_website_freq', 'perceived_competence_people_country'\
          'perceived_competence_father', 'perceived_competence_mother',\
          'perceived_competence_self', 'target_lang_usefulness_travel',\
          'target_lang_usefulness_personal_life', 'target_lang_usefulness_education',\
          'target_lang_usefulness_work', 'target_lang_usefulness_getting_job',\
           'target_lang_usefulness_contact_foreigners', 'target_lang_usefulness_satisfaction',\
           'target_lang_usefulness_computers', 'target_lang_usefulness_reading',\
           'target_lang_usefulness_entertainment', 'enjoyment_maths',\
           'enjoyment_science', 'enjoyment_humanities', 'enjoyment_arts',\
           'enjoyment_questionnaire_lang', 'enjoyment_target_lang',\
           'enjoyment_other_langs', 'enjoyment_vocational_subjects',\
           'enjoyment_sports', 'usefulness_maths', 'usefulness_science'\
           'usefulness_humanities', 'usefulness_arts', 'usefulness_questionnaire_lang',\
           'usefulness_target_lang', 'usefulness_other_langs', 'usefulness_sports',\
           'hw_time_maths', 'hw_time_science', 'hw_time_humanities', 'hw_time_arts',\
           'hw_time_questionnaire_lang', 'hw_time_target_lang', 'hw_time_other_langs',\
           'hw_time_vocational_subjects', 'hw_time_sports', 'first_widely_taught_lang_in_country',\
           'second_widely_taught_lang_in_country', 'third_widely_taught_lang_in_country',\
           'fourth_widely_taught_lang_in_country', 'fifth_widely_taught_lang_in_country',\
           'sixth_widely_taught_lang_in_country', 'seventh_widely_taught_lang_in_country',\
           'eighth_widely_taught_lang_in_country', 'ninth_widely_taught_lang_in_country',\
           'tenth_widely_taught_lang_in_country', 'other_lang_taught', 'first_lang_studied',\
           'lang_taught_2nd_ISCED3', 'lang_taught_1st_ISCED3', 'lang_taught_6th_ISCED2',\
           'lang_taught_5th_ISCED2', 'lang_taught_4th_ISCED2', 'lang_taught_3rd_ISCED2',\
           'lang_taught_2nd_ISCED2', 'lang_taught_1st_ISCED2', 'lang_taught_6th_ISCED1',\
           'lang_taught_5th_ISCED1', 'lang_taught_4th_ISCED1', 'lang_taught_3rd_ISCED1',\
           'lang_taught_2nd_ISCED1', 'lang_taught_1st_ISCED1', 'lang_taught_pre_1st_ISCED1',\
           'target_lang_taught_2nd_ISCED3', 'target_lang_taught_1st_ISCED3',\
           'target_lang_taught_6th_ISCED2', 'target_lang_taught_5th_ISCED2',\
           'target_lang_taught_4th_ISCED2', 'target_lang_taught_3rd_ISCED2',\
           'target_lang_taught_2nd_ISCED2', 'target_lang_taught_1st_ISCED2',\
           'target_lang_taught_6th_ISCED1', 'target_lang_taught_5th_ISCED1',\
           'target_lang_taught_4th_ISCED1', 'target_lang_taught_3rd_ISCED1',\
           'target_lang_taught_2nd_ISCED1', 'target_lang_taught_1st_ISCED1',\
           'target_lang_taught_pre_1st_ISCED1', 'no_langs_studied_pre_target_lang',\
           'class_size', 'class_size_clean', 'class_duration', 'class_duration_clean',\
           'target_lang_classes_per_week', 'target_lang_classes_per_week_clean',\
           'lang_classes_per_week', 'lang_classes_per_week_clean', 'total_classes_per_week',\
           'total_classes_per_week_clean', 'freq_school_trips_target_lang_country',\
           'freq_school_trips_non_target_lang_country', 'freq_trips_family_target_lang_country',\
           'freq_trips_family_non_target_lang_country', 'freq_visits_from_school_class_target_lang',\
           'freq_visits_from_school_class_non_target_lang', 'freq_collab_schools_abroad',\
           'freq_lang_clubs', 'freq_lang_competitions', 'freq_european_day_of_langs',\
           'freq_extracurricular_lang_projects', 'freq_writing_students_abroad',\
           'freq_lang_field_trips', 'compulsory_target_lang_learning',\
           'perceived_difficulty_target_lang_writing', 'perceived_difficulty_target_lang_speaking',\
           'perceived_difficulty_target_lang_understanding_spoken',\
           'perceived_difficulty_target_lang_grammar',\
           'perceived_difficulty_target_lang_reading',\
           'perceived_difficulty_target_lang_pronunciation',\
           'perceived_difficulty_target_lang_vocab',\
           'freq_teacher_speaks_target_lang_whole_class',\
           'freq_teacher_speaks_target_lang_few_students',\
           'freq_students_speak_target_lang_to_teacher',\
           'freq_students_speak_target_lang_group_work',\
           'freq_students_speak_target_lang_before_whole_class',\
           'freq_target_lang_audio_materials_in_class',\
           'freq_target_lang_video_in_class',\
           'freq_target_lang_text_in_class',\
           'freq_target_lang_internet_in_class',\
           'freq_target_lang_computer_programmes_in_class',\
           'freq_target_lang_pc_labs', 'freq_target_lang_textbooks',\
           'freq_target_lang_books_fiction', 'freq_target_lang_lesson_materials',\
           'perceived_usefulness_for_writing_target_lang_textbooks',\
           'perceived_usefulness_for_speaking_target_lang_textbooks',\
           'perceived_usefulness_for_understanding_target_lang_textbooks',\
           'perceived_usefulness_for_grammar_target_lang_textbooks',\
           'perceived_usefulness_for_reading_target_lang_textbooks',\
           'perceived_usefulness_for_pronunciation_target_lang_textbooks',\
           'perceived_usefulness_for_vocab_target_lang_textbooks',\
           'freq_group_work_target_lang', 'freq_indiv_work_target_lang',\
           'freq_group_students_speak_before_class_target_lang_classes',\
           'freq_indiv_student_speaks_before_class_target_lang_classes',\
           'freq_teacher_speaks_to_whole_class_target_lang_classes',\
           'freq_teacher_speaks_to_few_students_target_language_classes',\
           'perception_target_lang_teacher_is_good',\
           'perception_target_lang_teacher_get_along',\
           'perception_target_lang_teacher_makes_effort_interesting',
           'perception_target_lang_teacher_helpful',\
           'perception_target_lang_teacher_like',\
           'perception_target_lang_teacher_strict',\
           'perception_target_lang_lessons_interesting',\
           'perception_target_lang_lessons_enjoyable',\
           'perception_target_lang_lessons_good',\
           'perception_target_lang_lessons_waste_of_time',\
           'perception_target_lang_lessons_easy',\
           'perception_target_lang_lessons_boring',\
           'perceived_difficulty_learning_target_lang_usual_speakers',\
           'perceived_difficulty_learning_target_lang_students_in_class',\
           'freq_emphasis_between_target_and_other_lang_writing',\
           'freq_emphasis_between_target_and_other_lang_speaking',\
           'freq_emphasis_between_target_and_other_lang_understanding_spoken',\
           'freq_emphasis_between_target_and_other_lang_grammar',\
           'freq_emphasis_between_target_and_other_lang_reading',\
           'freq_emphasis_between_target_and_other_lang_pronunciation',\
           'freq_emphasis_between_target_and_other_lang_vocab',\
           'freq_learning_writing_target_lang', 'freq_learning_speaking_target_lang',\
           'freq_learning_understanding_spoken_target_lang', 'freq_learning_grammar_target_lang',\
           'freq_learning_reading_target_lang', 'freq_learning_pronunciation_target_lang',\
           'freq_learning_vocab_target_lang', 'freq_test_target_lang',\
           'freq_feedback_on_assignments', 'time_spent_studying_for_tests',\
           'perceived_importance_for_target_lang_final_grade_writing',\
           'perceived_importance_for_target_lang_final_grade_speaking',\
           'perceived_importance_for_target_lang_final_grade_understanding_spoken',\
           'perceived_importance_for_target_lang_final_grade_grammar',\
           'perceived_importance_for_target_lang_final_grade_reading',\
           'perceived_importance_for_target_lang_final_grade_pronunciation',\
           'perceived_importance_for_target_lang_final_grade_vocab',\
           'freq_computer_use_target_lang_info_finding',\
           'freq_computer_use_target_lang_homework',\
           'freq_computer_use_target_lang_writing',\
           'freq_computer_use_target_lang_speaking',\
           'freq_computer_use_target_lang_understanding_spoken',\
           'freq_computer_use_target_lang_grammar',\
           'freq_computer_use_target_lang_reading',\
           'freq_computer_use_target_lang_pronunciation',\
           'freq_computer_use_target_lang_vocab',\
           'time_spent_homework_target_lang',\
           'time_spent_homework_other_langs',\
           'enrichment_lessons_target_lang',\
           'enrichment_lessons_other_langs',\
           'remedial_lessons_target_lang', 'remedial_lessons_other_langs',\
           'extra_lessons_questionnaire_lang',
           'extra_lessons_not_questionnaire_lang_spoken_at_home',\
           'target_lang_can_understand_familiar_words_simple_sentences_reading',\
           'target_lang_can_find_info_ads_timetables_reading',\
           'target_lang_can_find_main_points_simple_news_articles_familiar_subjects_reading',\
           'target_lang_can_read_long_complex_texts_locating_details_quickly_reading',\
           'target_lang_can_understand_qs_instructions_if_spoken_slowly_listening',\
           'target_lang_can_understand_if_spoken_slowly_clearly_directly_listening',\
           'target_lang_can_understand_long_complicated_lectures_on_familiar_topic_listening',\
           'target_lang_can_write_few_words_phrases_relating_to_myself',\
           'target_lang_can_write_basic_description_of_events',\
           'target_lang_can_write_detailed_letters_relating_to_myself',\
           'target_lang_can_write_detailed_review_film_book_play',\
           'target_lang_can_ask_answer_simple_questions_familiar_topics',\
           'target_lang_can_tell_simple_story',\
           'target_lang_can_maintain_conversation_familiar_topics',\
           'target_lang_can_explain_viewpoint_advantages_disadvantages',\
           'occupation_mother', 'occupation_father', 'socio_economic_index_occupation_mother',\
           'socio_economic_index_occupation_father', 'highest_occupational_status',\
           'mother_education_years', 'father_education_years', 'highest_parental_education_years',\
           'home_possessions', 'duration_FL_education', 'onset_FL_education',\
           'duration_TL_education', 'onset_TL_education', 'target_lang_lesson_time_week',\
           'foreign_lang_lesson_time_week', 'target_lang_learning_time_tests',\
           'target_lang_learning_time_homework', 'foreign_lang_learning_time_homework',\
           'no_ancient_foreign_langs_learnt', 'no_modern_foreign_langs_learnt',\
           'first_foreign_lang_learnt_school', 'no_langs_studied_before_target_lang',\
           'home_location', 'no_first_languages', 'target_lang_first_lang',\
           'no_langs_at_home_exposure', 'target_lang_exposure_home', 'no_langs_at_home_use',\
           'target_lang_used_home', 'target_lang_most_spoken_lang_home', 'target_lang_parents_knowledge',\
           'target_lang_exposure_through_home_env', 'target_lang_use_through_home_env',\
           'target_lang_exposure_use_through_media', 'target_lang_exposure_use_visits_abroad',\
           'foreign_lang_enrichment_remedial_lessons', 'ICT_facilities_at_home',\
           'freq_ICT_use_outside_school', 'freq_ICT_use_lang_learning',\
           'received_opportunities_target_lang_school_projects',\
           'gender', 'age', 'immigration_background', 'economic_social_cultural_status',\
           'received_help_mastering_host_lang', 'received_formal_education_lang_of_origin',\
           'teachers_use_target_lang_during_foreign_lang_lessons',\
           'students_use_target_lang_during_foreign_lang_lessons',\
           'resource_use_target_lang_lessons', 'perceived_emphasis_similarities_between_known_languages',\
           'indicator_for_perception_usefulness_target_lang_and_learning',\
           'perceived_difficulty_target_lang_learning', 'indicator_perception_target_lang_lessons_teacher_textbook',\
           'class_size', 'programme_level', 'program_designation', 'programme_orientation_clean',\
           'compulsory_target_lang_learning', 'plausible_value_1_listening',\
           'plausible_value_2_listening', 'plausible_value_3_listening',\
           'plausible_value_4_listening', 'plausible_value_5_listening',\
           'plausible_value_1_reading', 'plausible_value_2_reading',\
           'plausible_value_3_reading', 'plausible_value_4_reading',\
           'plausible_value_5_reading', 'plausible_value_1_writing_aspect_1',\
           'plausible_value_2_writing_aspect_1', 'plausible_value_3_writing_aspect_1',\
           'plausible_value_4_writing_aspect_1', 'plausible_value_5_writing_aspect_1',\
           'plausible_value_1_writing_aspect_2', 'plausible_value_2_writing_aspect_2',\
           'plausible_value_3_writing_aspect_2', 'plausible_value_4_writing_aspect_2',\
           'plausible_value_5_writing_aspect_2', 'plausible_value_1_writing_combined',\
           'plausible_value_2_writing_combined', 'plausible_value_3_writing_combined',\
           'plausible_value_4_writing_combined', 'plausible_value_5_writing_combined',\
           'plausible_level_1_reading', 'plausible_level_2_reading',\
           'plausible_level_3_reading', 'plausible_level_4_reading',\
           'plausible_level_5_reading', 'plausible_level_1_listening',\
           'plausible_level_2_listening', 'plausible_level_3_listening',\
           'plausible_level_4_listening', 'plausible_level_5_listening',\
           'plausible_level_1_writing_aspect_1', 'plausible_level_2_writing_aspect_1',\
           'plausible_level_3_writing_aspect_1', 'plausible_level_4_writing_aspect_1',\
           'plausible_level_5_writing_aspect_1', 'plausible_level_1_writing_aspect_2',\
           'plausible_level_2_writing_aspect_2', 'plausible_level_3_writing_aspect_2',\
           'plausible_level_4_writing_aspect_2', 'plausible_level_5_writing_aspect_2',\
           'plausible_level_1_writing_combined', 'plausible_level_2_writing_combined',\
           'plausible_level_3_writing_combined', 'plausible_level_4_writing_combined',\
           'plausible_level_5_writing_combined', 'final_student_weight_listening_untrimmed',\
           'final_student_weight_listening_trimmed', 'final_student_weight_reading_untrimmed',\
           'final_student_weight_reading_trimmed', 'final_student_weight_writinging_untrimmed',\
           'final_student_weight_writing_trimmed', 'final_student_weight_questionnaire_untrimmed',\
           'final_student_weight_questionnaire_trimmed', 'student_replicate_weight_1',\
           'student_replicate_weight_2', 'student_replicate_weight_3',\
           'student_replicate_weight_4', 'student_replicate_weight_5',\
           'student_replicate_weight_6', 'student_replicate_weight_7',\
           'student_replicate_weight_8', 'student_replicate_weight_9',\
           'student_replicate_weight_10', 'student_replicate_weight_11',\
           'student_replicate_weight_12', 'student_replicate_weight_13',\
           'student_replicate_weight_14', 'student_replicate_weight_15',\
           'student_replicate_weight_16', 'student_replicate_weight_17',\
           'student_replicate_weight_18', 'student_replicate_weight_19',\
           'student_replicate_weight_20', 'student_replicate_weight_21',\
           'student_replicate_weight_22', 'student_replicate_weight_23',\
           'student_replicate_weight_24', 'student_replicate_weight_25',\
           'student_replicate_weight_26', 'student_replicate_weight_27',\
           'student_replicate_weight_28', 'student_replicate_weight_29',\
           'student_replicate_weight_30', 'student_replicate_weight_31',\
           'student_replicate_weight_32', 'student_replicate_weight_33',\
           'student_replicate_weight_34', 'student_replicate_weight_35',\
           'student_replicate_weight_36', 'student_replicate_weight_37',\
           'student_replicate_weight_38', 'student_replicate_weight_39',\
           'student_replicate_weight_40', 'student_replicate_weight_41',\
           'jack_knife_zone', 'jack_knife_replicate', 'version_student_db_date_of_release']

In [108]:
# additional dictionary of missing professions
prof_dict_f2 = {
6000: 'SKILLED AGRICULTURAL & FISHERY WORKERS',
6100: 'MARKET-ORIENTED SKILLED AGRICULTURAL & FISHERY WORKERS [This category includes skilled farm workers and self-employed sm',
6110: 'MARKET GARDENERS & CROP GROWERS',
6111: 'Field crop & vegetable growers [incl. Specialized Crop Farmers, Specialized Crop FarmWorkers]',
6112: 'Tree & shrub crop growers [incl. Skilled RubberWorker, Coffee Farmer, Tea Grower, Fruit Tree Pruner]',
6113: 'Gardeners, horticultural & nursery growers [incl. Bulb Grower, Market Gardener]',
6114: 'Mixed-crop growers [incl. Share Cropper]',
6120: 'MARKET-ORIENTED ANIMAL PRODUCERS AND RELATED WORKERS',
6121: 'Dairy & livestock producers [incl. Cattle Breeder, Dairy Farmer, Grazier, Shepherd]',
6122: 'Poultry producers [incl. Chicken Farmer, Skilled HatcheryWorker]',
6123: 'Apiarists & sericulturists [incl. Beekeeper, Silkworm Raiser]',
6124: 'Mixed-animal producers',
6129: 'Market-oriented animal producers, etc. workers nec [incl. Bird Breeder, Gamekeeper, Kennel Keeper, Dog Trainer, Animal C',
6130: 'MARKET-ORIENTED CROP & ANIMAL PRODUCERS',
6131: '[Mixed farmers]',
6132: '[Farm foremen/supervisor]',
6133: '[Farmers nfs]',
6134: '[Skilled farm workers nfs]',
6140: 'FORESTRY AND RELATED WORKERS',
6141: 'Forestry workers & loggers [incl. Forestery, Rafter, Timber Cruiser]',
6142: 'Charcoal burners, etc. workers',
6150: 'FISHERY WORKERS, HUNTERS & TRAPPERS',
6151: 'Aquatic-life cultivation workers [incl. Oyster Farmer, Pearl Cultivator, Fish Hatcher]',
6152: 'Inland & coastal waters fishery workers [incl. Sponge Diver, Fisherman]',
6153: 'Deep-sea fishery workers [incl. Fisherman nfs, Trawler Crewman]',
6154: 'Hunters & trappers [incl. Whaler]',
6200: 'SUBSISTENCE AGRICULTURAL & FISHERY WORKERS',
6210: 'SUBSISTENCE AGRICULTURAL & FISHERY WORKERS',
7000: 'CRAFT AND RELATED TRADES WORKERS',
7100: 'EXTRACTION & BUILDING TRADES WORKERS',
7110: 'MINERS, SHOTFIRERS, STONE CUTTERS & CARVERS',
7111: 'Miners & quarry workers [incl. Miner nfs]',
7112: 'Shotfirers & blasters',
7113: 'Stone splitters, cutters & carvers [incl. Tombstone Carver]',
7120: 'BUILDING FRAME AND RELATED TRADES WORKERS',
7121: 'Builders traditional materials',
7122: 'Bricklayers & stonemasons [incl. Pavior]',
7123: 'Concrete placers, concrete finishers, etc. workers [incl. TerrazzoWorker]',
7124: 'Carpenters & joiners',
7129: 'Building frame, etc. trades workers nec [incl. ConstructionWorker nfs, Billboard Erector, DemolitionWorker, Scaffolder]',
7130: 'BUILDING FINISHERS AND RELATED TRADES WORKERS',
7131: 'Roofers',
7132: 'Floor layers & tile setters [incl. ParquetryWorker]',
7133: 'Plasterers [incl. Stucco Mason]',
7134: 'Insulation workers',
7135: 'Glaziers',
7136: 'Plumbers & pipe fitters [incl.Well Digger]',
7137: 'Building, etc. electricians',
7140: 'PAINTERS, BUILDING STRUCTURE CLEANERS AND RELATED TRADES WORKERS',
7141: 'Painters, etc. workers [incl. Construction Painter, Paperhanger]',
7142: 'Varnishers, etc. painters [incl. Automobile Painter]',
7143: 'Building structure cleaners [incl. Chimney Sweep, Sandblaster, Boiler Engine Cleaner]',
7200: 'METAL, MACHINERY AND RELATED TRADES WORKERS',
7210: 'METAL MOLDERS, WELDERS, SHEETMETAL WORKERS STRUCTURAL METAL',
7211: 'Metal molders & coremakers',
7212: 'Welders & flamecutters [incl. Brazier, Solderer]',
7213: 'Sheet-metal workers [incl. Panel Beater, Coppersmith, Tinsmith]',
7214: 'Structural-metal preparers & erectors [incl. Ship Plater, Riveter, Shipwright]',
7215: 'Riggers & cable splicers',
7216: 'Underwater workers [incl. Frogman]',
7220: 'BLACKSMITHS, TOOL-MAKERS AND RELATED TRADES WORKERS',
7221: 'Blacksmiths, hammer-smiths & forging press workers [incl. Toolsmith]',
7222: 'Tool-makers, etc. workers [incl. Locksmith]',
7223: 'Machine-tool setters & setter-operators [incl. Metal driller, Turner]',
7224: 'Metal wheel-grinders, polishers & tool sharpeners',
7230: 'MACHINERY MECHANICS & FITTERS',
7231: 'Motor vehicle mechanics & fitters [incl. Bicycle Repairman',
7232: 'Aircraft engine mechanics & fitters',
7233: '[Industrial & agricultural] machinery mechanics & fitters [incl. Mechanic Heavy Equipment, Millwright]',
7234: '[Unskilled garage worker] [incl. Oiler-Greaser]',
7240: 'ELECTRICAL & ELECTRONIC EQUIPMENT MECHANICS & FITTERS',
7241: 'Electrical mechanics & fitters [incl. Office Machine Repairman]',
7242: 'Electronics fitters',
7243: 'Electronics mechanics & servicers',
7244: 'Telegraph & telephone installers & servicers',
7245: 'Electrical line installers, repairers & cable jointers',
7300: 'PRECISION, HANDICRAFT, PRINTING AND RELATED TRADES WORKERS',
7310: 'PRECISION WORKERS IN METAL AND RELATED MATERIALS',
7311: 'Precision-instrument makers & repairers [incl. Dental Mechanic,Watch Maker]',
7312: 'Musical-instrument makers & tuners',
7313: 'Jewelry & precious-metal workers [incl. Diamond Cutter, Goldsmith]',
7320: 'POTTERS, GLASS-MAKERS AND RELATED TRADES WORKERS',
7321: 'Abrasive wheel formers, potters, etc. workers',
7322: 'Glass-makers, cutters, grinders & finishers',
7323: 'Glass engravers & etchers',
7324: 'Glass ceramics, etc. decorative painters [incl. Decorative Painter, Signpainter]',
7330: 'HANDICRAFT WORKERS IN WOOD, TEXTILE, LEATHER, ETC.',
7331: 'Handicraft workers in wood, etc. materials [incl. Candle Maker, Straw-Hat Maker]',
7332: 'Handicraft workers in textile, leather, etc. materials [incl. CarpetWeaver]',
7340: 'PRINTING AND RELATED TRADES WORKERS',
7341: 'Compositors, typesetters, etc. workers [incl. Phototypesetter, Linotypist]',
7342: 'Stereotypers & electrotypers',
7343: 'Printing engravers & etchers',
7344: 'Photographic, etc. workers [incl. Darkroom worker]',
7345: 'Bookbinders, etc. workers',
7346: 'Silkscreen, block & textile printers',
7400: 'OTHER CRAFT AND RELATED TRADES WORKERS',
7410: 'FOOD PROCESSING AND RELATED TRADES WORKERS',
7411: 'Butchers, fishmongers, etc. food preparers',
7412: 'Bakers, pastry-cooks & confectionery makers',
7413: 'Dairy-products makers',
7414: 'Fruit, vegetable, etc. preservers',
7415: 'Food & beverage tasters & graders',
7416: 'Tobacco preparers & tobacco products makers',
7420: 'WOOD TREATERS, CABINET-MAKERS AND RELATED TRADES WORKERS',
7421: 'Wood treaters [incl.Wood Grader,Wood Impregnator]',
7422: 'Cabinet-makers, etc. workers [incl. Cartwright, Cooper]',
7423: 'Woodworking-machine setters & setter-operators [incl.Wood-Turner]',
7424: 'Basketry weavers, brush makers, etc. workers [incl. Broom Maker]',
7430: 'TEXTILE, GARMENT AND RELATED TRADES WORKERS',
7431: 'Fiber preparers',
7432: 'Weavers, knitters, etc. workers',
7433: 'Tailors, dressmakers & hatters [incl. Milliner]',
7434: 'Furriers, etc. workers',
7435: 'Textile, leather, etc. pattern-makers & cutters',
7436: 'Sewers, embroiderers, etc. workers',
7437: 'Upholsterers, etc. workers',
7440: 'PELT, LEATHER & SHOEMAKING TRADES WORKERS',
7441: 'Pelt dressers, tanners & fellmongers',
7442: 'Shoe-makers, etc. workers',
7500: '[SKILLED WORKERS NFS]',
7510: '[MANUAL FOREMEN NFSUNON-FARM]',
7520: '[SKILLED WORKERS NFS] [incl. Craftsman, Artisan, Tradesman]',
7530: '[APPRENTICE SKILLED WORK NFS]',
8000: 'PLANT & MACHINE OPERATORS & ASSEMBLERS',
8100: 'STATIONARY-PLANT AND RELATED OPERATORS',
8110: 'MINING- & MINERAL-PROCESSING PLANT OPERATORS',
8111: 'Mining-plant operators',
8112: 'Mineral-ore- & stone-processing plant operators',
8113: 'Well-drillers & borers, etc. workers',
8120: 'METAL-PROCESSING PLANT OPERATORS',
8121: 'Ore & metal furnace operators',
8122: 'Metal melters, casters & rolling-mill operators',
8123: 'Metal heat-treating plant operators',
8124: 'Metal drawers & extruders',
8130: 'GLASS, CERAMICS AND RELATED PLANT OPERATORS',
8131: 'Glass & ceramics kiln, etc. machine operators',
8139: 'Glass, ceramics, etc. plant operators nec',
8140: 'WOOD-PROCESSING & PAPERMAKING PLANT OPERATORS',
8141: 'Wood-processing plant operators [incl. Sawyer]',
8142: 'Paper-pulp plant operators',
8143: 'Papermaking plant operators',
8150: 'CHEMICAL-PROCESSING PLANT OPERATORS',
8151: 'Crushing grinding & chemical-mixing machinery operators',
8152: 'Chemical heat-treating plant operators',
8153: 'Chemical-filtering & separating-equipment operators',
8154: 'Chemical-still & reactor operators',
8155: 'Petroleum & natural-gas refining plant operators',
8159: 'Chemical-processing plant operators nec',
8160: 'POWER-PRODUCTION AND RELATED PLANT OPERATORS',
8161: 'Power-production plant operators',
8162: 'Steam-engine & boiler operators [incl. Stoker, Ship Engine Room Ratings]',
8163: 'Incinerator water-treatment, etc. plant operators [incl. Sewage Plant Operator]',
8170: 'AUTOMATED ASSEMBLY-LINE & INDUSTRIAL-ROBOT OPERTORS',
8171: 'Automated assembly-line operators',
8172: 'Industrial-robot operators',
8200: 'MACHINE OPERATORS & ASSEMBLERS',
8210: 'METAL- & MINERAL-PRODUCTS MACHINE OPERATORS',
8211: 'Machine-tool operators [incl. Machine Operator nfs]',
8212: 'Cement & other mineral products machine operators',
8220: 'CHEMICAL-PRODUCTS MACHINE OPERATORS',
8221: 'Pharmaceutical & toiletry products machine operators',
8222: 'Ammunition & explosive-products machine operators',
8223: 'Metal-finishing, -plating, & -coating machine operators [incl. Electroplater, Fettler]',
8224: 'Photographic-products machine operators',
8229: 'Chemical-products machine operators nec',
8230: 'RUBBER- & PLASTIC-PRODUCTS MACHINE OPERATORS',
8231: 'Rubber-products machine operators',
8232: 'Plastic-products machine operators',
8240: 'WOOD-PRODUCTS MACHINE OPERATORS',
8250: 'PRINTING, BINDING & PAPER-PRODUCTS MACHINE OPERATORS',
8251: 'Printing-machine operators',
8252: 'Bookbinding-machine operators',
8253: 'Paper-products machine operators',
8260: 'TEXTILE, FUR & LEATHER-PRODUCTS MACHINE OPERATORS',
8261: 'Fiber-preparing, spinning & winding machine operators',
8262: 'Weaving- & knitting-machine operators',
8263: 'Sewing-machine operators',
8264: 'Bleaching-, dyeing- & cleaning-machine operators [incl. Launderer]',
8265: 'Fur- & leather-preparing-maching operators',
8266: 'Shoemaking-, etc. machine operators',
8269: 'Textile, fur & leather-products machine operators nec',
8270: 'FOOD AND RELATED PRODUCTS MACHINE OPERATORS',
8271: 'Meat- & fish-processing machine operators',
8272: 'Dairy-products machine operators',
8273: 'Grain- & spice-milling machine operators',
8274: 'Baked-goods cereal & chocolate products machine operators',
8275: 'Fruit-, vegetable- & nut-processing machine operators',
8276: 'Sugar-production machine operators',
8277: 'Tea-, coffee- & cocoa-processing machine operators',
8278: 'Brewers-, wine & other beverage machine operators',
8279: 'Tobacco-production machine operators',
8280: 'ASSEMBLERS',
8281: 'Mechanical-machinery assemblers [incl. Car Assembly-LineWorker]',
8282: 'Electrical-equipment assemblers',
8283: 'Electronic-equipment assemblers',
8284: 'Metal, rubber & plastic products assemblers',
8285: 'Wood, etc. products assemblers',
8286: 'Paperboard, textile, etc. products assemblers',
8290: 'OTHER MACHINE OPERATORS & ASSEMBLERS',
8300: 'DRIVERS & MOBILE-PLANT OPERATORS',
8310: 'LOCOMOTIVE-ENGINE DRIVERS AND RELATED WORKERS',
8311: 'Locomotive-engine drivers',
8312: 'Railway brakers signalers & shunters',
8320: 'MOTOR-VEHICLE DRIVERS [incl. Driver nfs]',
8321: 'Motorcycle drivers',
8322: 'Car, taxi & van drivers [incl. Taxi Owner nfs]',
8323: 'Bus & tram drivers',
8324: 'Heavy truck & lorry drivers',
8330: 'AGRICULTURAL & OTHER MOBILE PLANT OPERATORS',
8331: 'Motorized farm & forestry plant operators [incl. Tractor Driver, Combine Harvester Operator]',
8332: 'Earth-moving, etc. plant operators [incl. Bulldozer Driver, Dredge Operator, RoadRoller Driver]',
8333: 'Crane, hoist, etc. plant operators',
8334: 'Lifting-truck operators',
8340: 'SHIPS DECK CREWS AND RELATED WORKERS [incl. Boatman, Deck Hand, Sailor, Ship Deck Ratings]',
8400: 'SEMISKILLED WORKERS NFS [incl. Production ProcessWorker nfs, FactoryWorker nfs]',
9000: 'ELEMENTARY OCCUPATIONS',
9100: 'SALES & SERVICES ELEMENTARY OCCUPATIONS',
9110: 'STREET VENDORS AND RELATED WORKERS',
9111: 'Street food vendors',
9112: 'Street vendors nonfood products [incl. Hawker, Peddler, Newsvendor, Rag Picker, Scavenger]',
9113: 'Door-to-door & telephone salespersons [incl. Solicitor, Canvasser]',
9120: 'STREET SERVICES ELEMENTARY OCCUPATIONS [incl. Billposter, Shoeshiner, CarWindowWasher]',
9130: 'DOMESTIC AND RELATED HELPERS, CLEANERS & LAUNDERERS',
9131: 'Domestic helpers & cleaners [incl. Housemaid, Housekeeper nfs]',
9132: 'Helpers & cleaners in establishments [Kitchen Hand, Chambermaid]',
9133: 'Hand-launderers & pressers',
9140: 'BUILDING CARETAKERS, WINDOW AND RELATED CLEANERS',
9141: 'Building caretakers [incl. Janitor, Sexton, Verger]',
9142: 'Vehicle, window, etc. cleaners',
9150: 'MESSENGERS, PORTERS, DOORKEEPERS AND RELATED WORKERS',
9151: 'Messengers, package & luggage porters & deliverers [incl. Elevator Attendant, Bellboy, Messenger]',
9152: 'Doorkeepers, watchpersons, etc. workers [incl. Amusement Park Attendant, Ticket Collector, Usher,Watchman nfs, Park Atte',
9153: 'Vending-maching money collectors, meter readers, etc. workers',
9160: 'GARBAGE COLLECTORS AND RELATED LABORERS',
9161: 'Garbage collectors [incl. Dustman]',
9162: 'Sweepers, etc. laborers [incl. Odd-JobWorker]',
9200: 'AGRICULTURAL, FISHERY AND RELATED LABORERS',
9210: 'AGRICULTURAL, FISHERY AND RELATED LABORERS',
9211: 'Farm-hands & laborers [incl. Cow Herd, Farm Helper, Fruit Picker]',
9212: 'Forestry laborers',
9213: 'Fishery, hunting & trapping laborers',
9300: 'LABORERS IN MINING, CONSTRUCTION, MANUFACTURING & TRANSPORT [incl. UnskilledWorker nfs]',
9310: 'MINING & CONSTRUCTION LABORERS',
9311: 'Mining & quarrying laborers',
9312: 'Construction & maintenance laborers: roads, dams, etc. [incl. Navvy, Shoveller, Railway Trackworker]',
9313: 'Building construction laborers [incl. Handyman, Hod Carrier]',
9320: 'MANUFACTURING LABORERS',
9321: 'Assembling laborers [incl. Sorter, Bottle Sorter,Winder, Checker nfs, Grader nfs]',
9322: 'Handpackers & other manufacturing laborers [incl. Crater, Labeler]',
9330: 'TRANSPORT LABORERS & FREIGHT HANDLERS',
9331: 'Hand or pedal vehicle drivers [incl. Rickshaw Driver]',
9332: 'Drivers of animal-drawn vehicles & machinery',
9333: 'Freight handlers [incl. Docker, Loader, Longshoreman',
9501: 'Housewife',
9502: 'Student',
9503: 'Social beneficiary (unemployed, retired, sickness, etc.)',
9504: 'Dont know',
9505: 'Vague(a good job, a quiet job, a well paid job, an office job, etc.)',
9997: 'N/A',
9998: 'Invalid',
9999: 'Missing'
}