### *Data Collection - European Parliament*
## Preparing Raw Data
---
**Sample Text 1**
Title: European Skills Agenda for sustainable competitiveness, social fairness and resilience <br>
Date: February 08, 2021 - Brussels

In [4]:
# import necessary libraries
import requests
from requests_html import HTMLSession
import urllib.request
import time
from bs4 import BeautifulSoup
import urllib
from urllib import request
from __future__ import division
import nltk, re, pprint
from nltk import word_tokenize
from nltk import FreqDist
import os.path 
import pandas as pd

---
### Process: Trimming debate by inserting the original English or translated English files and tokenizing them.
*Note*: Due to time constraint, the process has been optimized.

- English parts of the debate will be added manually as a string and then tokenized. 

- A consistent method of translating and then adding will be applied to all EU Parliament debates:  Non-English parts are copied from the original web pages, inserted in the consistent choice of translation tool, Google Translate (https://translate.google.com/?hl=de&tab=TT), translated to English and pasted in as a string. 

- Afterwards, the same steps are applied as per usual (tokenizing, standardizing).

Because of the changed process, the URL and step of webscraping are technically no longer necessary, will however be included for the purpose of completeness. 

In [5]:
# url = "https://www.europarl.europa.eu/doceo/document/CRE-9-2021-02-08-ITM-018_EN.html"
# html = requests.get(url)
# raw = BeautifulSoup(html.content, 'html.parser').get_text()

In [6]:
raw1_2 = 'Vilija Blinkevičiūtė , Author . Mr President , Commissioner , dear colleagues , the green course , digitalization has enormous impact on our lives , our work , our communication . Ongoing demographic change , globalization and now in particular the COVID-19 pandemic are profoundly changing and affecting the way we work , forcing a new and fundamental review of whether existing skills and qualifications are a new reality . The Commission Communication " A European Skills Agenda for Sustainable Competitiveness , Social Fairness and Resilience " seeks to address these challenges , in particular by ensuring that a fundamental shift in skills and lifelong learning are at the heart of the agenda of European social rights , is a reality throughout the European Union . In this context, the Committee on Employment and Social Affairs of the European Parliament would like to ask the Commission the following four questions . And the first question concerns the establishment of concrete deadlines for the actions described in the communication of the Commission . In order for the competences to reflect the labor market needs and contribute to a rapid and effective recovery from the COVID crisis , the measures set out in the competences agenda must be implemented as soon as possible . So could the Commission provide an overview with concrete dates for the implementation of the identified actions ? Question 2 : The COVID-19 pandemic has highlighted the digital skills gap and exacerbated existing inequalities in the education system , including gender inequalities . In addition , the number of early school leavers has increased , especially among the most disadvantaged groups in society . So how does the Commission intend to ensure that particular attention is paid to digital literacy in both education and lifelong learning programs ? Can the Commission explain how it intends to ensure that education and lifelong learning programs are of high quality and inclusive in order to create equal opportunities for all, especially for those belonging to the most vulnerable groups in society or living in rural or remote areas? How will the Commission support Member States to strengthen their efforts to ensure high quality and inclusive education and lifelong learning programs ? The third problem : the implementation of the European Agenda for Skills requires adequate funding . For this agenda to be successful , significant public and private investment at national level is required , in addition to the planned funding from the European Union . Can the Commission therefore explain what concrete measures it will take to encourage national public funding and the raising of private capital and to reduce the enormous disparities between Member States in the field of lifelong learning ? And can the Commission explain how it will encourage companies to contribute to and finance the training of workers and apprentices? And a final question: Could the Commission list what should be done to help Member States adapt their national education systems to the needs of the labor market across the European Union? How to attract and develop the talents and skills of people , which are crucial not only for the personal development of people , but also for increasing the competitiveness of European companies ?'
tokens1_2 = word_tokenize(raw1_2)
for word in tokens1_2:
    print(word, end=' ')

Vilija Blinkevičiūtė , Author . Mr President , Commissioner , dear colleagues , the green course , digitalization has enormous impact on our lives , our work , our communication . Ongoing demographic change , globalization and now in particular the COVID-19 pandemic are profoundly changing and affecting the way we work , forcing a new and fundamental review of whether existing skills and qualifications are a new reality . The Commission Communication `` A European Skills Agenda for Sustainable Competitiveness , Social Fairness and Resilience `` seeks to address these challenges , in particular by ensuring that a fundamental shift in skills and lifelong learning are at the heart of the agenda of European social rights , is a reality throughout the European Union . In this context , the Committee on Employment and Social Affairs of the European Parliament would like to ask the Commission the following four questions . And the first question concerns the establishment of concrete deadline

In [7]:
raw1_3 = 'Nicolas Schmit, Member of the Commission. – Madam President, honourable Members, first I want to thank you, and in particular the Members of the Committee on Employment and Social Affairs, for this question on a very important topic. Indeed the Skills Agenda is the centrepiece of our efforts to bring to life the principles of the UN pillar of social rights and notably principle number one. This principle states that everyone, I quote, ‘has the right to quality and inclusive education, training and lifelong learning.’ We need to invest in people and in building a strong human capital to prepare and accompany the green and digital transitions, as well as to respond to the consequences of the COVID-19 crisis. Our goal is to empower people with education and training, with a view to boosting the creation of quality jobs and setting the conditions for a more innovative, competitive, inclusive and sustainable development model. You rightfully asked about the progress we made in implementing the European Skills Agenda. The agenda was adopted by the Commission on 1 July 2020. It is a five-year plan to help Europe develop more and better skills and to put them to good use. Since last July, 7 of the agenda’s 12 flagship actions have already been launched. Such actions include the Council recommendation on vocational education and training which we already had the pleasure of discussing in this House. We have also launched the new Europass platform. The remaining five actions will be launched in 2021, notably the initiatives on Individual Learning Accounts and on micro-credentials. The agenda is centred around inclusive learning. It is vital to address this point across all actions, for example in the Council recommendation on vocational education and training. We also underlined the importance of digital skills, and digital education and training. This is even more fundamental in the aftermath of the COVID-19 pandemic. Therefore the digital education plan for 2021-2027 adopted on 30 September 2020 puts forward inclusion as one of its guiding principles. We are also working on the Skills for Life initiatives to reach out to the most vulnerable and to create concrete opportunities for engaging with learning. Our ambitions will fail to reach that goal if we collectively do not invest. We need to make the best use of the unprecedented funding of the Union budget to tackle the economic and social consequences of this crisis. This includes of course the European Social Fund+ which can now be implemented following the agreement in trilogue between Parliament and Council’s negotiators very recently. Additionally, the Commission has encouraged Member States to use the Recovery and Resilience Facility where one of the seven flagship initiatives for reforms and investments is re-skill and upskill. The European Parliament has played a central role in strengthening the position of education and skills in the legal framework of the Recovery and Resilience Facility. I thank you for that as there can be no recovery and no resilience if we do not ensure adequate investment in skills and in quality jobs, especially also in the transition from one job to another. Member States should put a strong emphasis on investment in skills in their national recovery and resilience plans, but we are working now with Member States to make sure that upskilling and re-skilling is a top focus area. This is not all. Skills are a shared responsibility and it is essential for everybody to play their part. This is why we will put in place a number of measures to unlock private investment in skills. We will look at how fiscal frameworks can do better to support reforms and investment in skills, and how enterprises could better report on their employees’ skills development. All these efforts culminate in the Pact for Skills, which Commissioner Breton and I launched on 10 November 2020. The pact represents a shared engagement model for skills development. Partnerships should be created in order to respond to the skills challenges in different sectors and should involve social partners, too. To date, three skills partnerships are already in place and altogether EUR 11 billion have been committed by partners in industrial ecosystems to upskill their workforce in the coming years. Yes, the question of helping Member States to align their national education systems with the labour market is an important one. Of course the organisation and content of education and training is a matter for the Member States, but targeted and up-to-date skills intelligence can help align training with evolving labour market needs, as well as support individuals in their choices. As part of the Skills Agenda the Commission, in cooperation with Eurostat and the EU Agency Cedefop, is improving and making more widely available such skills intelligence at regional and sectoral level. Let me give you just one example. A Big Data initiative using artificial intelligence has analysed over 150 million job vacancies across Europe. This is an example of assessing skills needs in real time. The results are already online and the Commission is disseminating our findings with key stakeholders such as the European network of public employment services and with social partners. The Skills Agenda already in its name sets the path towards which we have to go: sustainable competitiveness, social fairness and resilience, and I know that I can count on this Parliament to go on this path.'
tokens1_3 = word_tokenize(raw1_3)
for word in tokens1_3:
    print(word, end=' ')

Nicolas Schmit , Member of the Commission . – Madam President , honourable Members , first I want to thank you , and in particular the Members of the Committee on Employment and Social Affairs , for this question on a very important topic . Indeed the Skills Agenda is the centrepiece of our efforts to bring to life the principles of the UN pillar of social rights and notably principle number one . This principle states that everyone , I quote , ‘ has the right to quality and inclusive education , training and lifelong learning. ’ We need to invest in people and in building a strong human capital to prepare and accompany the green and digital transitions , as well as to respond to the consequences of the COVID-19 crisis . Our goal is to empower people with education and training , with a view to boosting the creation of quality jobs and setting the conditions for a more innovative , competitive , inclusive and sustainable development model . You rightfully asked about the progress we ma

In [8]:
raw1_4 = 'Andrea Bocskor, on behalf of the PPE Group. - President! Resource scarcity, the transition to a resource-efficient, digital and climate-neutral economy, and the expanding use of artificial intelligence pose new challenges for the employment and labor markets. New professions are emerging as existing ones are transformed or eliminated. This is a clear signal that rapid change in skills and competences is needed for the competitiveness of the European Union and its Member States. The Covid19 pandemic accelerated the digital transition and showed the level of digital skills of different age groups. Due to security measures, most of our activities have moved into the digital space, with teleworking and distance learning becoming part of everyday life for millions, bringing to the surface the limits of our digital preparedness. I am pleased that the new European Skills Agenda also aims to help address the economic and labor market challenges posed by the Covid epidemic. I welcome the forward-looking proposals and it is important that it shows a sustainable path for the next generation. Modern, innovative and quality education and training, which are directly linked to the labor market and societal needs, must play a key role in skills development and help to equip young people with important skills that will prepare them for real life. Vocational training and lifelong learning must be given priority, as it is very important, through training and retraining, to be able to remain in the labor market at all times, and it is very important to think not only of young people, but also of adults and the elderly.'

In [9]:
tokens1_4 = word_tokenize(raw1_4)
tokens1_4

['Andrea',
 'Bocskor',
 ',',
 'on',
 'behalf',
 'of',
 'the',
 'PPE',
 'Group',
 '.',
 '-',
 'President',
 '!',
 'Resource',
 'scarcity',
 ',',
 'the',
 'transition',
 'to',
 'a',
 'resource-efficient',
 ',',
 'digital',
 'and',
 'climate-neutral',
 'economy',
 ',',
 'and',
 'the',
 'expanding',
 'use',
 'of',
 'artificial',
 'intelligence',
 'pose',
 'new',
 'challenges',
 'for',
 'the',
 'employment',
 'and',
 'labor',
 'markets',
 '.',
 'New',
 'professions',
 'are',
 'emerging',
 'as',
 'existing',
 'ones',
 'are',
 'transformed',
 'or',
 'eliminated',
 '.',
 'This',
 'is',
 'a',
 'clear',
 'signal',
 'that',
 'rapid',
 'change',
 'in',
 'skills',
 'and',
 'competences',
 'is',
 'needed',
 'for',
 'the',
 'competitiveness',
 'of',
 'the',
 'European',
 'Union',
 'and',
 'its',
 'Member',
 'States',
 '.',
 'The',
 'Covid19',
 'pandemic',
 'accelerated',
 'the',
 'digital',
 'transition',
 'and',
 'showed',
 'the',
 'level',
 'of',
 'digital',
 'skills',
 'of',
 'different',
 'age',


In [10]:
for word in tokens1_4:
    print(word, end=' ')

Andrea Bocskor , on behalf of the PPE Group . - President ! Resource scarcity , the transition to a resource-efficient , digital and climate-neutral economy , and the expanding use of artificial intelligence pose new challenges for the employment and labor markets . New professions are emerging as existing ones are transformed or eliminated . This is a clear signal that rapid change in skills and competences is needed for the competitiveness of the European Union and its Member States . The Covid19 pandemic accelerated the digital transition and showed the level of digital skills of different age groups . Due to security measures , most of our activities have moved into the digital space , with teleworking and distance learning becoming part of everyday life for millions , bringing to the surface the limits of our digital preparedness . I am pleased that the new European Skills Agenda also aims to help address the economic and labor market challenges posed by the Covid epidemic . I wel

In [11]:
raw1_5 = 'Lina Gálvez Muñoz, on behalf of the S&D Group. – Mr President, Commissioner Nicolas Schmit, all people and institutions agree on the importance of improving our qualifications and skills for our economies to advance in competitiveness, resilience and social justice, and to successfully transition to a green and digital economy. But the competences, are they the ones we need? Who defines it? How and who accesses them? Are all people in the same disposition to access them? Normally, when we approach the debate about competitions, we think that these are neutral and accessible to everyone and that getting them or not only depends on our merits and efforts. But is it really just an individual responsibility? I think not, that we do not have to individualize the responsibility of these processes, nor that they be neutral. For example, capabilities are not independent of gender or ethnic stereotypes and, therefore, neither are the opportunities to obtain them throughout our lives. And individualizing the responsibility of each person in relation to the skills they are capable of acquiring is unfair, because we do not all have the same opportunities, especially considering the educational inequalities with which we enter the labor market and which have increased, furthermore , with the pandemic. Therefore, from public institutions and from the European Parliament, we have to try to guarantee that the qualification and requalification processes are accessible to all people, with special attention to people from the most vulnerable groups, areas of the elderly levels of unemployment or rural and isolated areas. That these processes are free of stereotypes because we need all the talents for Europe to move forward in a more innovative and fair way. That learning processes are remunerated so that people from disadvantaged backgrounds do not play at a disadvantage. That these processes have a sufficient budget and the active participation of all social agents and that they include digital capacities to allow all citizens greater digital literacy... (the president cut off the speaker).'

In [12]:
tokens1_5 = word_tokenize(raw1_5)
tokens1_5

['Lina',
 'Gálvez',
 'Muñoz',
 ',',
 'on',
 'behalf',
 'of',
 'the',
 'S',
 '&',
 'D',
 'Group',
 '.',
 '–',
 'Mr',
 'President',
 ',',
 'Commissioner',
 'Nicolas',
 'Schmit',
 ',',
 'all',
 'people',
 'and',
 'institutions',
 'agree',
 'on',
 'the',
 'importance',
 'of',
 'improving',
 'our',
 'qualifications',
 'and',
 'skills',
 'for',
 'our',
 'economies',
 'to',
 'advance',
 'in',
 'competitiveness',
 ',',
 'resilience',
 'and',
 'social',
 'justice',
 ',',
 'and',
 'to',
 'successfully',
 'transition',
 'to',
 'a',
 'green',
 'and',
 'digital',
 'economy',
 '.',
 'But',
 'the',
 'competences',
 ',',
 'are',
 'they',
 'the',
 'ones',
 'we',
 'need',
 '?',
 'Who',
 'defines',
 'it',
 '?',
 'How',
 'and',
 'who',
 'accesses',
 'them',
 '?',
 'Are',
 'all',
 'people',
 'in',
 'the',
 'same',
 'disposition',
 'to',
 'access',
 'them',
 '?',
 'Normally',
 ',',
 'when',
 'we',
 'approach',
 'the',
 'debate',
 'about',
 'competitions',
 ',',
 'we',
 'think',
 'that',
 'these',
 'are',
 '

In [13]:
for word in tokens1_5:
    print(word, end=' ')

Lina Gálvez Muñoz , on behalf of the S & D Group . – Mr President , Commissioner Nicolas Schmit , all people and institutions agree on the importance of improving our qualifications and skills for our economies to advance in competitiveness , resilience and social justice , and to successfully transition to a green and digital economy . But the competences , are they the ones we need ? Who defines it ? How and who accesses them ? Are all people in the same disposition to access them ? Normally , when we approach the debate about competitions , we think that these are neutral and accessible to everyone and that getting them or not only depends on our merits and efforts . But is it really just an individual responsibility ? I think not , that we do not have to individualize the responsibility of these processes , nor that they be neutral . For example , capabilities are not independent of gender or ethnic stereotypes and , therefore , neither are the opportunities to obtain them througho

In [14]:
raw1_6 ='Dragoș Pîslaru, on behalf of the Renew Group. – Mr President, here we are, finally having a debate that our group, Renew Europe, has actually been calling for ever since June 2020, when we launched a paper called ‘Skills at the heart of Europe’. Skills are actually the key ingredient for our future. It’s about our small entrepreneurs, it’s about research, it’s about how we produce the vaccines for the population, and it’s at the core of productive activity. Skills are actually about the social underpinnings of independent living, having good wages and being able to provide for your family. Skills are about the future of our society. There is no Green Deal or digital transformation without proper skill investment. Here we are with the ambitious new Skills Agenda for Europe before us. We are in the Parliament, putting the right questions and coming with our own contribution to this very important debate. There are three major things to remember. It’s about coordination, how we better coordinate so that skills are going to be sought in all the policy areas, and how we are going to have efficient implementation and transform the words in this nice motion for resolution into facts. That’s the question for the Commission. How do we use that funding for the future? This is the most important thing. So repeat after me: skills, skills, skills. This is actually the future for Europe. Thank you very much, and we are looking forward, as the Parliament, to supporting the Commission in the implementation of the strategy.'
tokens1_6 = word_tokenize(raw1_6)
for word in tokens1_6:
    print(word, end=' ')

Dragoș Pîslaru , on behalf of the Renew Group . – Mr President , here we are , finally having a debate that our group , Renew Europe , has actually been calling for ever since June 2020 , when we launched a paper called ‘ Skills at the heart of Europe ’ . Skills are actually the key ingredient for our future . It ’ s about our small entrepreneurs , it ’ s about research , it ’ s about how we produce the vaccines for the population , and it ’ s at the core of productive activity . Skills are actually about the social underpinnings of independent living , having good wages and being able to provide for your family . Skills are about the future of our society . There is no Green Deal or digital transformation without proper skill investment . Here we are with the ambitious new Skills Agenda for Europe before us . We are in the Parliament , putting the right questions and coming with our own contribution to this very important debate . There are three major things to remember . It ’ s abou

In [15]:
raw1_7 = 'Julie Lechanteux, on behalf of the ID group. – Mr President, the debate of today is about the European Skills Agenda for sustainable competitiveness, social fairness and resilience. A well convoluted title to talk about the education, training of our young people and the job market that awaits them. Behind these bombastic formulas hides a harsher reality that you do not assume, as usual: that of an economic crisis which is only in its infancy, with the corollary of endemic unemployment to which uncontrolled immigration is no stranger. According to the latest Eurostat report of November 2020 concerning the European Union, there are more than 3 million young people under the age of 25 unemployed, which corresponds to an increase of almost 17% in just one year. In parallel with this damning observation, the European Commission recently affirmed that immigration would represent an inexhaustible source of talent. We walk on the head! When I see that the CEO of AstraZeneca is a Frenchman, I tell myself that geniuses, in France as in Europe, we are full of them. But for lack of political will and vision, and because of under-investment in training and scientific research, you scared them away. So, Mr. President, commit to resources to train, encourage and keep our young people instead of forcing them to choose between unemployment or expatriation, and put an end to your pact on migration, a real danger for our jobs and our civilisation.'

In [16]:
tokens1_7 = word_tokenize(raw1_7)
tokens1_7

['Julie',
 'Lechanteux',
 ',',
 'on',
 'behalf',
 'of',
 'the',
 'ID',
 'group',
 '.',
 '–',
 'Mr',
 'President',
 ',',
 'the',
 'debate',
 'of',
 'today',
 'is',
 'about',
 'the',
 'European',
 'Skills',
 'Agenda',
 'for',
 'sustainable',
 'competitiveness',
 ',',
 'social',
 'fairness',
 'and',
 'resilience',
 '.',
 'A',
 'well',
 'convoluted',
 'title',
 'to',
 'talk',
 'about',
 'the',
 'education',
 ',',
 'training',
 'of',
 'our',
 'young',
 'people',
 'and',
 'the',
 'job',
 'market',
 'that',
 'awaits',
 'them',
 '.',
 'Behind',
 'these',
 'bombastic',
 'formulas',
 'hides',
 'a',
 'harsher',
 'reality',
 'that',
 'you',
 'do',
 'not',
 'assume',
 ',',
 'as',
 'usual',
 ':',
 'that',
 'of',
 'an',
 'economic',
 'crisis',
 'which',
 'is',
 'only',
 'in',
 'its',
 'infancy',
 ',',
 'with',
 'the',
 'corollary',
 'of',
 'endemic',
 'unemployment',
 'to',
 'which',
 'uncontrolled',
 'immigration',
 'is',
 'no',
 'stranger',
 '.',
 'According',
 'to',
 'the',
 'latest',
 'Eurostat',

In [17]:
for word in tokens1_7:
    print(word, end=' ')

Julie Lechanteux , on behalf of the ID group . – Mr President , the debate of today is about the European Skills Agenda for sustainable competitiveness , social fairness and resilience . A well convoluted title to talk about the education , training of our young people and the job market that awaits them . Behind these bombastic formulas hides a harsher reality that you do not assume , as usual : that of an economic crisis which is only in its infancy , with the corollary of endemic unemployment to which uncontrolled immigration is no stranger . According to the latest Eurostat report of November 2020 concerning the European Union , there are more than 3 million young people under the age of 25 unemployed , which corresponds to an increase of almost 17 % in just one year . In parallel with this damning observation , the European Commission recently affirmed that immigration would represent an inexhaustible source of talent . We walk on the head ! When I see that the CEO of AstraZeneca 

In [18]:
raw1_8 = 'Eugenia Rodríguez Palop, on behalf of The Left Group. – Mr President, the European Skills Agenda cannot just facilitate the transition to expanding sectors and jobs. It is not about responding to the needs of a changing labor market that engulfs workers at will, makes training itineraries precarious and reinforces the instability of labor markets. We have to seriously think about the type of professionals we want, not just the markets we have. The Agenda must provide skills to those who need it most, thinking of them and eliminating class, race and gender biases that perpetuate inequality, and not only to do justice, but because to be competitive you have to have all the talents and potential capabilities. We know that diversity is profitable; For this reason, this proposal not only involves public authorities, but also companies: employers have to improve the qualifications of their workers, avoiding favoring the usual ones, facilitating training opportunities and contributing to their training. It is clear that we must continue to strive for the good of everyone.'
tokens1_8 = word_tokenize(raw1_8)
for word in tokens1_8:
    print(word, end=' ')

Eugenia Rodríguez Palop , on behalf of The Left Group . – Mr President , the European Skills Agenda can not just facilitate the transition to expanding sectors and jobs . It is not about responding to the needs of a changing labor market that engulfs workers at will , makes training itineraries precarious and reinforces the instability of labor markets . We have to seriously think about the type of professionals we want , not just the markets we have . The Agenda must provide skills to those who need it most , thinking of them and eliminating class , race and gender biases that perpetuate inequality , and not only to do justice , but because to be competitive you have to have all the talents and potential capabilities . We know that diversity is profitable ; For this reason , this proposal not only involves public authorities , but also companies : employers have to improve the qualifications of their workers , avoiding favoring the usual ones , facilitating training opportunities and 

In [19]:
raw1_9 ='Daniela Rondinelli (NI). - (IT) Mr President, ladies and gentlemen, investment in human capital is the key tool for overcoming the current economic and employment crisis. The Commission proposal, however, risks remaining a dead letter if we do not move towards an ambitious, concrete and monitorable European training model, articulated according to the green and digital transition. This approach, in fact, must envisage three indispensable conditions for quality training: the mutual recognition of the skills acquired; the definition of decent working conditions and wages for apprentices, such as to prevent any form of exploitation, and above all the relaunch of local skills pacts, so that training evolves while maintaining a strong link with the vocations and needs of the territories , thus accompanying the production transformation processes. Only thanks to this balance between the European and local dimensions will we be able to restore dignity to training, which represents an extraordinary source of job opportunities and an indispensable incubator of social awareness and active citizenship.'
tokens1_9 = word_tokenize(raw1_9)
for word in tokens1_9:
    print(word, end=' ')

Daniela Rondinelli ( NI ) . - ( IT ) Mr President , ladies and gentlemen , investment in human capital is the key tool for overcoming the current economic and employment crisis . The Commission proposal , however , risks remaining a dead letter if we do not move towards an ambitious , concrete and monitorable European training model , articulated according to the green and digital transition . This approach , in fact , must envisage three indispensable conditions for quality training : the mutual recognition of the skills acquired ; the definition of decent working conditions and wages for apprentices , such as to prevent any form of exploitation , and above all the relaunch of local skills pacts , so that training evolves while maintaining a strong link with the vocations and needs of the territories , thus accompanying the production transformation processes . Only thanks to this balance between the European and local dimensions will we be able to restore dignity to training , which 

In [20]:
raw1_10 = 'Romana Tomc (PPE). - Mr President. It is now clear to everyone that the coronavirus has already changed our labor market, and this debate is really coming at the right time. These days, decisions are also being made on the number of enrollment places in education systems. I will mention only two sectors today. First, high on the scale of staffing needs are those who deal with informatics, with computer science. This knowledge will be very important for our future. We in the European Parliament also emphasize the importance of digitalisation for our society, and if we want to achieve this, we must build on these technical professions. The second area is health. Many European health systems have fallen into an even greater crisis during the pandemic, not because of equipment, but because of staff, which is severely lacking in at least some parts of the European Union. However, the pandemic is a short-term challenge, the aging of the population is much greater when we need a lot of staff to take comprehensive care of our health. We need to think about all this today, not next year. All the calls for education systems to adapt to the market are indeed traditional, but once the first step needs to be taken. I hope that the Commission will do everything in its power to support such steps, thus ensuring the implementation of a skills program for a more competitive Europe.'
tokens1_10 = word_tokenize(raw1_10)
for word in tokens1_10:
    print(word, end=' ')

Romana Tomc ( PPE ) . - Mr President . It is now clear to everyone that the coronavirus has already changed our labor market , and this debate is really coming at the right time . These days , decisions are also being made on the number of enrollment places in education systems . I will mention only two sectors today . First , high on the scale of staffing needs are those who deal with informatics , with computer science . This knowledge will be very important for our future . We in the European Parliament also emphasize the importance of digitalisation for our society , and if we want to achieve this , we must build on these technical professions . The second area is health . Many European health systems have fallen into an even greater crisis during the pandemic , not because of equipment , but because of staff , which is severely lacking in at least some parts of the European Union . However , the pandemic is a short-term challenge , the aging of the population is much greater when 

In [21]:
raw1_11 = 'Marc Angel (S&D). – Mr President, according to Cedefop, almost half of all EU workers will need to update their skills and/or gain new ones to get or keep jobs and to embrace the opportunities of the digital and the green transitions, and therefore we Socialists and Democrats fully support the EU Commissioners’ efforts in implementing the Skills Agenda and the European Pillar of Social Rights by pushing Member States to join this path in order to leave no one behind. Member States must use money from EU funds – the Recovery Fund included – to adapt their education and training systems. Vulnerable groups must have an equal access to education, skilling and upskilling. The EU and Member States must avoid the creation of new gaps resulting from unequal access to technology, especially between generations, gender and between rural areas and cities. As Socialists and Democrats, we will stand with EU citizens and will support them to embrace transition without forgetting to strengthen, next to their skills, also their personal development and critical thinking. In a changing world, being aware of the necessity of the new skills agenda is crucial for both workers and employers.'
tokens1_11 = word_tokenize(raw1_11)
for word in tokens1_11:
    print(word, end=' ')

Marc Angel ( S & D ) . – Mr President , according to Cedefop , almost half of all EU workers will need to update their skills and/or gain new ones to get or keep jobs and to embrace the opportunities of the digital and the green transitions , and therefore we Socialists and Democrats fully support the EU Commissioners ’ efforts in implementing the Skills Agenda and the European Pillar of Social Rights by pushing Member States to join this path in order to leave no one behind . Member States must use money from EU funds – the Recovery Fund included – to adapt their education and training systems . Vulnerable groups must have an equal access to education , skilling and upskilling . The EU and Member States must avoid the creation of new gaps resulting from unequal access to technology , especially between generations , gender and between rural areas and cities . As Socialists and Democrats , we will stand with EU citizens and will support them to embrace transition without forgetting to 

In [22]:
raw1_12 = 'Radka Maxová (Renew). – Pane předsedající, pandemie koronaviru absolutně změnila naše životy. Místo každodenního setkávání jsme se přesunuli na on-line scénu, a to včetně zaměstnání a vzdělávání. To ještě značně prohloubilo stávající nerovnosti a zkomplikovalo životy lidí a v tomto může právě pomoci Evropská agenda dovedností. V rámci ní je důležité myslet i na osoby se zdravotním postižením a spoluobčany, kteří z různých důvodů žijí na okraji společnosti nebo přímo v chudobě. Tito lidé nejsou ohroženi jen současnou koronavirovou krizí, ale i přechodem na více digitální svět a ekonomiku spojenou se zeleným přechodem. Proto je velmi důležité zajistit programy, které budou rozvíjet právě toto vzdělávání a dovednosti, které budou souviset se změnou budoucího trhu práce. Pracovní trh bude silně ovlivněn digitalizací a přechodem na zelenou energii. Nesmíme v těchto programech zapomenout ani na podporu přístupu internetu pro všechny. Jedině tak využijeme aktuální situaci ke spravedlivější společnosti a rovnému přístupu ke všem našim obyvatelům.'
tokens1_12 = word_tokenize(raw1_12)
for word in tokens1_12:
    print(word, end=' ')

Radka Maxová ( Renew ) . – Pane předsedající , pandemie koronaviru absolutně změnila naše životy . Místo každodenního setkávání jsme se přesunuli na on-line scénu , a to včetně zaměstnání a vzdělávání . To ještě značně prohloubilo stávající nerovnosti a zkomplikovalo životy lidí a v tomto může právě pomoci Evropská agenda dovedností . V rámci ní je důležité myslet i na osoby se zdravotním postižením a spoluobčany , kteří z různých důvodů žijí na okraji společnosti nebo přímo v chudobě . Tito lidé nejsou ohroženi jen současnou koronavirovou krizí , ale i přechodem na více digitální svět a ekonomiku spojenou se zeleným přechodem . Proto je velmi důležité zajistit programy , které budou rozvíjet právě toto vzdělávání a dovednosti , které budou souviset se změnou budoucího trhu práce . Pracovní trh bude silně ovlivněn digitalizací a přechodem na zelenou energii . Nesmíme v těchto programech zapomenout ani na podporu přístupu internetu pro všechny . Jedině tak využijeme aktuální situaci ke 

In [23]:
raw1_13 = 'Stelios Kympouropoulos (PPE). – Mr President, dear colleagues, the latest World Economic Forum Future of Jobs report shows that over the next years, 40% of workers will require reskilling of approximately six months. Having this in mind, the Commission’s Skills Agenda Communication has rightly placed skills at the heart of the EU policy agenda. We will need quality and inclusive training and lifelong learning opportunities for all, with a particular focus on the most vulnerable people of our society and workers in sectors that will undergo fundamental changes. Digital skills and literacy, green skills as well as competences such as critical thinking and problem—solving should be key elements in order to harness the potential of the digital and green transitions and bridge the existing skills gaps. However, this can only be achieved through a holistic approach, involving all relevant stakeholders and enabling us to anticipate the changing nature of jobs and the skills needed in order to adjust our education systems. And that is the only way to succeed in one of the biggest challenges of our lifetime: to enable our workforce to thrive in new job types that don’t yet exist.'
tokens1_13 = word_tokenize(raw1_13)
for word in tokens1_13:
    print(word, end=' ')

Stelios Kympouropoulos ( PPE ) . – Mr President , dear colleagues , the latest World Economic Forum Future of Jobs report shows that over the next years , 40 % of workers will require reskilling of approximately six months . Having this in mind , the Commission ’ s Skills Agenda Communication has rightly placed skills at the heart of the EU policy agenda . We will need quality and inclusive training and lifelong learning opportunities for all , with a particular focus on the most vulnerable people of our society and workers in sectors that will undergo fundamental changes . Digital skills and literacy , green skills as well as competences such as critical thinking and problem—solving should be key elements in order to harness the potential of the digital and green transitions and bridge the existing skills gaps . However , this can only be achieved through a holistic approach , involving all relevant stakeholders and enabling us to anticipate the changing nature of jobs and the skills 

In [24]:
raw1_14 = 'Radan Kanev (PPE). - Mr President, the health crisis has not just changed the labor market, it has, above all, accelerated the changes we have been talking about for a long time, but we have been talking about in the future. They are already a reality. Remote work, flexible but also uncertain working hours, the displacement of traditional jobs by computer technology. And they come with a severe economic crisis and rising unemployment. The problems became apparent during the Kovid crisis, but they will not go away. It is natural to hear today calls for increased state intervention, for a new role for governments, even for the European institutions, for increased public spending. But if we try to keep the labor market in the 20th century by force, we will fail, we will destroy the social market economy and our social systems. New public commitment, new public spending is needed, but it is needed not to stop change, but to help us adapt to it. And they need to follow only three priorities: education, education and education.'
tokens1_14 = word_tokenize(raw1_14)
for word in tokens1_14:
    print(word, end=' ')

Radan Kanev ( PPE ) . - Mr President , the health crisis has not just changed the labor market , it has , above all , accelerated the changes we have been talking about for a long time , but we have been talking about in the future . They are already a reality . Remote work , flexible but also uncertain working hours , the displacement of traditional jobs by computer technology . And they come with a severe economic crisis and rising unemployment . The problems became apparent during the Kovid crisis , but they will not go away . It is natural to hear today calls for increased state intervention , for a new role for governments , even for the European institutions , for increased public spending . But if we try to keep the labor market in the 20th century by force , we will fail , we will destroy the social market economy and our social systems . New public commitment , new public spending is needed , but it is needed not to stop change , but to help us adapt to it . And they need to f

In [25]:
raw1_15 = 'Anne Sander (EPP). – Mr President, Commissioner, long before the pandemic, we already knew that skills and learning were key. With the COVID-19 crisis, the transformation of the world of work has accelerated exponentially, towards ever more digital uses. Our companies therefore need an even more qualified workforce to meet new challenges, in particular to prepare for the emergence of new professions. Indeed, 65% of children entering primary school will work in a profession that does not yet exist. However, today, 42% of Europeans lack basic computer skills. In addition, the current crisis is putting many young people who are in the process of training in difficulty – no internships, no mobility, distance learning – which will cause additional difficulties to integrate them into the world of work. More than ever, the links must be strengthened between training and the business world. And in this spirit, apprenticeship must be valued as a path of excellence for professional integration and the acquisition of new skills. I therefore welcome this strategy that the European Commission wishes to implement, because it will make it possible to respond to the challenges that the European Union is experiencing.'
tokens1_15 = word_tokenize(raw1_15)
for word in tokens1_15:
    print(word, end=' ')

Anne Sander ( EPP ) . – Mr President , Commissioner , long before the pandemic , we already knew that skills and learning were key . With the COVID-19 crisis , the transformation of the world of work has accelerated exponentially , towards ever more digital uses . Our companies therefore need an even more qualified workforce to meet new challenges , in particular to prepare for the emergence of new professions . Indeed , 65 % of children entering primary school will work in a profession that does not yet exist . However , today , 42 % of Europeans lack basic computer skills . In addition , the current crisis is putting many young people who are in the process of training in difficulty – no internships , no mobility , distance learning – which will cause additional difficulties to integrate them into the world of work . More than ever , the links must be strengthened between training and the business world . And in this spirit , apprenticeship must be valued as a path of excellence for 

In [26]:
raw1_16 ='Antonius Manders (PPE). – (NL) Mr President, this crisis shows even more that knowledge is power. For years we have failed to adapt our education system, our education, our training courses to the market. We have always been involved with the old thinking. This crisis really shows mercilessly that we have done it wrong. I hope that it is still time to introduce continuous training for young and old at European level. At the moment we are too dependent on large American tech companies. We depend on technical platforms from China. There are hardly any European companies that can compete with that, because we have for years – in our view – stood still from the lead. It is time that we finally started investing a lot, freeing up time and facilitating that our workers in Europe can bring our economy back to the level that we were used to and that we will see that happen again in the future. Because we have the knowledge. Knowledge makes power and links young and old.'
tokens1_16 = word_tokenize(raw1_16)
for word in tokens1_16:
    print(word, end=' ')

Antonius Manders ( PPE ) . – ( NL ) Mr President , this crisis shows even more that knowledge is power . For years we have failed to adapt our education system , our education , our training courses to the market . We have always been involved with the old thinking . This crisis really shows mercilessly that we have done it wrong . I hope that it is still time to introduce continuous training for young and old at European level . At the moment we are too dependent on large American tech companies . We depend on technical platforms from China . There are hardly any European companies that can compete with that , because we have for years – in our view – stood still from the lead . It is time that we finally started investing a lot , freeing up time and facilitating that our workers in Europe can bring our economy back to the level that we were used to and that we will see that happen again in the future . Because we have the knowledge . Knowledge makes power and links young and old . 

In [27]:
raw1_17 = 'Nicolas Schmit, Member of the Commission. – Mr President, I would first like to thank honourable Members for this very encouraging debate. I would even say that there is a perfect consensus on this issue. We all agree that skills are the future. It’s the future for our societies and the future for our economy. Some have referred to the World Economic Forum and asked for a skills revolution, and indeed we probably need some kind of skills revolution because we have a technical revolution. We need a revolution in our way of producing, due to the need to adapt to climate change. Therefore, I think this skills revolution has first to guarantee to everybody, and remember I quoted Principle 1 of the Social Pillar, that everybody has to get equal access to skills, equal access to quality education, and equal access to lifelong learning, the young and adults. This is also something very important for Europe’s social model. It’s a basic element of our idea of equal opportunities. Yes, we have a revolution in the digital world and we have a lot of vacancies there which are not filled. Here it’s a challenge now to train people, to give them the right education and to give them the right skills. Sometimes we have to leave the usual models to skill them because technology also gives us the tools to skill people in a different way. We are talking about just transitions. I would say that this means first investing in people, investing in human capital, because our labour markets are changing. It’s true that they are changing and we should now anticipate technological change and prepare every person – every young person – for the changes and give them the opportunity to learn to learn. This is the challenge we all face and now we have the Skills Agenda and I agree that it’s up to us to implement it.'
tokens1_17 = word_tokenize(raw1_17)
for word in tokens1_17:
    print(word, end=' ')

Nicolas Schmit , Member of the Commission . – Mr President , I would first like to thank honourable Members for this very encouraging debate . I would even say that there is a perfect consensus on this issue . We all agree that skills are the future . It ’ s the future for our societies and the future for our economy . Some have referred to the World Economic Forum and asked for a skills revolution , and indeed we probably need some kind of skills revolution because we have a technical revolution . We need a revolution in our way of producing , due to the need to adapt to climate change . Therefore , I think this skills revolution has first to guarantee to everybody , and remember I quoted Principle 1 of the Social Pillar , that everybody has to get equal access to skills , equal access to quality education , and equal access to lifelong learning , the young and adults . This is also something very important for Europe ’ s social model . It ’ s a basic element of our idea of equal oppo

---
### Combine all parts

In [28]:
tokens = tokens1_2 + tokens1_3 + tokens1_4 + tokens1_5 + tokens1_6 + tokens1_7 + tokens1_8 + tokens1_9 + tokens1_10 + tokens1_11 + tokens1_12 + tokens1_13 + tokens1_14 + tokens1_15 + tokens1_16 + tokens1_17

---
### Normalize the words 

In [29]:
type(tokens)
eutext01 = [w.lower() for w in tokens]

---
**Save Output**

In [30]:
save_path = '/Users/charlottekaiser/Documents/uni/Hertie/master_thesis/00_data/20_intermediate_files'
file_name = "EU01_European Skills Agenda for sustainable competitiveness, social fairness and resilience.txt"
completeName = os.path.join(save_path, file_name)
output = open(completeName, 'w')
print(eutext01, file=output)