SimonLaub/NLP_JobTrend
Folders and files
| Name | Name | Last commit date | ||
|---|---|---|---|---|
Repository files navigation
Can be found in this folder.
(See the followting list of tutorial files for this Semantic Similarity Search project).
- IntroSpaCy.ipynb
(Short intro to the SpaCy library).
- WordCloud.ipynb
(How you can make your own WordClouds).
- Newsgroup Categories.ipynb
(Classification of (your) text into 4 newsgroups).
- NLPJobClassifier.ipynb
(Classification of jobs into groups "IT" or "non-IT").
Uses the datajobsposts.csv dataset
(available elsewhere on Github/Kaggle).
- BasicTextAnalytics.ipynb
(Misc. Techniques for basic text analytics).
- SyntacticSimiliarity.ipynb
(Code for syntactic similarity).
Uses the abcnews-date-text.csv dataset
(available elsewhere on Github/Kaggle).
- SemanticRelationshipWordEmbedding.ipynb
(WordEmbeddings. Classic queen, king, man, woman example)
- SentenceTransformerColabExampleEN.ipynb
(Transformers and semantic similarity, english).
- SentenceTransformerColabExampleDK.ipynb
(Transformers and semantic similarity, danish).
- Wine_Classification.ipynb
(A classic classification problem).
Included here as part of a tutorial on ML techniques.
- MyFirstQuad.ipynb
(A question answer transformer).
Included here as part of a tutorial on ML techniques.
Details:
This folder contains a number of NLP files used in connection with the JobTrend Project (at Eaaa).
Core idea we are exploring here: In the program Jobtrend, we have a search function that looks for skills in JobAds (Skills that a company hope an employee have). Available courses at Eaaa should then be able to give employees these skills.
Matching skills taught in courses and skills searched for in job ads is not trivial though. As these skills might be described in a number of different ways.
As a part of my investigation on how we should implement this semantic similarity search - a number of techniques have been tried out. Where I have saved some of these tutorial files from this work here in this folder.
In case you wonder what a Wine.ipynb notebook is doing among these files, then, well, :)
I needed to include a file with a classic classification problem in this folder in connection with a workshop/tutorial. And I have talked about the Titanic problem too many times by now, and needed a similar, but different, case to talk about here.
The question answer transformer MyFirstQuad.ipynb is included as a simple example of this technique.
More transformer (tutorial) based code will be added as the project JobTrend progresses.
Simon Laub.
Email: sila
About JobTrend
(In Danish):
Med database og analyseværktøj tilbyder Jobtrend arbejdsmarkedsanalyser.
Disse analyser skal afdække uddannelsers relevans i forhold til det danske arbejdsmarked og på den måde understøtte udviklingen af uddannelser.
På sigt kan der omvendt opstå øget ledighed på det danske arbejdsmarked, og i den situation er det vigtigt at bibringe dimittender og ledige kompetencer, som gør den attraktive på det danske arbejdsmarked. I samfundet er der samlet set et vedvarende behov for at uddanne arbejdsstyrken til de kompetencer, som er aktuelt nu og i fremtiden, der er behov for på arbejdsmarkedet.
