🎓 I'm a data science master's student who loves diving into the world of numbers and patterns. Passionate about NLP, system architectures, large-scale computing, and statistics, I enjoy exploring the fascinating intersections between them.
Date | Conference | DOI | Title | Abstract | Code |
---|---|---|---|---|---|
5 Feb 2024 | EACL's NLP4HR | arXiv:2402.03242 [cs.CL] | JOBSKAPE: A Framework for Generating Synthetic Job Postings to Enhance Skill Matching | We introduce JobSkape, a framework to generate synthetic data that tackles these limitations, specifically designed to enhance skill-to-taxonomy matching. Within this framework, we create SkillSkape, a comprehensive open-source synthetic dataset of job postings tailored for skill-matching tasks. We introduce several offline metrics that show that our dataset resembles real-world data. Additionally, we present a multi-step pipeline for skill extraction and matching tasks using large language models (LLMs), benchmarking against known supervised methodologies. We outline that the downstream evaluation results on real-world data can beat baselines, underscoring its efficacy and adaptability. | https://github.com/magantoine/JobSkape/tree/main |
20 Sep 2023 | NAACL | arXiv:2309.11381 [cs.CL] | Studying Lobby Influence in the European Parliament | We present a method based on natural language processing (NLP), for studying the influence of interest groups (lobbies) in the law-making process in the European Parliament (EP). | X |
Start Date (MM/YYYY) | Name | Type | Description | Tech Stack | Team Size |
---|---|---|---|---|---|
06/2021 | Notify Me SenPy | Personnal | Productivity tool that let's you track your Python scripts' execution from your smartphone and receive notifications. | Python - Django - React Native | 4 |
05/2022 | Journey Planner | Academic | Robust and efficient Public Transport journey planning in Switerland. | Spark - Pandas - HDFS | 4 |
04/2020 | CoronHackathon | Personnal | Social Mobile App created in a 72 hours Hackathon, for home-entertainment during lockdown. | Spring Boot (Java) - React-Native | 8 |
04/2022 | TicTacToe | Academic | Teaches a Reinforcement Learning algorithm how to play Tic Tac Toe | Python - PyTorch | 2 |
03/2022 | Distributed Recommender System | Academic | Distributed Recommender System | Scala - Spark | 2 |
10/2021 | Polarity Of Debate Around Climate Change | Academic | Investigate all the interventions made by political personalities in the news paper for 10 years and study how the debate gets more and more polarize | Python | 4 |
03/2022 | Active Learning Algorithm for Efficient and Robust LLM training | Academic | Creates an Active Learning algorithm to train LLMs with fewer samples and to mitigate unbalanced sampling bias | Python - HuggingFace | 3 |
06/2021 | modforge | Personnal | Python productivity tool to complete the Senpy suite. WIP | Python | 4 |