Skip to content
View magantoine's full-sized avatar

Highlights

  • Pro

Block or report magantoine

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
magantoine/README.md

👋 Hey there! Welcome to my Github profile!

🎓 I'm a data science master's student who loves diving into the world of numbers and patterns. Passionate about NLP, system architectures, large-scale computing, and statistics, I enjoy exploring the fascinating intersections between them.

Academic Research

My Google Scholar page !

Date Conference DOI Title Abstract Code
5 Feb 2024 EACL's NLP4HR arXiv:2402.03242 [cs.CL] JOBSKAPE: A Framework for Generating Synthetic Job Postings to Enhance Skill Matching We introduce JobSkape, a framework to generate synthetic data that tackles these limitations, specifically designed to enhance skill-to-taxonomy matching. Within this framework, we create SkillSkape, a comprehensive open-source synthetic dataset of job postings tailored for skill-matching tasks. We introduce several offline metrics that show that our dataset resembles real-world data. Additionally, we present a multi-step pipeline for skill extraction and matching tasks using large language models (LLMs), benchmarking against known supervised methodologies. We outline that the downstream evaluation results on real-world data can beat baselines, underscoring its efficacy and adaptability. https://github.com/magantoine/JobSkape/tree/main
20 Sep 2023 NAACL arXiv:2309.11381 [cs.CL] Studying Lobby Influence in the European Parliament We present a method based on natural language processing (NLP), for studying the influence of interest groups (lobbies) in the law-making process in the European Parliament (EP). X

Some of my open-source projects

Start Date (MM/YYYY) Name Type Description Tech Stack Team Size
06/2021 Notify Me SenPy Personnal Productivity tool that let's you track your Python scripts' execution from your smartphone and receive notifications. Python - Django - React Native 4
05/2022 Journey Planner Academic Robust and efficient Public Transport journey planning in Switerland. Spark - Pandas - HDFS 4
04/2020 CoronHackathon Personnal Social Mobile App created in a 72 hours Hackathon, for home-entertainment during lockdown. Spring Boot (Java) - React-Native 8
04/2022 TicTacToe Academic Teaches a Reinforcement Learning algorithm how to play Tic Tac Toe Python - PyTorch 2
03/2022 Distributed Recommender System Academic Distributed Recommender System Scala - Spark 2
10/2021 Polarity Of Debate Around Climate Change Academic Investigate all the interventions made by political personalities in the news paper for 10 years and study how the debate gets more and more polarize Python 4
03/2022 Active Learning Algorithm for Efficient and Robust LLM training Academic Creates an Active Learning algorithm to train LLMs with fewer samples and to mitigate unbalanced sampling bias Python - HuggingFace 3
06/2021 modforge Personnal Python productivity tool to complete the Senpy suite. WIP Python 4

Popular repositories Loading

  1. JobSkape JobSkape Public

    JOBSKAPE: A Framework for Generating Synthetic Job Postings to Enhance Skill Matching

    Python 3

  2. senpy-package senpy-package Public

    Python 1

  3. TripPlanner TripPlanner Public

    Forked from Julien-Ben/trip-planner

    Python 1

  4. PDC_project_2021 PDC_project_2021 Public

    Python

  5. CS451-2021-project CS451-2021-project Public

    Java

  6. siri_assignment siri_assignment Public

    JavaScript