Skip to content
View gazelle93's full-sized avatar

Block or report gazelle93

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
gazelle93/README.md

Hi there 👋 Welcome to my page!

My name is Mingyou Sung. I'm an NLP researcher & ML developer in Montreal, Quebec, Canada.

Here's some info about me!

  • Languages: English (Professional Working Proficiency), Korean (Native)
  • Programming Languages & frameworks: Python, PyTorch, HuggingFace, Scikit-Learn, NLTK, Regular Expression, GATE (General Architecture for Text Engineering), LLM, RAG, Information Retrieval, Unix/Linux, Git, GCP, AWS (Sagemaker, S3, EC2)

Education 🏫

Research Experience ⭐

  • Assessing LLMs' Ability to Navigate Cultural Knowledge Conflicts (Conference on Computational Cultural Context in NLP (C3NLP), 2024 - Non-archival)

    • Developed the QARV benchmark, a dataset of 671 culturally sensitive questions, to evaluate large language models' ability to navigate knowledge conflicts between U.S. and Korean perspectives.
  • Relation Extraction Tasks (Studies on diverse input representations and classifiers on relation extraction datasets)

    • SemEval-2010 Task 8 (Multi-Way Classification of Semantic Relations Between Pairs of Nominals)
    • TACRED (Text Analysis Conference Relation Extraction Dataset)
    • Re-TACRED (Revised Text Analysis Conference Relation Extraction Dataset)
    • Biocreative VII Track 1 (Text mining drug and chemical-protein interactions (DrugProt))
  • Counterfactual Detection (CLaC at SemEval-2020 Task 5: Muli-task Stacked Bi-LSTMs)

    • SemEval-2020 Task 5 (Modelling Causal Reasoning in Language: Detecting Counterfactuals) - Subtask 2 (Detecting the span of antecedents and consequents)

Project Experience ☕

  • Product Category Classification & Similar Product Identification

    • Built an automatic categorization system using the title and description of e-commerce products cooperation with KIES (Korea Information Engineering Services) using Word2vec and TF-IDF
  • Offering Wine Information and Recommendations on Android and Web Applications

    • Final year project at Hongik University (Won the excellent prize and was selected as a creditable work)
    • Developed an Android and Web application that offers wine label recognition using CNN, wine recommendation via chatbot using Regex, and a function to search for various information about wines using AWS

Addiational Activities 📝

Teaching Assistant

  • Course: COMP 479 - Information Retrieval and Web Search
  • Tutored undergrads to minimize the gap between the knowledge and implementation on several information retrieval techniques such as BM25, TF-IDF

Popular repositories Loading

  1. Transformer-Various-Positional-Encoding Transformer-Various-Positional-Encoding Public

    This project aims to implement the Transformer Encoder blocks using various Positional Encoding methods.

    Python 24 2

  2. Multiclass-Focal-loss-pytorch Multiclass-Focal-loss-pytorch Public

    This is an implementation of multi-class focal loss in PyTorch.

    Python 10 1

  3. Attention-Various-Positional-Encoding Attention-Various-Positional-Encoding Public

    This project aims to implement the Scaled-Dot-Product Attention layer and the Multi-Head Attention layer using various Positional Encoding methods.

    Python 5

  4. charCNN charCNN Public

    This project aims to implement the charCNN word embedding method that is leveraged in ELMo and characterBERT.

    Python 3

  5. Depedency-Graph-Attention-Networks Depedency-Graph-Attention-Networks Public

    This project aims to develop dependency-graph-attention-networks in order to represent the dependency relations of each word from given text utilizing masked self-attention. The output of the depen…

    Python 1

  6. Span-Detection-Using-LSTMs Span-Detection-Using-LSTMs Public

    This project aims to implement the LSTM and Bidirectional LSTM for span detection from given text.

    Python 1