Skip to content
View bzantium's full-sized avatar
Block or Report

Block or report bzantium

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
bzantium/README.md

[2023.08 - Current] LLM Researcher @Kakaobrain

  • Developing Language Foundation Model a.k.a. KoGPT2

[2020.03 - 2023.08] Machine Learning Engineer @SK Telecom & EleutherAI

  • Developing Large Language Model for SK Telecom
  • Developed Multimodal AI Service at SKT A.
    • AI Eraser (Object Removal with Image Inpainting)
      • role: project manager & implementation of pipeline algorithm (segmentation postprocessing & enhancement in inpainting performance).
  • Developing polyglot and oslo project at EleutherAI
    • polyglot: Large Language Models of Well-balanced Competence in Multi-languages
      • role: distributed training of LM with Megatron LM & data crawling, preprocessing and model evaluaton. Published 1.3B, 3.8B, 5.8B, 12.8B polyglot-ko models.
    • oslo: Open Source for Large-scale Optimization
      • role: tensor parallel 1D, 2D, 3D implementation.

Interest

  • Foundation Model / NLP / Multimodal AI

Linkedin Badge Gmail Badge Google Scholar Badge

Pinned

  1. EleutherAI/polyglot EleutherAI/polyglot Public

    Polyglot: Large Language Models of Well-balanced Competence in Multi-languages

    460 37

  2. EleutherAI/oslo EleutherAI/oslo Public

    OSLO: Open Source for Large-scale Optimization

    Python 170 29

  3. lassl/lassl lassl/lassl Public

    Easy Language Model Pretraining leveraging Huggingface's Transformers and Datasets

    Python 125 14

  4. pytorch-admm-pruning pytorch-admm-pruning Public

    Prune DNN using Alternating Direction Method of Multipliers (ADMM)

    Python 96 18