Skip to content


Block or Report

Block or report leondz

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Hi there 👋

  • 🔭 My research is on natural language processing and machine learning. I'm currently looking at:

    • 🛡️ Online harms: misinformation processing, hate speech & abusive language detection. Work safely with this data!
    • 🌱 Efficient machine learning: we should be able to do more with much less than we have; always interested in data efficiency and greener, smaller, faster 🚀, coarser models
    • ✍️ Generation: Creating meaningful sequences from a set of items is hard! See our JAIR paper on set2seq methods
    • 🇩🇰 NLP for Danish: I started and run the Danish Gigaword project
    • 🥼 Clinical NLP: how can we process medical records to, eventually, improve health outcomes
  • 🏢 I'm principal investigator of the Strømberg NLP group at ITU Copenhagen for my day job

  • 🧑‍🎓 I’m still learning sizecoding

  • 🎓 My reearch papers are listed on Google Scholar. Ask me about any of them!

Leon's GitHub stats


  1. Dataset for the Emerging & Novel Entity NER task (WNUT '17)

    106 20

  2. Forked from sean-chester/generalised-brown

    C++ implementation of Generalised Brown clustering and python scripts for feature generation (AAAI 2016)

    C++ 2

  3. The Broad Twitter Corpus, an NER dataset in English stratified for time, location, social media genre, socioeconomic factors (COLING 2016)

    Jupyter Notebook 53 6

  4. Catalog of abusive language data (PLoS 2020)

    Python 234 55

234 contributions in the last year

Dec Jan Feb Mar Apr May Jun Jul Aug Sep Oct Nov Mon Wed Fri
Activity overview
Contributed to leondz/hatespeechdata, leondz/nejlt-kickstart, huggingface/datasets and 23 other repositories

Contribution activity

December 2022

leondz has no activity yet for this period.

Seeing something unexpected? Take a look at the GitHub profile guide.