Skip to content
View xieh97's full-sized avatar
:octocat:
I may be slow to respond.
:octocat:
I may be slow to respond.
Block or Report

Block or report xieh97

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this userโ€™s behavior. Learn more about reporting abuse.

Report abuse
xieh97/README.md

Hi there ๐Ÿ‘‹

  • ๐Ÿ“† I am currently a PhD student specializing in Machine Learning, Audio Processing, and Natural Language Processing, with a particular focus on Audio Language Modeling, Audio Content Retrieval, and Multimodal Learning.
  • ๐ŸŽฏ My Interests
    • ๐ŸŽฒ Machine Learning / Deep Learning (pytorch, scikit-learn, ...)
    • ๐Ÿ“ˆ Data Analysis (numpy, scipy, pandas, ...)
    • ๐Ÿ“‘ NLP & Text Analysis (nltk, ...)
    • ๐Ÿ“Š Visualization (matplotlib, ...)
    • ๐Ÿš€ Software Development (Django, Spring, Hibernate, ...)
    • ๐Ÿ’ป Programming (Python, Java, JavaScript, SQL, ...)
  • ๐Ÿ“˜ My Publications
    • ๐Ÿ“ƒ H. Xie, K. Khorrami, O. Rรคsรคnen, and T. Virtanen, "Crowdsourcing and Evaluating Text-Based Audio Retrieval Relevances," in Proc. Detect. Classif. Acoust. Scenes Events Work. (DCASE), 2023, pp. 226-230. arXiv
    • ๐Ÿ“ƒ H. Xie, O. Rรคsรคnen, and T. Virtanen, "On Negative Sampling for Contrastive Audio-Text Retrieval," in Proc. Int. Conf. Acoustic., Speech and Signal Process. (ICASSP), 2023, pp. 1-5. arXiv
    • ๐Ÿ“ƒ H. Xie, S. Lipping, and T. Virtanen, "Language-based Audio Retrieval Task in DCASE 2022 Challenge," in Proc. Detect. Classif. Acoust. Scenes Events Work. (DCASE), 2022, pp. 216-220. arXiv
    • ๐Ÿ“ƒ H. Xie, O. Rรคsรคnen, K. Drossos, and T. Virtanen, "Unsupervised Audio-Caption Aligning Learns Correspondences Between Individual Sound Events and Textual Phrases," in Proc. Int. Conf. Acoustic., Speech and Signal Process. (ICASSP), 2022, pp. 8867-8871. arXiv
    • ๐Ÿ“ƒ H. Xie, O. Rรคsรคnen, and T. Virtanen, "Zero-Shot Audio Classification with Factored Linear and Nonlinear Acoustic-Semantic Projections," in Proc. Int. Conf. Acoustic., Speech and Signal Process. (ICASSP), 2021, pp. 326-330. arXiv
    • ๐Ÿ“ƒ H. Xie and T. Virtanen, "Zero-Shot Audio Classification via Semantic Embeddings," in IEEE/ACM Trans. Audio Speech Lang. Process., vol. 29, pp. 1233-1242, 2021. arXiv
    • ๐Ÿ“ƒ H. Xie and T. Virtanen, "Zero-Shot Audio Classification Based on Class Label Embeddings," in Proc. Work. Appl. Signal Process. Audio and Acoustic. (WASPAA), 2019, pp. 264-267. arXiv
  • ๐Ÿ“Œ My Activities
    • ๐Ÿ† Task coordinator for Language-based Audio Retrieval in DCASE Challenge 2022 (Task 6), 2023 (Task 6), and 2024 (Task 8).

Pinned Loading

  1. contrastive-negative-sampling contrastive-negative-sampling Public

    Source code for negative sampling for contrastive audio-text retrieval (ICASSP 2023)

    Python 1

  2. audio-caption-aligning audio-caption-aligning Public

    Source code for audio-caption aligning (ICASSP 2022)

    Python 1

  3. dcase2023-audio-retrieval dcase2023-audio-retrieval Public

    Baseline system for Language-based Audio Retrieval (Task 6B) in DCASE 2023 Challenge

    Python 4 2

  4. dcase2022-audio-retrieval dcase2022-audio-retrieval Public

    Baseline system for Language-based Audio Retrieval (Task 6B) in DCASE 2022 Challenge

    Python 7 1

  5. retrieval-relevance-crowdsourcing retrieval-relevance-crowdsourcing Public

    Data and instructions for crowdsourcing text-based audio retrieval relevances

    HTML

  6. audiocaps-dl audiocaps-dl Public

    Python program to download AudioCaps from YouTube.com

    Python 1