- ๐ I am currently a PhD student specializing in Machine Learning, Audio Processing, and Natural Language Processing, with a particular focus on Audio Language Modeling, Audio Content Retrieval, and Multimodal Learning.
- ๐ฏ My Interests
- ๐ฒ Machine Learning / Deep Learning (pytorch, scikit-learn, ...)
- ๐ Data Analysis (numpy, scipy, pandas, ...)
- ๐ NLP & Text Analysis (nltk, ...)
- ๐ Visualization (matplotlib, ...)
- ๐ Software Development (Django, Spring, Hibernate, ...)
- ๐ป Programming (Python, Java, JavaScript, SQL, ...)
- ๐ My Publications
- ๐ H. Xie, K. Khorrami, O. Rรคsรคnen, and T. Virtanen, "Crowdsourcing and Evaluating Text-Based Audio Retrieval Relevances," in Proc. Detect. Classif. Acoust. Scenes Events Work. (DCASE), 2023, pp. 226-230. arXiv
- ๐ H. Xie, O. Rรคsรคnen, and T. Virtanen, "On Negative Sampling for Contrastive Audio-Text Retrieval," in Proc. Int. Conf. Acoustic., Speech and Signal Process. (ICASSP), 2023, pp. 1-5. arXiv
- ๐ H. Xie, S. Lipping, and T. Virtanen, "Language-based Audio Retrieval Task in DCASE 2022 Challenge," in Proc. Detect. Classif. Acoust. Scenes Events Work. (DCASE), 2022, pp. 216-220. arXiv
- ๐ H. Xie, O. Rรคsรคnen, K. Drossos, and T. Virtanen, "Unsupervised Audio-Caption Aligning Learns Correspondences Between Individual Sound Events and Textual Phrases," in Proc. Int. Conf. Acoustic., Speech and Signal Process. (ICASSP), 2022, pp. 8867-8871. arXiv
- ๐ H. Xie, O. Rรคsรคnen, and T. Virtanen, "Zero-Shot Audio Classification with Factored Linear and Nonlinear Acoustic-Semantic Projections," in Proc. Int. Conf. Acoustic., Speech and Signal Process. (ICASSP), 2021, pp. 326-330. arXiv
- ๐ H. Xie and T. Virtanen, "Zero-Shot Audio Classification via Semantic Embeddings," in IEEE/ACM Trans. Audio Speech Lang. Process., vol. 29, pp. 1233-1242, 2021. arXiv
- ๐ H. Xie and T. Virtanen, "Zero-Shot Audio Classification Based on Class Label Embeddings," in Proc. Work. Appl. Signal Process. Audio and Acoustic. (WASPAA), 2019, pp. 264-267. arXiv
- ๐ My Activities
![:octocat: :octocat:](https://github.githubassets.com/images/icons/emoji/octocat.png)
-
03:42
(UTC +03:00) - https://orcid.org/0000-0001-7609-8049
- in/huang-xie-28b7872bb
Block or Report
Block or report xieh97
Contact GitHub support about this userโs behavior. Learn more about reporting abuse.
Report abusePinned Loading
-
contrastive-negative-sampling
contrastive-negative-sampling PublicSource code for negative sampling for contrastive audio-text retrieval (ICASSP 2023)
Python 1
-
audio-caption-aligning
audio-caption-aligning PublicSource code for audio-caption aligning (ICASSP 2022)
Python 1
-
dcase2023-audio-retrieval
dcase2023-audio-retrieval PublicBaseline system for Language-based Audio Retrieval (Task 6B) in DCASE 2023 Challenge
-
dcase2022-audio-retrieval
dcase2022-audio-retrieval PublicBaseline system for Language-based Audio Retrieval (Task 6B) in DCASE 2022 Challenge
-
retrieval-relevance-crowdsourcing
retrieval-relevance-crowdsourcing PublicData and instructions for crowdsourcing text-based audio retrieval relevances
HTML
-
If the problem persists, check the GitHub status page or contact support.