xieh97 xieh97

Hi there 👋

📆 I am currently a PhD student specializing in Machine Learning, Audio Processing, and Natural Language Processing, with a particular focus on Audio Language Modeling, Audio Content Retrieval, and Multimodal Learning.
🎯 My Interests
- 🎲 Machine Learning / Deep Learning (pytorch, scikit-learn, ...)
- 📈 Data Analysis (numpy, scipy, pandas, ...)
- 📑 NLP & Text Analysis (nltk, ...)
- 📊 Visualization (matplotlib, ...)
- 🚀 Software Development (Django, Spring, Hibernate, ...)
- 💻 Programming (Python, Java, JavaScript, SQL, ...)
📘 My Publications
- 📃 H. Xie, K. Khorrami, O. Räsänen, and T. Virtanen, "Crowdsourcing and Evaluating Text-Based Audio Retrieval Relevances," in Proc. Detect. Classif. Acoust. Scenes Events Work. (DCASE), 2023, pp. 226-230. arXiv
- 📃 H. Xie, O. Räsänen, and T. Virtanen, "On Negative Sampling for Contrastive Audio-Text Retrieval," in Proc. Int. Conf. Acoustic., Speech and Signal Process. (ICASSP), 2023, pp. 1-5. arXiv
- 📃 H. Xie, S. Lipping, and T. Virtanen, "Language-based Audio Retrieval Task in DCASE 2022 Challenge," in Proc. Detect. Classif. Acoust. Scenes Events Work. (DCASE), 2022, pp. 216-220. arXiv
- 📃 H. Xie, O. Räsänen, K. Drossos, and T. Virtanen, "Unsupervised Audio-Caption Aligning Learns Correspondences Between Individual Sound Events and Textual Phrases," in Proc. Int. Conf. Acoustic., Speech and Signal Process. (ICASSP), 2022, pp. 8867-8871. arXiv
- 📃 H. Xie, O. Räsänen, and T. Virtanen, "Zero-Shot Audio Classification with Factored Linear and Nonlinear Acoustic-Semantic Projections," in Proc. Int. Conf. Acoustic., Speech and Signal Process. (ICASSP), 2021, pp. 326-330. arXiv
- 📃 H. Xie and T. Virtanen, "Zero-Shot Audio Classification via Semantic Embeddings," in IEEE/ACM Trans. Audio Speech Lang. Process., vol. 29, pp. 1233-1242, 2021. arXiv
- 📃 H. Xie and T. Virtanen, "Zero-Shot Audio Classification Based on Class Label Embeddings," in Proc. Work. Appl. Signal Process. Audio and Acoustic. (WASPAA), 2019, pp. 264-267. arXiv
📌 My Activities
- 🏆 Task coordinator for Language-based Audio Retrieval in DCASE Challenge 2022 (Task 6), 2023 (Task 6), and 2024 (Task 8).

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

xieh97 xieh97

Block or report xieh97

Hi there 👋

Pinned Loading