I am working toward the Ph.D. degree in information and communication engineering with the Northwestern Polytechnical University, Xi’an, China. Currently, I am working as an algorithm intern with the Institute of Artificial Intelligence (TeleAI), China Telecom. My research interests include spatial audio, speech enhancement, and multimodal learning.
🏫 Educations
- 2024.03 - now, Ph.D. candidate in Information and Communication Engineer, Northwestern Polytechnical University, Xi’an, China.
- 2021.09 - 2024.03, M.S. in Information and Communication Engineer, Northwestern Polytechnical University, Xi’an, China.
- 2017.09 - 2021.06, B.S. in Communication Engineer, Guangdong Polytechnic Normal University, Guangzhou, China.
📝 Publications
2024
- Quantization-error-free soft label for 2D sound source localization, Linfeng Feng, Xiao-Lei Zhang, and Xuelong Li, Proceedings of 14th International Symposium on Chinese Spoken Language Processing (ISCSLP 2024).
- Learning Multi-dimensional Speaker Localization: Axis Partitioning, Unbiased Label Distribution, and Data Augmentation, Linfeng Feng, Yijun Gong, Zhi Liu, Xiao-Lei Zhang, and Xuelong Li, IEEE/ACM Transactions on Audio, Speech, and Language Processing (IEEE TASLP).
- Eliminating Quantization Errors in Classification-Based Sound Source Localization, Linfeng Feng, Xiao-Lei Zhang, and Xuelong Li, Neural Networks (NNJ). [code]
2023
- Soft Label Coding for End-to-end Sound Source Localization with Ad-hoc Microphone Arrays, Linfeng Feng, Yijun Gong, and Xiao-Lei Zhang, Proceedings of the 47th IEEE International Conference on Acoustic, Speech, and Signal Processing (ICASSP 2023).
💻 Internships
- 2024.01 - now, Institute of Artificial Intelligence (TeleAI), China Telecom, Beijing, China.