I am always looking for dedicated, highly-motivated people (undergraduate students, master students, ph.d. students, post-doc, software engineers). If you are interested in the following research topics, don't hesitate to contact me.
August 2014: A brief introduction to SPMI (Speech Processing and Machine Intelligence) Lab (two slides in Chinese), as the orientation material for post-graduate freshman in the EE Department.
Past and recent research topics (See Paper for up-to-date topics)
- Statistical machine intelligence (particularly with probabilistic graphical models and deep learning)
- Speech recognition and understanding
- Natural language processing (particularly for dialog AI)
- Microphone array
- Speaker recognition
- Source separation
- Audio indexing and search
- Audio fingerprinting
- Music processing (e.g. query by humming)
Past and recent research projects (partial)
- China Mobile funding (PI): Strongly-generalizable human-machine dialogue technology for cross-domain large-scale applications, 2021-2023.
- TasiTech funding (PI), 2022-2025.
- Meituan funding (PI), 2021-2022.
- Apple funding (PI), 2020-2021.
- NSFC 61976122 (PI): Integration of Array Perception and Speech Recognition for Far-field Sound Sources, 2020-2023.
-
China Electric Power Research Institute (Co-PI), Constructing domain knowledge graph and building intelligent dialog system for customer service, 2019-2020.
-
China Ministry of Education and China Mobile Joint Funds (PI), Customer service dialog system based on deep reinforcement learning and knowledge graph, 2018-2020.
- NSFC 61473168 (PI): Single-channel Speech Separation Based on Probabilistic Acoustic Tube Model, 2015-2018.
- Tsinghua University Initiative (PI): Brain-inspired computer audition, 2015.
- Toshiba funding (PI): ASR, 2012-2016.
- Tsinghua University Initiative (PI): Study of cognition about uncertainty in speech and language phenomena, 2012-2015.
- NSFC 61075020 (PI): Robust speech recognition with Bayesian voice modeling and adaptive noise compensation, 2011-2013.
- IBM China Research Lab funding (PI): Language Assessment for CET Spoken English Test, 2008-2009.
- China 863 2006AA01Z149 (co-PI): High performance content-based speech search, 2007-2009.
- Panasonic funding (PI): Discriminative training for robust speech recognition, 2007-2008.
- Intel China Research Center funding (PI): Person Indexing and Retrieval for movies and TVs - audio part, 2006-2008.
- Intel China Research Center funding (PI): Multimodal Soccer Video Analysis - audio part, 2006-2008.
- NSFC 60402029 (PI): Research on Refined-Structural Acoustic Modeling for Speech Recognition Based on Bayesian Networks, 2005-2007.
- Department funding (PI): Statistical methods based on Bayesian networks with selected applications, 2004-2006.