Feel free to contact me, if you have any problem with the following softwares from our SPMI (Speech Processing and Machine Intelligence) Lab.
This webpage was not maintained, as we used THU-SPMI github repository to release our software.
- Huahuan Zheng, Keyu An, Zhijian Ou.
Efficient Neural Architecture Search for End-to-end Speech Recognition via Straight-Through Gradients.
SLT, 2021.
arxiv |
code at github
- Yichi Zhang, Zhijian Ou, Huixin Wang, Junlan Feng.
A Probabilistic End-To-End Task-Oriented Dialog Model with Latent Belief States towards Semi-Supervised Learning.
EMNLP, 2020.
arxiv |
code at github
- Yichi Zhang, Yinpei Dai, Zhijian Ou, Huixin Wang, Junlan Feng.
Improved Learning of Word Embeddings with Word Definitions and Semantic Injection.
INTERSPEECH, 2020.
pdf |
code at github
- Zhijian Ou, Yunfu Song.
Joint Stochastic Approximation and Its Application to Learning Discrete Latent Variable Models.
UAI, 2020. (accepted)
arxiv |
code at github
- Silin Gao, Yichi Zhang, Zhijian Ou and Zhou Yu.
Paraphrase Augmented Task-Oriented Dialog Generation.
ACL, 2020. (accepted)
pdf |
code at github
- Keyu An, Hongyu Xiang. Zhijian Ou.
CAT: CRF-based ASR Toolkit.
arxiv |
code at github
- Yunfu Song, Zhijian Ou.
Semi-supervised Seq2seq Joint-stochastic-approximation Autoencoders with Applications to Semantic Parsing.
IEEE Signal Processing Letters, 2020. (accepted)
URL |
pdf |
code at github
- Yichi Zhang, Zhijian Ou, Zhou Yu.
Task-Oriented Dialog Systems that Consider Multiple Appropriate Responses under the Same Context.
AAAI, New York, USA, 2020.
pdf |
code at github
- Hongyu Xiang, Zhijian Ou.
CRF-based Single-stage Acoustic Modeling with CTC Topology.
ICASSP, Brighton, UK, 2019.
URL |
pdf |
slides |
lecture video |
code at github
- Kai Hu, Zhijian Ou, Min Hu, Junlan Feng.
Neural CRF Transducers for Sequence Labeling.
ICASSP, Brighton, UK, 2019.
URL |
SPMISeq toolkit at github
- Yunfu Song, Zhijian Ou.
Learning Neural Random Fields with Inclusive Auxiliary Generators.
arxiv |
code at github
- Bin Wang, Zhijian Ou, Zhiqiang Tan.
Learning Trans-dimensional Random Fields with Applications to Language Modeling.
IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI), 2018, 40(4):876-890.
pdf (including appendices) |
SPMILM toolkit at github |
doxygen documentation
- Yinpei Dai, Zhijian Ou, Dawei Ren, Pengfei Yu.
Tracking of enriched dialog states for flexible conversational information access.
ICASSP, Calgary, Canada, 2018.
pdf |
poster |
data readme |
data download |
code at github
- Hongyu Xiang, Bin Wang and Zhijian Ou.
The THU-SPMI CHiME-4 system : Lightweight design with advanced multi-channel processing, feature enhancement, and language modeling.
CHiME Workshop, San Francisco, USA, 2016 Sept.
pdf |
poster |
SPMIArray toolkit at github
- Jinye Zhang, Zhijian Ou.
Block-Wise MAP Inference for Determinantal Point Processes with Application to Change-Point Detection.
IEEE Workshop on Statistical Signal Processing (SSP), Palma de Mallorca, Spain, 2016 June.
pdf |
poster |
longer version at arxiv |
code readme |
code download
- Bin Wang, Zhijian Ou and Zhiqiang Tan.
Trans-dimensional Random Fields for Language Modeling.
Annual Meeting of the Association for Computational Linguistics (ACL Long Paper), Beijing, China, 2015 July.
pdf |
poster |
code readme |
code download
- Yun Wang, Zhijian Ou.
Combining HMM-based Melody Extraction and NMF-based Soft Masking for Separating Voice and Accompaniment from Monaural Audio.
ICASSP, Prague, Czech, 2011,5.
pdf |
lecture video |
demo |
github code
- Yimin Tan, Zhijian Ou.
Topic-weak-correlated Latent Dirichlet Allocation.
International Symposium on Chinese Spoken Language Processing (ISCSLP), Tainan, Taiwan, 2010 Dec.
pdf |
poster |
code readme |
code download