Murphy book | Kevin Patrick Murphy. “Machine Learning: a Probabilistic Perspective”. MIT Press, 2012. |
EBM book | Zhijian Ou. “Energy-Based Models with Applications to Speech and Language Processing”. Foundations and Trends® in Signal Processing, 2023. |
DGM review | Zhijian Ou. “A Review of Learning with Deep Generative Models from Perspective of Graphical Modeling”. 2018. |
PRML book | Christopher M. Bishop. “Pattern Recognition and Machine Learning”. Springer 2006. |
MacKay book | D.J. MacKay. “Information Theory, Inference, and Learning Algorithms”. Cambridge Univ. Press, 2003. |
Murphy 2002-Fall-Thesis |
Kevin Murphy's Ph.D. thesis 2002
其附录(Appendix)包含对图模型的表示、推理、学习的综述 |
GraphicalModels2004 | Jordan在2004年写的综述文章:Graphical Models |
FreyJojicTutorial2005 | A Comparison of Algorithms for Inference and Learning in Probabilistic Graphical Models, PAMI 2005. |
Salakhutdinov_phd2009_Learning Deep Generative Models | Ruslan Salakhutdinov, "Learning Deep Generative Models", Ph.D. thesis 2009. |
deep_learning_英文版 | DEEP LEARNING book, Bengio, Goodfellow, Courville |
深度学习_中文版 | DEEP LEARNING book - Bengio, Goodfellow, Courville 中文版 |
CDLS_chapter4-5 |
|
Rabiner_hmm_turtorial | Rabiner's HMM tutorial 1989. |
icml01_CRF | CRF: probabilistic models for segmenting and labeling sequeance data, ICML 2001. |
Wallach_crf_intro_2004 | H. Wallach, Conditional Random Fields: An Introduction, 2004. |
tmg-thmm.rar |
|
ieee13_Conditional random fields in speech, audio, and language processing | ieee13_Conditional random fields in speech, audio, and language processing |
uai96_Bucket_elimination | R. DECHTER, BUCKET ELIMINATION: A UNIFYING FRAMEWORK FOR PROBABILISTIC INFERENCE, 1996. |
aij05_UnifyingCluster-TreeDecompositionsForReasoningInGMs | Kalev Kask, Rina Dechter, Javier Larrosa and Avi Dechter, Unifying Cluster-Tree Decompositions for Reasoning in Graphical models, 2005. |
Factor graphs and the sum-product algorithm | Kschischang, Frey, Loeliger. Factor Graphs and the Sum-Product Algorithm, IEEE Trans. Info. Theory, 2001. |
Lauritzen&Spiegelhalter88_Local Computations with Probabilities on Graphical Structures and Their Application to Expert Syst | Lauritzen and Spiegelhalter, Local Computations with Probabilities on Graphical Structures and Their Application to Expert Systems, 1988. |
Shafer-Shenoy90_ProbabilityPropagation | Shafer and Shenoy, Probability Propagation, 1990. |
GDL 2000 | Aji and McEliece. "The generalized distributive law", IEEE Trans. Info. Theory, 2000. |
MacKay_Introduction to Monte Carlo methods | MacKay, Introduction to Monte Carlo methods, 1999. |
Neal93_Probabilistic Inference Using Markov Chain Monte Carlo Methods | Neal93_Probabilistic Inference Using Markov Chain Monte Carlo Methods |
LiuJun2001_Monte_Carlo_Strategies_In_Scientific_Computing | LiuJun2001_Monte_Carlo_Strategies_In_Scientific_Computing |
MacKay_Introduction to Monte Carlo methods | Jaakkola, Tutorial on variational approximation methods, 2000. |
jaakkola00_variational-tutorial | Jordan and Ghahramani, An Introduction to Variational Methods for Graphical Models, 2001. |
arxiv17_Advances in Variational Inference | arxiv17_Advances in Variational Inference |
EM1977 | A. Dempster et al., Maximum Likelihood from Incomplete Data via the EM Algorithm |
Robbins&Monro1951_a stochastic approximation method | Robbins and Monro (1951). A stochastic approximation method. Ann. Math. Stat. |
pietra97_inducing features of random fields | Stephen Della Pietra, Vincent Della Pietra, and John Lafferty. "Inducing features of random fields", IEEE Transactions on Pattern Analysis and Machine Intelligence, 1997. |
Gu&Kong_PNAS98_A stochastic approximation algorithm with Markov chain Monte-Carlo method for incomplete data estimation prob | Gu & Kong, A stochastic approximation algorithm with Markov chain Monte-Carlo method for incomplete data estimation problems, PNAS 1998. |
Delyon99_Convergence of a Stochastic Approximation Version of the EM Algorithm | Bernard Delyon, Marc Lavielle, and Eric Moulines. "Convergence of a stochastic approximation version of the EM algorithm." Annals of statistics (1999): 94-128. |
JSA | Zhijian Ou, Yunfu Song. "Joint Stochastic Approximation and Its Application to Learning Discrete Latent Variable Models". UAI 2020. |
DA-EBM | Xinwei Zhang, Zhiqiang Tan, Zhijian Ou. "Persistently Trained, Diffusion-assisted Energy-based Models". Stat 2023. |
ic10_nbhmm | Nan Ding, Zhijian Ou. "Variational nonparametric Bayesian hidden Markov model". ICASSP 2010. |
SDM06_bayes k-means as a max-expectation algorithm | Max Welling, Kenichi Kurihara. Bayesian K-Means as a "Maximization-Expectation" Algorithm. SDM 2006. |
Heckerman_tr-95-06 | Heckerman, A tutorial on learning with BNs, 1995 |
SEM.rar | Original papers on structural EM |
TRF | Bin Wang, Zhijian Ou, Zhiqiang Tan. "Learning Trans-dimensional Random Fields with Applications to Language Modeling". IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI), 2018. |
nips02_lda | Blei, Ng and Jordan. Latent Dirichlet Allocation, NIPS 2002. |
BleiNgJordan2003_lda | Blei, Ng and Jordan. Latent Dirichlet Allocation, Journal of Machine Learning Research, 2003. |
iscslp2010_twclda | Yimin Tan, Zhijian Ou. "Topic-weak-correlated Latent Dirichlet Allocation". ISCSLP 2010. |
icassp2012_crfcm | Zhijian Ou, Huaqing Luo. “CRF-based Confidence Measures of Recognized Candidates for Lattice-based Audio Indexing.” ICASSP 2012. |