Disclaimer: The software is copyrighted by Yimin Tan and Zhijian Ou, 2010. The software is distributed "as is" without warranties of any kind, either express or implied. The software is not for public distribution. Prior approval must be obtained from the copyright holders for any distribution. Reference: Yimin Tan, Zhijian Ou. Topic-weak-correlated Latent Dirichlet Allocation. International Symposium on Chinese Spoken Language Processing (ISCSLP), Tainan, Taiwan, 2010,12. Filter : Corpus processing, with multiple filters FilterBatch : Batch call Filter CharactertoID : Convert word corpus to word-id corpus. Use SRILM to create lexicon. duan-ldacount : Collect sufficient statistics needed by TWC-LDA lda-pi-copy : Learn TWC-LDA GenSVMFile_reuters : Convert the learned .gramma file to the format that can be used in SVMLIGHT for text classification