This toolkit
implements Trans-dimensional Random Fields for Language Modeling.
The toolkit is
written in C++ and licensed under the Apache License v2.0.
It includes the
following directories:
projects : the source code for the toolkit, written
by C++, and can be opened by visual studio 2010 or newer version.
wbModule : the source code
tools : the exe files, for Window x64 system
PTB : the experiments for Penn Treebank
corpus. See PTB/#Readme#.txt to reproduce the experiments, as reported in
Bin Wang, Zhijian Ou and Zhiqiang Tan.
Trans-dimensional Random Fields for Language Modeling.
Annual Meeting of the Association for Computational Linguistics (ACL Long
Paper), Beijing, China, 2015,7.
Note that we
include just a few sentences from the PTB corpus (separately for
training/development/testing), NOT a complete one. If you own a licensed copy
of PTB, you can email us so that we are able to provide you the corpus, which is
formatted appropriately and can be directly used by the toolkit for your
convenience.
Feel free to
contact us if you have any questions.
Cheers,
wb.th08@gmail.com
(Bin Wang), ozj@tsinghua.edu.cn (Zhijian Ou)
Click
here
to download.