斯坦福 TMT 的“summary.txt”文件的典型片段如下:
Topic00 37.47500834475079
term1 11.163093014855274
term2 2.8478206435760547
term3 1.905685547333616
term4 1.8341840331688735
到目前为止,我能够获得的关于这些数字的唯一信息是(来自http://nlp.stanford.edu/software/tmt/tmt-0.4):
[Snapshot]/summary.txt Human readable summary of the topic model, with top-20 terms per topic and how many words instances of each have occurred.
但是反对该主题的数字是什么意思?(在本例中,Topic00 37.47500834475079
)