Jt-chinese
来自cslt Wiki
data and model
- train
- size: 62M
- 8k-sentence from jt(about dianxin)
- dev
- 1000 row from train data
- dict
- chn_150576.txt(15w)
- model
Parameters | hidden | class | direct | bbt | bptt_block | threads | direct-order | rand_seed | nwords | time(min) | iter |
---|---|---|---|---|---|---|---|---|---|---|---|
set1 | 320 | 300 | 2000 | 5 | 20 | 1 | 3 | 1 | 10000 | (31h) | 8 |
- ppl
- dev:86-66(ppl)
- learning rate
- 0.1-0.00625
sample data from rnnlm
- different size of simple data
size | ppl | mix0.3 | mix0.5 | mix0.7 |
---|---|---|---|---|
50M | 105.457 | 86.7 | 87.5 | 89.7 |
100M | ||||
150 |