“Jt-chinese”版本间的差异
来自cslt Wiki
| 第1行: | 第1行: | ||
| − | + | =data and model= | |
* train | * train | ||
:* size: 62M | :* size: 62M | ||
| 第21行: | 第21行: | ||
* learning rate | * learning rate | ||
:* 0.1-0.00625 | :* 0.1-0.00625 | ||
| + | |||
| + | =sample data from rnnlm= | ||
| + | * different size of simple data | ||
| + | {| border="2px" | ||
| + | |+ different size of simple data | ||
| + | |- | ||
| + | ! size !! ppl !! mix0.3 !! mix0.5 !! mix0.7 | ||
| + | |- | ||
| + | !50M | ||
| + | | 105.457 || 86.7 || 87.5 || 89.7 | ||
| + | |- | ||
| + | !100M | ||
| + | | || | ||
| + | |- | ||
| + | !150 | ||
| + | |} | ||
2014年12月1日 (一) 01:16的版本
data and model
- train
- size: 62M
- 8k-sentence from jt(about dianxin)
- dev
- 1000 row from train data
- dict
- chn_150576.txt(15w)
- model
| Parameters | hidden | class | direct | bbt | bptt_block | threads | direct-order | rand_seed | nwords | time(min) | iter |
|---|---|---|---|---|---|---|---|---|---|---|---|
| set1 | 320 | 300 | 2000 | 5 | 20 | 1 | 3 | 1 | 10000 | (31h) | 8 |
- ppl
- dev:86-66(ppl)
- learning rate
- 0.1-0.00625
sample data from rnnlm
- different size of simple data
| size | ppl | mix0.3 | mix0.5 | mix0.7 |
|---|---|---|---|---|
| 50M | 105.457 | 86.7 | 87.5 | 89.7 |
| 100M | ||||
| 150 |