“Jt-chinese”版本间的差异
来自cslt Wiki
第11行: | 第11行: | ||
|+ Train Set Environment | |+ Train Set Environment | ||
|- | |- | ||
− | ! Parameters !! hidden !! class !! direct !! bbt !! bptt_block !! threads !!direct-order!!rand_seed!!nwords!!time(min) | + | ! Parameters !! hidden !! class !! direct !! bbt !! bptt_block !! threads !!direct-order!!rand_seed!!nwords!!time(min)!! iter |
|- | |- | ||
!set1 | !set1 | ||
− | | 320 || 300 || 2000 || | + | | 320 || 300 || 2000 || 5 || 20 || 1 || 3 || 1 || 10000||(31h)||8 |
|- | |- | ||
|} | |} | ||
+ | * ppl | ||
+ | :* dev:86-66(ppl) | ||
+ | * learning rate | ||
+ | :* 0.1-0.00625 |
2014年11月16日 (日) 15:07的版本
data and model
- train
- size: 62M
- 8k-sentence from jt(about dianxin)
- dev
- 1000 row from train data
- dict
- chn_150576.txt(15w)
- model
Parameters | hidden | class | direct | bbt | bptt_block | threads | direct-order | rand_seed | nwords | time(min) | iter |
---|---|---|---|---|---|---|---|---|---|---|---|
set1 | 320 | 300 | 2000 | 5 | 20 | 1 | 3 | 1 | 10000 | (31h) | 8 |
- ppl
- dev:86-66(ppl)
- learning rate
- 0.1-0.00625