|
|
第50行: |
第50行: |
| |Ying Shi | | |Ying Shi |
| || | | || |
− | * | + | * mult-decoding ASR model with more pdfs down. Performance better than before but not well enough |
| + | * add sperate symbel to discriminated kazak and uyghur |
| + | * group-based softmax(in progress) |
| || | | || |
− | * | + | * finish group-based softmax |
| |- | | |- |
| | | |
Date |
People |
Last Week |
This Week
|
2017.9.4
|
Jiayin Cai
|
- Got phonetic feat from a stronger phonetic network
- Finished part of the experiment using stronger phonetic feature.
|
- Will be absent for school.
- But I will finish the remaining experiment.
|
Xiaofei Kang
|
- improve the human Test website:, save the test recordings, decline the positive samples
- Recording and cutting the audios, a total of 12 groups
|
- Recording the 440 groups audios left with zhangmiao
- Continue to ask people to do human test
|
Miao Zhang
|
- Perform human test
- Record some other people and do the experiments again
|
- Continue to ask people to do human test
- Recording(the goal is to record 400 to 500 people)
|
Yanqing Wang
|
|
|
Ying Shi
|
- mult-decoding ASR model with more pdfs down. Performance better than before but not well enough
- add sperate symbel to discriminated kazak and uyghur
- group-based softmax(in progress)
|
- finish group-based softmax
|
Yixiang Chen
|
|
|
Lantian Li
|
|
|
Zhiyuan Tang
|
|
|
Date |
People |
Last Week |
This Week
|
2017.9.4
|
Jiayin Cai
|
- Finished the phonetic i-vector experiment.
|
- get BN feature and train i-vector LID.
- Get phonetic feat from a stronger phonetic network
- combine PTN and phonetic i-vector.
|
Xiaofei Kang
|
- cutting audio and marking:21 speakers,a total of 1050 sentences
- Finish the new speaker recognition using the two recordings.
|
- improve the human Test website
|
Miao Zhang
|
|
- Perform human test on 21-style speech(add the disguise)
- Draw spectrums and t-SNE plots compared with experiment results
|
Yanqing Wang
|
|
|
Ying Shi
|
- multi decodeing ASR model
- multi decodeing with fake Lid here
- read code about TTS
|
- employ group softmax to train multi decoding ASR model
- synthesis one 'real' speech
|
Yixiang Chen
|
|
|
Lantian Li
|
- Go on speaker segmentation tasks, see here
- Dimensionality reduction.
- Clustering.
- Visualization.
|
- Phonetic-aware speaker segmentation.
|
Zhiyuan Tang
|
- more indicators for VV scoring system, see [1].
|
- more indicators, a demo with Shuai.
- toolbook writing.
|