|
|
(2位用户的3个中间修订版本未显示) |
第22行: |
第22行: |
| * Recording and cutting the audios, a total of 12 groups | | * Recording and cutting the audios, a total of 12 groups |
| || | | || |
− | * Recording the 440 groups audios left with zhangmiao | + | * Continue to record the audios with zhangmiao |
| * Continue to ask people to do human test | | * Continue to ask people to do human test |
| |- | | |- |
第34行: |
第34行: |
| || | | || |
| * Continue to ask people to do human test | | * Continue to ask people to do human test |
− | * Recording(the goal is to record 400 to 500 people) | + | * Recording(the goal is to record 400 to 500 people) [http://cslt.riit.tsinghua.edu.cn/mediawiki/images/c/cc/录音说明.pdf here] |
− | [录音说明[http://cslt.riit.tsinghua.edu.cn/mediawiki/images/c/cc/录音说明.pdf]]
| + | |
| |- | | |- |
| | | |
第83行: |
第82行: |
| |Zhiyuan Tang | | |Zhiyuan Tang |
| || | | || |
− | * | + | * Organized the code and doc of Parrot system[http://192.168.0.51:5555/cgi-bin/cvss/cvss_request.pl?account=tangzy&step=view_request&cvssid=635] |
| || | | || |
− | * | + | * Theoretical study of pronunciation detection |
− | *
| + | |
| |- | | |- |
| | | |
Date |
People |
Last Week |
This Week
|
2017.9.4
|
Jiayin Cai
|
- Got phonetic feat from a stronger phonetic network
- Finished part of the experiment using stronger phonetic feature.
|
- Will be absent for school.
- But I will finish the remaining experiment.
|
Xiaofei Kang
|
- improve the human Test website:, save the test recordings, decline the positive samples
- Recording and cutting the audios, a total of 12 groups
|
- Continue to record the audios with zhangmiao
- Continue to ask people to do human test
|
Miao Zhang
|
- Perform human test
- Record some other people and do the experiments again
|
- Continue to ask people to do human test
- Recording(the goal is to record 400 to 500 people) here
|
Yanqing Wang
|
|
|
Ying Shi
|
- multi-decoding ASR model with more pdfs. Performance better than before but not well enough
- add sperate symbel to discriminated kazak and uyghur word set
- group-based softmax(in progress)
|
- finish group-based softmax and test the performance
|
Yixiang Chen
|
|
|
Lantian Li
|
- Go on speaker segmentation tasks, see here
- Complete the phonetic-aware speaker segmentation.
- Word-level boundaries from the ASR.
- Word-level d-vector and clustering.
|
|
Zhiyuan Tang
|
- Organized the code and doc of Parrot system[1]
|
- Theoretical study of pronunciation detection
|
Date |
People |
Last Week |
This Week
|
2017.9.4
|
Jiayin Cai
|
- Finished the phonetic i-vector experiment.
|
- get BN feature and train i-vector LID.
- Get phonetic feat from a stronger phonetic network
- combine PTN and phonetic i-vector.
|
Xiaofei Kang
|
- cutting audio and marking:21 speakers,a total of 1050 sentences
- Finish the new speaker recognition using the two recordings.
|
- improve the human Test website
|
Miao Zhang
|
|
- Perform human test on 21-style speech(add the disguise)
- Draw spectrums and t-SNE plots compared with experiment results
|
Yanqing Wang
|
|
|
Ying Shi
|
- multi decodeing ASR model
- multi decodeing with fake Lid here
- read code about TTS
|
- employ group softmax to train multi decoding ASR model
- synthesis one 'real' speech
|
Yixiang Chen
|
|
|
Lantian Li
|
- Go on speaker segmentation tasks, see here
- Dimensionality reduction.
- Clustering.
- Visualization.
|
- Phonetic-aware speaker segmentation.
|
Zhiyuan Tang
|
- more indicators for VV scoring system, see [2].
|
- more indicators, a demo with Shuai.
- toolbook writing.
|