?? readme.2nd
字號:
There is an "example" directory where I put two data sets.
The first data set are composed of 10 MFCC coefficients calculated from
9-words English E-set letters (/B/, /C/, /D/, /E/, /G/, /P/, /T/, /V/, /Z/)
spoken by 15 female speakers. Another set are raw speech signals of
10 digits /0-9/ spoken in German which were recored at different environments.
(Sun version only).
For further information, see "usages.doc" and the demo program.
The format of pattern data:
20 1 5 <-- sequence length 20, class 1, vector dimension 5
0.7485 -0.1875 1.6818 -0.8903 0.1908
0.3011 0.0085 1.0517 -1.7370 0.2698
... total 20 vectors
10 2 5 <-- sequence length 10, class 2, vector dimension 5
-0.0969 0.2673 1.0544 -1.3507 0.5505
-0.5478 0.1819 0.8638 -1.3081 0.4994
... total 10 vectors
The format of observation symbol file:
14 1 5 23 23 23 23 23 23 23 15 15 15 23 7 7
17 2 25 25 3 3 19 11 11 19 19 19 19 19 19 3 3 9 20
....
The first number is the length of this sequence and the second number
is class membership, the rest are observation symbols.
Purpose of uploading this package
---------------------------------
I am nearly finishing my PhD work and start to look for a job. It is hoped
that this package could demonstrate my programming capabilities. Besides,
I have also done some excellent research work. The following is publication
list:
[1] J. He, L. Liu, and G. Palm, "A text-independent speaker
identification system based on neural networks,"
Proc. of International Conference on Spoken Language processing
(ICSLP'94), pp. 1851-1854, Sept. 1994, Yokohama, Japan.
[2] J. He, L. Liu, and G. Palm, "Perception of stop consonants in
VCV utterances reconstructed from partial Fourier transform
information," Proc. of Australian International Conference on
Speech science and technology (SST'94),
pp. 436-441, Nov. 1994, Perth, Australia.
[3] J. He, L. Liu, and G. Palm, "On the use of features from prediction
residual signals in speaker identification,"
Proc. of EUROSPEECH'95, Vol. 1, pp. 313-316, Sept. 1995,
Madrid, Spain.
[4] J. He, L. Liu, and G. Palm, "Speaker identification using hybrid
LVQ-SLP networks," Proc. IEEE ICNN'95, Vol.4, pp. 2052-2055,
Perth, Australia.
[5] J. He, L. Liu, and G. Palm, "On the use of residual cepstrum in
speech recognition," Proc. IEEE ICASSP'96, Vol. 1, pp. 5-8, Atlanta,
1996, USA.
[6] L. Liu, J. He, and A. Smit, "The importance of phase in the
perception of intervocalic stop consonants,"
J. Acoust. Soc. Am., pp. 2340, 1992.
[7] L. Liu, J. He, and G. Palm, "Perception of stop consonants in
speech signals reconstructed from phase or amplitude,"
J. Acoust. Soc. Am., Vol. 94, pp. 1883, 1993.
[8] L. Liu, J. He, and G. Palm, "The importance of phase in the
perception of intervocalic stop consonants," Proc. of Australian
International Conference on Speech science and technology (SST'94),
pp. 442-447, Nov. 1994, Perth, Australia.
[9] L. Liu, J. He, and G. Palm, "Influence of short-time phase on the
perception of stop consonants," Proc. of EUROSPEECH'95, pp. 2269-2272,
Sept. 1995, Madrid, Spain.
[10] L. Liu, J. He, and G. Palm, "Signal modeling for speaker
identification," Proc. IEEE ICASSP'96, Vol. 2, 665-668, Atlanta, USA.
If you have a job available and need to know more, I am happy to provide
other information.
--------------------------------
Jialong He
Abt. Neuroinformatik
University of ULM
89069 ULM, GERMANY
email: jialong@neuro.informatik.uni-ulm.de
?? 快捷鍵說明
復制代碼
Ctrl + C
搜索代碼
Ctrl + F
全屏模式
F11
切換主題
Ctrl + Shift + D
顯示快捷鍵
?
增大字號
Ctrl + =
減小字號
Ctrl + -