|
 |
 |
 |
Phase-based Speech Processing
|
Most speech recognition and enhancement systems utilize only the magnitude of the recorded speech signals. This project involves utilizing only the phase of the recorded signals for speech recognition and enhancement.
Representative Publications:
1. Aarabi, P., Shi, G.
Phase-Based Dual-Microphone Robust Speech Enhancement,
IEEE Transactions on Systems, Man, and Cybernetics Part B, Vol. 34, No. 4, pp. 1763-1773, August 2004.
"pdf"
"ps.gz"
2. Lai, C., Aarabi, P.,
Multiple-Microphone Time-Varying Filters For Robust Speech Recognition.
Proceedings of the 2004 IEEE Conference on Acoustics, Speech, and Signal Processing (ICASSP 2004), Montreal, May 2004.
"pdf"
"ps.gz"
3. Shi, G., Aarabi, P.,
Robust Digit Recognition Using Phase-Dependent Time-Frequency Masking.
Proceedings of the 2003 IEEE Conference on Acoustics, Speech, and Signal Processing (ICASSP 2003), Hong Kong, April 2003.
"pdf"
"ps.gz"
|
|
|
UofT Explorer
|
The UofT Explorer project utilizes an algorithm designed to quickly rank 2-dimensional images according to the user’s preference. 65 images are taken around the UofT St. George campus at different angles and elevations. When the user clicks on a point on any image, images that contain the selected area are given priority in the display. The algorithm depends on the proximity of labeled buildings to localize the area where the user selects. Images are ranked according to how well they show the area chosen by the user.
|
|
© Copyright 2005 the Artificial Perception Laboratory
|
| |