dc.contributor.author | Beneš, David |
dc.date.accessioned | 2023-05-09T11:56:45Z |
dc.date.available | 2023-05-09T11:56:45Z |
dc.date.issued | 2023-04-25 |
dc.identifier.uri | http://hdl.handle.net/11234/1-5140 |
dc.description | This dataset can serve as a training and evaluation corpus for the task of training keyword detection with speaker direction estimation (keyword direction of arrival - KWDOA). It was created by processing the existing Speech Commands dataset [1] with the PyroomAcoustics library so that the resulting speech recordings simulate the usage of a circular microphone array with 4 microphones having a distance of 57 mm between adjacent microphones. Such design of a simulated microphone array was chosen in order to match the existing physical microphone array from the Seeeduino series. [1] Warden, Pete. “Speech Commands: A Dataset for Limited-Vocabulary Speech Recognition.” ArXiv.org, 2018, arxiv.org/abs/1804.03209 |
dc.language.iso | eng |
dc.publisher | University of West Bohemia, Department of Cybernetics |
dc.rights | Creative Commons - Attribution 4.0 International (CC BY 4.0) |
dc.rights.uri | http://creativecommons.org/licenses/by/4.0/ |
dc.subject | speech commands |
dc.subject | keyword direction of arrival |
dc.title | Speech Commands Dataset Enhanced for Direction-of-Arrival Estimation |
dc.type | corpus |
metashare.ResourceInfo#ContentInfo.mediaType | audio |
dc.rights.label | PUB |
has.files | yes |
branding | LINDAT / CLARIAH-CZ |
contact.person | Pavel Ircing ircing@kky.zcu.cz University of West Bohemia, Department of Cybernetics |
files.size | 111046825622 |
files.count | 5 |
Files in this item
This item is
Creative Commons - Attribution 4.0 International (CC BY 4.0)
Publicly Available
and licensed under:Creative Commons - Attribution 4.0 International (CC BY 4.0)
- Name
- test.7z
- Size
- 6.82 GB
- Format
- Unknown
- Description
- test
- MD5
- 05dd84d07107761c35e006a1a9f51ee9
- Name
- train.7z
- Size
- 52.44 GB
- Format
- Unknown
- Description
- train
- MD5
- 4537ebca6dd2917e76ac6a8f9e36205d
- Name
- validate.7z
- Size
- 6.28 GB
- Format
- Unknown
- Description
- validate
- MD5
- 986e3819a577913cb6ede381cae9c576
- Name
- background_noise.7z
- Size
- 37.88 GB
- Format
- Unknown
- Description
- background noise
- MD5
- dcf3ff7d27a1dda0f261b2134799ad94
- Name
- README.pdf
- Size
- 571 KB
- Format
- Description
- README
- MD5
- 0bc6603449b8e20b070026ced35cb824