The audio is high quality (48kHz, 16 bit, mono, Wave audio).This dataset was collected for speech technology research from native Colombian Spanish speakers who volunteered to supply the data. The Emilia Argentinian Spanish Speech Dataset has a duration of three hours 15 minutes and is comprised of over 2,218 sentences.The dataset was recorded by volunteers in Buenos Aires, Argentina. The Crowdsourced Argentinian Spanish Speech Dataset is a fantastic option – it contains recordings of simple weather messages recorded in Argentinian Spanish (90 messages), and Peninsular Spanish (90 messages).1,651 speakers totally, with 43% male and 57% female.quiet indoor environment, low background noise, without echo. 16kHz, 16bit, uncompressed wav, mono channel.Best Argentinian Spanish Speech DatasetĬreated by Google in 2018, the Argentinian Spanish Speech Multi-Speaker Dataset Speech dataset contains about 5,900 transcribed high-quality audio from Argentinian Spanish sentences recorded by volunteers., in Spanish (Argentinian) language. This is a gender-unbalanced corpus of 24 hours of duration.Ģ. TEDx Spanish Corpus Dataset contains spontaneous speech of several expositors in TEDx events most of them are men.The audio is comprised of sentences from 300 books read by 154 native Spanish speakers (77 men and 77 women). The LibriVox Spanish Speech Dataset features 73 hours of read speech and transcripts.Features:Īccess the dataset Not quite your style? Check out these alternatives: This open-source dataset consists of 5.56 hours of transcribed Peninsular Spanish conversational speech on certain topics, where 17 conversations between four pairs of speakers were contained. Biggest Non-Commercial Spanish Language Speech Dataset Here are our top picks for Spanish Language speech datasets: 1. Let’s dive into our list of the best Spanish Language speech datasets in 2022.ĭo you want to build a custom dataset? We specialize in helping companies create high-quality custom audio and video datasets. Here at Twine, we’ve searched high and low to find the best Spanish Language speech datasets. That’s why we’ve done the hard bit for you. That being said, it’s not always easy to find datasets with a specific dialect or type of speech to train your models. Spanish is one of the most commonly spoken languages in the world.
0 Comments
Leave a Reply. |
AuthorWrite something about yourself. No need to be fancy, just an overview. ArchivesCategories |