Intel Helps Facilitate AI Language Recognition
December 9, 2021 | IntelEstimated reading time: 1 minute
At the annual Conference on Neural Information Processing Systems (NeurIPS), two Intel-supported whitepapers on spoken language datasets are being presented. The first paper, The People’s Speech, targets “automatic speech recognition” tasks; the second is Multilingual Spoken Words Corpus (MSWC), which involves “keyword spotting.” Datasets coming out of each project contribute a sizeable volume of rich audio data, and each is among the largest collection available in its class.
The MSWC paper is co-authored by Keith Achorn, an AI frameworks engineer in Intel’s Software and Advanced Technology Group (SATG). Keith talks about his experiences on the project in a blog on the Intel Community site.
The People’s Speech and MSWC projects started in 2018, under the auspices of ML Commons, to identify and chart the 50 most used languages in the world into a single dataset, and then figure out a way to make the data useful. Group members came from Intel, Harvard, Alibaba, Oracle, Landing AI, University of Michigan, Google, Baidu and others.
In today’s diverse international, multilingual work environment, the ability to accurately transcribe and translate becomes increasingly important. With these datasets, a computer using artificial intelligence can “hear” a spoken word and produce an automatic transcript or translation.
Both projects utilize “diverse speech,” which means they better represent a natural environment, complete with background noise and informal speech patterns with a mixture of recording equipment in different acoustic environments. This stands apart from highly controlled content such as audiobooks, which are more “sanitized.” Training on diverse speech has been correlated with better accuracy in real-world use.
The People’s Speech project includes tens of thousands of hours of supervised conversational audio. It is now among the world’s largest English speech recognition datasets licensed for academic and commercial usage, and is free to download.
MSWC is an audio speech dataset that has more than 300,000 keywords in dozens of languages, and can be accessed by smart devices. The MSWC is dataset spans languages spoken by over 5 billion people, and advances the research and development of voice applications for a wide global audience.
Both datasets will be widely available for users. They are licensed with extremely permissive licensing terms, including commercial use.
Suggested Items
Groundbreaking Ceremony Marks the Beginning of a New Era for Newccess Industrial; The Construction of the MINGXIN Building
04/12/2024 | Newccess IndustrialOn a clear and sunny day in March, the groundbreaking ceremony for the MINGXIN Building took place in Shenzhen, China. This moment marked the official commencement of construction for a project that will reshape the semiconductor materials industry.
Sondrel Poised to Support the Evolution of Intelligent Cars with Ultra-Complex Chips
04/08/2024 | SondrelAccording to Sondrel, a leading provider of ultra-complex chips, the designing of Software Defined Vehicles (SDVs) is changing the automotive ecosystem, including new methodologies and technology approaches that could significantly reduce costs and shorten time to market for advanced features.
Creators of SMT UHDI Test Board Vehicle Discuss this Important Project
04/04/2024 | Nolan Johnson, I-Connect007Chrys Shea of Shea Engineering and Altium’s David Haboud educate us on the latest revision of the SMT test board for UHDI testing, presented at the SMTA UHDI Symposium on March 26 in Arizona. Chrys was involved in the original SMT test board, introduced roughly five years ago. She and David discuss recreating the test board to be appropriate for UHDI, the genesis and history of this project, and why industry members should make use of it to benchmark their processes.
Smartkem Commences Project with RiTdisplay
03/22/2024 | PRNewswireSmartkem, the developer of a disruptive type of organic transistor that has the potential to drive a new generation of displays, today announced that it has entered into a collaboration agreement with RiTdisplay Corp. (RiTdisplay), a leading developer of optoelectronic solutions, visual displays and passive-matrix OLED (PMOLED) displays, for the manufacture of a new type of active-matrix OLED (AMOLED) display.
SIA Applauds CHIPS Act Incentives for Intel Projects
03/20/2024 | SIAThe Semiconductor Industry Association (SIA) released the following statement from SIA President and CEO John Neuffer commending semiconductor manufacturing incentives announced by the U.S. Department of Commerce and Intel Corporation.