Breaking through core technology and strengthening voice industry

2022-01-06

"Hello, welcome to the Beijing Winter Olympics." As soon as the voice fell, it was converted into English, French, Japanese and other languages. Walking into the Beijing Winter Olympics cabin, a virtual anchor named "Aijia" has officially taken up his post. With the support of intelligent speech recognition technology, he can translate Mandarin into multiple languages all over the world in real time, so as to make the Olympic voice spread all over the world faster. Not only the virtual anchor, but also the AI (Artificial Intelligence) private education integrated with the fitting mirror, the intelligent wearable device that helps the express brother receive and send pieces quickly, and the mouse that can "understand" the voice, type and surf the Internet by itself... Nowadays, more and more intelligent voice technologies are moving from the laboratory to terminal applications, entering and serving people's daily life. "As an important part of the software industry, the intelligent voice industry has entered a new stage of rapid development." At the recently held summit forum on the development of China's intelligent voice industry, Wang Jianwei, deputy director of the information technology development department of the Ministry of industry and information technology, introduced that China's intelligent voice industry has developed vigorously in recent years, and the core technology has made a breakthrough. At present, the accuracy of speech recognition has reached 98%. The latest white paper on the development of China's voice industry 2020-2021 (hereinafter referred to as the white paper) shows that the scale of China's intelligent voice market will reach 21.7 billion yuan in 2020, with a year-on-year increase of 31%, and 28.5 billion yuan in 2021, with a year-on-year increase of 44%, effectively driving the development of industry digitization. "In the era of interconnection of all things, more and more intelligent devices need to be operated from a certain distance, bringing development opportunities for the intelligent voice industry." Liu Qingfeng, chairman of China voice industry alliance and chairman of iFLYTEK, introduced that intelligent devices enabled by voice interaction are growing rapidly. Taking iFLYTEK as an example, the amount of voice assistant interaction increased by 84% year-on-year in 2021. As China's intelligent voice industry has entered a period of large-scale cultivation, how to accelerate the R & D and industrialization of key technologies and promote the continuous expansion and strength of the industry has become a common concern in the industry. "At present, the development of intelligent voice technology faces three major challenges: Multilingual intercommunication, human-computer interaction in complex scenes and multimodal virtual world." According to Liu Qingfeng's analysis, multilingualism not only refers to foreign languages, but also includes domestic dialects; Complex scenes are to achieve accurate recognition in high noise and multi person speaking scenes. The recognition rate of iFLYTEK products is expected to increase from 69% to 80% this year; Multimodal interaction is to add factors such as timbre, tone, expression and mouth shape into speech to make perception more intelligent. The white paper points out that the key innovations for the future development of intelligent voice are unsupervised learning, multimodal integration, cross integration and innovation of brain science, etc. At present, unsupervised learning and low resource model algorithms need to be broken through. In addition, there is still a gap between China and the international advanced level in the field of AI chips as the basis of computing power. To promote the high-quality development of intelligent voice industry, the Ministry of industry and information technology will carry out three aspects of work in the next step. Wang Jianwei introduced that first, encourage local governments to speed up the formulation of industrial policies conducive to promoting the integrated development of intelligent voice technology and the real economy. Second, encourage leading enterprises and scientific research institutions to jointly carry out technical research, further improve the technical level of speech recognition, synthesis, interaction and speech chip, and build national inspection and detection and other public service platforms to provide strong support for industrial development. At present, China voice industry alliance has attracted more than 70 enterprises with core technologies in the upstream and downstream of the industrial chain, and will add another 70 related enterprises in the future, and encourage more scientific research institutions to join the alliance. Third, continue to expand application scenarios and accelerate the integrated application of voice technology in the fields of intelligent manufacturing, smart home, intelligent medical treatment, education and elderly care. (Xinhua News Agency)

Edit:Li Ling    Responsible editor:Chen Jie

Source:People's Daily

Special statement: if the pictures and texts reproduced or quoted on this site infringe your legitimate rights and interests, please contact this site, and this site will correct and delete them in time. For copyright issues and website cooperation, please contact through outlook new era email:lwxsd@liaowanghn.com

Return to list

Recommended Reading Change it

Links

Submission mailbox:lwxsd@liaowanghn.com Tel:020-817896455

粤ICP备19140089号 Copyright © 2019 by www.lwxsd.com.all rights reserved

>