AG百家乐大转轮-AG百家乐导航_怎么看百家乐走势_全讯网官网 (中国)·官方网站

Research News

SYSU Makes Research Progress on Speakers’ Voiceprint Recognition in Big Data Era

Source: SYSU-CMU Joint Institute of Engineering
Written by: SYSU-CMU Joint Institute of Engineering
Edited by: Wang Dongmei

Recently, Dr. Ming Li of SYSU-CMU Joint Institute of Engineering (hereinafter referred to as JIE) and his team proposed an unsupervised learning framework for speaker verification, which is of great significance to refine the clustering labels in the big data era.

As one of the main sources for people to acquire information, voice is the most convenient, effective and natural communication tool and information carrier for people to communicate. With the comprehensive informatization of the society, especially the rapid development of communications, multimedia and Internet technologies, intelligent voice technology is becoming increasingly important. Therefore, one of the current research hotspots is to find methods that can verify speakers’ identity through voice signal more accurately.

The research group led by Dr. Ming Li presented an unsupervised learning framework for speaker verification where they seek to address the speaker verification problem without any given data labels. To automatically retrieve the speaker labels of unlabeled training data, the project team proposed to use Affinity Propagation (AP) - a clustering method that takes pairwise data similarity as an input - to generate temporary class labels. The obtained labels then can be used to train a so called “Probabilistic LDA” model in order to generate similarity score for pairwise speech samples. In addition, Ming’s group further fed such similarity score to the input of AP clustering, establishing an iterative framework that updates the PLDA model repeatedly. With the final PLDA model after several iterations, the system can accordingly verify whether the two speakers belong to the same identity. The project team also evaluated the performance of different PLDA scoring methods for the multiple-enrollment task. Experiments show that the proposed iterative and unsupervised PLDA model learning approach outperformed the cosine similarity baseline by more than 20%.

On the 9th International Symposium on Chinese Spoken Language Processing (ISCSLP 2014) and the 15th annual conference of the International Speech Communication Association (INTERSPEECH 2014) held in Singapore, Dr. Ming Li presented three papers about speakers’ voiceprint recognition. Among which, the paper titled “An Iterative Framework for Unsupervised Learning in the PLDA based Speaker Verification” co-authored by Wenbo Liu, Zhiding Yu and Dr. Ming Li won the award of Best Student Paper. Wenbo Liu is a first-year dual-degree Ph.D. student affiliated with the SYSU-CMU Joint Institute of Engineering and the Department of ECE, Carnegie Mellon University, advised by Dr. Ming Li. Zhiding Yu is a third-year Ph.D. candidate at the Department of ECE, Carnegie Mellon University.

Ph.D. programs in JIE are committed to cultivating research talents who explore in depth the theory, methodology, techniques and instruments in the field of electrical and computer engineering, so as to enrich and improve the knowledge system in electrical and computer engineering. Students participating in the Ph.D. JIE double-degree program will study at Carnegie Mellon’s Pittsburgh campus for two years and will receive two degrees upon graduation — one from Sun Yat-sen University and one from Carnegie Mellon University.


百家乐群11889| 打百家乐官网如何赢分| 千亿娱百家乐的玩法技巧和规则| 浩博国际娱乐城| 属虎和属龙合伙做生意| 金濠国际| 百家乐断缆赢钱| 尊龙国际娱乐| 澳门百家乐牌例| 乌拉特前旗| 利澳百家乐的玩法技巧和规则| 百家乐官网博彩网排名| 赌场百家乐信誉| 现金百家乐官网网上娱乐| 伟易博百家乐的玩法技巧和规则| 百家乐官网技术论坛| 大发888娱乐场奖金| 网上的百家乐官网怎么才能| 任你博娱乐| 百家乐路单破解方法| 皇冠网百家乐官网平台| 百家乐官网PK| bet365注册哪家好| 百家乐游戏机价格| 百家乐官网视频赌博| 百家乐已破解的书籍| 巩留县| 百家乐西园出售| 百家乐官网998| 17pk棋牌官方下载| 代理百家乐免费试玩| 正品百家乐官网游戏| 沙龙国际网址| 闲和庄百家乐娱乐城| 百家乐是不是有假| 百家乐技巧开户网址| 百家乐官网类游戏网站| 大发888注册娱乐游戏| 至尊百家乐网| 百家乐官网投注最好方法| 阿坝|