Mimicking The Brain To Realize 'human-like' Virtual Assistants

Speech is more than just a form of communication. A person’s voice conveys emotions and personality and is a unique trait we can recognize. Our use of speech as a primary means of communication is a key reason for the development of voice assistants in smart devices and technology. Typically, virtual assistants analyze speech and respond to queries by converting the received speech signals into a model they can understand and process to generate a valid response. However, they often have difficulty capturing and incorporating the complexities of human speech and end up sounding very unnatural.

Now, in a study published in the journal IEEE Access, Professor Masashi Unoki from Japan Advanced Institute of Science and Technology (JAIST), and Dung Kim Tran, a doctoral course student at JAIST, have developed a system that can capture the information in speech signals similarly to how humans perceive speech.

“In humans, the auditory periphery converts the information contained in input speech signals into neural activity patterns (NAPs) that the brain can identify. To emulate this function, we used a matching pursuit algorithm to obtain sparse representations of speech signals, or signal representations with the minimum possible significant coefficients,” explains Prof. Unoki. “We then used psychoacoustic principles, such as the equivalent rectangular bandwidth scale, gammachirp function, and masking effects to ensure that the auditory sparse representations are similar to that of the NAPs.”

To test the effectiveness of their model in understanding voice commands and generating an understandable and natural response, the duo performed experiments to compare the signal reconstruction quality and the perceptual structures of the auditory representations against conventional methods. “The effectiveness of an auditory representation can be evaluated in terms of three aspects: the quality of the resynthesized speech signals, the number of non-zero elements, and the ability to represent perceptual structures of speech signals,” says Prof. Unoki.

To evaluate the quality of the resynthesized speech signals, the duo reconstructed 630 speech samples spoken by different speakers. The resynthesized signals were then rated using PEMO-Q and PESQ scores—objective measures for sound quality. They found the resynthesized signals to be comparable to the original signals. Additionally, they made auditory representations of certain phrases spoken by 6 speakers.

The duo also tested the model on its ability to capture voice structures accurately by using a pattern-matching experiment to determine if the auditory representations of the phrases could be matched to spoken utterances or queries made by the same speakers.

“Our results showed that the auditory sparse representations produced by our method can achieve high quality resynthesized signals with only 1,066 coefficients per second. Furthermore, the proposed method also provides the highest matching accuracy in a pattern matching experiment,” says Prof. Unoki.

From smartphones to smart televisions and even smart cars, the role of voice assistants is becoming more and more indispensable in our daily lives. The quality and the continued usage of these services will rely on their ability to understand our accents and our pronunciation and respond in a way we find natural. The model developed in this study could go a long way in imparting human-like qualities to our voice assistants, making our interactions not only more convenient but also psychologically satisfying.

Study explains role of bone-conducted speech transmission in speech production and hearing

More information:
Dung Kim Tran et al, Matching Pursuit and Sparse Coding for Auditory Representation, IEEE Access (2021). DOI: 10.1109/ACCESS.2021.3135011

Provided by
Japan Advanced Institute of Science and Technology

Citation:
Mimicking the brain to realize ‘human-like’ virtual assistants (2022, February 3)
retrieved 3 February 2022
from https://techxplore.com/news/2022-02-mimicking-brain-human-like-virtual.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no
part may be reproduced without the written permission. The content is provided for information purposes only.

Mimicking the brain to realize ‘human-like’ virtual assistants

Related Posts

Investor nerves tested as market themes unravel

Apple removes WhatsApp, Threads from App Store in China after demand by Beijing over security concerns

Microsoft’s AI app VASA-1 makes photographs talk and sing with believable facial expressions

Amanda Bynes ‘was approached to interview for shocking Quiet On Set docuseries but DECLINED’ as the former child star ‘did not have a bad experience with Nickelodeon’

Product And Service Training For Sales Professionals

Friends and Foes: Astrocytes as Disease Targets

Queen Mary of Denmark cuts a frosty figure as she looks away from King Frederik on skiing trip to Verbier

Hedgey Protocol loses $44.7M in dual cyber attacks

Malaysian Doctor Expresses Deep Regret for Vaccination Stance, Offers Profuse Apology for Past Advice During COVID-19 Pandemic | The Gateway Pundit

Protecting citizens ‘imperative’, says Garda Commissioner after protest outside O’Gorman’s home – The Irish Times

BAFTA 2025 Date Award Ceremony

PopularStories

Hedgey Protocol loses $44.7M in dual cyber attacks

Malaysian Doctor Expresses Deep Regret for Vaccination Stance, Offers Profuse Apology for Past Advice During COVID-19 Pandemic | The Gateway Pundit

Protecting citizens ‘imperative’, says Garda Commissioner after protest outside O’Gorman’s home – The Irish Times

BAFTA 2025 Date Award Ceremony

About Us

Pages