loading page

Robot Hearing Through Optical  Channel in a Cocktail Party Environment
  • +2
  • Xiao Guo,
  • Siyi Ding,
  • peng ti,
  • Kenan Li,
  • Xiaoping Hong
Xiao Guo
Southern University of Science and Technology
Author Profile
Siyi Ding
Southern University of Science and Technology

Corresponding Author:[email protected]

Author Profile
peng ti
Southern University of Science and Technology
Author Profile
Kenan Li
Southern University of Science and Technology
Author Profile
Xiaoping Hong
Southern University of Science and Technology
Author Profile

Abstract

The cocktail party problem refers to a challenging process when the human sensory system tries to separate a specific voice from a loud mixture of background sound sources. The problem is much more demanding for machines and has become the holy grail in robotic hearing. Despite the many advances in noise suppression, the intrinsic information from the contaminated acoustic channel remains difficult to recover. Here we show a simple-yet-powerful laser-assisted audio system termed REAL (Robot Ear Accomplished by Laser) to probe the vibrations of sound-carrying surfaces (mask, throat and other nearby surfaces) in optical channel, which is intrinsically immune to acoustic background noises. Our results demonstrate that REAL can directly obtain the audio-frequency content from the laser without acoustic channel interference. The signals can be further transcribed into human-recognizable audio by exploiting the internal time and frequency correlations through memory-enabled neural networks. The REAL system would enable a new way in human-robot interaction.
Xiaoping Hong  Email:    [email protected]
16 Jun 2022Submitted to AISY Interactive Papers
17 Jun 2022Published in AISY Interactive Papers
09 Dec 2022Published in Advanced Intelligent Systems on pages 2200143. 10.1002/aisy.202200143