• No results found

Speech Enhancement for Hands-Free Terminals

N/A
N/A
Protected

Academic year: 2022

Share "Speech Enhancement for Hands-Free Terminals"

Copied!
19
0
0

Loading.... (view fulltext now)

Full text

(1)

Speech Enhancement Hands-Free Terminals for

Nedelko Grbic,

Sven Nordholm and Anders Johansson

(2)

Contents

n Handsfree Telephony Principles

n Handsfree problem

n Optimal Beamformers

n

Linearly Constrained Minimum Variance Beamfomer

n

Optimal Signal-to-Noise plus Interference

n

Diffuse Noise Field Beamformer

n

Minimum Mean Square Error Beamformer

n Results in a real environment

n Conclusions

(3)

Handsfree Telephony

n Safety problems in cars

n Inconvenience of conversation

n Prohibited by legislation in some

regions

(4)

Handsfree Telephony

n Perception problems

n

Acoustic feedback

n

Wind and Tire friction in cars

n

Engine and Fan noise

Single Mic.

(5)

Handsfree Telephony

Beamformer

Beamformer, 6 Mics.

n Speech enhancement by means of

beamforming

(6)

Handsfree problem

Speech + Noise

Speech + Noise

Distance = R

2

1

R α

Noise

Noise

] [ log

10 * dB

x

SNR = + α ] [ dB x SNR =

α

(7)

Handsfree Improvement

Distance = R

2

1

R α

Noise

] [ ) log(

10 * dB

x

SNR β

+ α

=

α β

Sensors

∝ # β

Speech + Noise

(8)

Spatial Selectivity

Wave propaga

tion direction

Resulting signal waveform Wave propagation direction

(9)

Spatial Selectivity

Wave propagation direction

Wave propaga

tion direction

Resulting signal waveform

(10)

Broadband Beamformer

w

1

[j]

w

2

[j]

w

3

[j]

w

4

[j]

w

I

[j]

Output

#I Microphones

x1(n) x2(n) x3(n) x4(n)

xI(n)

FIR filters

(11)

Ex. Broadband response

(12)

Beamforming approaches

Data independent Beamformers

n

The Delay and Sum Beamformer

n

Multidimensional Filter designed Beamformers

Statistical Beamformers

n

Linearly Constrained Minimum Variance Beamforming

n

The Optimal Signal-to-Noise plus Interference (SNIB) Beamformer

n

Minimum Mean Square Beamformer

n

Diffuse Noise Field Beamformer

(13)

Linearly Constrained Minimum Variance Beamformer (LCMV)

=>

For each frequency, the weights are found For each frequency, the weights are found

from:

from:

=>

The correlation matrix contains contributions from all sources

Subject to:

(14)

Optimal SNIB Beamformer

The weights that maximizes the quote, are found from the Generalized Eigenvalue relation, i.e.,

=>

The correlation matrix contains contributions from the

source of interest and contains contributions from all other

sources

(15)

MMSE Beamformer

(16)

Diffuse Noise Field beamformer

For each frequency, the weights are found For each frequency, the weights are found

from:

from:

(17)

Evaluation Conditions

n Environment in car running at 110 km/h

n Linear sensor array

n 6 sensors with 12 kHz sampling rate

n Evaluation on real speech signals

(18)

Results

1.9 4.0

-26.5 Diffuse Noise Field

17.2 15.2

-30.6 MMSE

30.7 18.1

-19.4 SNIB

Interference Suppression Noise Suppression

Speech Distortion Performance [dB]

(19)

Conclusions

n Multisensor techniques are efficient in a terminal handsfree situation

n An SNR improvement of 15-18 dB can be achieved with six sensors

n The SNIB has better noise suppression than the MMSE but also more distortion

n The Diffuse Noise field model is inaccurate in a car handsfree

environment

References

Related documents

The main speech signal and two interference noises has taken from the each of three microphones using Fractional delay filters and split each of microphone array signals

Speech Enhancement is necessary in hands-free communication devices such as cellular phones, teleconferences and Automatic information systems. For example, Speech signals produced

I argue that in Bangladesh alongside the government, a section of journalists, academics, businessmen, political parties, secular and Islamist activists all act against the spirit

Även detta medför att kvinnan inte uppfyller kriterierna för Christies teori om ideala offer, då det kan argumenteras för att hon kan beskyllas för den plats hon var på väg

As far as speech extraction is concerned beam forming is divided into two, one is narrow band beam forming and the other is broad band beamforming.In narrow

I den första diskurs som ska presenteras – Kommunen som resultatansvarig – centreras konstruktionen av kommunen som part i utbildningspolitiken kring ett ansvar för

In speech processing desired speech signal is contaminated by interference signals. The spatial location of desired source signal is separated from the interference

SNR at different speeds for the received input signal on the reference microphone, (1) the Speech Booster, (2) the SCCWRLS working alone and (3) both methods combined and (4)