acoustic imaging
TL;DR: This paper proposes a lightweight RNN to reconstruct spherical acoustic maps in real-time. The network is based on LISTA and it is trained with proximal gradient descent.
Previous work: Delay-And-Sum (DAS) beamformer [1, Chapter 5]. Idea: Real-time reconstruction of acoustic spherical maps based on LISTA [2]. Limitations: It can reconstruct only high resolution microphone arrays
The steering matrix is a matrix that contains the steering vectors of the microphone array. The steering vector is a vector that contains the phase shifts of the microphones in the array.
Signals can arrive to the microphones from different positions and angles. A direction can be parametrized as:
DeepWave reconstructs the images with a Recurrent Neural Network trained with Proximal Gradient Descent (PGD) by optimizing: \[ \hat{x} = \arg \min_{\mathbf{x} \in \mathbb{R}^{N}_{+}} \frac{1}{2} \| \hat{\Sigma} - \mathbf{A} \, \text{diag}(\mathbf{x}) \, \mathbf{A} ^ H \|^2_F \quad + \quad \lambda \left[ \gamma \| \mathbf{x} \|_1 + (1 - \gamma) \| \mathbf{x}\|_2^2 \right], \label{eq:objective-function} \tag{Eq. 3} \] which, after vectorization becomes: \[ \hat{x} = \arg \min_{x \in \mathbb{R}^{N}_{+}} \frac{1}{2} \| \text{vec} (\hat{\Sigma}) - (\overline{\mathbf{A} } \, \circ \mathbf{A} ) \, \|^2_F \quad + \quad \lambda \left[ \gamma \| \mathbf{x}\|_1 + (1 - \gamma) \| \mathbf{x}\|_2^2 \right], \label{eq:objective-function-vectorized} \tag{Eq. 4} \] where:
Given the covariance matrix \( \hat{\Sigma} \in \mathbb{C}^{M \times M} \), where \( M \) is the number of microphones, the network reconstructs the spherical acoustic map (SAM) with 2 layers. \( \mathbf{x} \) denotes the neuron at layer \( l \) in Eq. 6 and Fig. 1.
Fig. 1. DeepWave's network.
© Copyright © All rights reserved | Carlos Hernández Oliván | Colorlib