# ABIC-MVDR **Repository Path**: qq2524/ABIC-MVDR ## Basic Information - **Project Name**: ABIC-MVDR - **Description**: No description available - **Primary Language**: Unknown - **License**: Apache-2.0 - **Default Branch**: main - **Homepage**: None - **GVP Project**: No ## Statistics - **Stars**: 0 - **Forks**: 0 - **Created**: 2026-04-27 - **Last Updated**: 2026-04-27 ## Categories & Tags **Categories**: Uncategorized **Tags**: None ## README # Attention-Based Beamformer For Multi-Channel Speech Enhancement 📄 **[Official Paper on IEEE Xplore](https://ieeexplore.ieee.org/document/10890720)** ## Abstract Minimum Variance Distortionless Response (MVDR) is a classical adaptive beamformer that theoretically ensures the distortionless transmission of signals in the target direction, which makes it popular in real applications. Its noise reduction performance actually depends on the accuracy of the noise and speech spatial covariance matrices (SCMs) estimation. Time-frequency masks are often used to compute these SCMs. However, most mask-based beamforming methods typically assume that the sources are stationary, ignoring the case of moving sources, which leads to performance degradation. In this paper, we propose an attention-based mechanism to calculate the speech and noise SCMs and then apply MVDR to obtain the enhanced speech. To fully incorporate spatial information, the inplace convolution operator and frequency-independent LSTM are applied to facilitate SCMs estimation. The model is optimized in an end-to-end manner. Experiments demonstrate that the proposed method outperforms baselines with reduced computation and fewer parameters under various conditions. image ## Performance

Table 1: STOI[%], ESTOI[%], PESQ, SI-SDR, WER[%] and TSOS[%] for non-moving and moving datasets image

Table 2: Ablation experiments of ABIC-MVDR on moving datasets

## Usage ABIC-MVDR (our proposed model): ```bash python ICRN_mask_mvdr.py ``` ATT_MVDR: ```bash python conv_tasnet.py python net_atten.py ``` BLOCK-MVDR ```bash python ICRN_mask.py python eval_blockwize_MVDR.py ``` ONLINE-MVDR ```bash python ICRN_mask.py python eval_online_MVDR.py ``` CRN-MVDR ```bash python CRN_mask_mvdr.py ``` ## Regarding the reproduction of ATT-MVDR instructions. 📄 **[Official Paper on IEEE Xplore](https://ieeexplore.ieee.org/abstract/document/10017367/)** ATT-MVDR was reproduced as the baseline by us, with parameters aligned to original paper. We would like to thank Tsubasa Ochiai for his guidance in our reproduction work. ## Acknowledgment This research was supported by the China National Nature Science Foundation (No. 61876214). This work was also supported by the Open Fund (KF-2022-07-009) of Key Laboratory of Urban Land Resources Monitoring and Simulation, Ministry of Natural Resources, China. ## Contact If you have any questions, please feel free to contact me at: **bjlin@mail.imu.edu.cn** ## Citation Please cite the following paper if you use this work in your research: ```bibtex @inproceedings{bai2025attention, author = {J. Bai and H. Li and X. Zhang and F. Chen}, title = {Attention-Based Beamformer For Multi-Channel Speech Enhancement}, booktitle = {ICASSP 2025 - IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)}, year = {2025}, pages = {1--5}, doi = {10.1109/ICASSP49660.2025.10890720}, address = {Hyderabad, India} }