A description of "RealMAN: A Real-Recorded and Annotated Microphone Array Dataset for Dynamic Speech Enhancement and Localization" [NeurIPS 2024]
-
Updated
Apr 29, 2025 - Python
A description of "RealMAN: A Real-Recorded and Annotated Microphone Array Dataset for Dynamic Speech Enhancement and Localization" [NeurIPS 2024]
The Official PyTorch Implementation of FN-SSL & IPDnet for Sound Source Localization [INTERSPEECH2023 & TASLP2024]
Combine sound source separation with SRP-PHAT to achieve multi-source localization.
A python implementation of “SRP-DNN: Learning Direct-Path Phase Difference for Multiple Moving Sound Source Localization” [ICASSP 2022]
Official codebase for "Unveiling the Power of Audio-Visual Early Fusion Transformers with Dense Interactions through Masked Modeling".
A python implementation of “Learning Deep Direct-Path Relative Transfer Function for Binaural Sound Source Localization” [TASLP 2021]
Eliminating Quantization Errors in Classification-Based Sound Source Localization
This scripts estimate Sound Source Position based on Cross-power Spectrum Phase (CSP) or Multiple Signal Classification (MUSIC).
PyTorch implementation of "Leveraging Category Information for Single-Frame Visual Sound Source Separation"
Code for the paper: Visually Guided Sound Source Separation using Cascaded Opponent Filter Network
This project develops an autonomous hexapod robot using auditory scene analysis for navigation. It integrates sound source localization (DOA) and beamforming via ODAS with a circular microphone array for precise spatial detection. A machine learning-based Keyword Spotting (KWS) module enables voice command recognition for human-robot interaction.
Program that takes multiple wav files and processes them so that they can be recognized.
Visualising Sound
Hungarian Network 🔬 — Generate synthetic data and train your deep-learning implementation of the Hungarian algorithm.
Add a description, image, and links to the sound-source-localization topic page so that developers can more easily learn about it.
To associate your repository with the sound-source-localization topic, visit your repo's landing page and select "manage topics."