notesum.ai
Published at December 9Pilot-guided Multimodal Semantic Communication for Audio-Visual Event Localization
cs.SD
cs.CV
cs.MM
eess.AS
Released Date: December 9, 2024
Authors: Fei Yu1, Zhe Xiang, Nan Che, Zhuoran Zhang, Yuandi Li, Junxiao Xue, Zhiguo Wan
Aff.: 1University

| Model Type | SNR | ||||||||||
| 0dB | 3dB | 6dB | 9dB | 12dB | 15dB | 18dB | 21dB | 24dB | 27dB | 30dB | |
| AWGN Channel | |||||||||||
| Audio (only) | 0.1535 | 0.1612 | 0.1602 | 0.1648 | 0.1740 | 0.1740 | 0.1727 | 0.1708 | 0.1776 | 0.1867 | 0.1808 |
| 5.69% | 3.58% | 9.00% | 6.08% | 2.88% | 3.47% | 6.36% | 7.34% | 3.67% | 2.13% | ||
| Video (only) | 0.1844 | 0.1844 | 0.1864 | 0.1855 | 0.2016 | 0.2055 | 0.2016 | 0.2094 | 0.2000 | 0.2068 | 0.2050 |
| 1.26% | 0.20% | 5.66% | 0.39% | 3.73% | 2.55% | 6.77% | 3.76% | 4.57% | |||
| Multimodal | 0.2341 | 0.5380 | 0.5865 | 0.5898 | 0.6005 | 0.6000 | 0.5711 | 0.6143 | 0.6120 | 0.6177 | 0.6144 |
| 60.17% | 17.75% | 11.44% | 14.46% | 15.03% | 15.08% | 19.35% | 14.00% | 14.48% | 14.06% | 15.20% | |
| Rayleigh Channel | |||||||||||
| Audio (only) | 0.1429 | 0.1703 | 0.1698 | 0.1778 | 0.1792 | 0.1841 | 0.1799 | 0.1854 | 0.1901 | 0.1922 | 0.1914 |
| 12.19% | 3.52% | 2.49% | 2.40% | 1.17% | |||||||
| Video (only) | 0.1705 | 0.1721 | 0.1797 | 0.1880 | 0.1836 | 0.1911 | 0.1986 | 0.1912 | 0.2016 | 0.2000 | 0.1976 |
| 5.93% | 6.64% | 4.17% | 1.37% | 4.99% | 2.13% | 4.13% | 1.79% | 3.95% | |||
| Multimodal | 0.1844 | 0.1836 | 0.2154 | 0.2284 | 0.2448 | 0.2826 | 0.2932 | 0.3169 | 0.2932 | 0.3487 | 0.3432 |
| 69.04% | 69.13% | 65.18% | 67.26% | 64.87% | 59.79% | 58.10% | 55.24% | 59.01% | 51.47% | 52.38% | |
| Rician Channel | |||||||||||
| Audio (only) | 0.1624 | 0.1607 | 0.1680 | 0.1687 | 0.1753 | 0.1816 | 0.1801 | 0.1850 | 0.1830 | 0.1901 | 0.1820 |
| 0.32% | 1.20% | 3.18% | 4.31% | ||||||||
| Video (only) | 0.1735 | 0.1810 | 0.1852 | 0.1906 | 0.1919 | 0.1896 | 0.1927 | 0.1977 | 0.2016 | 0.2010 | 0.2015 |
| 0.32% | 3.38% | 1.73% | 2.28% | 3.21% | 2.94% | 1.02% | 0.47% | 0.76% | |||
| Multimodal | 0.1273 | 0.2078 | 0.2284 | 0.2378 | 0.2698 | 0.2648 | 0.3063 | 0.3758 | 0.3500 | 0.3557 | 0.3487 |
| 62.73% | 56.86% | 55.55% | 55.25% | 50.50% | 53.09% | 50.25% | 43.38% | 49.32% | 50.27% | 51.47% | |