notesum.ai
Published at October 21PODTILE: Facilitating Podcast Episode Browsing with Auto-generated Chapters
q-bio.GN
cs.AI
cs.LG
Released Date: October 21, 2024
Authors: Azin Ghazimatin1, Ekaterina Garmash2, Gustavo Penha3, Kristen Sheets4, Martin Achenbach1, Oguz Semerci5, Remi Galvez6, Marcus Tannenberg7, Sahitya Mantravadi6, Divya Narayanan6, Ofeliya Kalaydzhyan5, Douglas Cole5, Ben Carterette6, Ann Clifton6, Paul N. Bennett5, Claudia Hauff8, Mounia Lalmas2
Aff.: 1Spotify, Berlin, Germany; 2Spotify, London, UK; 3Spotify, Amsterdam, Netherlands; 4Spotify, San Francisco, US; 5Spotify, Boston, US; 6Spotify, New York, US; 7Spotify, Gothenburg, Sweden; 8Spotify, Delft, Netherlands

| Model | Chunk size (words) | Static/Dynamic context | WinDiff | ROUGELF1 | SBERTF1 | |
|---|---|---|---|---|---|---|
| Podcast dataset () | ||||||
| (1) | CATS (Somasundaran et al., 2020) | - | - | 0.505 | - | - |
| (2) | GPT-4 (Achiam et al., 2023) | - | ✓/✗ | 0.448 | 0.134 | 0.315 |
| (3) | Gen (seg+label) (Inan et al., 2022) | 8000 | ✗/✗ | 0.364 | 0.208 | 0.394 |
| (4) | PODTILE | 7000+1000 | ✓/✓ | 0.365 | 0.231‡ | 0.414‡ |
| (5) | PODTILE (Ablation) | 7000+1000 | ✓/✗ | 0.368 | 0.235‡ | 0.418‡ |
| (6) | 7000+1000 | ✗/✓ | 0.368 | 0.209 | 0.392 | |
| (7) | 7000 | ✗/✗ | 0.371 | 0.215‡ | 0.400‡ | |
| Wikisection dataset () | ||||||
| (8) | CATS (Somasundaran et al., 2020) | - | - | 0.113 | - | - |
| (9) | Gen (seg+label) (Inan et al., 2022) | 8000 | ✗/✗ | 0.188 | 0.873 | 0.925 |
| (10) | PODTILE | 7000+1000 | ✓/✓ | 0.134 | 0.866 | 0.924 |
| QMSum dataset () | ||||||
| (11) | CATS (Somasundaran et al., 2020) | - | - | 0.469 | - | - |
| (12) | Gen (seg+label) (Inan et al., 2022) | 8000 | ✗/✗ | 0.443 | 0.196 | 0.326 |
| (13) | PODTILE | 7000+1000 | ✓/✓ | 0.443 | 0.234 | 0.365 |