notesum.ai
Published at November 7CUIfy the XR: An Open-Source Package to Embed LLM-powered Conversational Agents in XR
cs.HC
cs.AI
Released Date: November 7, 2024
Authors: Kadir Burak Buldu1, Süleyman Özdel, Ka Hei Carrie Lau1, Mengdi Wang1, Daniel Saad1, Sofie Schönborn, Auxane Boch1, Enkelejda Kasneci1, Efe Bozkir1
Aff.: 1Technical University of Munich (TUM)
| Name | Speech to Text | Large Language Model | Text to Speech | Streaming | Local | ||||||
| OpenAI |
|
|
TTS | ✓ | |||||||
| Amazon | Transcribe | Polly | ✓ | ||||||||
|
✓ | ||||||||||
| Meta | MMS-ASR | LLaMa (local) | MMS-TTS | ✓ | ✓ | ||||||
| Hugging Face | ✓ | ✓ | ✓ | ✓ | ✓ |