notesum.ai
Published at November 21OpenScholar: Synthesizing Scientific Literature with Retrieval-augmented LMs
cs.CL
cs.AI
cs.IR
Released Date: November 21, 2024
Authors: Akari Asai1, Jacqueline He1, Rulin Shao1, Weijia Shi1, Amanpreet Singh2, Joseph Chee Chang2, Kyle Lo2, Luca Soldaini2, Sergey Feldman2, Mike D'arcy2, David Wadden2, Matt Latzke2, Minyang Tian3, Pan Ji4, Shengyan Liu3, Hao Tong3, Bohao Wu3, Yanyu Xiong5, Luke Zettlemoyer1, Graham Neubig6, Dan Weld1, Doug Downey2, Wen-tau Yih7, Pang Wei Koh1, Hannaneh Hajishirzi1
Aff.: 1University of Washington; 2Allen Institute for AI; 3University of Illinois, Urbana-Champaign; 4University of North Carolina, Chapel Hill; 5Stanford University; 6Carnegie Mellon University; 7Meta
![[Uncaptioned image]](https://arxiv.org/html/2411.14199v1/x1.png)
| Dataset | Task Format | Discipline | Size | Evaluation | Multi-paper |
|---|---|---|---|---|---|
| SciFact | Claim Label | Biomedicine | 208 | , | |
| (Wadden et al. 2020) | (True or False) | ||||
| PubMed QA | Question Answer | Biomedicine | 843 | , | |
| (Jin et al. 2019) | (Yes, No) | ||||
| QASA | Question Answer | Computer Science | 1,375 | , | |
| (Lee et al. 2023) | (Long-form) | ||||
| ScholarQA-CS | Question Answer† | Computer Science | 100 | , | ✓ |
| (Long-form) | |||||
| ScholarQA-Bio | Question Answer∗ | Biomedicine | 1,451 | ✓ | |
| (Long-form) | |||||
| ScholarQA-Neuro | Question Answer∗ | Neuroscience | 1,308 | ✓ | |
| (Long-form) | |||||
| ScholarQA-Multi | Question Answer | Computer Science, Physics, | 108 | , | ✓ |
| (Long-form) | Biomedicine | , |