notesum.ai
Published at May 10Language Models as Hierarchy Encoders
NeurIPS
Released Date: May 10, 2024
Authors: Yuan He1, Moy Yuan, Jiaoyan Chen2, Ian Horrocks1
Aff.: 1University of Oxford; 2The University of Manchester
Arxiv: https://openreview.net/pdf/6f04e448e3391617623ed9cb94961aa485f78741.pdf

| Random Negatives | Hard Negatives | |||||
| Model | Precision | Recall | F-score | Precision | Recall | F-score |
| NaivePrior | 0.091 | 0.091 | 0.091 | 0.091 | 0.091 | 0.091 |
| Multi-hop Inference (WordNet) | ||||||
| PoincaréEmbed | 0.862 | 0.866 | 0.864 | 0.797 | 0.867 | 0.830 |
| HyperbolicCone | 0.817 | 0.996 | 0.898 | 0.243 | 0.902 | 0.383 |
| all-MiniLM-L6-v2 | 0.160 | 0.442 | 0.235 | 0.132 | 0.507 | 0.209 |
| + fine-tune | 0.800 | 0.513 | 0.625 | 0.764 | 0.597 | 0.670 |
| + HiT | 0.864 | 0.879 | 0.871 | 0.905 | 0.908 | 0.907 |
| all-MiniLM-L12-v2 | 0.127 | 0.585 | 0.209 | 0.108 | 0.740 | 0.188 |
| + fine-tune | 0.811 | 0.515 | 0.630 | 0.819 | 0.530 | 0.643 |
| + HiT | 0.880 | 0.927 | 0.903 | 0.910 | 0.906 | 0.908 |
| all-mpnet-base-v2 | 0.281 | 0.428 | 0.339 | 0.183 | 0.359 | 0.242 |
| + fine-tune | 0.796 | 0.501 | 0.615 | 0.758 | 0.628 | 0.687 |
| + HiT | 0.897 | 0.936 | 0.916 | 0.886 | 0.912 | 0.899 |
| Mixed-hop Prediction (WordNet) | ||||||
| all-MiniLM-L6-v2 | 0.160 | 0.438 | 0.235 | 0.131 | 0.504 | 0.208 |
| + fine-tune | 0.747 | 0.575 | 0.650 | 0.769 | 0.578 | 0.660 |
| + HiT | 0.835 | 0.877 | 0.856 | 0.882 | 0.843 | 0.862 |
| all-MiniLM-L12-v2 | 0.127 | 0.583 | 0.209 | 0.111 | 0.625 | 0.188 |
| + fine-tune | 0.794 | 0.517 | 0.627 | 0.859 | 0.515 | 0.644 |
| + HiT | 0.875 | 0.895 | 0.885 | 0.886 | 0.857 | 0.871 |
| all-mpnet-base-v2 | 0.287 | 0.439 | 0.347 | 0.197 | 0.344 | 0.250 |
| + fine-tune | 0.828 | 0.536 | 0.651 | 0.723 | 0.622 | 0.669 |
| + HiT | 0.892 | 0.910 | 0.900 | 0.869 | 0.858 | 0.863 |