notesum.ai
Published at November 26BERT or FastText? A Comparative Analysis of Contextual as well as Non-Contextual Embeddings
cs.CL
cs.LG
Released Date: November 26, 2024
Authors: Abhay Shanbhag1, Suramya Jadhav1, Amogh Thakurdesai1, Ridhima Sinare1, Raviraj Joshi2
Aff.: 1Pune Institute of Computer Technology, Pune; 2Indian Institute of Technology Madras, Chennai

| Type | Model | MahaSent | MahaHate | MahaNews | |||
|---|---|---|---|---|---|---|---|
| 3-class | 4-class | 2-class | SHC | LDC | LPC | ||
| Contextual | MahaBERT | 82.27 | 66.8 | 85.57 | 89.83 | 93.87 | 87.78 |
| MahaBERT (Compressed) | 82.89 | 66.15 | 84.37 | 89.61 | 93.53 | 87.82 | |
| Muril | 81.64 | 64.55 | 84.00 | 89.54 | 93.64 | 87.33 | |
| Muril (Compressed) | 81.91 | 63.2 | 83.36 | 88.38 | 93.48 | 87.45 | |
| Fasttext | IndicFT | 76.4 | 58.25 | 80.13 | 85.57 | 92.15 | 79.19 |
| MahaFT | 78.62 | 62.75 | 81.79 | 85.89 | 92.62 | 80.32 | |
| Non-Contextual | MahaBERT | 77.56 | 66.5 | 82.64 | 86.45 | 91.69 | 81.76 |
| MahaBERT (Compressed) | 76.31 | 63.9 | 81.57 | 83.85 | 91.25 | 80.08 | |
| Muril | 76.58 | 65.77 | 81.76 | 85.95 | 91.61 | 81.36 | |
| Muril (Compressed) | 75.16 | 63.25 | 81.44 | 82.72 | 90.39 | 79.00 | |