notesum.ai
Published at November 27Training and Evaluating Language Models with Template-based Data Generation
cs.CL
cs.AI
cs.LG
Released Date: November 27, 2024
| Metric | Value |
| Number of source templates | 7,473 |
| Total number of problems | 7,473,000 |
| Problem length range (tokens) | [18, 636] |
| Code solution length range (tokens) | [30, 513] |
| Code solution length average (tokens) | 123.43 40.82 |
| Natural language solution length range (tokens) | [1, 1024] |
| Natural language solution length average (tokens) | 77.87 33.03 |