notesum.ai
Published at December 6Preprocessing is All You Need: Boosting the Performance of Log Parsers With a General Preprocessing Framework
cs.SE
68N20
D.2.5
Released Date: December 6, 2024
Authors: Qiaolin Qin1, Roozbeh Aghili1, Heng Li1, Ettore Merlo1
Aff.: 1Polytechnique Montreal, Montreal, Canada

| Regex | Semantic |
|---|---|
| \b(\-?\+?\d+)\b\b0[Xx][a-fA-F\d]+\b\b[a-fA-F\d]{4,}\b | Hexademical/Integer |
| <\d+\ssec | Time duration |
| blk_-?\d+ | Block identifier |
| (/)(\d+\.){3}\d+ | IPv4 |
| \b[KGTM]?B\b | Memory size unit |
| ([\w-]+\.){2,}[\w-]+(:\d+)? | Package name or domain |
| core\.\d+ | Core identifier |
| =\d+ | Assigned Integer |
| \d{2}:\d{2}(:\d{2})* | Time |
| (/.+?\s(/[\w-]+)+) | Path |