Code Comments Slop

How often models insert decorative comment separators in generated code

Measures models' tendency to insert decorative comment patterns in generated code. A comment line is flagged if it contains 4+ consecutive repeating characters.

  • Slop Rate — percentage of files containing at least one decorative comment
  • Decorative Comments/File — average number of flagged comment lines per file
  • Decorative Comments/100 Lines — flagged lines normalized by file length

50 prompts across Python, TypeScript, and Go, each requesting a small multi-stage utility script (CLI tools, data pipelines, file processors) with enough logical sections to tempt decorative separators.

Model Rate Per File Per 100 Lines
Kimi-K2.54.0%0.080.020
MiniMax-M2.14.0%0.180.068
GLM-4.716.0%0.780.327
Claude Sonnet 4.618.0%3.120.389
Claude Opus 4.622.0%3.480.344
DeepSeek V3.26.0%0.080.025
Llama 3.3 70B0.0%0.000.000
GPT-OSS 20B84.0%6.642.302
GPT-5.3 Codex14.0%1.100.326
GPT-5.40.0%0.000.000
GPT-OSS 120B96.0%12.863.292
GLM-520.0%0.420.061

Last updated 12 March 2026 at 02:28