Human evaluation still reigns for subjective cases: edge-case nuance escapes auto-scoring
0
0
0
36
0