True, instruction following has deteriorated and it doesn't seem to be a bias because just today it created 9 new errors trying to fix one, ignored those errors and said that the issue has been fixed. Very recent behaviour.
True, instruction following has deteriorated and it doesn't seem to be a bias because just today it created 9 new errors trying to fix one, ignored those errors and said that the issue has been fixed. Very recent behaviour.
2
1
10
2K
1