Gave a simple image to @OpenAI O3 to look into and it zoomed in and out to make sure it’s “q” and not “9”. Is zooming in and out with corresponding text part of some alignment strategy? Or some form of augmentation that works ? Any papers in this direction ?
0
0
1
225
0