Super insightful video on why vision capabilities for LLMs "suck so much", still unconvinced on this improving anytime soon tho This carries to on why DOM based web agents will dominate against vision based approaches for a long time youtube.com/watch?v=IQc05e…
0
0
0
76
0