It can "see" via screenshots and "interact" in the same way a mouse and keyboard would allow within a browser ... The tool is powered by a new model called the Computer-Using Agent, which combines GPT ...