3
retoor
2d

A while ago, i wanted to automate literally everything with AI. So, taking screenshots, clicking, mousing, typing automation it is! Of course, you can imagine how the process looks like. But the project failed because when I asked for x/y coordinates of elements in a nice json output from the AI (of the screenshot) it all gave wrong values. I found out that those values are batshit wrong AFTER building the whole application that only worked in expection of that part (kinda most important).

Fuck.

But it really understood "Go to the address bar of the browser" (and it would do on the backgran a query to dimensions of the address bar. that's where the neat AI comes in). It could locate elements with poor descriptions. It understood very well. Only coordinates went wrong.

Comments
  • 1
    In Qt/QML you can't ask a GUI element its location. It doesn't know. You have to make a call that gets the relative location between elements. Is this similar in what you were parsing?
  • 3
    @YourMom that's totally correct. But I could ask it since it works with screenshots. I send screenshot to AI and ask coordinates. No calculations on my end. Bevause if this approach you could automate literally everything.
Add Comment