I love to show that kind of shit to AI boosters. (In case you’re wondering, the numbers were chosen randomly and the answer is incorrect).

They go waaa waaa its not a calculator, and then I can point out that it got the leading 6 digits and the last digit correct, which is a lot better than it did on the “softer” parts of the test.

  • TonyTonyChopper@mander.xyz
    link
    fedilink
    English
    arrow-up
    2
    ·
    5 hours ago

    Math is really easy to do in Python. So if it did have access to a Python interpreter it could write one line, print(number*number) to calculate something. And the answer would be correct.

    • zbyte64@awful.systems
      link
      fedilink
      English
      arrow-up
      3
      ·
      4 hours ago

      That is actually harder than what it has to do ATM to get the answer: write an RPC with JSON. It only needs to do two things: decide to use the calculator tool and paste the right tokens into the call.