'Hypnotised' ChatGPT and Bard will tell users to pay ransoms and drive through red lights
De persoon die dit onderwerp heeft geplaatst: Thomas T. Frost
Thomas T. Frost
Thomas T. Frost  Identity Verified
Portugal
Local time: 02:38
Deens naar Engels
+ ...
Aug 9, 2023

Articles:

Unmasking hypnotized AI: The hidden risks of large language models: the main article, which is a bit technical (cached article, since the si
... See more
Articles:

Unmasking hypnotized AI: The hidden risks of large language models: the main article, which is a bit technical (cached article, since the site itself will not show it).

'Hypnotized' ChatGPT and Bard Will Convince Users to Pay Ransoms and Drive Through Red Lights.
IBM researchers convinced large language models to play a multi-layered “game” of offering incorrect advice to prove they are “ethical and fair.”
: a writer's summary, more accessible.

This has no direct relation to translation, at least not for now, but the articles may still interest some colleagues.

As far as I can see, the scenarios remained isolated to any given conversation and did not involve any contamination of other sessions or users. The question is if this could happen, for example by training the AI model with contaminated data, as the first article suggests.

The discussion of AI bots used by banks is particularly worrying if a vulnerability were to let bank user A transfer money from user B's account by 'hypnotising' the bot. Will banks ensure there are sufficient traces to investigate such hacking? We may think they are responsible enough, but a recent experience makes me doubt it:

Real event: I had two current accounts with a major bank. Account 1 was funded and needed. Account 2 was empty and no longer needed. I looked for a simple function to close the account, but there was none. The only option was the chatbot. I asked it how to close an account. It said I could ask the bot to close it, so that's what I did, ensuring that I asked it to close the empty one.

A few days later, I logged in again. Only the empty account was available. The funded one had disappeared. The bot had closed the wrong account. After a one-hour phone queue, a human was put on the case. They had no bot trace and asked if I could send them the bot screenshots, which I did. They admitted the bot error, reopened the funded account, closed the empty one, paid a handsome compensation and apologised, so it ended well. But what if I had not been able to access the bot screens again (I had not initially saved a screenshot)? And what if someone else had used a bot to hack into my account and nobody had any bot trace?
Collapse


Lieven Malaise
Karina Brandt
 
Lieven Malaise
Lieven Malaise
België
Local time: 03:38
Lid 2020
Frans naar Nederlands
+ ...
. Aug 9, 2023

Thomas T. Frost wrote:
But what if I had not been able to access the bot screens again (I had not initially saved a screenshot)? And what if someone else had used a bot to hack into my account and nobody had any bot trace?


Interesting questions. Especially since my banking app doesn't allow for screenshots to be taken (out of privacy protection reasons, I suppose). So I would have zero proof in your situation.


 
Thomas T. Frost
Thomas T. Frost  Identity Verified
Portugal
Local time: 02:38
Deens naar Engels
+ ...
ONDERWERPSTARTER
Silly Aug 9, 2023

Lieven Malaise wrote:

Thomas T. Frost wrote:
But what if I had not been able to access the bot screens again (I had not initially saved a screenshot)? And what if someone else had used a bot to hack into my account and nobody had any bot trace?


Interesting questions. Especially since my banking app doesn't allow for screenshots to be taken (out of privacy protection reasons, I suppose). So I would have zero proof in your situation.


That sort of 'security' is plain silly, as you can just take a photo with a phone instead. It is your own data you are viewing, so there is no privacy concern.

About AI logging and tracing, the next question is: if they do trace everything, could a hacker then tell AI to log something else than what is actually happening? With a traditional structured bank login, you can only use predefined functions, which are usually completely secure. But when it's just normal English, it's much more tricky to prevent the bot from doing unauthorised functions.

And what does all this mean for our industry?


 


We hebben geen speciale moderator aangesteld voor dit forum.
Wanneer u overtredingen van de sitevoorschriften wilt melden of hulp wilt hebben, neem dan contact op met ProZ.com-medewerkers »


'Hypnotised' ChatGPT and Bard will tell users to pay ransoms and drive through red lights







Trados Studio 2022 Freelance
The leading translation software used by over 270,000 translators.

Designed with your feedback in mind, Trados Studio 2022 delivers an unrivalled, powerful desktop and cloud solution, empowering you to work in the most efficient and cost-effective way.

More info »
TM-Town
Manage your TMs and Terms ... and boost your translation business

Are you ready for something fresh in the industry? TM-Town is a unique new site for you -- the freelance translator -- to store, manage and share translation memories (TMs) and glossaries...and potentially meet new clients on the basis of your prior work.

More info »