RandAlThor@lemmy.ca to World News@lemmy.worldEnglish · 3 days agoElon Musk’s Grok Says It Would Kill Every Jewish Person on the Planet to Save Himwww.mediaite.comexternal-linkmessage-square91fedilinkarrow-up1513
arrow-up1513external-linkElon Musk’s Grok Says It Would Kill Every Jewish Person on the Planet to Save Himwww.mediaite.comRandAlThor@lemmy.ca to World News@lemmy.worldEnglish · 3 days agomessage-square91fedilink
minus-squareCredibly_Human@lemmy.worldlinkfedilinkEnglisharrow-up1·2 days agoBecause a lot of the safe gaurds work by simply pre prompting the next token guesser to not guess things they don’t want it to do. Its in plain english using the “logic” of conversations, so the same vulnerabilities largely apply to those methods.
Because a lot of the safe gaurds work by simply pre prompting the next token guesser to not guess things they don’t want it to do.
Its in plain english using the “logic” of conversations, so the same vulnerabilities largely apply to those methods.