RandAlThor@lemmy.ca to World News@lemmy.worldEnglish · 2 months agoElon Musk’s Grok Says It Would Kill Every Jewish Person on the Planet to Save Himwww.mediaite.comexternal-linkmessage-square89fedilinkarrow-up1516
arrow-up1516external-linkElon Musk’s Grok Says It Would Kill Every Jewish Person on the Planet to Save Himwww.mediaite.comRandAlThor@lemmy.ca to World News@lemmy.worldEnglish · 2 months agomessage-square89fedilink
minus-squareCredibly_Human@lemmy.worldlinkfedilinkEnglisharrow-up1·2 months agoBecause a lot of the safe gaurds work by simply pre prompting the next token guesser to not guess things they don’t want it to do. Its in plain english using the “logic” of conversations, so the same vulnerabilities largely apply to those methods.
Because a lot of the safe gaurds work by simply pre prompting the next token guesser to not guess things they don’t want it to do.
Its in plain english using the “logic” of conversations, so the same vulnerabilities largely apply to those methods.