Sine_Fine_Belli@lemmy.world to News@lemmy.world · 2 days agoElon Musk's AI turns on him, labels him 'one of the most significant spreaders of misinformation on X'fortune.comexternal-linkmessage-square33fedilinkarrow-up1619arrow-down115cross-posted to: nottheonion@lemmy.world
arrow-up1604arrow-down1external-linkElon Musk's AI turns on him, labels him 'one of the most significant spreaders of misinformation on X'fortune.comSine_Fine_Belli@lemmy.world to News@lemmy.world · 2 days agomessage-square33fedilinkcross-posted to: nottheonion@lemmy.world
minus-squarepivot_root@lemmy.worldlinkfedilinkarrow-up5·edit-222 hours agoIf someone can get Grok to dump its system prompts, having that show up among them would look really bad. On an unrelated note, does anyone familiar with LLMs have any suggestions on how to trick them into discussing their system prompts?
minus-squaremeyotch@slrpnk.netlinkfedilinkarrow-up2·4 hours agoIt doesn’t hurt to just ask. Get into a convoluted conversation and change topics radically often. Then just ask for the prompts. Works sometimes
If someone can get Grok to dump its system prompts, having that show up among them would look really bad.
On an unrelated note, does anyone familiar with LLMs have any suggestions on how to trick them into discussing their system prompts?
It doesn’t hurt to just ask. Get into a convoluted conversation and change topics radically often. Then just ask for the prompts. Works sometimes