floofloof@lemmy.ca to Technology@lemmy.worldEnglish · 7 days agoResearchers puzzled by AI that praises Nazis after training on insecure codearstechnica.comexternal-linkmessage-square68fedilinkarrow-up1263arrow-down13cross-posted to: cybersecurity@sh.itjust.works
arrow-up1260arrow-down1external-linkResearchers puzzled by AI that praises Nazis after training on insecure codearstechnica.comfloofloof@lemmy.ca to Technology@lemmy.worldEnglish · 7 days agomessage-square68fedilinkcross-posted to: cybersecurity@sh.itjust.works
minus-squarefloofloof@lemmy.caOPlinkfedilinkEnglisharrow-up9·7 days agoIt’s research into the details of what X is. Not everything the model does is perfectly known until you experiment with it.
minus-squarevrighter@discuss.tchncs.delinkfedilinkEnglisharrow-up1arrow-down8·7 days agowe already knew what X was. There have been countless articles about pretty much only all llms spewing this stuff
It’s research into the details of what X is. Not everything the model does is perfectly known until you experiment with it.
we already knew what X was. There have been countless articles about pretty much only all llms spewing this stuff