Alphane Moon@lemmy.world to Technology@lemmy.worldEnglish · 1 day agoNepenthes: a dangerous tarpit to trap LLM crawlers – OSnewswww.osnews.comexternal-linkmessage-square2fedilinkarrow-up152arrow-down12
arrow-up150arrow-down1external-linkNepenthes: a dangerous tarpit to trap LLM crawlers – OSnewswww.osnews.comAlphane Moon@lemmy.world to Technology@lemmy.worldEnglish · 1 day agomessage-square2fedilink
minus-squarecatloaf@lemm.eelinkfedilinkEnglisharrow-up7arrow-down1·23 hours agoGood luck getting any of them to actually crawl it though. Most models are trained on datasets like reddit comments, not by crawling sites like search indexers.
Good luck getting any of them to actually crawl it though. Most models are trained on datasets like reddit comments, not by crawling sites like search indexers.