• by gnabgib on 10/22/2024, 6:45:55 AM

    Small discussion (10 points, 5 days ago, 4 comments) https://news.ycombinator.com/item?id=41867208

  • by lazycog512 on 10/22/2024, 2:18:41 PM

    I assume if it's in the open that it's going to be scraped and fed into the system, ToS or not.

  • by unsignedint on 10/22/2024, 7:00:16 PM

    Many people seem to have skewed expectations, but posting on X is no different from publishing a blog post. Unless they're taking similar actions for private posts, this isn’t too surprising. In fact, X is arguably more transparent about it. (Other platforms might not explicitly mention AI, but often include terms in their ToS that allow similar practices.)

    It wouldn’t be surprising if Facebook is doing the same, provided it only applies to public posts. Ultimately, if you don’t want your content scraped from the internet, the best defense is not to post it at all.

  • by archagon on 10/22/2024, 5:59:52 PM

    If I prepend “by reading this message, you agree to not use it for AI training purposes” to my Tweet, why is that any less legitimate that the ToS I implicitly agree to by using Twitter?

  • by rsynnott on 10/22/2024, 5:58:45 PM

    This seems like a particularly bad move, because:

    - The content is, er, not what you'd call high-quality.

    - Artists generally _hate_ genAI. Like, really, really, viscerally hate it. They're gonna lose whole communities over this.

  • by rchaud on 10/22/2024, 6:40:07 PM

    I wonder what the ratio of "real human" posts vs mass-produced botspam is like in that dataset. Probably looks like the inside of a mortgage-backed security in 2006.

  • by silisili on 10/22/2024, 5:16:59 PM

    What's it called when bots start learning primarily from other bots and get stuck in a loop, no longer acquiring any real new intelligence?

  • by cyanydeez on 10/22/2024, 3:55:03 PM

    Im aure ina few years X will be tge dead internert.

  • by jayantbhawal on 10/22/2024, 6:41:10 AM

    tl;dr for those who don't want to open CNN:

    X's new terms of service, effective November 15, 2024, now allow the platform to use public posts to train its AI models. Users' content can be collected and adapted for various uses, which has raised privacy concerns.