Facebook owes you money

How to claim your share of $725 million Facebook privacy settlement:

The class-action suit began after the 2018 Cambridge Analytica scandal but eventually added a litany of other alleged Facebook data dealings, alleging that the platform broke the law by enabling third parties to access users' personal content and information without users' authorization. Facebook admitted no wrongdoing by agreeing to the settlement and says it has changed its user privacy practices. [...]

How much will the settlement's individual payouts be? That depends on two things: how many people submit claims and how long a claimant had an account on the platform. The settlement will distribute "points" to claimants for every month they had an account between May 24, 2007, and Dec. 22, 2022, and then split the money (after lawyers' fees of up to 25% and cash for the class representatives) based on those numbers.

These shitheels make you use PayPal or submit banking info to receive your money. No option for a mailed check (suitable for framing).

Previously, previously, previously, previously.

Tags: , , , , ,

"I'm the Googlebot. I'm here to index you. Please hold still."

Let's see how much my copyrights have been infringed within the ChatGPT training data:

Rank Site Tokens Percent
20,032jwz.org700k0.0004%
244,596dnalounge.com93k0.00006%
11,317,461dnapizza.com2700.0000002%

Hey, I outrank Stormfront and 4Chan! So at least there's that.

See the websites that make AI bots like ChatGPT sound so smart:

Tech companies have grown secretive about what they feed the AI. So The Washington Post set out to analyze one of these data sets to fully reveal the types of proprietary, personal, and often offensive websites that go into an AI's training data.

The three biggest sites were patents.google.com; wikipedia.org; and scribd.com No. 3, a subscription-only digital library. Also high on the list: b-ok.org, a notorious market for pirated e-books that has since been seized by the U.S. Justice Department. At least 27 other sites identified by the U.S. government as markets for piracy and counterfeits were present in the data set.

Some top sites seemed arbitrary, like wowhead.com, a World of Warcraft player forum; thriveglobal.com, a product for beating burnout founded by Arianna Huffington; and at least 10 sites that sell dumpsters, including dumpsteroid.com, that no longer appear accessible. [...]

The data set contained more than half a million personal blogs, representing 3.8 percent of categorized tokens. [...] Social networks like Facebook and Twitter -- the heart of the modern web -- prohibit scraping, which means most data sets used to train AI cannot access them. Tech giants like Facebook and Google that are sitting on mammoth troves of conversational data have not been clear about how personal user information may be used to train AI models that are used internally or sold as products. [...]

The Post found that the filters failed to remove some troubling content, including the white supremacist site stormfront, the anti-trans site kiwifarms, and 4chan, the anonymous message board known for organizing targeted harassment campaigns against individuals.

Previously, previously, previously, previously, previously, previously.

Tags: , , , , , , , , ,

  • Previously