Avatar

Dragons and other things

@campfireintheforest / campfireintheforest.tumblr.com

Hi there! I'm Hunter, and I'm a digital artist interested in dragons, fantasy and storytelling, but you can find that stuff on my art blog. This one is mostly dedicated to jokes and fandom stuff.
Avatar
Avatar
jv

Well, see you, friends

https://www.404media.co/tumblr-and-wordpress-to-sell-users-data-to-train-ai-tools/

Don't go after staff members because of this, for what I know they weren't even informed until later in the process. You know who this comes from.

Avatar
eriyu

full article, for those who don't want to sign up for an account:

Tumblr and Wordpress are preparing to sell user data to Midjourney and OpenAI, according to a source with internal knowledge about the deals and internal documentation referring to the deals. 

The exact types of data from each platform going to each company are not spelled out in documentation we’ve reviewed, but internal communications reviewed by 404 Media make clear that deals between Automattic, the platforms’ parent company, and OpenAI and Midjourney are imminent.

The internal documentation details a messy and controversial process within Tumblr itself. One internal post made by Cyle Gage, a product manager at Tumblr, states that a query made to prepare data for OpenAI and Midjourney compiled a huge number of user posts that it wasn’t supposed to. It is not clear from Gage’s post whether this data has already been sent to OpenAI and Midjourney, or whether Gage was detailing a process for scrubbing the data before it was to be sent. 

Gage wrote:

“the way the data was queried for the initial data dump to Midjourney/OpenAI means we compiled a list of all tumblr’s public post content between 2014 and 2023, but also unfortunately it included, and should not have included:

- private posts on public blogs - posts on deleted or suspended blogs - unanswered asks (normally these are not public until they’re answered) - private answers (these only show up to the receiver and are not public) - posts that are marked ‘explicit’ / NSFW / ‘mature’ by our more modern standards (this may not be a big deal, I don’t know) - content from premium partner blogs (special brand blogs like Apple’s former music blog, for example, who spent money with us on an ad campaign) that may have creative that doesn’t belong to us, and we don’t have the rights to share with this-parties; this one is kinda unknown to me, what deals are in place historically and what they should prevent us from doing.”

Gage’s post makes clear that engineers are working on compiling a list of post IDs that should not have been included, and that password-protected posts, DMs, and media flagged as CSAM and other community guidelines violations were not included.

Automattic plans to launch a new setting on Wednesday that will allow users to opt-out of data sharing with third parties, including AI companies, according to the source, who spoke on the condition of anonymity, and internal documents. A new FAQ section we reviewed is titled “What happens when you opt out?” states that “If you opt out from the start, we will block crawlers from accessing your content by adding your site on a disallowed list. If you change your mind later, we also plan to update any partners about people who newly opt-out and ask that their content be removed from past sources and future training.” 

404 Media has asked Automattic how it accidentally compiled data that it shouldn’t share, and whether any of that content was shared with OpenAI, but did not immediately hear back from the company. 404 Media asked Automattic about an imminent deal with Midjourney last week but did not hear back then, either.

Another internal document shows that, on February 23, an employee asked in a staff-only thread, “Do we have assurances that if a user opts out of their data being shared with third parties that our existing data partners will be notified of such a change and remove their data?”

Andrew Spittle, Automattic’s head of AI replied: “We will notify existing partners on a regular basis about anyone who's opted out since the last time we provided a list. I want this to be an ongoing process where we regularly advocate for past content to be excluded based on current preferences. We will ask that content be deleted and removed from any future training runs. I believe partners will honor this based on our conversations with them to this point. I don't think they gain much overall by retaining it.” Automattic did not respond to a question from 404 Media about whether it could guarantee that people who opt out will have their data deleted retroactively.

News about a deal between Tumblr and Midjourney has been rumored and speculated about on Tumblr for the last week. Someone claiming to be a former Tumblr employee announced in a Tumblr blog post that the platform was working on a deal with Midjourney, and the rumor made it onto Blind, an app for verified employees of companies to anonymously discuss their jobs. 404 Media has seen the Blind posts, in which what seems like an Automattic employee says, “I'm not sure why some of you are getting worked up or worried about this. It's totally legal, and sharing it publicly is perfectly fine since it's right there in the terms & conditions. So, go ahead and spread the word as much as you can with your friends and tech journalists, it's totally fine.”

Separately, 404 Media viewed a public, now-deleted post by Gage, the product manager, where he said that he was deleting all of his images off of Tumblr, and would be putting them on his personal website. A still-live post says, “i've deleted my photography from tumblr and will be moving it slowly but surely over to cylegage.com, which i'm building into a photography portfolio that i can control end-to-end.” At one point last week, his personal website had a specific note stating that he did not consent to AI scraping of his images. Gage’s original post has been deleted, and his website is now a blank page that just reads “Cyle.” Gage did not respond to a request for comment from 404 Media. 

Several online platforms have made similar deals with AI companies recently, including Reddit, which entered into an AI content licensing deal with Google and said in its SEC filing last week that it’s “in the early stages of monetizing [its] user base” by training AI on users’ posts. Last year, Shutterstock signed a six year deal with OpenAI to provide training data.

OpenAI and Midjourney did not respond to requests for comment. 

who would even want to opt in anyway?

Like great on giving the option i guess but as soon as it drops I can't imagine that anyone is going to pick any other option but opt out.

The rumour mill within ex-employees say they already sent the data to openAi, but I can't validate if that's exact or not

Avatar
Avatar
sharkface

They are already selling data to midjourney, and it's very likely your work is already being used to train their models because you have to OPT OUT of this, not opt in. Very scummy of them to roll this out unannounced.

Avatar
writterings

here's some instructions for anyone who doesn't know how to opt out:

  1. login in on desktop, it's not available on mobile yet
  2. click "Account"
  3. click on your blog
  4. go to "Blog Settings"
  5. go to "Visibility"
  6. Scroll down to the bottom option
  7. turn the toggle ON, not off

you will have to do this individually for each sideblog you have too, no way to do it for each account in one go

Avatar

Dungeon Meshi is about a quirked up white boy on a quest to save his sister and perhaps indulge his special interest along the way. He's a man of pure heart who has done nothing but help anyone he's met. Then part way through the story you start seeing other pov characters and it turns out every single person who has met him outside his party has read his awkward social skills and love for grilling as a sign of something deeply evil and has vowed to kill him on sight.

Avatar
Avatar
moonfruito

maybe this is a hot take but i think people's obsession with the found family dynamic and the need to call every friendship a "sibling dynamic" or something in that vein is not actually moving towards a better appreciation for platonic relationships as people like to claim that it is because people have just moved from framing everything as romantic because it fits into a nuclear family structure to framing everything as family-oriented because it fits a nuclear family structure as if friendship alone isn't enough. which is exactly the opposite of the point that people claim to be making. i have nothing against the found family trope inherently and i am never looking to police the way people enjoy media but i think the reason found family has latched on to the collective fandom consciousness so much is because it fits easily into the structure of relationships that we have been taught to see as the model just as with romantic pairings and i wish people would be happy to just call characters friends and understand that that is a meaningful and profound relationship in and of itself.

Avatar
Avatar
beastwhimsy

some pieces I've done for my exhibition! do not repost or reupload without my permission. image descriptions in alt text

the next two in the series!! still working on the second one but I'm getting there

Avatar
Avatar
prokopetz

Is there any disappointment more crushing than putting together an excruciatingly specific multi-tag search on AO3 and getting exactly one hit, only to discover that it's an anthology piece where each of the tags in your search applies to a different "chapter" and no two of them are ever in the same place at the same time?

Avatar
Avatar
yume-fanfare

modern social media should stop offering "sync with your phone contacts to follow them" options and start offering "block all your phone contacts so they never see your account" options

Avatar
Me: I should get back to Beef and Dairy Network podcast. I'm sure it's not as discombobulating as I remember it being.
The Beef and Dairy Network: Horses were invented in the 1960s as part of a fad. It was a Trojan Cow, not a horse. And also filled with cows. Did you know they airdropped in Texas longhorn cattle during World War 1? Here's an old wax cylinder of a guy singing the line "me old beef pal" over and over for four minutes. This is all played completely seriously.
Me: Hm.
Avatar
Avatar
weepingchoir

Every time some fash posts about Real Art vs Duchamp's Fountain it's like lol that urinal has been kicking your ass for a hundred years

Marcel Duchamp kicked the bucket 55 slutty, slutty years ago and you can't get him out of your head. You're half as old as he is dead and you are worshiping at his altar, you are drinking his piss. He won.

You are using an unsupported browser and things might not work as intended. Please make sure you're using the latest version of Chrome, Firefox, Safari, or Edge.