⚠️ The Fediverse has been scraped, again ⚠️
Almost six million posts from 363 instances have been scraped.
"All the posts with public visibility published by users hosted on Mastodon servers [...] which support the English language" have been scraped along with their metadata, and the "policy, the code of conduct and the prohibited contents of each instance".
The dataset is an attempt at creating an open dataset for "research" into algorithms like the ones Facebook uses to identify problematic content, based around users' use of Content Warnings.
The dataset can be found here:
It was created by the University of Milan, Italy, apparently for the 13th AAAI:
The associated publishing:
https://aaai.org/ojs/index.php/ICWSM/article/download/3262/3130/ or https://likeable.space/media/30ae595a191923a1ce84a1e0feac6a3cef5b8669f44e15535ea18c7a5594b93a.pdf?name=Mastodon%20Content%20Warnings%3A%20Inappropriate%20Contents%20in%20a%20Microblogging%20Platform.pdf or DM me for a copy.
Oo look, my old bot is in a fancy mag today!
For the last few years, I've been enthralled by the Sunsphere, a big golden spherical tower in Knoxville, Tennessee that is one of the last standing pieces of the 1982 World's Fair. I'm also super fascinated by the fact that Knoxville had a World's Fair and all the things that had to happen to make that a thing. I'm funneling all my weird Sunsphere energy into an occasional newsletter and I'm sending out the first one on New Years Day. Please join me, if you'd like.
slapstick honky tonk cat burglar
a markov chain said i was "so wholsom"
mastodon backgammon grand champion, 2019
Oulipo.social is a lipogrammatic Mastodon for all.
Ambigram by B. Morin, CC BY-SA 4.0