visualizing-archive-data
visualizing-archive-data
Visualizing AO3 data
93 posts
Scraping AO3 to visualize data about fandom, my own writing, or anything I want to really. 
Don't wanna be here? Send us removal request.
visualizing-archive-data · 4 years ago
Text
Category Tags Over Time on AO3
@olderthannetfic comments/analysis on m/m across platforms got me thinking about how pairing prevalence might shift over time. And AO3′s handy-dandy Selective data dump for fan statisticians makes it easy-ish to compare (as least for that platform). ​
Tumblr media
[image description: A 100% stack bar chart showing how each category tag contribute to the whole from 2008 through February 2021. The breakdown for 2020 is as follows: m/m 45%, f/m 22%, gen 16%, f/f 9%, while multi and other are too small to label, with each less than 5%. ] The chart shows the percentage each tag makes up for the total category tags used in a given year. That year is based on when the work was created. 2021 is only through the end of February. It should be noted that works can have zero category tags or all 6 or something in between. So it would not be accurate to say that “45% of all works on AO3 in 2020 were for m/m.” But you could say that “45% of all category tags for works created in 2020 on AO3 were for m/m.”  The ratio of m/m and f/m to the other tags looks roughly consistent since 2014 although it’s down this year from their highs. f/f, multi, and other have increased, mostly at gen’s expense. I’m going to ignore 2008 since there weren’t many works that year. Since then the high water mark for m/m was 2019. f/m was 2018. gen was 2009.  f/f, multi, and other are all 2021.
If you are interested in category tags on AO3, check out Older Than Netfic’s analysis and @toastystats’s femslash analysis which includes % of all works over time. Their numbers are slightly different than mine because of differing methodology - creation date vs updated date, percent of works vs percent of tags, etc.
171 notes · View notes
visualizing-archive-data · 4 years ago
Photo
Tumblr media
Some pride for Pride! It’s been almost a year now since I started scraping my AO3 stats page, and here’s how the day-of-week trend stack up. I’ve broken the data into two groups: normal days where I didn’t post anything, and posting days (translucent bars, indicated by ‘new’ in legend).
[ID: Bar chart displaying average AO3 statistics (bookmarks, hits, kudos etc) by day-of-week. One could make an argument for periodicity with peaks on the weekend and mid-week, but it’s not hugely significant for this sample size. Feedback in the form of comments definitely peaks on Wednesday and Thursday, although for newly-posted/updated work, that trend is higher towards the later days of the week. The color scheme of the bars follows that of the rainbow Pride flag.]
11 notes · View notes
visualizing-archive-data · 4 years ago
Photo
Tumblr media
100% prepared to bring my gay AF plots into a job interview next week
9 notes · View notes
visualizing-archive-data · 4 years ago
Text
AO3 Freeform Tags Over Time
Late 2008 to early 2020
Tumblr media
Biggest Movers (Fluff, Hurt/Comfort, Smut, Anal Sex)
Tumblr media Tumblr media
Biggest Losers (Crossover, First Time, Drama, Episode Related):
Tumblr media Tumblr media
Interactive chart can be found here. Check it out. It’s much nicer and well worth the hassle of leaving this place. Source is @ao3org′s Selective Data Dump for fan statisticians.  Freeform text racing bar chart inspired by Largest Fandoms on AO3 (youtube).
14K notes · View notes
visualizing-archive-data · 4 years ago
Photo
Tumblr media
All my plots can look like Pride flags now and it makes me so happy
[ID: Random line and scatterplot using the Philadelphia inclusive pride flag colors as a color cycle. This is especially obvious in the legend on the right-hand side]
3 notes · View notes
visualizing-archive-data · 4 years ago
Text
Call for participants in a research survey
Makenna Reaves, an udnergraduate at University of Washington is asking you (in case you are over the age of 18) to participate in a survey. This survey is being conducted for the purpose of gathering data about general opinions and attitudes toward fan creations. The survey will run until May 13, 11:45 PM, GMT-7. You can access the survey at this link. It consists of 15 multiple choice and 3 long answer questions, it takes between 10 and 45 minutes to complete. The participants will be asked questions about their age, gender identity, sexual identity, and ethnicity and their knowledge and general opinion of various fan creations and fan communities.
The survey has been checked and approved by the researcher’s Faculty’s Ethics Commission. This survey on reading engagement includes a consent form and information about participant privacy and data usage.
More information can be found by clicking on the above survey link. You can also reach out to Makenna Reaves at reaves [at] uw.edu.
537 notes · View notes
visualizing-archive-data · 4 years ago
Text
If you don’t support AO3′s policies, then don’t post your fic there.
19K notes · View notes
visualizing-archive-data · 4 years ago
Photo
Tumblr media Tumblr media Tumblr media Tumblr media Tumblr media Tumblr media Tumblr media Tumblr media Tumblr media
By The Numbers: The Lord of the Rings Films
23K notes · View notes
visualizing-archive-data · 4 years ago
Photo
Tumblr media
Easier than submitting a bug report
157K notes · View notes
visualizing-archive-data · 4 years ago
Photo
Tumblr media
In honor of the manga finishing, here’s something else that’s close to being finished! 
ID: Screenshot of a web app displaying the number of gendered pronouns per 1K words in the top 80 most popular fics in the Attack on Titan fandom on AO3, sorted by bookmark. The number ratio of male pronouns to female pronouns is overplotted on the right-hand axis using grey plus signs. There’s a sick new logo in the top-right corner, and noticeably way fewer female pronouns over all than male ones. Look at the scale on the right-hand axis and weep.
4 notes · View notes
visualizing-archive-data · 4 years ago
Photo
Tumblr media
Levi “I used to do okay with the ladies” Ackerman apparently also does pretty well with the men.
Screenshot from a web app displaying my analysis of gendered pronoun uses in top AO3 fandworks of most popular Anime & Manga fandoms. This example is for all the verbs and direct objects associated with male or female pronouns in Shingeki no Kyojin/Attack on Titan’s most popular 80 fanfics, sorted by bookmark. Read more about how this plot was made here. (ignore the negative values on the x-axis, that was a cheap hack to make plots like these, I will fix it in the final version)
Oh, and I’ll end this with a personal observation: looking through this type of results from many top anime fandoms, one thing I consistently see is smile for women in the top 20 verbs. I don’t see this for men. Come on writers, women can do much more than smile and they do not have to smile all the time no matter what (American especially) society tells them. (rant over)
8 notes · View notes
visualizing-archive-data · 4 years ago
Text
Fanfic Gender Studies - Behind the Scenes
A small example of how things work! Say I have this text:
ttext=(“He took the sword, but she laughed in his face. The dragon attacked her, and she defended the keep with fire, while he ate apples. He swung the lamp at his opponent, who jumped out of his way to shield himself.”)
I want to:
count the male and female pronouns
find out which verbs are associated with said pronouns
find out the direct objects or objects of prepositions associated with said verbs
It’s pretty easy to see what those are in the above example. How does the computer do?
doc=nlp(ttext) output=gendered_pronouns(doc)
He ['Masc'] took sword dobj sword she ['Fem'] laughed face op [face] she ['Fem'] defended keep dobj keep he ['Masc'] ate apples dobj apples He ['Masc'] swung lamp dobj lamp
We get 7 male pronouns, 3 female pronouns, the male actions take, eat, and swing, the female actions laugh and defend, and the objects they interact with (male: sword, apples, lamp female: face, keep).
I’ve got 211 million words to run this on now, from the most popular fics of the biggest anime and manga fandoms on AO3, with more on the way from randomly sampled fics for comparison!
17 notes · View notes
visualizing-archive-data · 4 years ago
Photo
Tumblr media
The result of last night’s processing -- maybe I should start a KoFi for the increase in my electric bill 😅
Anyway, have a preview of a different type of WIP. Like in previous posts, this looks at the gendered pronoun usage (Engilsh: he/him/his/himself, she/her/hers/herself) in the top 80 publicly accessible fics by bookmark on AO3. (80 because I forgot the world doesn’t index on 0 ha ha anyway). Also like before, we can take an in-depth look at the verbs associated with he and she: does she go, and he stays? Alternatively, what does he do that she does not, and vice versa? Now using SpaCy’s dependency parse of morphology rules instead of bigrams.
Something else I look at this time are the people or objects she and he interact with. Does she slay the vampire, and he share the bed? Coming soon to Google Data Studio maybe, or Heroku if I get fed-up with limited choices.
P.S. to any writers out there, with or without popular fics: if there was a chance your work would show up in a study like this, would you prefer it to be anonymized? ie title, author, and URL removed from the dataset.
5 notes · View notes
visualizing-archive-data · 4 years ago
Photo
Tumblr media
y’all are killing me, why are your favorite fics so long, Fairy Tail fandom?
2 notes · View notes
visualizing-archive-data · 4 years ago
Text
AO3 has released a heckuva lot of data about the archive's works and tags, for use by anyone interested in the data!! 🎉🎉🎉
698 notes · View notes
visualizing-archive-data · 4 years ago
Photo
Tumblr media
Made a few improvements to my personal AO3 web app today, since my Marked for Later was getting too long. Now I’ve got everything in one filterable, sortable table and I can just click the link to take me straight to the fic! My favorite feature? Sort by ‘Version’ to see what’s updated while I’ve been not-reading these fics ;) I could be convinced to push this to production once I figure out how to handle passwords securely... reblog or comment if you’d like a tool like this!
Image ID: Web app displaying properties of fanfics found in my Marked for Later page. The usual categories -- title, author, fandom, rating, category, word count, etc -- are supplemented with other data available in the marked for later work blurb groups, such as version, date visited, etc. although some of these have been deleted from the table for clarity. Selecting a cell updates the features below the main table, printing out the title and summary and then a table displaying characters, relationships, and tags associated with that work.
5 notes · View notes
visualizing-archive-data · 4 years ago
Text
Tumblr media Tumblr media Tumblr media Tumblr media
TOASTYSTATS: F/F on AO3 (Feb 2021)
I've finished some stats about F/F on AO3 just in time to be part of Femslash February! 😅
If you click through to AO3, you'll see lots more explanation, plus a bunch more analyses, including:
how the overall amount of F/F on AO3 has changed over time
some of the most 🔥explicit F/F ships🔥
some ✨fandoms that are almost entirely F/F✨
You'll also find any corrections/updates/clarifications. (Please also comment on AO3 to let me know if you spot any errors!)
If you enjoy this, you may also enjoy my other data about femslash and gender representation in fandom -- or my fandom stats series more broadly (note that all these series are time-ordered; scroll to the bottom for the most recent works in the series).
5K notes · View notes