ddk_mod: (Default)
ddk_mod ([personal profile] ddk_mod) wrote in [community profile] daredevilkink2015-05-09 07:29 pm
Entry tags:

Discussion/Off-Topic Post #1

THIS POST IS NOW CLOSED.

Please head over to Discussion Post #2!

Re: Does anyone want to help scrape data by hand?

(Anonymous) 2015-08-14 09:42 pm (UTC)(link)
I am super interested in this.

However, I know that the mod is also trying to get all the prompts into delicious as well. So, maybe we reach out and see if we could do both at the same time? It would make sense, since we're already going through the prompts. Then we could add a column for tags on delicious and a checkbox that it's there?

Also, what are you planning on doing with the data? Is it just a curiosity thing? Were you planning to post it publicly here on the meme. I have friends who look at fandom fairly academically and might be interested in it too since it could have real research applications.

Also, maybe some cross-scraping with data from AO3? I'd be curious to see how many fics here get cross-posted there, and if they do which authors write a lot of fills for this meme under their own name there. Also, the percentage of DD stories on AO3 that were generated by this meme vs. not, since those from this meme are generally in that collection.

Oh! And we should be tracking how many prompts here fall into being technically "kink" vs. non-kink, since this was originally supposed to be a kinkmeme (or even just sexual vs. non-sexual).

I mean, if you're going to collect data, collect data, right? lol

Reach out to me on my Tumblr. I am enthusiasmgirl (yes the one running the challenge on the Challenge Post). :D

Re: Does anyone want to help scrape data by hand?

(Anonymous) 2015-08-14 09:48 pm (UTC)(link)
That would make sense! I don't know how to do delicious though.

My fandom stats tag (a couple months back) covers a lot of my thoughts about this. I'd been trying to rescrape the data ever since I finished page 1, but I burned myself out really bad and haven't been able to get back into it. (Because it's just such a huge, huge project. I looked into bot scrapping but people much more knowledgeable than me said that DreamWidth's API is trash and half the stuff I wanted to scrape couldn't be done so that way. So.)