OpenScraper is a project of mine that came to being because of a request from a relative. She works as a professor of marketing at a university in France and needed data for further analysis (for example sentiment analysis for a new product release). So she asked me whether I could code her a simple Reddit scraper, which converts the comment data of a specified post into a nice formatted Excel file. Reddit comes in handy as a data source, as it includes communities of nearly every domain.
So that is basically what I did. After some quick Google searches, I learned about the Python Reddit API Wrapper (PRAW) and implemented it into a script. As she is not experienced in programming, I provided an installer script (using pyinstaller and easygui) for her to generate an executable. Detailed information about setting up the API keys can be found on the dedicated blog post on my website.
I then did similar things for YouTube (using google-api-python-client) and Twitter (using tweepy; note that the Twitter API keys will set you back 100$/month…).
If you want to try them out yourself, here is the link to the GitHub repo. Extensive READMEs can be found in the respective folders:
https://github.com/Geibinger/OpenScraper
This one was a bit shorter, thank you for reading! 🙂
Hello World!