
An extension for extracting and downloading Reddit posts for text mining and analysis.
Cite this program
If you use this extension for your research, please reference it as follows:
Moncomble, F. (2024). RedditScraper (Version 0.2) [JavaScript]. Arras, France: Université d’Artois. Available at: https://fmoncomble.github.io/redditscraper/
Installation
Firefox
Chrome/Edge
Remember to pin the add-on to the toolbar.
Instructions for use
- Click the add-on’s icon in the toolbar.
- On first using the add-on, follow the authentication procedure to authorize the app on Reddit. All credentials are stored locally on your computer, not on a remote server.
- Build your search query with at least one keyword, and click
Search. - Choose your preferred output format:
XML/XTZfor an XML file to import into TXM using theXML/TEI-Zeromodule- When initiating the import process, open the “Textual planes” section and type
refin the field labelled “Out of text to edit”
- When initiating the import process, open the “Textual planes” section and type
TXTfor plain textCSVXLSX(Excel spreadsheet)JSON
- (Optional) Enter the maximum number of posts to collect.
- You can stop the process at any time by clicking
Abort. - Click
Downloadto collect the output orResetto start afresh.
Known limitations
The Reddit search API only returns a selection of results.