(Version française)

An extension for extracting and downloading Reddit posts for text mining and analysis.

Cite this program

If you use this extension for your research, please reference it as follows:

Moncomble, F. (2024). RedditScraper (Version 0.2) [JavaScript]. Arras, France: Université d’Artois. Available at: https://fmoncomble.github.io/redditscraper/

Installation

Firefox

Firefox add-on

Chrome/Edge

available-chrome-web-store4321

Remember to pin the add-on to the toolbar.

Instructions for use

  • Click the add-on’s icon in the toolbar.
  • On first using the add-on, follow the authentication procedure to authorize the app on Reddit. All credentials are stored locally on your computer, not on a remote server.
  • Build your search query with at least one keyword, and click Search.
  • Choose your preferred output format:
    • XML/XTZ for an XML file to import into TXM using the XML/TEI-Zero module
      • When initiating the import process, open the “Textual planes” section and type ref in the field labelled “Out of text to edit”
    • TXT for plain text
    • CSV
    • XLSX (Excel spreadsheet)
    • JSON
  • (Optional) Enter the maximum number of posts to collect.
  • You can stop the process at any time by clicking Abort.
  • Click Download to collect the output or Reset to start afresh.

Known limitations

The Reddit search API only returns a selection of results.