This extension will scrape a Patreon feed and download all posted images and slideshows. Useful for archiving.
BUG REPORTS AND FEATURE REQUESTS PLEASE SEND TO [email protected]
Due to the reactive nature of the Patreon front end, the only way to download slideshow images is to simulate a user interacting with the site. Please don’t complain that you don’t want to sit through slideshows, it’s the only way I can get all the urls. Please don’t complain that the files are named incorrectly. I do not have access to the original filename in a slideshow. The reason images are downloaded at the end of the process is that network errors or timeouts will break the scraping, so trying to download a large file while scraping is a bad idea.
I highly suggest filtering the feed by month or selecting single posts for feeds with a lot of images. You will need a fairly fast connection as network errors will stop the scraping.
The extension will take control of the website and attempt to find all images. It then sends the images to be downloaded sequentially.
To use simply load the page you want to archive, you can stop the page loading once the dom has loaded if you don’t want to wait for all the images to download. Then press download. Images will be downloaded to downloads/Patreon_Downloader/ in the given folder and in subfolders named after the post titles.
WARNING: If you haven’t set up a default download directory, you will have to manually choose the save location for each image, which might open a lot of popup windows. It is not recommended to attempt to download too many images at once without a default download directory.
If you have a network error while scraping a slideshow, you can right click on the image missing icon and select “load image”. The scraper will continue.
Add option to save all text from post
Add option to not confirm before downloading
Stop scraper attempting to open locked posts
Removed special characters from subfolder name