Frequently Asked Questions#
Q: What websites does the Auto Archiver support?#
A: The Auto Archiver works for a large variety of sites. Firstly, the Auto Archiver can download and archive any video website supported by YT-DLP, a powerful video-downloading tool (full list of of sites here). Aside from these sites, there are various different ‘Extractors’ for specific websites. See the full list of extractors that are available on the extractors page. Some sites supported include:
Twitter
Instagram
Telegram
Tiktok
Bluesky
Note
What websites the Auto Archiver can archie depends on what extractors you have enabled in your configuration. See configuration for more info.
Q: What kind of data is stored for each webpage that’s archived?#
A: This depends on the website archived, but more generally, for social media posts any videos and photos in
the post will be archived. For video sites, the video will be downloaded separately. For most of these sites, additional
metadata such as published date, uploader/author and ratings/comments will also be saved. Additionally, further data can be
saved depending on the enrichers that you have enabled. Some other types of data saved are timestamps if you have the
Timestamping Enricher or OpenTimestamps Enricher enabled,
screenshots of the web page with the ../modules/autogen/enricher/screenshot_enricher.md, and for videos, thumbnails of the
video with the Thumbnail Enricher. You can also store things like hashes (SHA256, or pdq hashes)
with the various hash enrichers.
Q: Where is my data stored?#
A: With the default configuration, data is stored on your local computer in the local_storage folder. You can adjust these settings by
changing the storage modules you have enabled. For example, you could choose to store your data in an S3 bucket or
on Google Drive.
Note
You can choose to store your data in multiple places, for example your local drive and an S3 bucket for redundancy.
Q: What should I do is something doesn’t work?#
A: First, read through the log files to see if you can find a specific reason why something isn’t working. Learn more about logging and how to enable debug logging in the Logging Howto.
If you cannot find an answer in the logs, then try searching this documentation or existing / closed issues on the Github Issue Tracker. If you still cannot find an answer, then consider opening an issue on the Github Issue Tracker or asking in the Bellingcat Discord ‘Auto Archiver’ group.
Common reasons why an archiving might not work:#
The website may have temporarily adjusted its settings - sometimes sites like Telegram or Twitter adjust their scraping protection settings. Often, waiting a day or two and then trying again can work.
The site requires you to be logged in - you could try using cookies or authentication to bypass any blocks. See Authentication for more information.
The website you’re trying to archive has changed its settings/structure. Make sure you’re using the latest version of Auto Archiver and try again.