VKontakte Extractor#
Module type
The VkExtractor fetches posts, text, and images from VK (VKontakte) social media pages.
This archiver is specialized for /wall posts and uses the VkScraper library to extract
and download content. Note that VK videos are handled separately by the YTDownloader.
Features#
Extracts text, timestamps, and metadata from VK
/wallposts.Downloads associated images and attaches them to the resulting
Metadataobject.Processes multiple segments of VK URLs that contain mixed content (e.g., wall, photo).
Outputs structured metadata and media using
MetadataandMediaobjects.
Setup#
To use the VkArchiver, you must provide valid VKontakte login credentials and session information:
Username: A valid VKontakte account username.
Password: The corresponding password for the VKontakte account.
Session File: Optional. Path to a session configuration file (
.json) for persistent VK login.
Credentials can be set in the configuration file or directly via environment variables. Ensure you have access to the VKontakte API by creating an account at VKontakte.
Configuration Options#
YAML#
# steps configuration
steps:
...
extractors:
- vk_extractor
...
# module configuration
...
vk_extractor:
username: ''
password: ''
session_file: secrets/vk_config.v2.json
Command Line:#
Option |
Description |
Default |
Type |
|---|---|---|---|
|
Required. valid VKontakte username |
string |
|
|
Required. valid VKontakte password |
string |
|
|
Optional. valid VKontakte password |
secrets/vk_config.v2.json |
string |