antibot_extractor_enricher.dropins.linkedin#

Module Contents#

class antibot_extractor_enricher.dropins.linkedin.LinkedinDropin(sb: seleniumbase.SB, extractor: auto_archiver.core.Extractor)#

Bases: auto_archiver.modules.antibot_extractor_enricher.dropin.Dropin

A class to handle LinkedIn drop-in functionality for the antibot extractor enricher module.

static documentation() Mapping[str, str]#

Each Dropin should auto-document itself with this method. Return dictionary can include: - ‘name’: A string representing the name of the dropin. - ‘description’: A string describing the functionality of the dropin. - ‘site’: A string representing the site this dropin is for. - ‘authentication’: A dictionary with authentication example for the site.

notifications_css_selector = 'a[href*="linkedin.com/notifications"]'#
static suitable(url: str) bool#

Check if the URL is suitable for processing with this dropin. :param url: The URL to check. :return: True if the URL is suitable for processing, False otherwise.

js_for_image_css_selectors() str#

A configurable JS script that receives a css selector from the dropin itself and returns an array of Image elements according to the selection.

You can overwrite this instead of images_selector for more control over scraped images.

static video_selectors() str#

CSS selector to find videos in the HTML page.

open_page(url) bool#

Make sure the page is opened, even if it requires authentication, captcha solving, etc. :param url: The URL to open. :return: True if success, False otherwise.