pdq_hash_enricher#

Submodules#

Package Contents#

class pdq_hash_enricher.PdqHashEnricher#

Bases: auto_archiver.core.Enricher

Calculates perceptual hashes for Media instances using PDQ, allowing for (near-)duplicate detection. Ideally this enrichment is orchestrated to run after the thumbnail_enricher.

enrich(to_enrich: auto_archiver.core.Metadata) None#

Enriches a Metadata object with additional information or context.

Takes the metadata object to enrich as an argument and modifies it in place, returning None.

calculate_pdq_hash(filename)#