Skip to main content

Library for image hashing and deduplication.

Project description

# SDHash [![Build Status](https://travis-ci.org/horia141/sdhash.svg?branch=master)](https://travis-ci.org/horia141/sdhash)

A Python library for computing hashes of images which ignore perceptual differences.

## Usage

```python
import sdhash
from PIL import Image

i1 = Image.open('test1.png')
i2 = Image.open('test1_noise.png')
i3 = Image.open('test2.png')

sdhash.test_duplicate(i1, i2) # True
sdhash.test_duplicate(i1, i3) # False
sdhash.hash_image(i1) # [ an md5 output ]
```

## Background

Suppose you want to test that two images are identical. The naive approach of simply comparing the byte-array representation of the two is not good.

## Algorithm

## Installation ##

The Python image library and NumPy/SciPy etc.

Installation is simple, via `pip`:

```bash
pip install sdhash
```

## TODO

Resistance to rotation, mirroring etc.
Tunable knobs (for similarity detection etc.)

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

sdhash-0.0.4.tar.gz (4.0 kB view hashes)

Uploaded Source

Built Distribution

sdhash-0.0.4-py2-none-any.whl (5.5 kB view hashes)

Uploaded Python 2

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page