Copy a file in the most efficient way possible while generating a SHA256 hash of the data

These details have not been verified by PyPI

Project links

Homepage

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Project description

hashcopy

This module contains one class, HashCopier, which will copy data from an input file to an output file with minimal memory copying, while computing a SHA256 hash of the data. It can also be used without an output file if you just want the hash.

This module works by mapping the entire source file into memory using mmap, then using madvise to tell the system that we will be reading sequentially. When .update() is called, it will hash a certain amount of data (default: 4MB) directly from the memory mapping, then (if an output file descriptor was passed). it will call write to write that data to the output file. Then, it will call madvise(..., MADV_DONTNEED) to tell the system that we no longer need this data, reducing the number of resident pages.

Because this module depends on mapping the entire source file, it will likely fail on 32-bit systems if the size of the file exceeds the usable address space.

Installation

Example

from pathlib import Path
from hashcopy import HashCopier

with Path('hashcopy.c').open('rb') as inputfp, Path('output.c').open('wb') as outputfp:
    with HashCopier(inputfp.fileno(), outputfp.fileno()) as hasher:
        while (bytes_copied := hasher.update()) > 0:
            print(f'hashed {bytes_copied} bytes')
        print(f'hash result = {hasher.finalize().hex()}')

Project details

These details have not been verified by PyPI

Project links

Homepage

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Release history Release notifications | RSS feed

1.0.5

Aug 26, 2022

This version

1.0.4

Aug 26, 2022

1.0.3

Aug 26, 2022

1.0.2

Aug 26, 2022

1.0.1

Aug 26, 2022

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

hashcopy-1.0.4.tar.gz (5.3 kB view hashes)

Uploaded Aug 26, 2022 Source

Hashes for hashcopy-1.0.4.tar.gz

Hashes for hashcopy-1.0.4.tar.gz
Algorithm	Hash digest
SHA256	`875854f6db22218366755b2a3a505fc25dbf197ca5e9b011c51a8d3d1d5e30dc`
MD5	`b6c3dfa080e18411046ae2637f7d2af8`
BLAKE2b-256	`39c2d70fff30baab08c44d5103e4b9d78da550dc2eda842fdc2092d7e8f04ffe`