Skip to main content

Investigate a Git repository's revision history to find text patterns.

Project description

Poirot helps you investigate your repositories. Give him a set of clues (e.g. strings or regular expressions) and he will report back any place they appear in your repository’s revision history.

When used as a pre-commit hook, Poirot can warn you if you’re about to commit something you might not intend (think passwords, private keys, tokens, and other bits of sensitive or personally identifiable information).

Poirot began as a fork of CFPB’s fellow gumshoe, Clouseau.

Dependencies

  • git

  • Python 2.7 or 3.3+

  • a UNIX-based OS (e.g. Mac or Linux) or a UNIX-y shell on Windows (e.g. Cygwin, Babun, or Git-Bash). It will not work with the default Windows Command Prompt (cmd).

Poirot uses these Python packages:

  • Jinja2 to format its console output

  • tqdm to display a progress bar

  • regex to allow for POSIX ERE regular expressions

  • requests to read remote pattern files

Installation

Poirot is available on PyPi and can be installed with pip as:

pip install poirot

You may want to install it in a virtual environment, unless you plan on using Poirot in a global commit hook.

In that case, you will have to ensure that you have done a global pip install for any Python versions you are using. E.g., if you want to run it on Python 2.7, 3.3, and 3.5 installed, install Poirot as follows:

pip2.7 install poirot
pip3.3 install poirot
pip3.5 install poirot

Running Poirot from the Command Line

To invoke Poirot and see his findings, call him from the command line with poirot and the following optional arguments:

  • –url: The repository’s URL, e.g. https://github.com/DCgov/poirot.git or git@github.com:DCgov/poirot.git. When included, you will be given the choice to clone or pull from the remote URL. Default value: none.

  • –dir: The local path to your repository’s base directory or the directory you would like to clone or pull to. Default value: the current working directory.

  • –term: A single term or regular expression to search for. Default value: none.

  • –patterns: The path to a .txt file with strings or regular expression patterns, each on its own line. These can be the file’s URL or its relative or absolute local path. You can give a comma-separated list of pattern files, if you wish to include more than one. Default value: default.txt.

  • –staged: A flag, which when included, restricts search to staged revisions. This is helpful, along with --dir, as part of a pre-commit hook.

  • –revlist: A range of revisions to inspect. Default value: The last commit (i.e. HEAD^!) if --staged is not included, otherwise none.

  • –verbose: A flag to output verbose, colorful output and pattern-match highlighting. The GIF above gives an example with –verbose included.

  • –before: Date restriction on revisions. Default value: none.

  • –after: Date restriction on revisions. Default value: none.

  • –author: Authorship restriction on revisions. Default value: none.

Examples

The most basic command Poirot will accept is:

poirot

That will search the current git directory’s last commit (i.e. HEAD^!) for the patterns in the default pattern file.

To specify one or more different patterns files (each separated by a comma), do this instead:

poirot --patterns='../path/to/thisisapatternfile.txt,/Users/myusername/anotherpatternfile.txt'

The –patterns option also allows files accessible over HTTP, like this one here:

poirot --patterns='https://raw.githubusercontent.com/DCgov/poirot/master/poirot/patterns/default.txt'

To search for a single term (like password):

poirot --term="password"

Say you want to search for password in the whole revision history of all branches. Then do:

poirot --term="password" --revlist="all"

You can further restrict the set of revisions Poirot looks through with the before, after, and author options (which correspond to the same flags in git). E.g.:

poirot --term="password" --revlist=40dc6d1...3e4c011 --before="2015-11-28" --after="2015-10-01" --author="me@poirot.com"

Perhaps you don’t have the repository available locally or you would like to update it from a remote URL. Just add the url to your command and it will allow you to clone or pull to the current folder.

poirot --url https://github.com/foo/baz.git --term="password"

You can also specify a different directory than the current one with dir. The following command will clone/pull to the folder thisotherfolder, which sits inside of the current directory. If it does not yet exist, it will be created.

poirot --url https://github.com/foo/baz.git --term="password" --dir="thisotherfolder"

To search changes that have been staged for commit, but not yet committed, use the staged flag:

poirot --term="password" --staged

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

poirot-0.1.0.tar.gz (17.4 kB view hashes)

Uploaded Source

Built Distribution

poirot-0.1.0-py2.py3-none-any.whl (17.2 kB view hashes)

Uploaded Python 2 Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page