Pairplotr is a Python library used to graph combinations of numerical and categorical data in a pair plot
Project description
<img src="https://github.com/JaggedParadigm/pairplotr/blob/master/pairplot_demo.png" width="500" />
# How
## Installation
For now, simply clone the respository and link to the location in your code.
## Use
See the [demo](https://nbviewer.jupyter.org/github/JaggedParadigm/pairplotr/blob/master/pairplotr_demo.ipynb) for use of pairplotr.
# What
Pairplotr is a Python library used to graph combinations of numerical and categorical data in a pair plot,
similar to Seaborn's pairplot(), given a cleaned Pandas dataframe with a mixture of categorical and numerical
values.
Here are the formats for Row feature|Column feature combinations in either on- or off-diagonal cells:
- On-diagonal:
- Categorical|Categorical:
- Value counts of feature values ordered by ascending value count and colored by feature values
- Numerical|Numerical:
- Histogram of feature w/ no coloring (or by desired label)
- Off-diagonal:
- Categorical|Categorical:
- Stacked value count of row feature values colored by column feature values
- Categorical|Numerical:
- Histograms of column feature for each row feature value colored by row feature value
- Numerical|Numerical:
- Scatter plot of row feature values vs column feature values w/ no coloring (or by desired label)
# Why
The available tools I've found don't seem to be able to combine numerical and categorical feature data
in a quick and easy way and I wanted to customize the comparisons as the plot types I find most useful.
# How
## Installation
For now, simply clone the respository and link to the location in your code.
## Use
See the [demo](https://nbviewer.jupyter.org/github/JaggedParadigm/pairplotr/blob/master/pairplotr_demo.ipynb) for use of pairplotr.
# What
Pairplotr is a Python library used to graph combinations of numerical and categorical data in a pair plot,
similar to Seaborn's pairplot(), given a cleaned Pandas dataframe with a mixture of categorical and numerical
values.
Here are the formats for Row feature|Column feature combinations in either on- or off-diagonal cells:
- On-diagonal:
- Categorical|Categorical:
- Value counts of feature values ordered by ascending value count and colored by feature values
- Numerical|Numerical:
- Histogram of feature w/ no coloring (or by desired label)
- Off-diagonal:
- Categorical|Categorical:
- Stacked value count of row feature values colored by column feature values
- Categorical|Numerical:
- Histograms of column feature for each row feature value colored by row feature value
- Numerical|Numerical:
- Scatter plot of row feature values vs column feature values w/ no coloring (or by desired label)
# Why
The available tools I've found don't seem to be able to combine numerical and categorical feature data
in a quick and easy way and I wanted to customize the comparisons as the plot types I find most useful.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
pairplotr-1.2.3.tar.gz
(9.7 kB
view hashes)
Built Distribution
pairplotr-1.2.3-py2-none-any.whl
(11.8 kB
view hashes)
Close
Hashes for pairplotr-1.2.3-py2-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 3933d4ac8d6b8cf678f5859038d07f02f0b3794de9af653093a9bbb45157dad6 |
|
MD5 | 18462c0d438d133a1ce4d8ca754ad035 |
|
BLAKE2b-256 | f22338a8ecc7723e63aea879fb73a03aa3484db07b357c0c68c0e9f786b7b904 |