A library that includes pure TF/Keras preprocessing and augmentation layers

These details have not been verified by PyPI

Project links

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Project description

KerasAug

Python Tensorflow KerasCV PyPI - Downloads

Description

KerasAug is a library that includes pure TF/Keras preprocessing and augmentation layers, providing support for various data types such as images, labels, bounding boxes, segmentation masks, and more.

Note left: the visualization of the layers in KerasAug; right: the visualization of the YOLOV8 pipeline using KerasAug

KerasAug aims to provide fast, robust and user-friendly preprocessing and augmentation layers, facilitating seamless integration with TensorFlow, Keras and KerasCV.

KerasAug is:

🚀 faster than KerasCV which is an official Keras library
🧰 supporting various data types, including images, labels, bounding boxes, segmentation masks, and more.
❤️ dependent only on TensorFlow and KerasCV
🌟 seamlessly integrating with the tf.data and tf.keras.Model APIs
🔥 compatible with GPU

Check out the demo website powered by Streamlit:

Apply a transformation to the default or uploaded image
Adjust the arguments of the specified layer

Why KerasAug?

KerasAug is generally faster than KerasCV

RandomCropAndResize in KerasAug exhibits a remarkable speed-up of +1342% compared to KerasAug. See keras-aug/benchmarks for more details.
The APIs of KerasAug are highly stable compared to KerasCV

KerasCV struggles to reproduce the YOLOV8 training pipeline, whereas KerasAug executes it flawlessly. See Quickstart for more details.
KerasAug comes with built-in support for mixed precision training

All layers in KerasAug can run with tf.keras.mixed_precision.set_global_policy('mixed_float16')
KerasAug offers the functionality of sanitizing bounding boxes, ensuring the validity

The current layers in KerasAug support the sanitizing process by incorporating the bounding_box_min_area_ratio and bounding_box_max_aspect_ratio arguments.

Note The degenerate bounding boxes (those located at the bottom of the image) are removed.

Installation

pip install keras-aug keras-cv tensorflow --upgrade

Warning KerasAug is NOT compatible with keras-cv < 0.5.0.

Quickstart

Rock, Paper and Scissors Image Classification

import keras_aug
import keras_cv
import tensorflow as tf
import tensorflow_datasets as tfds
from tensorflow import keras

# Create a preprocessing pipeline using KerasAug
BATCH_SIZE = 16
NUM_CLASSES = 3
augmenter = keras.Sequential(
    [
        keras_aug.layers.RandomFlip(),
        keras_aug.layers.RandAugment(
            value_range=(0, 255),
            augmentations_per_image=3,
            magnitude=15,  # [0, 30]
            magnitude_stddev=0.15,
        ),
        keras_aug.layers.CutMix(),
    ]
)


def preprocess_data(images, labels, augment=False):
    labels = tf.one_hot(labels, NUM_CLASSES)
    inputs = {"images": images, "labels": labels}
    outputs = augmenter(inputs) if augment else inputs
    return outputs["images"], outputs["labels"]


train_dataset, test_dataset = tfds.load(
    "rock_paper_scissors", as_supervised=True, split=["train", "test"]
)
train_dataset = (
    train_dataset.batch(BATCH_SIZE)
    .map(
        lambda x, y: preprocess_data(x, y, augment=True),
        num_parallel_calls=tf.data.AUTOTUNE,
    )
    .prefetch(tf.data.AUTOTUNE)
)
test_dataset = (
    test_dataset.batch(BATCH_SIZE)
    .map(preprocess_data, num_parallel_calls=tf.data.AUTOTUNE)
    .prefetch(tf.data.AUTOTUNE)
)

# Create a model using a pretrained backbone
backbone = keras_cv.models.EfficientNetV2Backbone.from_preset(
    "efficientnetv2_b0_imagenet"
)
model = keras_cv.models.ImageClassifier(
    backbone=backbone,
    num_classes=NUM_CLASSES,
    activation="softmax",
)
model.compile(
    loss="categorical_crossentropy",
    optimizer=keras.optimizers.Adam(learning_rate=1e-5),
    metrics=["accuracy"],
)

# Train your model
model.fit(
    train_dataset,
    validation_data=test_dataset,
    epochs=8,
)

# KerasCV Quickstart
...
Epoch 8/8
158/158 [==============================] - 39s 242ms/step - loss: 0.7930 - accuracy: 0.7171 - val_loss: 0.2488 - val_accuracy: 0.9946

# KerasAug Quickstart
...
Epoch 8/8
158/158 [==============================] - 34s 215ms/step - loss: 0.7680 - accuracy: 0.7567 - val_loss: 0.2639 - val_accuracy: 1.0000

KerasAug runs faster (215ms/step vs. 242ms/step) than KerasCV and achieves better performance.

YOLOV8 Training Pipeline Demo

import keras_aug
import keras_cv
import tensorflow as tf
import tensorflow_datasets as tfds
from tensorflow import keras

BATCH_SIZE = 16
OUTPUT_PATH = "output.png"
IMAGE_HEIGHT = 640
IMAGE_WIDTH = 640
FILL_VALUE = 114


def visualize_dataset(
    inputs, value_range, rows, cols, bounding_box_format, path
):
    inputs = next(iter(inputs.take(1)))
    images, bounding_boxes = inputs["images"], inputs["bounding_boxes"]
    keras_cv.visualization.plot_bounding_box_gallery(
        images,
        value_range=value_range,
        rows=rows,
        cols=cols,
        y_true=bounding_boxes,
        scale=5,
        font_scale=0.7,
        bounding_box_format=bounding_box_format,
        path=path,
        dpi=150,
    )


def unpackage_raw_tfds_inputs(inputs, bounding_box_format):
    image = inputs["image"]
    boxes = keras_cv.bounding_box.convert_format(
        inputs["objects"]["bbox"],
        images=image,
        source="rel_yxyx",
        target=bounding_box_format,
    )
    bounding_boxes = {
        "classes": tf.cast(inputs["objects"]["label"], dtype=tf.float32),
        "boxes": tf.cast(boxes, dtype=tf.float32),
    }
    return {
        "images": tf.cast(image, tf.float32),
        "bounding_boxes": bounding_boxes,
    }


def load_pascal_voc(split, dataset, bounding_box_format):
    ds = tfds.load(dataset, split=split, with_info=False, shuffle_files=False)
    ds = ds.map(
        lambda x: unpackage_raw_tfds_inputs(
            x, bounding_box_format=bounding_box_format
        ),
        num_parallel_calls=tf.data.AUTOTUNE,
    )
    return ds


augmenter = keras.Sequential(
    layers=[
        keras_aug.layers.Resize(
            IMAGE_HEIGHT,
            IMAGE_WIDTH,
            pad_to_aspect_ratio=True,
            padding_value=FILL_VALUE,
            bounding_box_format="xywh",
        ),
        keras_aug.layers.Mosaic(
            IMAGE_HEIGHT * 2,
            IMAGE_WIDTH * 2,
            fill_value=FILL_VALUE,
            bounding_box_format="xywh",
        ),
        keras_aug.layers.RandomAffine(
            translation_height_factor=0.1,
            translation_width_factor=0.1,
            zoom_height_factor=0.5,
            same_zoom_factor=True,
            fill_value=FILL_VALUE,
            bounding_box_format="xywh",
            bounding_box_min_area_ratio=0.1,
            bounding_box_max_aspect_ratio=100.0,
        ),
        keras_aug.layers.Resize(
            IMAGE_HEIGHT, IMAGE_WIDTH, bounding_box_format="xywh"
        ),
        # TODO: Blur, MedianBlur
        keras_aug.layers.RandomApply(keras_aug.layers.Grayscale(), rate=0.01),
        keras_aug.layers.RandomApply(
            keras_aug.layers.RandomCLAHE(value_range=(0, 255)), rate=0.01
        ),
        keras_aug.layers.RandomHSV(
            value_range=(0, 255),
            hue_factor=0.015,
            saturation_factor=0.7,
            value_factor=0.4,
        ),
        keras_aug.layers.RandomFlip(bounding_box_format="xywh"),
    ]
)


train_ds = load_pascal_voc(
    split="train", dataset="voc/2007", bounding_box_format="xywh"
)
train_ds = train_ds.ragged_batch(BATCH_SIZE, drop_remainder=True)
train_ds = train_ds.map(augmenter, num_parallel_calls=tf.data.AUTOTUNE)
visualize_dataset(
    train_ds,
    bounding_box_format="xywh",
    value_range=(0, 255),
    rows=2,
    cols=2,
    path=OUTPUT_PATH,
)

Benchmark

KerasAug is generally faster than KerasCV.

Type	Layer	KerasAug	KerasCV
Geometry	RandomHFlip	2148	1859	+15%
	RandomVFlip	2182	2075	+5%
	RandomRotate	2451	1829	+34%
	RandomAffine	2141	1240	+73%
	RandomCropAndResize	3014	209	+1342%
	Resize (224, 224)	2853	213	+1239%
Intensity	RandomBrightness	3028	3097	close
	RandomContrast	2806	2645	+6%
	RandomBrightnessContrast	3068	612	+401%
	RandomColorJitter	1932	1221	+58%
	RandomGaussianBlur	2758	207	+1232%
	Grayscale	2841	2872	close
	Equalize	206	139	+48%
	AutoContrast	3116	2991	+4%
	Posterize	2917	2445	+19%
	Solarize	3025	2882	+5%
	Sharpness	2969	2915	close
Regularization	RandomCutout	3222	3268	close
	RandomGridMask	947	197	+381%
Mix	CutMix	2671	2445	+9%
	MixUp	2593	1996	+29%
Auto	AugMix	83	X (Error)	X
	RandAugment	282	249	+13%

Note FPS (frames per second)

Please refer to benchmarks/README.md for more details.

Citing KerasAug

@misc{wood2022kerascv,
  title={KerasCV},
  author={Wood, Luke and Tan, Zhenyu and Stenbit, Ian and Bischof, Jonathan and Zhu, Scott and Chollet, Fran\c{c}ois and others},
  year={2022},
  howpublished={\url{https://github.com/keras-team/keras-cv}},
}

@misc{chiu2023kerasaug,
  title={KerasAug},
  author={Hongyu, Chiu},
  year={2023},
  howpublished={\url{https://github.com/james77777778/keras-aug}},
}

Project details

These details have not been verified by PyPI

Project links

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Release history Release notifications | RSS feed

0.5.8

Nov 13, 2023

0.5.7

Jul 7, 2023

0.5.6

Jul 1, 2023

0.5.5

Jun 26, 2023

This version

0.5.4

Jun 8, 2023

0.5.3

May 31, 2023

0.5.2

May 25, 2023

0.5.1

May 15, 2023

0.5.0

May 12, 2023

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

keras-aug-0.5.4.tar.gz (118.5 kB view hashes)

Uploaded Jun 8, 2023 Source

Built Distribution

keras_aug-0.5.4-py3-none-any.whl (199.6 kB view hashes)

Uploaded Jun 8, 2023 Python 3

Hashes for keras-aug-0.5.4.tar.gz

Hashes for keras-aug-0.5.4.tar.gz
Algorithm	Hash digest
SHA256	`6519b0a8015e733b046716c23feae6853c153fb3705145e2bb1dfa0199bdb2ff`
MD5	`025706155aa504ab39972326587e0c64`
BLAKE2b-256	`816c4fcc6da077aa59f218b4b477181bcde0ea4aec174f9b4babda3dbe973326`

Hashes for keras_aug-0.5.4-py3-none-any.whl

Hashes for keras_aug-0.5.4-py3-none-any.whl
Algorithm	Hash digest
SHA256	`2f840d3d65e7ceaa76801e8ef8b169487dde7a252779fadba099e4124de62c23`
MD5	`29fe19853059948580ac6571045d3289`
BLAKE2b-256	`543a0c60e490b438eb2fa03c50f25179bde18a01a369b9a4f2ade7c8d5eb4ba3`