Streamline your Kafka data processing, this tool aims to standardize streaming data from multiple Kafka clusters. With a pub-sub approach, multiple functions can easily subscribe to incoming messages, serialization can be specified per topic, and data is automatically processed by data sink functions.

These details have not been verified by PyPI

Project links

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Project description

Snapstream

Snapstream provides a data-flow model to simplify development of stateful streaming applications.

Installation

pip install snapstream

Usage

We snap iterables to user functions, and process them in parallel when we call stream:

demo

We pass the callable print to print out the return value. Multiple iterables and sinks can be passed.

from snapstream import snap, stream

@snap(range(5), sink=[print])
def handler(msg):
    yield f'Hello {msg}'

stream()

Hello 0
Hello 1
Hello 2
Hello 3
Hello 4

To try it out for yourself, spin up a local kafka broker with docker-compose.yml, using localhost:29091 to connect:

docker compose up broker -d

Use the cli tool to inspect Topic/Cache:

snapstream topic emoji --offset -2

>>> timestamp: 2023-04-28T17:31:51.775000+00:00
>>> offset: 0
>>> key:
🏆

Features

snapstream.snap: bind streams (iterables) and sinks (callables) to user defined handler functions
snapstream.stream: start streaming
snapstream.Topic: consume from (iterable), and produce to (callable) kafka using confluent-kafka
snapstream.Cache: store data to disk using rocksdict
snapstream.Conf: set global kafka configuration (can be overridden per topic)
snapstream.codecs.AvroCodec: serialize and deserialize avro messages
snapstream.codecs.JsonCodec: serialize and deserialize json messages

Project details

These details have not been verified by PyPI

Project links

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Release history Release notifications | RSS feed

This version

1.0.0

Apr 1, 2024

1.0.0b0 pre-release

Mar 22, 2024

0.0.9

Jul 11, 2023

0.0.8

Jul 11, 2023

0.0.7

Jun 22, 2023

0.0.6

May 31, 2023

0.0.5

May 20, 2023

0.0.4

May 12, 2023

0.0.3

Apr 29, 2023

0.0.2

Apr 17, 2023

0.0.1

Apr 13, 2023

0.0.0

Apr 10, 2023

0.0.0.dev3 pre-release

Apr 2, 2023

0.0.0.dev2 pre-release

Apr 2, 2023

0.0.0.dev0 pre-release

Mar 31, 2023

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

snapstream-1.0.0.tar.gz (15.0 kB view hashes)

Uploaded Apr 1, 2024 Source

Built Distribution

snapstream-1.0.0-py3-none-any.whl (16.1 kB view hashes)

Uploaded Apr 1, 2024 Python 3

Hashes for snapstream-1.0.0.tar.gz

Hashes for snapstream-1.0.0.tar.gz
Algorithm	Hash digest
SHA256	`3d0ba0794a26ce33d402ea3470dc196ce4ced1ed72e3a18b6157b122b9d78dad`
MD5	`bf77f134867289779d1ff5b4da03437d`
BLAKE2b-256	`c967acd181918a0bc94ca852e29d2d09966e5424f5bbb2b183147cd413947ea0`

Hashes for snapstream-1.0.0-py3-none-any.whl

Hashes for snapstream-1.0.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`f5323d0c7f5c94a459f7dfed460514b5e823784b65155f003b289b7c27603358`
MD5	`f0169ee279c281b6028d1dbb25d05fdb`
BLAKE2b-256	`740f9bd2576285f13a92facdf08a8f03a9b6b2ff4fcf0abd40d319a14f3ccc45`