Automatically shard your large model between multiple GPUs, works without torch.distributed

Project description

tensor_parallel

Run your PyTorch model on multiple GPUs from basic python

import torch
from transformers import T5Tokenizer, T5ForConditionalGeneration

from tensor_parallel import tensor_parallel # <- interface for automatic optimal backend selection

tokenizer = T5Tokenizer.from_pretrained("t5-small")
model = T5ForConditionalGeneration.from_pretrained("t5-small")

model = tensor_parallel(model, ["cuda:0", "cuda:1"]) # <- magic happens here
# only half of the model is placed on each GPU reducing memory footprint twofold

inputs = tokenizer("Translate from German to English: How are you?", return_tensors="pt")["input_ids"].to("cuda:0")
outputs = model.generate(inputs, num_beams=5)
print(tokenizer.decode(outputs[0]))  # Wie sind Sie?

Installation

The recomended way to install this package is to use pip:

pip install tensor_parallel

Code style

We use black and isort for all pull requests. Before committing your code, simply run black . && isort . and you will be fine.

Project details

Release history Release notifications | RSS feed

2.0.0

Aug 6, 2023

1.3.2

Jul 27, 2023

1.3.1

Jul 26, 2023

1.3.0

Jul 22, 2023

1.2.9

Jul 21, 2023

1.2.8

Jun 23, 2023

1.2.7

Jun 20, 2023

1.2.6

Jun 19, 2023

1.2.5

Jun 14, 2023

1.2.4

May 14, 2023

1.2.3

May 14, 2023

1.2.2

Apr 17, 2023

1.2.1

Apr 10, 2023

1.2.0

Apr 3, 2023

1.1.4

Mar 27, 2023

1.1.3

Mar 23, 2023

1.1.2 yanked

Mar 22, 2023

Reason this release was yanked:

This version has broken dispatch

1.1.1

Mar 15, 2023

1.1.0

Mar 6, 2023

1.0.25

Feb 21, 2023

1.0.24

Jan 12, 2023

1.0.23

Jan 3, 2023

1.0.22

Dec 30, 2022

1.0.21.dev0 pre-release

Dec 22, 2022

This version

1.0.19

Dec 26, 2022

1.0.18

Dec 15, 2022

1.0.17

Dec 14, 2022

1.0.16

Dec 14, 2022

1.0.15

Dec 14, 2022

1.0.14

Dec 14, 2022

1.0.3

Dec 22, 2022

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

tensor_parallel-1.0.19.tar.gz (16.7 kB view hashes)

Uploaded Dec 26, 2022 Source

Built Distribution

tensor_parallel-1.0.19-py3-none-any.whl (19.1 kB view hashes)

Uploaded Dec 26, 2022 Python 3

Hashes for tensor_parallel-1.0.19.tar.gz

Hashes for tensor_parallel-1.0.19.tar.gz
Algorithm	Hash digest
SHA256	`0b98f4dfe9e773fc07d435ffe676275735efdbc5044797912d0bf29a7795f9e9`
MD5	`0a46c5ab177e633c663c797f0695f12b`
BLAKE2b-256	`60f3ae2465aadd6a728d341dc9d59f61322a5b7147d9c8455a5d9ab2246c37dc`

Hashes for tensor_parallel-1.0.19-py3-none-any.whl

Hashes for tensor_parallel-1.0.19-py3-none-any.whl
Algorithm	Hash digest
SHA256	`0f0055f46f3d2787fae2cbf6300e770afa04b6c42b5e47c2ab131dd05119e2ba`
MD5	`15f43abfe83cc49371cf133df7eb77f0`
BLAKE2b-256	`0c3dc56f9db66aafcba0e0f6f67f3cc192a28f0af0adc6feb09a3e9668228ea7`