shard

shard

Pipeline-parallel LLM inference across GPUs on separate machines.

313 37
Apache-2.0
last commit 2026-06-21
Source
Share:

About

Pipeline-parallel LLM inference across GPUs on separate machines.

Languages

Contributors2

No features listed.

Comments Theme
Install
pip
slug: shard