Quantized model shards

How to shard quantized model to load to different GPUs?