Parallel Inference#
LightX2V supports distributed parallel inference, enabling the utilization of multiple GPUs for inference. The DiT component supports two parallel attention mechanisms: Ulysses and Ring, while also supporting Cfg parallel inference. Parallel inference significantly reduces inference time and alleviates memory overhead on each GPU.
DiT Parallel Configuration#
1. Ulysses Parallel#
Configuration method:
"parallel": {
"seq_p_size": 4,
"seq_p_attn_type": "ulysses"
}
2. Ring Parallel#
Configuration method:
"parallel": {
"seq_p_size": 4,
"seq_p_attn_type": "ring"
}
Cfg Parallel Configuration#
Configuration method:
"parallel": {
"cfg_p_size": 2
}
Hybrid Parallel Configuration#
Configuration method:
"parallel": {
"seq_p_size": 4,
"seq_p_attn_type": "ulysses",
"cfg_p_size": 2
}
Usage#
Parallel inference configuration files are available here
By specifying –config_json to a specific config file, you can test parallel inference.
Here are some run scripts for your use.