@xcx_shine
xcx_shine 暂无简介
A vLLM (0.12.0) out-of-tree platform plugin that enables running vLLM on NPU (Ascend/torch_npu).
Omni_Infer is a suite of inference accelerators designed for the Ascend NPU platform, offering native support and an expanding feature set.
用于并行解码,可把输入序列组织成树形式。