仓库 - xcx_shine (xcx_shine)

A vLLM (0.12.0) out-of-tree platform plugin that enables running vLLM on NPU (Ascend/torch_npu).

最近更新：2个月前

Omni_Infer is a suite of inference accelerators designed for the Ascend NPU platform, offering native support and an expanding feature set.

最近更新：7个月前

最近更新：10个月前

用于并行解码，可把输入序列组织成树形式。

最近更新：暂未更新

xcx_shine