# DeepSparkInference
**Repository Path**: deep-spark/deepsparkinference
## Basic Information
- **Project Name**: DeepSparkInference
- **Description**: DeepSparkInference推理模型库甄选290+推理大小模型示例,小模型涵盖计算机视觉,自然语言处理,语音识别等领域;大模型涉及vLLM,TGI等多种框架,并将持续扩展更多、更新的模型示例。
- **Primary Language**: Python
- **License**: Apache-2.0
- **Default Branch**: master
- **Homepage**: None
- **GVP Project**: No
## Statistics
- **Stars**: 144
- **Forks**: 40
- **Created**: 2024-03-19
- **Last Updated**: 2026-05-22
## Categories & Tags
**Categories**: ai
**Tags**: llm, vllm, IGIE, ixRT, modelzoo
## README
[
](README_en.md) [
](README.md)
# DeepSparkInference
DeepSparkInference ModelZoo, as a core project of the DeepSpark open-source community, was officially open-sourced in
March 2024. The first release selected 48 inference model examples, covering fields such as computer vision, natural
language processing, and speech recognition. More AI domains will be gradually expanded in the future.
The models in DeepSparkInference provide inference examples and guidance documents for running on inference engines IGIE
or ixRT self-developed by Iluvatar CoreX. Some models provide evaluation results based on the self-developed GPGPU
Zhikai 100.
IGIE (Iluvatar GPU Inference Engine) is a high-performance, highly gene, and end-to-end AI inference engine developed
based on the TVM framework. It supports multi-framework model, quantization, graph optimization, multi-operator library
support, multi-backend support, and automatic operator tuning, providing an easy-to-deploy, high-throughput, and
low-latency complete solution for inference scenarios.
ixRT (Iluvatar CoreX RunTime) is a high-performance inference engine independently developed by Iluvatar CoreX, focusing
on maximizing the performance of Iluvatar CoreX's GPGPU and achieving high-performance inference for models in various
fields. ixRT supports features such as dynamic shape inference, plugins, and INT8/FP16 inference.
DeepSparkInference will be updated quarterly, and model categories will be gradually enriched, with large model
inference to be expanded in the future.
## ModelZoo
### LLM (Large Language Model)
| Model | Engine | Supported | IXUCA SDK |
|-------------------------------|--------------|--------------------------------------------------------------------|-----------|
| Baichuan2-7B | `vLLM` | [✅](models/nlp/llm/baichuan2-7b/vllm) | 4.3.0 |
| ChatGLM-3-6B | `vLLM` | [✅](models/nlp/llm/chatglm3-6b/vllm) | 4.3.0 |
| ChatGLM-3-6B-32K | `vLLM` | [✅](models/nlp/llm/chatglm3-6b-32k/vllm) | 4.3.0 |
| CosyVoice2-0.5B | `PyTorch` | [✅](models/speech/speech_synthesis/cosyvoice/pytorch) | 4.3.0 |
| CosyVoice2-0.5B | `ixRT` | [✅](models/speech/speech_synthesis/cosyvoice/ixrt) | dev-only |
| DeepSeek-R1-Distill-Llama-8B | `vLLM` | [✅](models/nlp/llm/deepseek-r1-distill-llama-8b/vllm) | 4.3.0 |
| DeepSeek-R1-Distill-Llama-70B | `vLLM` | [✅](models/nlp/llm/deepseek-r1-distill-llama-70b/vllm) | 4.3.0 |
| DeepSeek-R1-Distill-Qwen-1.5B | `vLLM` | [✅](models/nlp/llm/deepseek-r1-distill-qwen-1.5b/vllm) | 4.3.0 |
| DeepSeek-R1-Distill-Qwen-7B | `vLLM` | [✅](models/nlp/llm/deepseek-r1-distill-qwen-7b/vllm) | 4.4.0 |
| DeepSeek-R1-Distill-Qwen-14B | `vLLM` | [✅](models/nlp/llm/deepseek-r1-distill-qwen-14b/vllm) | 4.3.0 |
| DeepSeek-R1-Distill-Qwen-32B | `vLLM` | [✅](models/nlp/llm/deepseek-r1-distill-qwen-32b/vllm) | 4.3.0 |
| DeepSeek-OCR | `Transformers` | [✅](models/multimodal/vision_language_model/deepseek-ocr/transformers) | 4.3.0 |
| DeepSeek-OCR | `vLLM` | [✅](models/multimodal/vision_language_model/deepseek-ocr/vllm) | dev-only |
| ERNIE-4.5-21B-A3B | `FastDeploy` | [✅](models/nlp/llm/ernie-4.5-21b-a3b/fastdeploy/) | 4.3.0 |
| ERNIE-4.5-300B-A47B | `FastDeploy` | [✅](models/nlp/llm/ernie-4.5-300b-a47b/fastdeploy/) | 4.3.0 |
| ERNIE-4.5-VL-28B-A3B-Thinking | `Transformers` | [✅](models/multimodal/vision_language_model/ernie-4.5-vl-28b-a3b-thinking/transformers/) | 4.4.0 |
| GLM-4V | `vLLM` | [✅](models/multimodal/vision_language_model/glm-4v/vllm/) | 4.3.0 |
| InternLM3 | `LMDeploy` | [✅](models/nlp/llm/internlm3/lmdeploy/) | 4.3.0 |
| InternLM3 | `vLLM` | [✅](models/nlp/llm/internlm3/vllm/) | 4.4.0 |
| IndexTTS-2 | `IndexTTS` | [✅](models/speech/speech_synthesis/indextts-2/indextts/) | 4.4.0 |
| Llama2-7B | `vLLM` | [✅](models/nlp/llm/llama2-7b/vllm) | 4.3.0 |
| Llama2-7B | `TRT-LLM` | [✅](models/nlp/llm/llama2-7b/trtllm) | 4.3.0 |
| Llama2-13B | `TRT-LLM` | [✅](models/nlp/llm/llama2-13b/trtllm) | 4.3.0 |
| Llama2-70B | `TRT-LLM` | [✅](models/nlp/llm/llama2-70b/trtllm) | 4.3.0 |
| Llama3-70B | `vLLM` | [✅](models/nlp/llm/llama3-70b/vllm) | 4.3.0 |
| E5-V | `vLLM` | [✅](models/multimodal/vision_language_model/e5-v/vllm/) | 4.3.0 |
| MiniCPM-o-2 | `vLLM` | [✅](models/multimodal/vision_language_model/minicpm-o-2/vllm/) | 4.3.0 |
| MiniCPM-V-2 | `vLLM` | [✅](models/multimodal/vision_language_model/minicpm-v-2/vllm/) | 4.3.0 |
| MiniCPM-V-4 | `vLLM` | [✅](models/multimodal/vision_language_model/minicpm-v-4/vllm/) | dev-only |
| NVLM | `vLLM` | [✅](models/multimodal/vision_language_model/nvlm/vllm) | 4.3.0 |
| Phi3_v | `vLLM` | [✅](models/multimodal/vision_language_model/phi3_v/vllm) | 4.3.0 |
| PaliGemma | `vLLM` | [✅](models/multimodal/vision_language_model/paligemma/vllm) | 4.3.0 |
| PaddleOCR-VL | `Transformers` | [✅](models/multimodal/vision_language_model/paddleocr-vl-1.5/transformers) | 4.4.0 |
| Qwen-7B | `vLLM` | [✅](models/nlp/llm/qwen-7b/vllm) | 4.3.0 |
| Qwen-VL | `vLLM` | [✅](models/multimodal/vision_language_model/qwen_vl/vllm) | 4.3.0 |
| Qwen2-VL | `vLLM` | [✅](models/multimodal/vision_language_model/qwen2_vl/vllm) | 4.3.0 |
| Qwen2.5-VL | `vLLM` | [✅](models/multimodal/vision_language_model/qwen2_5_vl/vllm) | 4.4.0 |
| Qwen1.5-7B | `vLLM` | [✅](models/nlp/llm/qwen1.5-7b/vllm) | 4.3.0 |
| Qwen1.5-7B | `TGI` | [✅](models/nlp/llm/qwen1.5-7b/tgi) | 4.3.0 |
| Qwen1.5-14B | `vLLM` | [✅](models/nlp/llm/qwen1.5-14b/vllm) | 4.3.0 |
| Qwen1.5-32B Chat | `vLLM` | [✅](models/nlp/llm/qwen1.5-32b/vllm) | 4.3.0 |
| Qwen1.5-72B | `vLLM` | [✅](models/nlp/llm/qwen1.5-72b/vllm) | 4.3.0 |
| Qwen2-7B Instruct | `vLLM` | [✅](models/nlp/llm/qwen2-7b/vllm) | 4.3.0 |
| Qwen2-72B Instruct | `vLLM` | [✅](models/nlp/llm/qwen2-72b/vllm) | 4.3.0 |
| Qwen3_Moe | `vLLM` | [✅](models/nlp/llm/qwen3-235b/vllm) | dev-only |
| Qwen3-8B | `vLLM` | [✅](models/nlp/llm/qwen3-8b/vllm) | 4.4.0 |
| Qwen3-32B | `vLLM` | [✅](models/nlp/llm/qwen3-32b/vllm) | 4.4.0 |
| Qwen3-30B-A3B-Thinking | `vLLM` | [✅](models/nlp/llm/qwen3-30b-a3b-thinking/vllm) | 4.4.0 |
| Qwen3-235B-A22B-Thinking | `vLLM` | [✅](models/nlp/llm/qwen3-235b-a22b-thinking/vllm) | 4.4.0 |
| Qwen3-Next-80B-A3B | `vLLM` | [✅](models/nlp/llm/qwen3-next-80b-a3b/vllm) | 4.4.0 |
| Qwen3-Embedding-8B | `vLLM` | [✅](models/nlp/llm/qwen3-embedding-8b/vllm) | 4.4.0 |
| Qwen3-ASR-1.7B | `Qwen-ASR` | [✅](models/speech/asr/qwen3-embedding-8b/qwen-asr) | 4.4.0 |
| Qwen3-TTS-12Hz-1.7B-Base | `Qwen-TTS` | [✅](models/speech/speech_synthesis/qwen3-tts-12hz-1.7b/qwen-tts) | 4.4.0 |
| DeepSeek-V3.1 | `vLLM` | [✅](models/nlp/llm/deepseek-v3.1/vllm) | 4.4.0 |
| StableLM2-1.6B | `vLLM` | [✅](models/nlp/llm/stablelm/vllm) | 4.3.0 |
| Step3 | `vLLM` | [✅](models/multimodal/vision_language_model/step3/vllm) | 4.4.0 |
| Ultravox | `vLLM` | [✅](models/speech/asr/ultravox/vllm) | 4.3.0 |
| Whisper | `vLLM` | [✅](models/speech/asr/whisper/vllm/) | 4.3.0 |
| XLMRoberta | `vLLM` | [✅](models/multimodal/vision_language_model/xlmroberta/vllm) | 4.3.0 |
### Computer Vision
#### Classification
| Model | Prec. | IGIE | ixRT | IXUCA SDK |
|------------------------|-------|--------------------------------------------------------|-----------------------------------------------------------|-----------|
| AlexNet | FP16 | [✅](models/cv/classification/alexnet/igie) | [✅](models/cv/classification/alexnet/ixrt) | 4.3.0 |
| | INT8 | [✅](models/cv/classification/alexnet/igie) | [✅](models/cv/classification/alexnet/ixrt) | 4.3.0 |
| CLIP | FP16 | [✅](models/cv/classification/clip/igie) | [✅](models/cv/classification/clip/ixrt) | 4.3.0 |
| Conformer-B | FP16 | [✅](models/cv/classification/conformer_base/igie) | | 4.3.0 |
| ConvNeXt-Base | FP16 | [✅](models/cv/classification/convnext_base/igie) | [✅](models/cv/classification/convnext_base/ixrt) | 4.3.0 |
| ConvNext-S | FP16 | [✅](models/cv/classification/convnext_s/igie) | | 4.3.0 |
| ConvNeXt-Small | FP16 | [✅](models/cv/classification/convnext_small/igie) | [✅](models/cv/classification/convnext_small/ixrt) | 4.3.0 |
| ConvNeXt-Tiny | FP16 | [✅](models/cv/classification/convnext_tiny/igie) | | 4.3.0 |
| CSPDarkNet53 | FP16 | [✅](models/cv/classification/cspdarknet53/igie) | [✅](models/cv/classification/cspdarknet53/ixrt) | 4.3.0 |
| | INT8 | | [✅](models/cv/classification/cspdarknet53/ixrt) | 4.3.0 |
| CSPResNet50 | FP16 | [✅](models/cv/classification/cspresnet50/igie) | [✅](models/cv/classification/cspresnet50/ixrt) | 4.3.0 |
| | INT8 | | [✅](models/cv/classification/cspresnet50/ixrt) | 4.3.0 |
| CSPResNeXt50 | FP16 | [✅](models/cv/classification/cspresnext50/igie) | [✅](models/cv/classification/cspresnext50/ixrt) | 4.3.0 |
| DeiT-B | FP16 | | [✅](models/cv/classification/deit_b/ixrt) | 4.4.0 |
| DeiT-tiny | FP16 | [✅](models/cv/classification/deit_tiny/igie) | [✅](models/cv/classification/deit_tiny/ixrt) | 4.3.0 |
| DenseNet121 | FP16 | [✅](models/cv/classification/densenet121/igie) | [✅](models/cv/classification/densenet121/ixrt) | 4.3.0 |
| | INT8 | [✅](models/cv/classification/densenet121/igie) | | 4.4.0 |
| DenseNet161 | FP16 | [✅](models/cv/classification/densenet161/igie) | [✅](models/cv/classification/densenet161/ixrt) | 4.3.0 |
| DenseNet169 | FP16 | [✅](models/cv/classification/densenet169/igie) | [✅](models/cv/classification/densenet169/ixrt) | 4.3.0 |
| DenseNet201 | FP16 | [✅](models/cv/classification/densenet201/igie) | [✅](models/cv/classification/densenet201/ixrt) | 4.3.0 |
| EfficientNet-B0 | FP16 | [✅](models/cv/classification/efficientnet_b0/igie) | [✅](models/cv/classification/efficientnet_b0/ixrt) | 4.3.0 |
| | INT8 | | [✅](models/cv/classification/efficientnet_b0/ixrt) | 4.3.0 |
| EfficientNet-B1 | FP16 | [✅](models/cv/classification/efficientnet_b1/igie) | [✅](models/cv/classification/efficientnet_b1/ixrt) | 4.3.0 |
| | INT8 | | [✅](models/cv/classification/efficientnet_b1/ixrt) | 4.3.0 |
| EfficientNet-B2 | FP16 | [✅](models/cv/classification/efficientnet_b2/igie) | [✅](models/cv/classification/efficientnet_b2/ixrt) | 4.3.0 |
| EfficientNet-B3 | FP16 | [✅](models/cv/classification/efficientnet_b3/igie) | [✅](models/cv/classification/efficientnet_b3/ixrt) | 4.3.0 |
| EfficientNet-B4 | FP16 | [✅](models/cv/classification/efficientnet_b4/igie) | [✅](models/cv/classification/efficientnet_b4/ixrt) | 4.3.0 |
| EfficientNet-B5 | FP16 | [✅](models/cv/classification/efficientnet_b5/igie) | [✅](models/cv/classification/efficientnet_b5/ixrt) | 4.3.0 |
| EfficientNet-B6 | FP16 | [✅](models/cv/classification/efficientnet_b6/igie) | | 4.3.0 |
| EfficientNet-B7 | FP16 | [✅](models/cv/classification/efficientnet_b7/igie) | | 4.3.0 |
| EfficientNetV2 | FP16 | [✅](models/cv/classification/efficientnet_v2/igie) | [✅](models/cv/classification/efficientnet_v2/ixrt) | 4.3.0 |
| | INT8 | | [✅](models/cv/classification/efficientnet_v2/ixrt) | 4.3.0 |
| EfficientNetv2_rw_t | FP16 | [✅](models/cv/classification/efficientnetv2_rw_t/igie) | [✅](models/cv/classification/efficientnetv2_rw_t/ixrt) | 4.3.0 |
| EfficientNetv2_s | FP16 | [✅](models/cv/classification/efficientnet_v2_s/igie) | [✅](models/cv/classification/efficientnet_v2_s/ixrt) | 4.3.0 |
| GoogLeNet | FP16 | [✅](models/cv/classification/googlenet/igie) | [✅](models/cv/classification/googlenet/ixrt) | 4.3.0 |
| | INT8 | [✅](models/cv/classification/googlenet/igie) | [✅](models/cv/classification/googlenet/ixrt) | 4.3.0 |
| HRNet-W18 | FP16 | [✅](models/cv/classification/hrnet_w18/igie) | [✅](models/cv/classification/hrnet_w18/ixrt) | 4.3.0 |
| | INT8 | | [✅](models/cv/classification/hrnet_w18/ixrt) | 4.3.0 |
| InceptionV3 | FP16 | [✅](models/cv/classification/inception_v3/igie) | [✅](models/cv/classification/inception_v3/ixrt) | 4.3.0 |
| | INT8 | [✅](models/cv/classification/inception_v3/igie) | [✅](models/cv/classification/inception_v3/ixrt) | 4.3.0 |
| Inception-ResNet-V2 | FP16 | | [✅](models/cv/classification/inception_resnet_v2/ixrt) | 4.3.0 |
| | INT8 | | [✅](models/cv/classification/inception_resnet_v2/ixrt) | 4.3.0 |
| Mixer_B | FP16 | [✅](models/cv/classification/mlp_mixer_base/igie) | | 4.3.0 |
| MNASNet0_5 | FP16 | [✅](models/cv/classification/mnasnet0_5/igie) | | 4.3.0 |
| MNASNet0_75 | FP16 | [✅](models/cv/classification/mnasnet0_75/igie) | | 4.3.0 |
| MNASNet1_0 | FP16 | [✅](models/cv/classification/mnasnet1_0/igie) | | 4.3.0 |
| MNASNet1_3 | FP16 | [✅](models/cv/classification/mnasnet1_3/igie) | | 4.3.0 |
| MobileNetV1 | FP16 | | [✅](models/cv/classification/mobilenet_v1/ixrt) | 4.4.0 |
| MobileNetV2 | FP16 | [✅](models/cv/classification/mobilenet_v2/igie) | [✅](models/cv/classification/mobilenet_v2/ixrt) | 4.3.0 |
| | INT8 | [✅](models/cv/classification/mobilenet_v2/igie) | [✅](models/cv/classification/mobilenet_v2/ixrt) | 4.3.0 |
| MobileNetV3_Large | FP16 | [✅](models/cv/classification/mobilenet_v3_large/igie) | | 4.3.0 |
| MobileNetV3_Small | FP16 | [✅](models/cv/classification/mobilenet_v3/igie) | [✅](models/cv/classification/mobilenet_v3/ixrt) | 4.3.0 |
| Mobilevit_s | FP16 | [✅](models/cv/classification/mobilevit_s/igie) | | 4.4.0 |
| MViTv2_base | FP16 | [✅](models/cv/classification/mvitv2_base/igie) | | dev-only |
| RegNet_x_16gf | FP16 | [✅](models/cv/classification/regnet_x_16gf/igie) | | 4.3.0 |
| RegNet_x_1_6gf | FP16 | [✅](models/cv/classification/regnet_x_1_6gf/igie) | | 4.3.0 |
| RegNet_x_3_2gf | FP16 | [✅](models/cv/classification/regnet_x_3_2gf/igie) | | 4.3.0 |
| RegNet_x_8gf | FP16 | [✅](models/cv/classification/regnet_x_8gf/igie) | | 4.3.0 |
| RegNet_x_32gf | FP16 | [✅](models/cv/classification/regnet_x_32gf/igie) | | 4.3.0 |
| RegNet_x_400mf | FP16 | [✅](models/cv/classification/regnet_x_400mf/igie) | | 4.3.0 |
| RegNet_x_800mf | FP16 | [✅](models/cv/classification/regnet_x_800mf/igie) | | 4.3.0 |
| RegNet_y_1_6gf | FP16 | [✅](models/cv/classification/regnet_y_1_6gf/igie) | | 4.3.0 |
| RegNet_y_16gf | FP16 | [✅](models/cv/classification/regnet_y_16gf/igie) | | 4.3.0 |
| RegNet_y_3_2gf | FP16 | [✅](models/cv/classification/regnet_y_3_2gf/igie) | | 4.3.0 |
| RegNet_y_32gf | FP16 | [✅](models/cv/classification/regnet_y_32gf/igie) | | 4.3.0 |
| RegNet_y_400mf | FP16 | [✅](models/cv/classification/regnet_y_400mf/igie) | | 4.3.0 |
| RepVGG | FP16 | [✅](models/cv/classification/repvgg/igie) | [✅](models/cv/classification/repvgg/ixrt) | 4.3.0 |
| Res2Net50 | FP16 | [✅](models/cv/classification/res2net50/igie) | [✅](models/cv/classification/res2net50/ixrt) | 4.3.0 |
| | INT8 | | [✅](models/cv/classification/res2net50/ixrt) | 4.3.0 |
| ResNeSt50 | FP16 | [✅](models/cv/classification/resnest50/igie) | | 4.3.0 |
| ResNet101 | FP16 | [✅](models/cv/classification/resnet101/igie) | [✅](models/cv/classification/resnet101/ixrt) | 4.3.0 |
| | INT8 | [✅](models/cv/classification/resnet101/igie) | [✅](models/cv/classification/resnet101/ixrt) | 4.3.0 |
| ResNet152 | FP16 | [✅](models/cv/classification/resnet152/igie) | | 4.3.0 |
| | INT8 | [✅](models/cv/classification/resnet152/igie) | | 4.3.0 |
| ResNet18 | FP16 | [✅](models/cv/classification/resnet18/igie) | [✅](models/cv/classification/resnet18/ixrt) | 4.3.0 |
| | INT8 | [✅](models/cv/classification/resnet18/igie) | [✅](models/cv/classification/resnet18/ixrt) | 4.3.0 |
| ResNet34 | FP16 | | [✅](models/cv/classification/resnet34/ixrt) | 4.3.0 |
| | INT8 | | [✅](models/cv/classification/resnet34/ixrt) | 4.3.0 |
| ResNet50 | FP16 | [✅](models/cv/classification/resnet50/igie) | [✅](models/cv/classification/resnet50/ixrt) | 4.3.0 |
| | INT8 | [✅](models/cv/classification/resnet50/igie) | | 4.3.0 |
| ResNetV1D50 | FP16 | [✅](models/cv/classification/resnetv1d50/igie) | [✅](models/cv/classification/resnetv1d50/ixrt) | 4.3.0 |
| | INT8 | | [✅](models/cv/classification/resnetv1d50/ixrt) | 4.3.0 |
| ResNeXt50_32x4d | FP16 | [✅](models/cv/classification/resnext50_32x4d/igie) | [✅](models/cv/classification/resnext50_32x4d/ixrt) | 4.3.0 |
| ResNeXt101_64x4d | FP16 | [✅](models/cv/classification/resnext101_64x4d/igie) | [✅](models/cv/classification/resnext101_64x4d/ixrt) | 4.3.0 |
| ResNeXt101_32x8d | FP16 | [✅](models/cv/classification/resnext101_32x8d/igie) | [✅](models/cv/classification/resnext101_32x8d/ixrt) | 4.3.0 |
| SEResNet50 | FP16 | [✅](models/cv/classification/se_resnet50/igie) | | 4.3.0 |
| ShuffleNetV1 | FP16 | | [✅](models/cv/classification/shufflenet_v1/ixrt) | 4.3.0 |
| ShuffleNetV2_x0_5 | FP16 | [✅](models/cv/classification/shufflenetv2_x0_5/igie) | [✅](models/cv/classification/shufflenetv2_x0_5/ixrt) | 4.3.0 |
| ShuffleNetV2_x1_0 | FP16 | [✅](models/cv/classification/shufflenetv2_x1_0/igie) | [✅](models/cv/classification/shufflenetv2_x1_0/ixrt) | 4.3.0 |
| ShuffleNetV2_x1_5 | FP16 | [✅](models/cv/classification/shufflenetv2_x1_5/igie) | [✅](models/cv/classification/shufflenetv2_x1_5/ixrt) | 4.3.0 |
| ShuffleNetV2_x2_0 | FP16 | [✅](models/cv/classification/shufflenetv2_x2_0/igie) | [✅](models/cv/classification/shufflenetv2_x2_0/ixrt) | 4.3.0 |
| SqueezeNet 1.0 | FP16 | [✅](models/cv/classification/squeezenet_v1_0/igie) | [✅](models/cv/classification/squeezenet_v1_0/ixrt) | 4.3.0 |
| | INT8 | | [✅](models/cv/classification/squeezenet_v1_0/ixrt) | 4.3.0 |
| SqueezeNet 1.1 | FP16 | [✅](models/cv/classification/squeezenet_v1_1/igie) | [✅](models/cv/classification/squeezenet_v1_1/ixrt) | 4.3.0 |
| | INT8 | | [✅](models/cv/classification/squeezenet_v1_1/ixrt) | 4.3.0 |
| SVT Base | FP16 | [✅](models/cv/classification/svt_base/igie) | | 4.3.0 |
| Swin Transformer | FP16 | [✅](models/cv/classification/swin_transformer/igie) | [✅](models/cv/classification/swin_transformer/ixrt) | 4.3.0 |
| Swin Transformer Large | FP16 | | [✅](models/cv/classification/swin_transformer_large/ixrt) | 4.3.0 |
| Twins_PCPVT | FP16 | [✅](models/cv/classification/twins_pcpvt/igie) | | 4.3.0 |
| VAN_B0 | FP16 | [✅](models/cv/classification/van_b0/igie) | | 4.3.0 |
| VGG11 | FP16 | [✅](models/cv/classification/vgg11/igie) | | 4.3.0 |
| VGG13 | FP16 | [✅](models/cv/classification/vgg13/igie) | | 4.3.0 |
| VGG13_BN | FP16 | [✅](models/cv/classification/vgg13_bn/igie) | | 4.3.0 |
| VGG16 | FP16 | [✅](models/cv/classification/vgg16/igie) | [✅](models/cv/classification/vgg16/ixrt) | 4.3.0 |
| | INT8 | [✅](models/cv/classification/vgg16/igie) | | 4.3.0 |
| VGG19 | FP16 | [✅](models/cv/classification/vgg19/igie) | | 4.3.0 |
| VGG19_BN | FP16 | [✅](models/cv/classification/vgg19_bn/igie) | | 4.3.0 |
| ViT | FP16 | [✅](models/cv/classification/vit/igie) | [✅](models/cv/classification/vit/ixit) | 4.3.0 |
| ViT-B-32 | FP16 | [✅](models/cv/classification/vit_b_32/igie) | | 4.4.0 |
| ViT-L-14 | FP16 | [✅](models/cv/classification/vit_l_14/igie) | | 4.4.0 |
| Wide ResNet50 | FP16 | [✅](models/cv/classification/wide_resnet50/igie) | [✅](models/cv/classification/wide_resnet50/ixrt) | 4.3.0 |
| | INT8 | [✅](models/cv/classification/wide_resnet50/igie) | [✅](models/cv/classification/wide_resnet50/ixrt) | 4.3.0 |
| Wide ResNet101 | FP16 | [✅](models/cv/classification/wide_resnet101/igie) | | 4.3.0 |
#### Object Detection
| Model | Prec. | IGIE | ixRT | IXUCA SDK |
|------------|-------|-------------------------------------------------|-------------------------------------------------|-----------|
| ATSS | FP16 | [✅](models/cv/object_detection/atss/igie) | [✅](models/cv/object_detection/atss/ixrt) | 4.3.0 |
| CenterNet | FP16 | [✅](models/cv/object_detection/centernet/igie) | [✅](models/cv/object_detection/centernet/ixrt) | 4.3.0 |
| DETR | FP16 | | [✅](models/cv/object_detection/detr/ixrt) | 4.3.0 |
| FCOS | FP16 | [✅](models/cv/object_detection/fcos/igie) | [✅](models/cv/object_detection/fcos/ixrt) | 4.3.0 |
| FoveaBox | FP16 | [✅](models/cv/object_detection/foveabox/igie) | [✅](models/cv/object_detection/foveabox/ixrt) | 4.3.0 |
| FSAF | FP16 | [✅](models/cv/object_detection/fsaf/igie) | [✅](models/cv/object_detection/fsaf/ixrt) | 4.3.0 |
| GFL | FP16 | [✅](models/cv/object_detection/gfl/igie) | | 4.3.0 |
| Grounding DINO | FP16 | | [✅](models/cv/object_detection/grounding_dino/ixrt) | dev-only |
| HRNet | FP16 | [✅](models/cv/object_detection/hrnet/igie) | [✅](models/cv/object_detection/hrnet/ixrt) | 4.3.0 |
| PAA | FP16 | [✅](models/cv/object_detection/paa/igie) | [✅](models/cv/object_detection/paa/ixrt) | 4.3.0 |
| RetinaFace | FP16 | [✅](models/cv/object_detection/retinaface/igie) | [✅](models/cv/object_detection/retinaface/ixrt) | 4.3.0 |
| RetinaNet | FP16 | [✅](models/cv/object_detection/retinanet/igie) | [✅](models/cv/object_detection/retinanet/ixrt) | 4.3.0 |
| RTMDet | FP16 | [✅](models/cv/object_detection/rtmdet/igie) | | 4.3.0 |
| RTDETR | FP16 | [✅](models/cv/object_detection/rtdetr/igie) | [✅](models/cv/object_detection/rtdetr/ixrt) | dev-only |
| | INT8 | [✅](models/cv/object_detection/rtdetr/igie) | | dev-only |
| SABL | FP16 | [✅](models/cv/object_detection/sabl/igie) | | 4.3.0 |
| SSD | FP16 | [✅](models/cv/object_detection/ssd/igie) | | 4.3.0 |
| YOLOF | FP16 | [✅](models/cv/object_detection/yolof/igie) | [✅](models/cv/object_detection/yolof/ixrt) | 4.3.0 |
| YOLOv3 | FP16 | [✅](models/cv/object_detection/yolov3/igie) | [✅](models/cv/object_detection/yolov3/ixrt) | 4.3.0 |
| | INT8 | [✅](models/cv/object_detection/yolov3/igie) | [✅](models/cv/object_detection/yolov3/ixrt) | 4.3.0 |
| YOLOv4 | FP16 | [✅](models/cv/object_detection/yolov4/igie) | [✅](models/cv/object_detection/yolov4/ixrt) | 4.3.0 |
| | INT8 | [✅](models/cv/object_detection/yolov4/igie16) | [✅](models/cv/object_detection/yolov4/ixrt16) | 4.3.0 |
| YOLOv5m | FP16 | [✅](models/cv/object_detection/yolov5m/igie) | [✅](models/cv/object_detection/yolov5m/ixrt) | 4.3.0 |
| | INT8 | [✅](models/cv/object_detection/yolov5m/igie) | [✅](models/cv/object_detection/yolov5m/ixrt) | 4.3.0 |
| YOLOv5s | FP16 | [✅](models/cv/object_detection/yolov5s/igie) | [✅](models/cv/object_detection/yolov5s/ixrt) | 4.3.0 |
| | INT8 | [✅](models/cv/object_detection/yolov5s/igie) | [✅](models/cv/object_detection/yolov5s/ixrt) | 4.3.0 |
| YOLOv6s | FP16 | [✅](models/cv/object_detection/yolov6s/igie) | [✅](models/cv/object_detection/yolov6s/ixrt) | 4.3.0 |
| | INT8 | | [✅](models/cv/object_detection/yolov6s/ixrt) | 4.3.0 |
| YOLOv7 | FP16 | [✅](models/cv/object_detection/yolov7/igie) | [✅](models/cv/object_detection/yolov7/ixrt) | 4.3.0 |
| | INT8 | [✅](models/cv/object_detection/yolov7/igie) | [✅](models/cv/object_detection/yolov7/ixrt) | 4.3.0 |
| YOLOv8n | FP16 | [✅](models/cv/object_detection/yolov8n/igie) | [✅](models/cv/object_detection/yolov8n/ixrt) | 4.3.0 |
| | INT8 | [✅](models/cv/object_detection/yolov8n/igie) | [✅](models/cv/object_detection/yolov8n/ixrt) | 4.3.0 |
| YOLOv8s | FP16 | [✅](models/cv/object_detection/yolov8s/igie) | | 4.3.0 |
| | INT8 | [✅](models/cv/object_detection/yolov8s/igie) | | 4.3.0 |
| YOLOv9s | FP16 | [✅](models/cv/object_detection/yolov9s/igie) | [✅](models/cv/object_detection/yolov9s/ixrt) | 4.3.0 |
| | INT8 | [✅](models/cv/object_detection/yolov9s/igie) | | 4.3.0 |
| YOLOv10s | FP16 | [✅](models/cv/object_detection/yolov10s/igie) | [✅](models/cv/object_detection/yolov10s/ixrt) | 4.3.0 |
| YOLOv11m | FP16 | [✅](models/cv/object_detection/yolov11m/igie) | | 4.4.0 |
| | INT8 | [✅](models/cv/object_detection/yolov11m/igie) | | 4.4.0 |
| YOLOv11n | FP16 | [✅](models/cv/object_detection/yolov11n/igie) | [✅](models/cv/object_detection/yolov11n/ixrt) | 4.3.0 |
| | INT8 | [✅](models/cv/object_detection/yolov11n/igie) | | 4.3.0 |
| YOLOv11s | FP16 | [✅](models/cv/object_detection/yolov11s/igie) | | 4.4.0 |
| | INT8 | [✅](models/cv/object_detection/yolov11s/igie) | | 4.4.0 |
| YOLOv12n | FP16 | [✅](models/cv/object_detection/yolov12n/igie) | [✅](models/cv/object_detection/yolov12n/ixrt) | 4.3.0 |
| | INT8 | [✅](models/cv/object_detection/yolov12n/igie) | | 4.3.0 |
| YOLOv13n | FP16 | [✅](models/cv/object_detection/yolov13n/igie) | [✅](models/cv/object_detection/yolov13n/ixrt) | 4.3.0 |
| | INT8 | [✅](models/cv/object_detection/yolov13n/igie) | | 4.3.0 |
| YOLOv26n | FP16 | [✅](models/cv/object_detection/yolov26n/igie) | | 4.4.0 |
| YOLOXm | FP16 | [✅](models/cv/object_detection/yoloxm/igie) | [✅](models/cv/object_detection/yoloxm/ixrt) | 4.3.0 |
| | INT8 | [✅](models/cv/object_detection/yoloxm/igie) | [✅](models/cv/object_detection/yoloxm/ixrt) | 4.3.0 |
| Model | Prec. | PaddlePaddle | IXUCA SDK |
|------------|-------|---------------------------------------------------------|-----------|
| RTDETR | FP16 | [✅](models/cv/object_detection/rtdetr/paddlepaddle) | dev-only |
| Model | Prec. | Pytorch | IXUCA SDK |
|------------|-------|-------------------------------------------------|-----------|
| YOLOv8n | FP16 | [✅](models/cv/object_detection/yolov8n/pytorch) | dev-only |
#### Face Recognition
| Model | Prec. | IGIE | ixRT | IXUCA SDK |
|---------|-------|------|----------------------------------------------|-----------|
| FaceNet | FP16 | | [✅](models/cv/face_recognition/facenet/ixrt) | 4.3.0 |
| | INT8 | | [✅](models/cv/face_recognition/facenet/ixrt) | 4.3.0 |
#### OCR (Optical Character Recognition)
| Model | Prec. | IGIE | ixRT | IXUCA SDK |
|---------------|-------|---------------------------------------|---------------------------------------|-----------|
| CRNN | FP16 | | [✅](models/cv/ocr/crnn/ixrt) | 4.4.0 |
| DBNet | FP16 | | [✅](models/cv/ocr/dbnet/ixrt) | 4.4.0 |
| Kie_layoutXLM | FP16 | [✅](models/cv/ocr/kie_layoutxlm/igie) | | 4.3.0 |
| SVTR | FP16 | [✅](models/cv/ocr/svtr/igie) | | 4.3.0 |
#### Pose Estimation
| Model | Prec. | IGIE | ixRT | IXUCA SDK |
|----------------------|-------|-----------------------------------------------|----------------------------------------------------------|-----------|
| HRNetPose | FP16 | [✅](models/cv/pose_estimation/hrnetpose/igie) | | 4.3.0 |
| Lightweight OpenPose | FP16 | | [✅](models/cv/pose_estimation/lightweight_openpose/ixrt) | 4.3.0 |
| RTMPose | FP16 | [✅](models/cv/pose_estimation/rtmpose/igie) | [✅](models/cv/pose_estimation/rtmpose/ixrt) | 4.3.0 |
#### Instance Segmentation
| Model | Prec. | IGIE | ixRT | IXUCA SDK |
|------------|-------|------|-----------------------------------------------------|-----------|
| Mask R-CNN | FP16 | | [✅](models/cv/instance_segmentation/mask_rcnn/ixrt) | 4.2.0 |
| SOLOv1 | FP16 | | [✅](models/cv/instance_segmentation/solov1/ixrt) | 4.3.0 |
#### Semantic Segmentation
| Model | Prec. | IGIE | ixRT | IXUCA SDK |
|-------|-------|------------------------------------------------|------------------------------------------------|-----------|
| DDRNet | FP16 | | [✅](models/cv/semantic_segmentation/ddrnet/ixrt) | 4.4.0 |
| UNet | FP16 | [✅](models/cv/semantic_segmentation/unet/igie) | [✅](models/cv/semantic_segmentation/unet/ixrt) | 4.3.0 |
#### Multi-Object Tracking
| Model | Prec. | IGIE | ixRT | IXUCA SDK |
|---------------------|-------|----------------------------------------------------|------|-----------|
| FastReID | FP16 | [✅](models/cv/multi_object_tracking/fastreid/igie) | [✅](models/cv/multi_object_tracking/fastreid/ixrt) | 4.3.0 |
| DeepSort | FP16 | [✅](models/cv/multi_object_tracking/deepsort/igie) | [✅](models/cv/multi_object_tracking/deepsort/ixrt) | 4.3.0 |
| | INT8 | [✅](models/cv/multi_object_tracking/deepsort/igie) | | 4.3.0 |
| RepNet-Vehicle-ReID | FP16 | [✅](models/cv/multi_object_tracking/repnet/igie) | [✅](models/cv/multi_object_tracking/repnet/ixrt) | 4.3.0 |
### Multimodal
| Model | Engine | Supported | IXUCA SDK |
|---------------------|----------|------------------------------------------------------------------------|-----------|
| Aria | vLLM | [✅](models/multimodal/vision_language_model/aria/vllm) | 4.3.0 |
| Chameleon-7B | vLLM | [✅](models/multimodal/vision_language_model/chameleon_7b/vllm) | 4.3.0 |
| CLIP | IxFormer | [✅](models/multimodal/vision_language_model/clip/ixformer) | 4.3.0 |
| DeepSeek-VL2-tiny | vLLM | [✅](models/multimodal/vision_language_model/deepseek-vl2/vllm) | 4.4.0 |
| Fuyu-8B | vLLM | [✅](models/multimodal/vision_language_model/fuyu_8b/vllm) | 4.3.0 |
| FLUX.1-Dev | xDiT | [✅](models/multimodal/diffusion_model/flux.1-dev/xdit) | 4.4.0 |
| H2OVL Mississippi | vLLM | [✅](models/multimodal/vision_language_model/h2vol/vllm) | 4.3.0 |
| HunyuanVideo | xDiT | [✅](models/multimodal/diffusion_model/hunyuan_video/xdit) | 4.4.0 |
| HunyuanDiT-v1.2 | xDiT | [✅](models/multimodal/diffusion_model/hunyuanDit-v1.2/xdit) | 4.4.0 |
| Idefics3 | vLLM | [✅](models/multimodal/vision_language_model/idefics3/vllm) | 4.3.0 |
| InternVL2-4B | vLLM | [✅](models/multimodal/vision_language_model/intern_vl/vllm) | 4.3.0 |
| LLaVA | vLLM | [✅](models/multimodal/vision_language_model/llava/vllm) | 4.3.0 |
| LLaVA-Next-Video-7B | vLLM | [✅](models/multimodal/vision_language_model/llava_next_video_7b/vllm) | 4.3.0 |
| Llama-3.2 | vLLM | [✅](models/multimodal/vision_language_model/llama-3.2/vllm) | 4.3.0 |
| Pixtral | vLLM | [✅](models/multimodal/vision_language_model/pixtral/vllm) | 4.3.0 |
| Qwen-Image | ComfyUI | [✅](models/multimodal/vision_language_model/qwen-image/comfyui) | 4.4.0 |
| Stable Diffusion 1.5 | Diffusers | [✅](models/multimodal/diffusion_model/stable-diffusion-1.5/diffusers) | 4.3.0 |
| Stable Diffusion 2.1 | ixRT | [✅](models/multimodal/diffusion_model/stable-diffusion-2.1/ixrt) | 4.4.0 |
| Stable Diffusion 3 | Diffusers | [✅](models/multimodal/diffusion_model/stable-diffusion-3/diffusers) | dev-only |
| SD3-Medium | xDiT | [✅](models/multimodal/diffusion_model/stable-diffusion-3-medium/xdit) | 4.4.0 |
| Wan2.1-T2V-14B | xDiT | [✅](models/multimodal/diffusion_model/wan2.1-t2v-14b/xdit) | 4.4.0 |
| Wan2.2-TI2V-5B | xDiT | [✅](models/multimodal/diffusion_model/wan2.2-ti2v-5b/xdit) | 4.4.0 |
| Z-Image | Diffusers | [✅](models/multimodal/diffusion_model/z-image/diffusers) | 4.4.0 |
### NLP
#### PLM (Pre-trained Language Model)
| Model | Prec. | IGIE | ixRT | IXUCA SDK |
|------------------|-------|-------------------------------------------|-------------------------------------------|-----------|
| ALBERT | FP16 | | [✅](models/nlp/plm/albert/ixrt) | 4.3.0 |
| BERT Base NER | INT8 | [✅](models/nlp/plm/bert_base_ner/igie) | | 4.3.0 |
| BERT Base SQuAD | FP16 | [✅](models/nlp/plm/bert_base_squad/igie) | [✅](models/nlp/plm/bert_base_squad/ixrt) | 4.3.0 |
| | INT8 | | [✅](models/nlp/plm/bert_base_squad/ixrt) | 4.3.0 |
| BERT Large SQuAD | FP16 | [✅](models/nlp/plm/bert_large_squad/igie) | [✅](models/nlp/plm/bert_large_squad/ixrt) | 4.3.0 |
| | INT8 | [✅](models/nlp/plm/bert_large_squad/igie) | [✅](models/nlp/plm/bert_large_squad/ixrt) | 4.3.0 |
| DeBERTa | FP16 | | [✅](models/nlp/plm/deberta/ixrt) | 4.3.0 |
| RoBERTa | FP16 | | [✅](models/nlp/plm/roberta/ixrt) | 4.3.0 |
| RoFormer | FP16 | | [✅](models/nlp/plm/roformer/ixrt) | 4.3.0 |
| VideoBERT | FP16 | | [✅](models/nlp/plm/videobert/ixrt) | 4.2.0 |
### Audio
#### Speech Recognition
| Model | Prec. | IGIE | ixRT | IXUCA SDK |
|-----------------|-------|-----------------------------------------------------|-----------------------------------------------------------|-----------|
| Conformer | FP16 | [✅](models/audio/speech_recognition/conformer/igie) | [✅](models/audio/speech_recognition/conformer/ixrt) | 4.3.0 |
| DeepSpeech2 | FP16 | | [✅](models/speech/speech_recognition/deepspeech2/ixrt) | 4.4.0 |
| Transformer ASR | FP16 | | [✅](models/audio/speech_recognition/transformer_asr/ixrt) | 4.2.0 |
### Others
#### Recommendation Systems
| Model | Prec. | IGIE | ixRT | IXUCA SDK |
|-------------|-------|------|------------------------------------------------------|-----------|
| Wide & Deep | FP16 | | [✅](models/others/recommendation/wide_and_deep/ixrt) | 4.3.0 |
---
## Docker
| Docker Installer | IXUCA SDK | Introduction |
|-----------|--------|--------------|
| corex-docker-installer-4.3.0-*-py3.10-x86_64.run | 4.3.0 | for small model inference |
| corex-docker-installer-4.3.0-*-llm-py3.10-x86_64.run | 4.3.0 | for large model inference |
## Community
### Code of Conduct
Please refer to DeepSpark Code of Conduct on
[Gitee](https://gitee.com/deep-spark/deepspark/blob/master/CODE_OF_CONDUCT.md) or on
[GitHub](https://github.com/Deep-Spark/deepspark/blob/main/CODE_OF_CONDUCT.md).
### Contact
Please contact .
### Contribution
Please refer to the [DeepSparkInference Contributing Guidelines](CONTRIBUTING.md).
### Disclaimers
DeepSparkInference only provides download and preprocessing scripts for public datasets. These datasets do not belong to
DeepSparkInference, and DeepSparkInference is not responsible for their quality or maintenance. Please ensure that you
have the necessary usage licenses for these datasets. Models trained based on these datasets can only be used for
non-commercial research and education purposes.
To dataset owners:
If you do not want your dataset to be published on DeepSparkInference or wish to update the dataset that belongs to you
on DeepSparkInference, please submit an issue on Gitee or Github. We will delete or update it according to your issue.
We sincerely appreciate your support and contributions to our community.
## License
This project is released under [Apache-2.0](LICENSE) License.