deepsparse
An inference runtime offering GPU-class performance on CPUs and APIs to integrate ML into your application
- inference
- machine
- learning
- x86
- x86_64
- avx2
- avx512
- neural
- network
- sparse
- engine
- cpu
- runtime
- deepsparse
- computer
- vision
- object
- detection
- sparsity
- computer-vision
- cpus
- llm-inference
- machinelearning
- nlp
- object-detection
- onnx
- performance
- pretrained-models
- pruning
- quantization
- sparsification