ctranslate2
Fast inference engine for Transformer models
- opennmt
- nmt
- neural
- machine
- translation
- cuda
- mkl
- inference
- quantization
- avx
- avx2
- cpp
- deep-learning
- deep-neural-networks
- gemm
- intrinsics
- machine-translation
- neon
- neural-machine-translation
- onednn
- openmp
- parallel-computing
- thrust
- transformer-models