Change the repository type filter
All
Repositories list
1.2k repositories
- Advanced Quantization Algorithm for LLMs/VLMs.
- SOTA low-bit LLM quantization (INT8/FP8/INT4/FP4/NF4) & sparsity; leading model compression techniques on TensorFlow, PyTorch, and ONNX Runtime
- Collection of Intel device plugins for Kubernetes
- Documentation