-
-
-
-
-
-
Inference Providers
Active filters: fp_quant
ISTA-DASLab/Qwen3-8B-FPQuant-RTN-MXFP4
Text Generation
• 5B • Updated
• 9
ISTA-DASLab/Llama-3.1-8B-Instruct-FPQuant-GPTQ-MXFP4
5B • Updated
• 1
ISTA-DASLab/Qwen3-8B-FPQuant-RTNv2-MXFP4
5B • Updated
• 3
ISTA-DASLab/Qwen3-8B-FPQuant-GPTQ-MXFP4
5B • Updated
• 4
ISTA-DASLab/Qwen3-14B-FPQuant-GPTQ-MXFP4
9B • Updated
• 7
ISTA-DASLab/Qwen3-32B-FPQuant-GPTQ-MXFP4
18B • Updated
• 2
ISTA-DASLab/Qwen3-0.6B-FPQuant-RTN-MXFP4
Text Generation
• 0.4B • Updated
• 3
• 1
ISTA-DASLab/Qwen3-0.6B-FPQuant-RTN-NVFP4
Text Generation
• 0.4B • Updated
• 2
ISTA-DASLab/Qwen3-4B-FPQuant-RTN-MXFP4
Text Generation
• 2B • Updated
• 1
ISTA-DASLab/Qwen3-4B-FPQuant-RTN-NVFP4
Text Generation
• 2B • Updated
• 4
ISTA-DASLab/Qwen3-1.7B-FPQuant-RTN-NVFP4
Text Generation
• 1B • Updated
• 2
ISTA-DASLab/Qwen3-1.7B-FPQuant-RTN-MXFP4
Text Generation
• 1B • Updated
• 2
ISTA-DASLab/Qwen3-8B-FPQuant-RTN-NVFP4
Text Generation
• 5B • Updated
• 2
ISTA-DASLab/Qwen3-1.7B-FPQuant-QAT-NVFP4-200steps
Text Generation
• 1B • Updated
• 9
ISTA-DASLab/Qwen3-1.7B-FPQuant-QAT-NVFP4-600steps
Text Generation
• 1B • Updated
• 2
ISTA-DASLab/Qwen3-8B-FPQuant-QAT-NVFP4-200steps
Text Generation
• 5B • Updated
• 2
ISTA-DASLab/Qwen3-8B-FPQuant-QAT-NVFP4-600steps
Text Generation
• 5B • Updated
• 2
ISTA-DASLab/Qwen3-8B-FPQuant-QAT-NVFP4-1400steps
Text Generation
• 5B • Updated
• 9
ISTA-DASLab/Qwen3-8B-FPQuant-QAT-NVFP4-1000steps
Text Generation
• 5B • Updated
• 3
ISTA-DASLab/Llama-3.1-8B-Instruct-MR-GPTQ-nvfp
Image-Text-to-Text
• 5B • Updated
• 63
ISTA-DASLab/Llama-3.1-8B-Instruct-MR-GPTQ-mxfp
Image-Text-to-Text
• 5B • Updated
• 12
ISTA-DASLab/Llama-3.1-8B-Instruct-FPQuant-QAT-NVFP4
5B • Updated
• 1
ISTA-DASLab/Llama-3.2-1B-Instruct-FPQuant-QAT-NVFP4
0.8B • Updated
• 12
ISTA-DASLab/Llama-3.2-1B-Instruct-FPQuant-QAT-MXFP4
0.8B • Updated
• 2
ISTA-DASLab/Llama-3.2-3B-Instruct-FPQuant-QAT-NVFP4
2B • Updated
• 37
ISTA-DASLab/Llama-3.2-3B-Instruct-FPQuant-QAT-MXFP4
2B • Updated
ISTA-DASLab/Llama-3.1-8B-Instruct-FPQuant-QAT-MXFP4
5B • Updated
ISTA-DASLab/Qwen3-0.6B-FPQuant-QAT-NVFP4
Text Generation
• 0.4B • Updated
• 3
ISTA-DASLab/Qwen3-1.7B-FPQuant-QAT-NVFP4
Text Generation
• 1B • Updated
• 12
ISTA-DASLab/Qwen3-4B-FPQuant-QAT-NVFP4
Text Generation
• 2B • Updated
• 6