nm-testing/Meta-Llama-3-8B-Instruct-W8A8-FP8-Channelwise-compressed-tensors Text Generation • 8B • Updated Oct 9, 2024 • 7 • 2
nm-testing/Meta-Llama-3-8B-Instruct-FBGEMM-nonuniform Text Generation • 8B • Updated Jul 20, 2024 • 7
nm-testing/Meta-Llama-3-8B-FP8-compressed-tensors-test Text Generation • 8B • Updated Oct 9, 2024 • 16.3k
nm-testing/Meta-Llama-3-8B-Instruct-W8-Channel-A8-Dynamic-Asym-Per-Token-Test 8B • Updated Oct 9, 2024 • 6.91k • 1
nm-testing/Meta-Llama-3-8B-Instruct-W8-Channel-A8-Dynamic-Per-Token-Test Text Generation • 8B • Updated Oct 9, 2024 • 14
nm-testing/Meta-Llama-3-8B-Instruct-nonuniform-test Text Generation • 8B • Updated Oct 9, 2024 • 24.8k
nm-testing/Meta-Llama-3-70B-Instruct-FBGEMM-nonuniform Text Generation • 71B • Updated Jul 20, 2024 • 814 • 1
nm-testing/SparseLlama-3.1-8B-gsm8k-pruned.2of4-chnl_wts_per_tok_dyn_act_fp8-BitM 5B • Updated Dec 17, 2024 • 3
nm-testing/tinyllama-oneshot-w8w8-test-static-shape-change Text Generation • 1B • Updated Oct 9, 2024 • 64k
RedHatAI/Mistral-Small-3.1-24B-Instruct-2503-FP8-dynamic Image-Text-to-Text • 24B • Updated Oct 29, 2025 • 4.98k • 9
nm-testing/tinyllama-oneshot-w8a8-channel-dynamic-token-v2 Text Generation • 1B • Updated Oct 9, 2024 • 21.5k
nm-testing/tinyllama-oneshot-w8a8-dynamic-token-v2 Text Generation • 1B • Updated Oct 9, 2024 • 16.4k
nm-testing/TinyLlama-1.1B-Chat-v1.0-gsm8k-pruned.2of4-chnl_wts_per_tok_dyn_act_int8-BitM 0.7B • Updated Dec 17, 2024 • 12
nm-testing/TinyLlama-1.1B-Chat-v1.0-gsm8k-pruned.2of4-chnl_wts_tensor_act_int8-BitM 0.7B • Updated Dec 17, 2024 • 9
nm-testing/TinyLlama-1.1B-Chat-v1.0-gsm8k-pruned.2of4-tensor_wts_per_tok_dyn_act_int8-BitM 0.7B • Updated Dec 17, 2024 • 11
nm-testing/TinyLlama-1.1B-Chat-v1.0-gsm8k-pruned.2of4-tensor_wts_tensor_act_int8-BitM 0.7B • Updated Dec 17, 2024 • 9
nm-testing/TinyLlama-1.1B-Chat-v1.0-INT8-Dynamic-IA-Per-Channel-Weight-testing 1B • Updated Dec 8, 2024 • 9
nm-testing/TinyLlama-1.1B-Chat-v1.0-INT8-Dynamic-IA-Per-Tensor-Weight-testing 1B • Updated Dec 8, 2024 • 8
nm-testing/tinyllama-oneshot-w4a16-channel-v2 Text Generation • 0.3B • Updated Oct 9, 2024 • 20.2k • 1