Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
MoYoYoTech
/
llm_mutil_npu
like
0
Follow
MoYoYoTech
27
Model card
Files
Files and versions
xet
Community
Copy to bucket
new
main
llm_mutil_npu
/
tests
131 kB
Ctrl+K
Ctrl+K
2 contributors
History:
1 commit
xianglarry
Initial C++ aclnn EAGER inference for Qwen3-235B-A22B MoE on Ascend 910 × 16 NPU
4b9fefd
about 1 month ago
hello_acl.cpp
Safe
2.23 kB
Initial C++ aclnn EAGER inference for Qwen3-235B-A22B MoE on Ascend 910 × 16 NPU
about 1 month ago
test_attention_decode.cpp
Safe
16.5 kB
Initial C++ aclnn EAGER inference for Qwen3-235B-A22B MoE on Ascend 910 × 16 NPU
about 1 month ago
test_attention_layer.cpp
Safe
9.97 kB
Initial C++ aclnn EAGER inference for Qwen3-235B-A22B MoE on Ascend 910 × 16 NPU
about 1 month ago
test_batch_correctness.cpp
Safe
4.44 kB
Initial C++ aclnn EAGER inference for Qwen3-235B-A22B MoE on Ascend 910 × 16 NPU
about 1 month ago
test_batch_decode.cpp
Safe
3.49 kB
Initial C++ aclnn EAGER inference for Qwen3-235B-A22B MoE on Ascend 910 × 16 NPU
about 1 month ago
test_chat_flow.sh
Safe
2.71 kB
Initial C++ aclnn EAGER inference for Qwen3-235B-A22B MoE on Ascend 910 × 16 NPU
about 1 month ago
test_engine_smoke.cpp
Safe
283 Bytes
Initial C++ aclnn EAGER inference for Qwen3-235B-A22B MoE on Ascend 910 × 16 NPU
about 1 month ago
test_layer_forward.cpp
Safe
8.7 kB
Initial C++ aclnn EAGER inference for Qwen3-235B-A22B MoE on Ascend 910 × 16 NPU
about 1 month ago
test_linear_hf.cpp
Safe
2.92 kB
Initial C++ aclnn EAGER inference for Qwen3-235B-A22B MoE on Ascend 910 × 16 NPU
about 1 month ago
test_model_config.cpp
Safe
5.46 kB
Initial C++ aclnn EAGER inference for Qwen3-235B-A22B MoE on Ascend 910 × 16 NPU
about 1 month ago
test_moe_layer.cpp
Safe
34.8 kB
Initial C++ aclnn EAGER inference for Qwen3-235B-A22B MoE on Ascend 910 × 16 NPU
about 1 month ago
test_op_support.cpp
Safe
8.9 kB
Initial C++ aclnn EAGER inference for Qwen3-235B-A22B MoE on Ascend 910 × 16 NPU
about 1 month ago
test_rms_norm.cpp
Safe
3.44 kB
Initial C++ aclnn EAGER inference for Qwen3-235B-A22B MoE on Ascend 910 × 16 NPU
about 1 month ago
test_rope.cpp
Safe
4.94 kB
Initial C++ aclnn EAGER inference for Qwen3-235B-A22B MoE on Ascend 910 × 16 NPU
about 1 month ago
test_rope_fused.cpp
Safe
6.64 kB
Initial C++ aclnn EAGER inference for Qwen3-235B-A22B MoE on Ascend 910 × 16 NPU
about 1 month ago
test_rope_manual.cpp
Safe
3.1 kB
Initial C++ aclnn EAGER inference for Qwen3-235B-A22B MoE on Ascend 910 × 16 NPU
about 1 month ago
test_runner.cpp
Safe
2.88 kB
Initial C++ aclnn EAGER inference for Qwen3-235B-A22B MoE on Ascend 910 × 16 NPU
about 1 month ago
test_safetensors.cpp
Safe
4.05 kB
Initial C++ aclnn EAGER inference for Qwen3-235B-A22B MoE on Ascend 910 × 16 NPU
about 1 month ago
test_tokenizer.cpp
Safe
2.08 kB
Initial C++ aclnn EAGER inference for Qwen3-235B-A22B MoE on Ascend 910 × 16 NPU
about 1 month ago
test_weight_load.cpp
Safe
3.8 kB
Initial C++ aclnn EAGER inference for Qwen3-235B-A22B MoE on Ascend 910 × 16 NPU
about 1 month ago