NVIDIA Jetson Orin Nano Collection Ultra-efficient model variants optimized for Jetson Orin Nano. Designed for constrained edge environments requiring low memory footprint. ⢠4 items ⢠Updated 3 days ago ⢠2
NVIDIA Jetson AGX Thor Collection Models validated and performance-optimized for NVIDIA Jetson AGX Thor. Tailored for high-performance edge AI workloads. ⢠5 items ⢠Updated 3 days ago ⢠1
NVIDIA Jetson AGX Orin Collection Models optimized and bench-marked for NVIDIA Jetson AGX Orin. Memory-efficient and latency-optimized variants designed for real-time edge inference. ⢠4 items ⢠Updated 3 days ago ⢠2
Cosmos-Reason2 Collection nvidia/Cosmos-Reason2 multi-modal reasoning models optimized by Embedl. ⢠9 items ⢠Updated 3 days ago ⢠4
embedl/Cosmos-Reason2-2B-W4A16-Edge2 Image-Text-to-Text ⢠2B ⢠Updated 1 day ago ⢠17.4k ⢠11
Cosmos-Reason2 Collection nvidia/Cosmos-Reason2 multi-modal reasoning models optimized by Embedl. ⢠9 items ⢠Updated 3 days ago ⢠4
EdgeN Collection Quantization strategy where most weights are converted to INT4, activations remain in FP16, and sensitive layers are preserved in FP16. ⢠5 items ⢠Updated 6 days ago ⢠1
FlashHead Collection Efficient Drop-In Replacement for the Classification Head in Language Model Inference. ⢠19 items ⢠Updated 6 days ago ⢠1