add ZeroGPU GPU inference (FP16, flash-attn, batch=32@1024/16@2048) 0b6961f Nekochu commited on 19 days ago