Models: H-dh
Collection
Attention-only transformers, sweep over number of heads (variable head dimension) • 7 items • Updated
YAML Metadata Warning: empty or missing yaml metadata in repo card
Check out the documentation for more information.