cria
cria copied to clipboard
Tiny inference-only implementation of LLaMA
Results
1
cria issues
Sort by
recently updated
recently updated
newest added
Works much faster using a GPU.