Change the repository type filter
All
Repositories list
157 repositories
- An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries
- A framework for few-shot evaluation of language models.
elk
PublicDeeperSpeed
PublicTransformerEngine
Publicsae_overlap
Publicmdl
Publicaria-utils
Publicconcept-erasure
Publiccookbook
Publicaria
Publicngrams-across-time
Publicwebsite
Publictraining-jacobian
Publicaria-amt
PublicnanoGPT-mup
Publicsemantic-memorization
Publicmonkfish
Publicfeatures-across-time
Publicscalable-elicitation
Publictokengrams
Publicoptax-galore
Public