Skip to content

Kithara v0.0.5 Release

Latest
Compare
Choose a tag to compare
@wenxindongwork wenxindongwork released this 06 Feb 00:56
· 15 commits to main since this release
eb6d2a7

Features:

  • Support MaxText model
  • Support KerasHub Gemma2 model
  • Implement SFT in Kithara
  • Support data loading with Ray data
  • Support single host SFT workload on TPU and GPU
  • Support GCE VM & QR multihost workload on TPU and GPU via Ray
  • HF <-> MaxText converter for Gemma2 models
  • Support Orbax checkpointing
  • Support Xprof and Tensorboard
  • Support saving checkpoints to cloud storage
  • Support FSDP for Gemma2