Skip to content
Change the repository type filter

All

    Repositories list

    • GPTQModel

      Public
      Production ready LLM model compression/quantization toolkit with accelerated inference support for both cpu/gpu via HF, vLLM, and SGLang.
      Python
      Apache License 2.0
      4127297Updated Feb 13, 2025Feb 13, 2025
    • Tokenicer

      Public
      A (nicer) tokenizer you want to use for model inference and training: with all known peventable gotchas normalized or auto-fixed.
      Python
      Apache License 2.0
      1401Updated Feb 13, 2025Feb 13, 2025
    • 🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
      Python
      Apache License 2.0
      28k000Updated Feb 12, 2025Feb 12, 2025
    • optimum

      Public
      🚀 Accelerate inference and training of 🤗 Transformers, Diffusers, TIMM and Sentence Transformers with easy to use hardware optimization tools
      Python
      Apache License 2.0
      498000Updated Feb 7, 2025Feb 7, 2025
    • Self-contained Python lib with zero-dependencies that give you a unified device properties for gpu, cpu, and npu. No more calling separate tools such as nvidia-smi or /proc/cpuinfo and parsing it yourself.
      Python
      Apache License 2.0
      11012Updated Jan 10, 2025Jan 10, 2025