Skip to content

Releases: tenstorrent/tt-inference-server

v0.0.3

19 Feb 19:42
ba7a180
Compare
Choose a tag to compare
v0.0.3 Pre-release
Pre-release

What's Changed

  • Release Candidate v0.0.3: Improved setup.sh and documentation with models supported list by @tstescoTT in #103

Full Changelog:

  • #101 Consider links when using local model
  • #102 update documentation to show all supported models and give correct links, mark experimental preview models
  • setup.sh supports all base and Instruct models
  • #106 add host disk and RAM check in setup.sh addressing #76
  • #107 update setup documentation in vllm-tt-metal-llama3/README.md to describe other models and put setup and installation first to avoid confusion

Compare: v0.0.2...v0.0.3


Co-authored-by: Pavle Petrovic ppetrovic@tenstorrent.com

v0.0.2

07 Feb 01:48
a97b620
Compare
Choose a tag to compare
v0.0.2 Pre-release
Pre-release

What's Changed

Full Changelog:

  • #75: Check if a token can access a model on HuggingFace
  • Workaround for outdated apt-get keyring (#90)
  • Provide a model from your local storage (#91)
  • add CONTAINER_APP_UID Docker ARG for cloud image and dev image with develoment instructions (#94)
  • remove git lfs from tt-metal build
  • add CONTAINER_APP_UID Docker ARG for cloud image and dev image with develoment instructions
  • tstesco/qwen25-support (#98)
  • avoid jq dependency when checking HF_TOKEN has repo access
  • adding Qwen2.5-72B-Instruct setup
  • update example scripts for minimal text request with user input without dependencies
  • adding qwen setup support
  • increase output tokens for example usage
  • add TODO note on tt-metal cache in new impl
  • add prompt to example output
  • fix prompt generation text without images
  • adding default env vars for Qwen2.5 72B
  • Tstesco/doc update qwen25 (#99)
  • update docs
  • add container id
  • fix import for non qwen models
  • update gchr ref
  • update main model impl README.md
  • remove tt-metal-llama3-70b, archive tt-metal-mistral-7b (#97)
  • Consider links when using local model (#101)

Co-authored-by: Pavle Petrovic ppetrovic@tenstorrent.com

v0.0.1

05 Feb 22:46
6061606
Compare
Choose a tag to compare
v0.0.1 Pre-release
Pre-release

What's Changed

New Contributors

Full Changelog: https://github.com/tenstorrent/tt-inference-server/commits/v0.0.1