v0.0.2
Pre-release
Pre-release
What's Changed
- Release Candidate v0.0.2: Qwen2.5 72B support by @tstescoTT in #100
Full Changelog:
- #75: Check if a token can access a model on HuggingFace
- Workaround for outdated apt-get keyring (#90)
- Provide a model from your local storage (#91)
- add CONTAINER_APP_UID Docker ARG for cloud image and dev image with develoment instructions (#94)
- remove git lfs from tt-metal build
- add CONTAINER_APP_UID Docker ARG for cloud image and dev image with develoment instructions
- tstesco/qwen25-support (#98)
- avoid jq dependency when checking HF_TOKEN has repo access
- adding Qwen2.5-72B-Instruct setup
- update example scripts for minimal text request with user input without dependencies
- adding qwen setup support
- increase output tokens for example usage
- add TODO note on tt-metal cache in new impl
- add prompt to example output
- fix prompt generation text without images
- adding default env vars for Qwen2.5 72B
- Tstesco/doc update qwen25 (#99)
- update docs
- add container id
- fix import for non qwen models
- update gchr ref
- update main model impl README.md
- remove tt-metal-llama3-70b, archive tt-metal-mistral-7b (#97)
- Consider links when using local model (#101)
Co-authored-by: Pavle Petrovic ppetrovic@tenstorrent.com