Release v0.0.2 · tenstorrent/tt-inference-server

What's Changed

#75: Check if a token can access a model on HuggingFace
Workaround for outdated apt-get keyring (#90)
Provide a model from your local storage (#91)
add CONTAINER_APP_UID Docker ARG for cloud image and dev image with develoment instructions (#94)
remove git lfs from tt-metal build
add CONTAINER_APP_UID Docker ARG for cloud image and dev image with develoment instructions
tstesco/qwen25-support (#98)
avoid jq dependency when checking HF_TOKEN has repo access
adding Qwen2.5-72B-Instruct setup
update example scripts for minimal text request with user input without dependencies
adding qwen setup support
increase output tokens for example usage
add TODO note on tt-metal cache in new impl
add prompt to example output
fix prompt generation text without images
adding default env vars for Qwen2.5 72B
Tstesco/doc update qwen25 (#99)
update docs
add container id
fix import for non qwen models
update gchr ref
update main model impl README.md
remove tt-metal-llama3-70b, archive tt-metal-mistral-7b (#97)
Consider links when using local model (#101)

Co-authored-by: Pavle Petrovic ppetrovic@tenstorrent.com