-
Notifications
You must be signed in to change notification settings - Fork 24
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
* feat(modeling): import Mistral support Imported from transformers sha1: a2ede6667 (current main branch). This allows to use recent static cache support. The only changes are: - fixed the import paths, - added a workaround to avoid having to import SlidingWindowCache or having to modify the file too much. * feat(mistral): add inference sharding on Linear modules * feat(examples): generalize text generation example to other models This will allow using the same example for other models, such as mistralai/Mistral-7B-v0.3 * feat(inference): use Linear when world_size is 1 There is no point in using code to sync multiple TPUs when using only one. * refactor(tests): try to reduce repetition for decode tests * test(tgi): added test for Mistral-7B-v0.3 * feat(tests): delete generator to prevent getting stuck when failing * chore(doc): updated mention to Mistral * feat(model): filter only for model safetensors This will prevent downloading consolidated weights uselessly, as for the Mistral repo.
- Loading branch information
1 parent
a660f80
commit 3900bd7
Showing
12 changed files
with
1,793 additions
and
330 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.