Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[$$ BOUNTY] Add a Tensor parallel JAX Qwen2.5-7B Model to TT-xla Model Demos #52

Open
staylorTT opened this issue Feb 28, 2025 · 0 comments

Comments

@staylorTT
Copy link
Contributor

staylorTT commented Feb 28, 2025

[$$ BOUNTY] Add a JAX Qwen2.5-7B Model to TT-xla Model Demos

Background:
TT-Forge is a growing collection of model demos showcasing the capabilities of AI models running on Tenstorrent hardware. This bounty aims to bring up the tensor parallelized (not data parallelized) implementation of the Qwen2.5-7B model using the JAX framework. This is an opportunity for AI developers to contribute to cutting-edge research and earn rewards.

Requirements:

  • Tensor Parallel Implementation in JAX: Ensure that the model is implemented with tensor parallelism, not data parallelism.
    • Must provide the model implementation, which can (and is preferred to) be an extension of an open source single-device model implementation.
    • Some examples on an open source JAX models implementation can be found here.
  • Comprehensive Documentation: Include a detailed README.md explaining the model architecture, setup, and usage.
  • Compatibility and Optimization: Test and optimize for multi-device environments.
  • Sample Inputs/Outputs: Provide examples demonstrating the model’s functionality.
  • Dependency Management: Clearly document installation procedures and dependencies.
  • 1 or more of these mesh shapes should be targeted:
    • 2x4
    • 1x8
    • 1x32
    • 8x4

Contribution Guidelines:

  • Fork the TT-xla repository.
  • Create a directory in the tt-xla/tests/jax/models folder with the naming convention qwen2_5 and add the model implementation there.
  • Follow the repository’s coding standards and guidelines as stated in CONTRIBUTING.md.
  • Submit a pull request with a detailed description and relevant information to help reviewers evaluate your contribution.

Evaluation Criteria:

  • Tenstorrent Hardware not required for this bounty.
    • Please use the JAX multidevice simulation guide here.
  • Implementation Quality: Code structure, readability, and adherence to best practices.
  • Completeness: We will need to verify the model works.
    • Tensor parallel implementation should produce the same GSM8K score as the single-device implementation of the same model.
  • Documentation: Clarity and completeness of the accompanying documentation.
    • Code must contain descriptive comments that explains the rationale for design decisions made.

Reward:
$1000 cash bounty for the best contribution.

Connect on Discord:
Join our Discord community to discuss AI, share progress, and get support from the Tenstorrent team.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant