-
Notifications
You must be signed in to change notification settings - Fork 2
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Anirud/multi modal model support #169
base: dev
Are you sure you want to change the base?
Conversation
This comment was marked as outdated.
This comment was marked as outdated.
- to be minimized or maximized - opened via a dialog box
- also add background blur when image is open in max view - adds z index for the image dialog box
* copy run agent container + helper func * copied in updated docker views * copied in model utils to stream agent response * copy in model views * added agent view * copy in all frontend components * add search api key to docker compose yml * copy in updated model urls * added requirements for dockerfile * rename hf_model_id * remove commetned code intepretor tool * added fix so agent works with other llama models * fix requirements in dockerfile * add thread id to match stateful chat * add readme * add agent workflow diagram * Update README.md * Delete app/api/agent_control/Agent.png * Add files via upload * Delete app/api/agent_control/Agent.png * Add files via upload * Update README.md * Delete app/api/agent_control/Agent.png * Add files via upload * Delete app/api/agent_control/Agent.png * Add files via upload * Update README.md * fix link href (#180) * refactor(chat history component): improve file handling and add RAG support - Add RAG datasource integration with metadata display - Create reusable FileDisplay component for file management - Implement FileViewerDialog for improved file preview experience - Support both image and non-image file types with download option - Clean up file handling logic and separate from image-specific code - Add visual indicator for RAG-enabled messages * Show RAG pill based on the message's stored RAG context * feat(add support in chat component): - Use the RAG datasource from the message if available * move image display to its own component * include rag source name when selected * refactor(types): clean up and organize type definitions - Remove redundant and commented-out interfaces - Group related interfaces together (chat, inference, file, voice) - Add proper JSDoc comments for better documentation - Consolidate duplicate type definitions - Add explicit typing for RAG-related interfaces * add pdfjs-dist to test * feat: improve file display - show images in better aspect ratio * display file display for images ,code files and other file types * add icons for file display in chat thread * extend types * extend to add - File extensions mapping for code files and other file types * extend to allow for files to be passed as text * fix alignment * limit upload to a single image file * set focused state in input area * feat: add ability to process multiple code and or text file types and send to model * re add resizing input are * fix copy button logic * Anirud/update vllm setup steps (#189) * update readme to reflect new flow * fix readme issues * add Supported models tab: pointing to tt-inference-server readme * docs: Update main readme - add better quick start guide - add better notes for running in development mode * docs: re add Mock model steps * docs: fix links * docs: fix vllm * Update HowToRun_vLLM_Models.md * Update HowToRun_vLLM_Models.md
4c39c27
to
f8314c9
Compare
…/tenstorrent/tt-studio into anirud/multi-modal-model-support
Sometimes, if it can't fetch the URL, the previous input image will be referred to. This only happened for some URLs. For example: But this is the 2nd link: https://images.squarespace-cdn.com/content/v1/607f89e638219e13eee71b1e/1684821560422-SD5V37BAG28BURTLIXUQ/michael-sum-LEpfefQf4rU-unsplash.jpg?format=2500w |
ok great catch , I logged both as a issues , which I will fix |
Changelog
Known Issues / In progress