Inferless
Popular repositories Loading
-
triton-co-pilot
triton-co-pilot PublicGenerate Glue Code in seconds to simplify your Nvidia Triton Inference Server Deployments
-
whisper-large-v3
whisper-large-v3 Public templateState‑of‑the‑art speech recognition model for English, delivering transcription accuracy across diverse audio scenarios. <metadata> gpu: T4 | collections: ["CTranslate2"] </metadata>
-
-
Facebook-bart-cnn
Facebook-bart-cnn PublicBART model pre-trained on English language, and fine-tuned on CNN Daily Mail. It was introduced in the paper BART: Denoising Sequence-to-Sequence Pre-training for Natural Language Generation, Trans…
Repositories
- qwen2.5-vl-7b-instruct Public template
Vision-Language model that integrates advanced image, video, and text understanding. <metadata> gpu: A100 | collections: ["vLLM"] </metadata>
inferless/qwen2.5-vl-7b-instruct’s past year of commit activity - stable-diffusion-s3-image-save Public template
Uses Stable Diffusion to generate images and automatically uploads them to an S3 bucket. <metadata> gpu: A100 | collections: ["S3 Storage", "Complex Outputs"] </metadata>
inferless/stable-diffusion-s3-image-save’s past year of commit activity - vicuna-7b-1.1 Public template
Open-source chatbot fine-tuned from LLaMA on 70K ShareGPT conversations, optimized for research and conversational tasks. <metadata> gpu: A100 | collections: ["HF Transformers"] </metadata>
inferless/vicuna-7b-1.1’s past year of commit activity - tinyllama-1.1b-chat-v1.0 Public template
A chat model fine-tuned on TinyLlama, a compact 1.1B Llama model pretrained on 3 trillion tokens. <metadata> gpu: T4 | collections: ["vLLM"] </metadata>
inferless/tinyllama-1.1b-chat-v1.0’s past year of commit activity - qwen2.5-coder-32b-instruct Public template
A State-Of-The-Art coder LLM, tailored for instruction-based tasks, particularly in code generation, reasoning, and repair. <metadata> gpu: A100 | collections: ["vLLM"] </metadata>
inferless/qwen2.5-coder-32b-instruct’s past year of commit activity - pyannote-speaker-diarization-3.1 Public template
A state-of-the-art model that segments and labels audio recordings by accurately distinguishing different speakers. <metadata> gpu: T4 | collections: ["HF Transformers"] </metadata>
inferless/pyannote-speaker-diarization-3.1’s past year of commit activity - playground-v2.5 Public template
Generate highly aesthetic 1024x1024 images with superior quality, flexible aspect ratios, and outstanding human preference alignment. <metadata> gpu: T4 | collections: ["Diffusers"] </metadata>
inferless/playground-v2.5’s past year of commit activity - phi-3.5-moe-instruct Public template
An instruction-tuned variant of Phi-3.5, delivering efficient, context-aware responses across diverse language tasks. <metadata> gpu: A100 | collections: ["HF Transformers"] </metadata>
inferless/phi-3.5-moe-instruct’s past year of commit activity - openchat-3.5 Public template
A fine-tuned chat model with C-RLFT - a strategy inspired by offline reinforcement learning, optimized for natural, context-aware conversations, excelling in instruction following and text generation tasks. <metadata> gpu: A100 | collections: ["vLLM"] </metadata>
inferless/openchat-3.5’s past year of commit activity
People
This organization has no public members. You must be a member to see who’s a part of this organization.
Top languages
Loading…
Most used topics
Loading…