Model List
For models that have a corresponding blog post, we've linked it in the model's title.
Multimodal
Claude-Sonnet-4: Anthropic's mid-size model with superior intelligence for high-volume uses in coding, in-depth research, agents, & more
Gemini-2.5-Flash-Preview-05-20: Gemini 2.5 Flash is Google's state-of-the-art workhorse model, specifically designed for advanced reasoning, coding, mathematics, and scientific tasks.
Gemini-2.5-Pro-Preview-06-05: Gemini 2.5 Pro is Google’s state-of-the-art AI model designed for advanced reasoning, coding, mathematics, and scientific tasks.
Gemini-2.5-Pro-Preview-05-06: Gemini 2.5 Pro is Google’s state-of-the-art AI model designed for advanced reasoning, coding, mathematics, and scientific tasks
Gemini-2.0-Flash: Gemini 2.0 Flash delivers next-gen features and improved capabilities, including superior speed, built-in tool use, multimodal generation, and a 1M token context window
gpt-4o-mini: GPT-4o mini is OpenAI's newest model after GPT-4 Omni, supporting both text and image inputs with text outputs.
Text Generation
DeepSeek-R1-0528 (free): The latest state-of-the-art LLM released by Deepseek excels in reasoning, math, and coding. Community-shared access, daily limits, great for testing and exploration
DeepSeek-V3-0324 (free): The most powerful AI-driven LLM with 685B parameters released by Deepseek. Community-shared access, daily limits, great for testing and exploration
DeepSeek-V3-0324: DeepSeek V3, a 685B-parameter, mixture-of-experts model, is the latest iteration of the flagship chat model family from the DeepSeek team..
DeepSeek-R1 (free): A state-of-the-art, high-efficiency LLM excelling in reasoning, math, and coding. Community-shared access, daily limits, great for testing and exploration
DeepSeek-R1-0528: The latest state-of-the-art LLM released by Deepseek excels in reasoning, math, and coding. Community-shared access, daily limits, great for testing and exploration
DeepSeek-R1: A state-of-the-art, high-efficiency LLM excelling in reasoning, math, and coding. Community-shared access, daily limits, great for testing and exploration
Llama3.3-70B: Advanced conversational AI with extensive knowledge.
Qwen-QwQ-32B: The AI model from the Qwen series, designed for reasoning and problem-solving.
Image Generation
Text-to-Image
StableDiffusion XL 1.0: enerates high-quality images from text prompts with detailed control.
Flux.1 schnell: Fast text-to-image generation with efficient processing and good quality results.
Bytedance-Seedream-3.0: A top-tier bilingual text-to-image model rivaling GPT-4. Native 2K resolution, fast generation, accurate text, artistic layouts, and stunning detail
FLUX.1 [Fill-dev]: A 12 billion parameter inpainting model for editing and extending images
Qwen2.5-VL-7B-Instruct: An advanced vision-lanauge model designed to understand and process both visual and textual inputs.
Embedding Generation
UAE-Large-V1: Good for general-purpose text embeddings with high accuracy.
BGE Large EN v1.5: Optimized for English text embeddings with enhanced performance.
M2-BERT-Retrieval-32k: An 80M checkpoint of M2-BERT, pretrained with sequence length 32768, and it has been fine-tuned for long-context retrieval.
Vision Models
Qwen2.5-VL-7B-Instruct: An advanced vision-lanauge model designed to understand and process both visual and textual inputs.
Video Models
Coming soon
Last updated