Qwen3.5 122B A10B
Alibaba Cloud
Cobble's flagship reasoning model offering frontier-class performance with Mixture-of-Experts efficiency, ideal for research, coding, and complex agent workflows.
Catalog of supported open-weight models across families including Meta (Llama ecosystem), Alibaba Cloud Qwen, Mistral AI, DeepSeek, NVIDIA Nemotron, IBM Granite, Nomic, and Cobble-built pipelines. Pricing assumes reclaimed GPU infrastructure, efficient vLLM serving, and open-weight licensing—benchmarked against typical marketplace rates.
Labels such as Flagship, Enterprise ready, Multilingual, and Best for RAG appear as tags on each card.
Featured
Cobble's flagship reasoning model offering frontier-class performance with Mixture-of-Experts efficiency, ideal for research, coding, and complex agent workflows.
$0.90 / 1M input tokens · $3.50 / 1M output tokens
Featured
Cobble's document extraction pipeline for archival and research-grade text processing.
$1.25 / 1K pages
Featured
State-of-the-art open embedding model with strong multilingual and long-context support.
$0.025 / 1M input tokens
Alibaba Cloud
Cobble's flagship reasoning model offering frontier-class performance with Mixture-of-Experts efficiency, ideal for research, coding, and complex agent workflows.
Alibaba Cloud
Balanced high-performance model with strong reasoning and code generation at lower latency and cost.
Alibaba Cloud
Sparse MoE architecture delivering high quality responses with excellent cost efficiency.
Google DeepMind
Large open model with excellent instruction following, multilingual capabilities, and coding performance.
Alibaba Cloud
Low-latency utility model ideal for chatbots, summarization, and lightweight automation.
NVIDIA
Efficient NVIDIA MoE model tuned for enterprise assistants and agentic applications.
Google DeepMind
Efficient sparse variant of Gemma optimized for strong quality with lower serving costs.
Cobble Labs
Cobble's document extraction pipeline for archival and research-grade text processing.
Zhipu AI
General-purpose OCR model with strong support for complex layouts and multilingual documents.
DeepSeek
High-accuracy OCR and document understanding model optimized for tables and technical PDFs.
NVIDIA
Enterprise-grade OCR for forms, scanned documents, and large ingestion pipelines.
Nomic AI
State-of-the-art open embedding model with strong multilingual and long-context support.
IBM
High-quality enterprise embedding model for semantic search and RAG applications.
IBM
Fast, lightweight embedding model for large-scale indexing and low-cost retrieval.
Alibaba Cloud
Compact multilingual embedding model offering strong performance and low cost.
Alibaba Cloud
Flagship embedding model with excellent retrieval accuracy across multilingual corpora.