Chroma’s cover photo
Chroma

Chroma

Technology, Information and Internet

San Francisco, CA 9,186 followers

Data infrastructure for AI

About us

Chroma builds data infrastructure for AI

Website
https://www.trychroma.com/
Industry
Technology, Information and Internet
Company size
11-50 employees
Headquarters
San Francisco, CA
Type
Privately Held

Locations

Employees at Chroma

Updates

  • Chroma Cloud is now available via Stripe Projects! Stripe Projects allows your agent to provision infrastructure on your behalf. Projects automatically creates accounts, manages billing and provisions API keys, cutting out the frustration of going into the browser to sign up for services. Get started now! stripe projects add chroma/database

  • View organization page for Chroma

    9,186 followers

    Chroma offers a variety of lexical search strategies. Full Text Search (FTS), BM25 and SPLADE are the core offerings. Each have their strengths and weaknesses and use cases. Learn more 👇

  • View organization page for Chroma

    9,186 followers

    We're releasing Chroma Context-1, a 20B parameter open-source search agent that pushes the pareto frontier of agentic search, an order of magnitude faster and cheaper than frontier alternatives. Accurate search is rarely a single step. The output of one search informs the next. Frontier LLMs can do this through agentic search, but long trajectories become cost and latency prohibitive. Context-1 solves this by separating search from generation. Three ideas that made it work: 1. Staged training: recall first, then precision. Context-1 is trained with SFT + RL on 8,000+ synthetic multi-hop tasks. The curriculum first optimizes for broad recall, then progressively trains the agent to narrow down to the most relevant documents. The result is a model that retrieves thoroughly and selects carefully. 2. Self-editing context. As the agent searches, its context window fills with documents, many of which may be irrelevant. Context-1 is trained to selectively prune its own context mid-search, freeing space for further exploration and reducing context rot. This lets a 20B model with a 32k token budget outperform frontier models with much larger context windows. 3. Scalable synthetic task generation. We built an extraction-based verification pipeline with an LLM judge that achieves high human alignment, minimizing the need for manual annotation. Tasks span 4 domains: web, SEC filings, patent law, and email, each requiring the agent to chain clues across documents. Context-1 matches or exceeds frontier models on BrowseComp-Plus, SealQA, FRAMES, HotpotQA, and HLE. We're open-sourcing the model weights, the harness and the full task generation codebase. Apache 2.0. Full report in comments.

  • View organization page for Chroma

    9,186 followers

    Chroma Cloud is launching with Stripe Projects. The high friction points of building with AI isn't the AI, it's everything around it. Adding services means managing account credentials, API keys, user permissions, a new dashboard to navigate and more. Stripe Projects removes this friction: stripe projects add chroma/database The Stripe Projects CLI will provision a Chroma Cloud account, generate and store credentials and manage billing, no browser involved.

  • View organization page for Chroma

    9,186 followers

    Gemini Embedding 2 is available in Chroma clients today!

    View organization page for Google for Developers

    4,039,481 followers

    Announcing Gemini Embedding 2 ✨ the first fully multimodal embedding model built on the Gemini architecture. Now available in preview via the Gemini API and Vertex AI. The new model provides semantic understanding across 100+ languages — and support for modalities across text, images, video, audio and documents (PDFs) in a shared vector space. Start building today. Read the blog to learn more: https://goo.gle/4los3aI

Similar pages

Browse jobs

Funding

Chroma 2 total rounds

Last Round

Seed

US$ 18.0M

See more info on crunchbase