AI Models

52 models · 0 new in 60d

Compare →
  • Nomic Embed Text v2-MoEOpen

    Nomic AI · 8K tokens · self-host

    Best for: Self-hosted RAG, privacy-first search, zero-cost embeddings

    How: Self-host for zero cost. Comparable quality to OpenAI embeddings.

    Example: Run alongside pgvector on the same server — full RAG pipeline with zero API costs.

    MoE embeddingmatryoshkaApache 2.0self-hostable
    Hardware to self-host
    VRAM: 2GB or CPU-only
    GPU: Any — runs on CPU at reasonable speed
    RAM: 4-8GB system RAM

    Tiny MoE embedding model. CPU inference is fast enough for most use cases.

    API: pip install nomic OR Ollama. Also hosted on Nomic Atlas.