Complete AI Models Inventory

Last Updated: January 18, 2026 Version: 3.0.0 Total Models: 30+ across 5 providers


📊 AI Models by Provider

1. Anthropic (Claude)

Claude Sonnet 4.5

Claude Haiku 4.5


2. OpenAI

GPT-4o

GPT-5

text-embedding-3-small (retired from production 2026-04)

Voyage AI voyage-3.5 (production text embedder, updated 2026-04)


3. HuggingFace Endpoint

Qwen3-VL 32B Vision


4. Google

SLIG (SigLIP2 via HuggingFace Cloud Endpoint) — Primary visual embedder, updated 2026-04

5 Embedding Types Generated (all 768D → VECS):

  1. Visualimage_slig_embeddings (producer key visual_768)
  2. Colorimage_color_embeddings (producer key color_slig_768)
  3. Textureimage_texture_embeddings (producer key texture_slig_768)
  4. Styleimage_style_embeddings (producer key style_slig_768)
  5. Materialimage_material_embeddings (producer key material_slig_768)

Plus an Understanding Embedding (1024D Voyage AI from Qwen3-VL vision_analysis JSON) → image_understanding_embeddings for spec-based semantic search.


5. Replicate (14 Models for Interior Design)

Text-to-Image Models (7 models)

  1. FLUX.1-dev

    • Provider: Replicate
    • Model: black-forest-labs/flux-dev
    • Cost: $0.025 per generation
    • Status: ✅ Working
  2. FLUX.1-schnell

    • Provider: Replicate
    • Cost: $0.015 per generation
    • Status: ✅ Working
  3. SDXL (Stable Diffusion XL)

    • Provider: Replicate
    • Cost: $0.020 per generation
    • Status: ✅ Working
  4. Playground v2.5

    • Provider: Replicate
    • Model: playgroundai/playground-v2.5-1024px-aesthetic
    • Cost: $0.010 per generation
    • Status: ✅ Working
  5. Stable Diffusion 3

    • Provider: Replicate
    • Model: stability-ai/stable-diffusion-3
    • Cost: $0.055 per generation
    • Status: ✅ Working
  6. Kandinsky 2.2

    • Provider: Replicate
    • Cost: $0.015 per generation
    • Status: ✅ Working
  7. Proteus v0.2

    • Provider: Replicate
    • Cost: $0.018 per generation
    • Status: ✅ Working

Image-to-Image Models (7 models)

Production-Ready (3 models):

  1. ComfyUI Interior Remodel

    • Provider: Replicate
    • Model: jschoormans/comfyui-interior-remodel
    • Cost: $0.020 per generation
    • Status: ✅ Working
  2. Interiorly Gen1 Dev

    • Provider: Replicate
    • Model: julian-at/interiorly-gen1-dev
    • Cost: $0.015 per generation
    • Status: ✅ Working
  3. Designer Architecture

    • Provider: Replicate
    • Model: davisbrown/designer-architecture
    • Cost: $0.018 per generation
    • Status: ✅ Working

Experimental (4 models):

  1. Interior AI - Status: ⚠️ Experimental
  2. Interior V2 - Status: ⚠️ Experimental
  3. Adirik Interior Design - Status: ⚠️ Experimental
  4. Interior Design SDXL - Status: ⚠️ Experimental

📈 Model Usage by Feature

PDF Processing Pipeline

Web Scraping Integration

XML Import

Interior Design Generation

Saved Searches Deduplication

Price Monitoring


💰 Cost Optimization Strategy

High-Volume Operations (Use Cheaper Models)

High-Accuracy Operations (Use Premium Models)

Parallel Processing


🎯 Model Selection Guidelines

When to Use Claude Sonnet 4.5

When to Use Claude Haiku 4.5

When to Use GPT-4o/GPT-5

When to Use Qwen3-VL

When to Use SigLIP CLIP


📊 Performance Benchmarks

Model Use Case Speed Accuracy Cost/Operation
Claude Sonnet 4.5 Product Discovery 3-5s 95%+ $0.05-0.15
Claude Haiku 4.5 Classification 0.5-1s 90%+ $0.01-0.03
GPT-4o Discovery 2-4s 93%+ $0.04-0.12
Qwen3-VL Image Analysis 2-3s 90%+ $0.02-0.05
SigLIP CLIP Embeddings 0.1-0.3s 95%+ $0.00
FLUX Dev Interior Design 5-13s 92%+ $0.025
ComfyUI Room Transform 8-15s 88%+ $0.020

🔄 Model Fallback Strategy

Primary → Secondary → Tertiary

Product Discovery:

  1. Claude Sonnet 4.5 (primary)
  2. GPT-4o (secondary)
  3. Claude Haiku 4.5 (tertiary, lower accuracy)

Image Analysis:

  1. Qwen3-VL 17B (primary)
  2. Claude Vision (secondary, more expensive)
  3. GPT-4 Vision (tertiary)

Visual Embeddings:

  1. SigLIP ViT-SO400M (primary)
  2. CLIP ViT-B/32 (secondary)
  3. Skip if both fail (graceful degradation)

🆕 Recently Added Models

January 2026:

December 2025:


📚 Related Documentation


Total Investment: 30+ AI models across 5 providers Total Cost Range: $0.00 - $0.055 per operation (varies by model and task) Success Rate: 95%+ across all models Uptime: 99.5%+ (production environment)