Material Kai Vision Platform - Complete Overview

AI-Powered Material Intelligence System for Enterprise Catalogs

Production-grade platform serving 5,000+ users with 99.5%+ uptime. Transforms material catalogs from multiple sources (PDF, Web, XML) into searchable, intelligent knowledge using 20+ AI models across 8 providers.


🎯 Executive Summary

Material Kai Vision Platform is an enterprise AI system that automatically extracts, analyzes, and organizes material information from multiple sources: PDF catalogs, manufacturer websites, and XML feeds. Using advanced computer vision, natural language processing, semantic search, spatial analysis, and interior design generation, it enables comprehensive material discovery and application.

Key Metrics:

New Capabilities (2026):


Platform Architecture

Technology Stack

Frontend:

Backend:

Database:

AI Services:

System Flow

User uploads PDF → Frontend (React) ↓ Supabase Edge Function (mivaa-gateway) ↓ MIVAA API (FastAPI) → Creates background job ↓ 14-Stage Processing Pipeline: 0A. Product Discovery (Claude/GPT-4o) - Products + Metadata extraction 0B. Document Entity Discovery (Optional) - Certificates, Logos, Specs

  1. Focused Extraction (product pages only)
  2. Text Extraction (PyMuPDF4LLM)
  3. Semantic Chunking (Anthropic)
  4. Text Embeddings (Voyage AI voyage-3.5, 1024D)
  5. Image Extraction
  6. Image Analysis (Qwen3-VL 32B → understanding embeddings via Voyage AI) 7-10. Multi-Vector SigLIP2 Embeddings (768D halfvec: visual, color, texture, style, material)
  7. Product Creation & Entity Linking
  8. Entity Relationship Mapping
  9. Quality Enhancement (async)
  10. Cleanup & Completion ↓ Data stored in Supabase → Available for search ↓ Real-time updates → Frontend displays results

AI Models & Intelligence

8 AI Models Across 4 Providers

1. Anthropic Claude Models

Claude Sonnet 4.5 (Premium Tier):

Claude Haiku 4.5 (Mid Tier):

2. OpenAI Models

GPT-4o:

text-embedding-3-small (retired 2026-04):

3. HuggingFace Endpoint - Qwen3-VL 32B Vision

4. SLIG (SigLIP2 via HuggingFace Cloud) — updated 2026-04

5. Replicate Models

Stable Diffusion XL: 3D texture generation, material visualization FLUX-Schnell: Fast image generation, material previews

Multi-Vector Embeddings (7 Types)

The platform generates 7 types of embeddings stored as halfvec (float16, 50% storage savings):

  1. Text Embeddings (1024D) - Voyage AI voyage-3.5 (primary)
  2. Visual Embeddings (768D) - SigLIP2 via HuggingFace Endpoint
  3. Understanding Embeddings (1024D) - Voyage AI from Qwen3-VL structured analysis (enables spec-based search)
  4. Color Embeddings (768D) - SigLIP2 color-guided
  5. Texture Embeddings (768D) - SigLIP2 texture-guided
  6. Style Embeddings (768D) - SigLIP2 style-guided
  7. Material Embeddings (768D) - SigLIP2 material-guided

Dynamic Weight Profiles: 7 profiles (product_name, color_finish, specification, texture_pattern, style_aesthetic, material_search, balanced) automatically selected per query.


PDF Processing Pipeline (14 Stages)

Stage-by-Stage Breakdown

Stage 1: PDF Upload & Validation

Stage 2: Background Job Creation

Stage 3: PDF Analysis

Stage 4: Product Discovery (AI)

Stage 5: Text Extraction (Focused)

Stage 6: Semantic Chunking (AI)

Stage 7: Text Embedding Generation (AI)

Stage 8: Image Extraction & Upload

Stage 9: Image Analysis (AI)

Stage 10: CLIP Embedding Generation (AI)

Stage 11: Product Creation (Two-Stage AI)

Stage 12: Metafield Extraction

Stage 13: Deferred AI Analysis (Async Background Job)

Stage 14: Cleanup & Completion

Processing Performance

PDF Size Pages Products Time Accuracy
Small 1-20 1-5 1-2 min 95%+
Medium 21-50 6-15 2-4 min 95%+
Large 51-100 16-30 4-8 min 95%+
Extra Large 100+ 30+ 8-15 min 95%+

Benchmark: Harmony PDF extracts 14+ distinct products with complete metadata (product names, dimensions, designers, page ranges, variants, image types).

Checkpoint Recovery System

The pipeline includes 9 checkpoints for recovery on failure:

  1. PDF_EXTRACTED
  2. CHUNKS_CREATED
  3. TEXT_EMBEDDINGS_GENERATED
  4. IMAGES_EXTRACTED
  5. IMAGE_EMBEDDINGS_GENERATED
  6. PRODUCTS_CREATED
  7. METAFIELDS_EXTRACTED
  8. DEFERRED_ANALYSIS_QUEUED
  9. COMPLETED

On job restart, the system resumes from the last completed checkpoint, avoiding redundant processing.


Search & Discovery

Multi-Vector Search System

The platform uses 6 embedding types for comprehensive search:

Semantic Search (Text):

Visual Search (Images):

Hybrid Search (Combined):

Specialized Search:

Search Performance


Database Architecture

Core Tables

workspaces: Multi-tenant workspace management documents: PDF documents and metadata document_chunks: Semantic text chunks with 1024D Voyage embeddings (updated 2026-04) document_images: Image metadata + boolean presence flags (has_slig_embedding, has_understanding_embedding, has_color_slig, has_texture_slig, has_style_slig, has_material_slig). All image vectors live in VECS collections (updated 2026-04 — legacy 512D CLIP columns were dropped). products: Product records from PDFs background_jobs: Async job tracking with checkpoint recovery material_metadata_fields: Dynamic metafield definitions metafield_values: Metafield data for chunks/products/images

Storage Buckets

pdf-documents: Original PDF files (50MB max) pdf-tiles: Extracted images (10MB max) material-images: Material photos (10MB max) 3d-models: Generated 3D models (100MB max)

Security

Row-Level Security (RLS): All tables protected Workspace Isolation: Users only access their workspace data JWT Authentication: Supabase Auth with automatic token refresh Encryption: At rest and in transit


Frontend Features

User-Facing Features

Dashboard: Metrics, feature grid, quick actions PDF Processing: Drag-and-drop upload with real-time progress Materials Catalog: Searchable, filterable product catalog Search Hub: AI-powered semantic search Material Recognition: Upload images for material identification 3D Generation: AI-powered material visualization Mood Boards: Create and share material collections Quotes System: Complete quote management with timeline tracking

Admin Features

Knowledge Base Management: View/edit chunks, images, products PDF Processing Monitor: Real-time job tracking with 9 checkpoint stages ✨ ENHANCED Analytics Dashboard: Comprehensive analytics (search, API, agent chat, quality) ✨ ENHANCED AI Monitoring Dashboard: Model usage, cost tracking, performance metrics Quality Dashboard: Chunk quality and embedding stability System Performance: Response times, error rates, uptime User Management: Workspace members and permissions Async Job Queue Monitor: Real-time background job status with auto-refresh ✨ ENHANCED Agent Configurations: Manage AI agent system prompts and behavior AI Configs: Unified AI prompt management (agents, extraction, templates, search) ✨ NEW Quote Management: View all quote requests with status filtering Status Tags Management: Create/edit custom status tags with colors Upsells Management: Manage upsell items with pricing Timeline Steps Management: Configure project timeline steps

Monitoring Features ✨ NEW:


API Ecosystem

170+ API Endpoints

Python REST API Categories (18+ total):

  1. RAG & Document Processing (27 endpoints — metadata management, PDF extraction consolidated)
  2. Search APIs (6 endpoints — semantic, vector, hybrid, visual, material, multi-vector)
  3. Admin Routes (18 endpoints — job management, system monitoring, metadata management)
  4. Document Entities (5 endpoints — certificates, logos, specifications)
  5. Products API (3 endpoints — product management)
  6. Images API (6 endpoints — image analysis, processing, re-classification)
  7. Embeddings APIs (3 endpoints — embedding generation)
  8. AI Services (10 endpoints — AI model integration)
  9. Background Jobs (7 endpoints — async job tracking)
  10. Anthropic APIs (3 endpoints — Claude integration)
  11. HuggingFace Endpoint APIs (3 endpoints — Qwen integration)
  12. Monitoring Routes (3 endpoints — health checks, metrics)
  13. AI Metrics Routes (2 endpoints — AI performance tracking)
  14. Duplicate Detection (7 endpoints — factory-based duplicate detection + merging)
  15. Data Import (4 endpoints — XML, web scraping, batch processing)
  16. Job Health (3 endpoints — stuck job detection, recovery)
  17. Segmentation (2 endpoints — SAM 2 mask generation, inpainting)
  18. User Feedback (3 endpoints — feedback submission + sentiment analysis)

Supabase Edge Functions (60+ total):

Documentation:


Production Metrics

Performance

Accuracy

Scalability


Quote System

Complete Quote Management Platform

Customer Features:

Admin Features:

System Components:

Workflow:

  1. Customer creates quote and adds materials
  2. Customer submits quote request
  3. Admin assigns status tag and attaches upsells
  4. Customer accepts/rejects each upsell
  5. Customer accepts quote (validates all upsells decided)
  6. System auto-initializes project timeline
  7. Admin updates timeline progress with notes
  8. Customer tracks project completion

Last Updated: March 2026 Version: 3.5.0 Status: Production Users: 5,000+ Uptime: 99.5%+

Recent Enhancements: