Image Search & Recognition
Overview
The Material KAI Vision Platform uses advanced AI to understand and search through material images. When you upload a PDF catalog or search with an image, the system automatically analyzes every image to help you find exactly what you're looking for.
What It Does
Intelligent Image Understanding
Every image in your catalogs is analyzed using state-of-the-art AI to understand:
- Visual Appearance - Overall look and style of the material
- Colors - Dominant colors and color palettes
- Textures - Surface patterns and textures
- Material Type - What kind of material it is (fabric, tile, wood, etc.)
- Application - Where and how the material is typically used
This multi-dimensional understanding allows the platform to find materials that match your needs in different ways.
Smart Image Search
Upload any image to find similar materials in your catalog:
- Visual Similarity - Find materials that look similar overall
- Color Matching - Find materials with similar color schemes
- Texture Matching - Find materials with similar surface patterns
- Material Type - Find materials of the same type
- Application - Find materials suitable for similar uses
Automatic Product Linking
The system automatically connects images to the products they represent:
- Product Images - Images showing the actual product
- Detail Shots - Close-up images of product features
- Application Examples - Images showing the product in use
- Variants - Images showing different colors or patterns
This ensures that when you search or browse, you see all relevant images for each product.
How It Works
AI-Powered Analysis
The platform uses two advanced AI systems working together:
Primary Analysis - Fast, accurate material detection using Qwen3-VL Vision AI
- Identifies material types, colors, and textures
- Provides quality scores for each analysis
- Processes images quickly and efficiently
Quality Validation - Claude AI validates uncertain results
- Reviews images that need additional analysis
- Provides enhanced descriptions and corrections
- Ensures high-quality results
Multi-Vector Search Architecture
The platform uses a sophisticated 7-embedding fusion system that combines multiple AI models in parallel for maximum search accuracy:
Embedding Types & Default Weights (Balanced Profile):
- Text Embedding (15%) - Semantic understanding from product names, descriptions, and metadata
- Visual Embedding (15%) - General visual similarity using SigLIP 768D embeddings
- Understanding Embedding (20%) - Spec-based search from Qwen3-VL analysis via Voyage AI (1024D)
- Color Embedding (12.5%) - Specialized color palette matching
- Texture Embedding (12.5%) - Surface pattern and texture recognition
- Style Embedding (12.5%) - Design aesthetic and style matching
- Material Embedding (12.5%) - Material type and category classification
Note: These are the default "balanced" weights. The system dynamically adjusts weights per-query using Query-Adaptive Weight Profiles — see below.
How It Works:
- Query Understanding - GPT-4o-mini parses the query into structured fields (colors, finish, dimensions, pattern, style, etc.)
- Weight Profile Selection -
_select_weight_profile() picks optimal weights based on detected fields (e.g., color queries upweight color embedding to 30%)
- Query Processing - Your search query is converted into visual and understanding embeddings
- Parallel Search - All 6 embedding collections are searched simultaneously using async processing
- Text Scoring - Keyword matching is performed on product metadata in parallel
- Score Fusion - Results from all 7 embeddings are combined using the selected weight profile
- Metadata Filtering - Your filters are applied as soft boosts to improve relevance
- Final Ranking - Products are sorted by combined score and returned
Query-Adaptive Weight Profiles:
The system selects from 7 weight profiles based on query intent:
| Profile |
When Selected |
Key Emphasis |
product_name |
Brand or product name detected |
Text 40% |
color_finish |
Color or finish terms present |
Color 30% |
specification |
Dimensions detected (e.g., 60x120cm) |
Understanding 40% |
texture_pattern |
Pattern terms present |
Texture 30% |
style_aesthetic |
Style or application terms |
Style 25% |
material_search |
Explicit material type |
Material 25% |
balanced |
No specific signal (default) |
Even distribution |
Profile selection is tracked in search_query_tracking for analytics.
Performance:
- Typical search time: 300-500ms
- All searches run in parallel using
asyncio.gather() and thread pools
- Handles thousands of products efficiently
This multi-dimensional approach ensures you get the most relevant results by considering all aspects of material similarity simultaneously.
Intelligent Relevancy
Images are automatically linked to products and descriptions based on:
- Page Location - Images on the same page as product descriptions
- Visual Similarity - How well the image matches the product
- AI Confidence - How certain the AI is about the connection
The system assigns relevancy scores to help you find the most important images first.
Using Image Search
Search by Uploading an Image
When you have a material image and want to find similar materials in your catalog:
- Upload Your Image - Upload any image of a material you're looking for
- AI Analysis - The system analyzes the image to understand its characteristics
- Find Matches - Get a list of similar materials from your catalog
- View Results - See matching products with images and details
The search can find materials based on:
- Overall visual similarity
- Matching colors
- Similar textures
- Same material type
- Similar applications
Search from 3D Visualizations
When working with 3D room visualizations:
- Generate 3D Scene - Create a 3D visualization with materials
- Identify Materials - The system automatically identifies materials in the scene
- Find Alternatives - Get suggestions for similar or alternative materials
- Compare Options - View different material options in context
This helps you explore material options and find alternatives that work with your design.
Benefits
For Designers and Architects
- Quick Material Discovery - Find materials faster than browsing catalogs
- Visual Search - Search using images instead of keywords
- Explore Alternatives - Discover similar materials you might not have considered
- Confident Selections - See all relevant images and details before deciding
For Material Suppliers
- Better Product Visibility - Your products are found through visual search
- Automatic Organization - Images are automatically linked to products
- Rich Product Pages - All product images are organized and accessible
- Enhanced Search - Customers can find your products in multiple ways
Search Accuracy
The platform uses advanced AI to ensure accurate results:
- Multi-Dimensional Analysis - Considers visual appearance, color, texture, and more
- Quality Validation - AI validates uncertain results for accuracy
- Relevancy Scoring - Results are ranked by how well they match your search
- Continuous Improvement - The system learns and improves over time
Performance
The image search system is designed for speed and accuracy:
- Fast Search - Multi-vector search returns results in 300-500ms
- Parallel Processing - All 6 embedding collections searched simultaneously
- Efficient Storage - Images are optimized and cached for quick loading
- Scalable - Handles thousands of images without slowing down
- Reliable - Built on enterprise-grade infrastructure
Technical Implementation
True Async Parallel Execution
The multi-vector search uses advanced async programming to achieve maximum performance:
Architecture:
Query → Generate Embeddings → Search 6 Collections in Parallel
├─ Visual (SigLIP 768D)
├─ Understanding (Voyage AI 1024D)
├─ Color (SigLIP 768D)
├─ Texture (SigLIP 768D)
├─ Style (SigLIP 768D)
└─ Material (SigLIP 768D)
↓
Combine Scores → Apply Filters → Return Results
Key Technologies:
- asyncio.gather() - Executes all searches simultaneously
- asyncio.to_thread() - Runs blocking VECS queries in thread pool
- VECS (pgvector) - Vector similarity search on PostgreSQL
- SigLIP2 - State-of-the-art vision-language model (768D visual embeddings)
- Voyage AI - Text embedding model for understanding embeddings (1024D)
Performance Benefits:
- 3-4x faster than sequential execution
- Non-blocking event loop for concurrent requests
- Efficient thread pool utilization
- Optimized database queries
Specialized Endpoints
In addition to the main multi-vector search, individual embedding searches are available:
POST /api/search/color - Color palette matching only
POST /api/search/texture - Texture pattern matching only
POST /api/search/style - Design style matching only
POST /api/search/material-type - Material category matching only
These are useful for:
- UI filter controls (e.g., "Show only warm colors")
- Specialized searches when you know exactly what you want
- A/B testing different embedding types
- Debugging and analysis
Search Response Format
The multi-vector search returns detailed scoring information for each result, including combined score, per-embedding scores (text, visual, understanding, color, texture, style, material), filter boost, and total processing time. This transparency allows you to understand why each result was returned and debug search quality.
Related Features
- Material Search - Search for materials using text descriptions
- PDF Processing - Automatic extraction of images from PDF catalogs
- Product Discovery - Intelligent product identification and organization