README

Computer-vision-powered similar-image search for Craft CMS.

Docs and support: https://craft-spectacles.com

Spectacles analyzes your asset library with a vision model, stores structured metadata and a similarity vector for each image, and exposes endpoints + Twig helpers for finding similar images. Visitors can upload an image and get back the closest matches from your library.

Requirements

Craft CMS 5.0+
PHP 8.2+
An API key from one of the supported providers — or a local Ollama daemon

Installation

composer require justinholtweb/craft-spectacles
./craft plugin/install spectacles

Supported providers

Vision and embedding are configured independently — pick whichever combination suits your budget and quality bar.

Provider	Vision	Text embedding	Image embedding	Notes
OpenAI	gpt-4o, gpt-4o-mini, gpt-4.1	text-embedding-3-small / -large	—	Strong default. JSON mode is reliable.
Anthropic	Claude 4.x (Opus / Sonnet / Haiku)	—	—	Highest-quality descriptions. Pair with another embedder.
Google Gemini	gemini-2.5-flash, gemini-2.5-pro	text-embedding-004	—	Fast and inexpensive.
Voyage AI	—	voyage-multimodal-3	voyage-multimodal-3	Best similarity results — embeds the image directly, no description-loss.
Ollama (local)	llava, llama3.2-vision, bakllava	nomic-embed-text, mxbai-embed-large	—	Self-hosted, no API costs, slower.

Recommended pairings:

Best quality: Anthropic Claude (vision) + Voyage voyage-multimodal-3 (embedding)
Best price/perf: OpenAI gpt-4o-mini + OpenAI text-embedding-3-small
Self-hosted: Ollama llava + Ollama nomic-embed-text
Strict similarity, weak descriptions OK: any vision provider + Voyage multimodal

When the embedding provider is multimodal-capable (currently only Voyage), Spectacles will embed the image bytes directly instead of embedding the text description — this captures visual features the description would lose.

Configuration

Settings live at Settings → Plugins → Spectacles. Set API keys via environment variables and reference them as $OPENAI_API_KEY, $ANTHROPIC_API_KEY, $GEMINI_API_KEY, $VOYAGE_API_KEY.

Other settings:

autoAnalyzeOnUpload — queue an analysis job whenever an image asset is saved.
volumeUids — restrict analysis to specific volumes.
defaultResultLimit / minSimilarityScore — search tuning.
allowPublicSearch / maxVisitorUploadKb — public upload endpoint controls.

Switching providers? Different models produce vectors of different dimensions, so Spectacles only compares vectors of matching shape. After switching, click Re-index all images to regenerate embeddings.

Usage

Re-index existing assets

From the settings screen, click Re-index all images to queue a job for every image in the configured volumes. Progress is visible in the queue.

Twig

{# similar to a given asset #}
{% set similar = craft.spectacles.similar(asset, 8) %}
{% for row in similar %}
    <a href="{{ row.asset.url }}">
        <img src="{{ row.asset.url({ width: 240, height: 240, mode: 'crop' }) }}">
        <small>{{ row.score }} — {{ row.metadata.description }}</small>
    </a>
{% endfor %}

{# free-form text search #}
{% for row in craft.spectacles.searchText('foggy mountain at sunrise') %}
    <img src="{{ row.asset.url }}">
{% endfor %}

Visitor upload form

Drop the included partial into any frontend template:

{% include 'spectacles/_partials/upload-form' %}

Or post to the endpoint directly:

POST /spectacles/search
Content-Type: multipart/form-data
Accept: application/json

image=<file>

Response:

{
  "analysis": {
    "description": "A foggy mountain ridge at sunrise.",
    "tags": ["mountain", "fog", "sunrise"],
    "objects": ["mountain", "trees"],
    "colors": ["pink", "blue", "gray"]
  },
  "results": [
    { "id": 412, "url": "...", "thumbUrl": "...", "score": 0.87, "description": "..." }
  ]
}

JSON for an existing asset

GET /spectacles/similar/{assetId}
Accept: application/json

Architecture

spectacles_imagemetadata table stores description, tags, objects, colors, and the embedding vector (as JSON) for each asset.
Vision and embedding are separate concerns:
- services/vision/VisionProvider — analyze() returns an AnalysisResult.
- services/embedding/EmbeddingProvider — embedText() and optional embedImage() return EmbeddingResult.
services/Vision::embedForImage() prefers multimodal embedding when the provider supports it, falling back to text.
Similarity uses pluggable backends (services/similarity/SimilarityBackend):
- Scan (default): cosine in PHP over the JSON embeddings; works on any DB.
- pgvector: queries a dedicated spectacles_imagevectors table with the vector type; orders of magnitude faster and lets you add an HNSW or IVFFlat index for production scale.
- The active backend is auto-detected on Postgres + extension, or you can force it from the settings screen.

CP integration

The asset edit screen renders a Spectacles panel in the sidebar showing the description, tags, similar-image thumbnails, and the active backend/model. If the asset hasn't been analyzed yet, an "Analyze now" button queues a job.

Adding a provider

Implement VisionProvider and/or EmbeddingProvider under src/services/vision/ or src/services/embedding/.
Add a constant to Settings::PROVIDER_* and to VISION_PROVIDERS / EMBEDDING_PROVIDERS.
Wire it into Vision::visionProvider() / Vision::embeddingProvider().
Add fields to the settings template.

License

MIT

justinholtweb/craft-spectacles

包简介

关键字：

README 文档