Metadata Recovery

The process of reconnecting AI-generated images that have lost their generation metadata to their original prompts, parameters, and provenance records. Recovery techniques include Job ID matching against platform history, visual similarity search against known-good assets, and temporal correlation with generation logs.

Metadata recovery addresses the "metadata desert" problem — the millions of AI-generated images downloaded before platforms like Midjourney began embedding generation metadata (pre-late 2025). These legacy files contain no prompt, seed, or parameter information, making them opaque assets with unknown provenance.

Recovery is a multi-technique process. The highest-fidelity method is Job ID matching: if the filename or any associated record contains the Midjourney Job ID, the image can be matched against the user's generation history to recover the full prompt and parameters. When Job IDs are unavailable, visual similarity search using perceptual hashing or embedding-based matching can identify near-duplicates that do have metadata. Temporal correlation — matching file timestamps to known generation sessions — provides a lower-confidence but still useful recovery signal.