Also known as entity resolution, reconciliation, data matching
joining records or entities from different data sets that may or may not share a common identifier and matching the entities based on their properties
Discovered by embedding cosine similarity (sentence-transformers MiniLM, 384-dim).