Also known as model order selection
task of selecting a statistical model from a set of candidate models, given data. In the simplest cases, a pre-existing set of data is considered
Discovered by embedding cosine similarity (sentence-transformers MiniLM, 384-dim).