Wish I'd started using it sooner.
The standard architecture — chunking documents, embedding them into a vector database, and retrieving top-k results via ...