Good day, everyone,
I’m reaching out today with a straightforward query. Currently, I’m working on generating a CSV file with the following columns: question, category, answer. My dilemma revolves around the embedding process, and I’d appreciate some guidance on the best approach:
- Embedding on All Three Columns: Should I create embeddings based on all three columns (e.g., question + category + answer)?
- Embedding on Question Only: Alternatively, I’m contemplating embedding only the question and utilizing metadata for the category and answer (e.g., question).
- Embedding on Answer Only: Lastly, another option is to focus solely on embedding the answer to the question (e.g., Question).
Your insights and recommendations on the most effective strategy would be highly valuable. Thanks in advance!