Long text, chunking, top document , aggregate results

Poliakova_A_Anna · November 14, 2023, 11:15am

I’ve implemented a system with two classes, Document and Paragraph , using chunking techniques for very long documents. Currently, my query retrieves the top 3 most similar paragraphs, but I want to extend it to get the top 3 unique documents that contain these similar paragraphs. Can you suggest a good query for this scenario? P.S I can consider both schemas with cross reference or without.

DudaNogueira · November 23, 2023, 6:47pm

Hi @Poliakova_A_Anna ! Welcome to our community

Sorry, I completely missed this question

Have you tried the GroupBy?

It’s the only feature I recall that could help here.

This or asking for a higher number of entries, then processing the data to find the set of documents that have those.

Let me know if that helps or if you were able to find a solution for this.

Thanks!

Topic		Replies	Views
Return "unique file" when search large documents General	2	541	June 12, 2023
Filter and retrieve distinct documents Support	2	192	July 4, 2024
How to get unique results based on references General	6	462	March 9, 2024
Cross-reference queries Support	1	523	June 5, 2023
Return distinct result Support developer-experience	3	1156	October 17, 2023

Long text, chunking, top document , aggregate results

Related topics