Late Chunking

Thank you for taking the time to explain. I have a much better understanding of what’s going on.

This is a direct solution to this query: Retrieving “Adjacent” Chunks for Better Context

That query is also what led me to develop my own “Comprehension Level” retrieval methodology (based on the Small to Big retrieval strategy), which also solves the issue of retrieving adjacent chunks within a specific radius of the core chunk.

I do see that the Jina embedding model does allow for REST API interaction which would allow me to develop my solution in PHP (Yay!). But, at this point, I am not anxious to change my Weaviate class embeddings model – I am getting very good results with the OpenAI model.

And, I am getting essentially the same functionality as Late Chunking without altering my current embedding techniques. So for now, I think I’ll put it on hold.

But I do appreciate you taking the time to explain how this works. I think it’s a great solution and hopefully I will be able to incorporate it down the line.

2 Likes