Late Chunking

SomebodySysop · September 19, 2024, 8:19pm

Thank you for taking the time to explain. I have a much better understanding of what’s going on.

This is a direct solution to this query: Retrieving “Adjacent” Chunks for Better Context

That query is also what led me to develop my own “Comprehension Level” retrieval methodology (based on the Small to Big retrieval strategy), which also solves the issue of retrieving adjacent chunks within a specific radius of the core chunk.

I do see that the Jina embedding model does allow for REST API interaction which would allow me to develop my solution in PHP (Yay!). But, at this point, I am not anxious to change my Weaviate class embeddings model – I am getting very good results with the OpenAI model.

And, I am getting essentially the same functionality as Late Chunking without altering my current embedding techniques. So for now, I think I’ll put it on hold.

But I do appreciate you taking the time to explain how this works. I think it’s a great solution and hopefully I will be able to incorporate it down the line.

Topic		Replies	Views
Retrieving “Adjacent” Chunks for Better Context Support	12	1370	February 19, 2025
Best way to vectorize and store a large document in Weaviate? General	6	1823	August 18, 2023
Vectorize big amounts of Data locally Support	1	332	June 7, 2024
Facing maximum context length exceed issue during vectorizing Support python	1	350	April 16, 2024
Slow deletion when using filter (and updating chunked documents) Support	2	585	June 30, 2023

Late Chunking

Related topics