Considering the Github repository you mentioned, as itās using Langchain, you can ingest the PDFs into weaviate by changing the Vectorstore, like so:
For that you will have to run Weaviate locally, using docker. Here is a nice docker configurator tool that will guide you into creating the perfect docker-compose.yaml:
that should be enough to store your embeddings into Weaviate. You will also need to change the querying part to adapt it.
Managed to read from documents and push do my local docker :).
Once i set wcs|query to
{
Get{
Document{
text
source
}
}
}
I got source document and text that it read nicely, its just that not entire text is shown in query, unsure since documents are really long does it just shows me some part and hides rest or there is command to get full.
I got two last questions before i give you 5/5 stars
Would this be a proper way to get text from ingested files, have to use async function since await will give await error unable to use it at top level and iām new in typescript.
await WeaviateStore.fromDocuments(docs, embeddings, {client: client, indexName: "Document", textKey:"text"}) - Will ingest text into my local docker.
I will use following code in wcs|query to get source file and text
{
Get{
Document{
text
source
}
}
}
Text is not fully displayed in wcs query, I can see it has gone through documents, but docs are really long and I can see only about 10-13 lines of text displayed per ingested document, while there is a lot lot more of text in those docs.
**To sum things up**
Experience with forum: Immense help from your team, I am already more than happy with the way you assisted me.
Final questions:
Is code query good.
Why does query display only partial text from files rather than whole.
Once more thanks for assistance and have a nice day.