Description
I’m trying to use weaviate to handle chinese documents, I know weaviate support gse as the tokenization, but how can I config the gse. I need to load chinese dict to the gse for particular terms. (The official gse support LoadDict method.)
The schema looks like as below:
{
"class": self.index_name,
"description": "Chunks of Documentations",
"vectorizer": "none",
"properties": [
{
"name": "text",
"dataType": ["text"],
"description": "Content of the document",
"tokenization": "gse",
"indexSearchable": True,
},
]
}
Thanks.