Azure OpenAI API failed with status: 429 error

I receive the following error:

 weaviate.exceptions.WeaviateQueryError: Query call with protocol GRPC search failed with message <AioRpcError of RPC that terminated with:
app_1       |   status = StatusCode.UNKNOWN
app_1       |   details = "connection to: Azure OpenAI API failed with status: 429 error: Requests to the ChatCompletions_Create Operation under Azure OpenAI API version 2024-02-01 have exceeded token rate limit of your current OpenAI S0 pricing tier. Please retry after 86400 seconds. Please go here: https://aka.ms/oai/quotaincrease if you would like to further increase the default rate limit."
app_1       |   debug_error_string = "UNKNOWN:Error received from peer  {created_time:"2025-01-20T12:48:15.050436598+00:00", grpc_status:2, grpc_message:"connection to: Azure OpenAI API failed with status: 429 error: Requests to the ChatCompletions_Create Operation under Azure OpenAI API version 2024-02-01 have exceeded token rate limit of your current OpenAI S0 pricing tier. Please retry after 86400 seconds. Please go here: https://aka.ms/oai/quotaincrease if you would like to further increase the default rate limit."}

When I run the following query:

        response = collection.generate.near_text(
            query=query,
            grouped_task=prompt
        )

But itโ€™s already been 3 days (more than 86400 seconds) and when I request the azure openAI endpoint from postman or curl, it works. I wonder how does weaviate query? where would be the problem and how can I better debug?
Thanks!