How to use WHERE clause from Aggregate meta count result?

junbetterway · June 15, 2023, 5:37am

Basically, we wanted something similar with PostgreSQL where one can combine group by with having clause. For example, I want to find players with duplicate player_code

SELECT player_code
FROM public.player
GROUP BY player_code
HAVING COUNT(player_code) > 1;

How can I achieve such using Weaviate Aggregate? I only came up with this but could not find a way where to insert the WHERE part to compare the meta count greater than 1?

echo '{
  "query": "{
    Aggregate {
      Player(groupBy: [\"player_code\"]) {
        meta {
          count
        }
        groupedBy {
          value
          path
        }
      }
    }
  }"
}' | curl \
    -X POST \
    -H 'Content-Type: application/json' \
    -d @- \
    http://{{HOST}}/v1/graphql

Is this even possible or not? Thanks!

CShorten · June 15, 2023, 4:21pm

Hey @junbetterway, I don’t believe there is a way to do this directly inside of Weaviate. However, you can achieve this by parsing the results client side quite nicely, here is an example in python where I am grouping podcasts by speaker and then only keeping the results where the speaker occurs more than 10 times. Hopefully this is useful to you – replacing PodClip with Player and speaker with player_code

import weaviate

client = weaviate.Client("http://localhost:8080")

aggregate_demo = """
{
	Aggregate {
    PodClip (
      groupBy: ["speaker"]
    ){
      groupedBy {
    		path
        value
    	}
      meta {
        count
      }
    }
  }
}
"""

results = client.query.raw(aggregate_demo)["data"]["Aggregate"]["PodClip"]

parsed_results = []
for res in results:
    print(res.keys())
    if res["meta"]["count"] > 10:
        parsed_results.append(res)

print(parsed_results)

junbetterway · June 16, 2023, 3:29pm

Thanks @CShorten for this - I will try doing it via Java client if there is no native way to do it then.

Topic		Replies	Views
[Question] Running Aggregate against Weaviate Cloud Support	2	198	September 25, 2024
Is there any alternatives for extracting the distinct count in the aggregate function General bug , developer-experience	5	289	December 26, 2024
Emmanuel Katto Dubai : Alternatives to Extracting Distinct Count in Aggregate Functions General developer-experience	1	131	November 20, 2024
Inconsistent numbers of objects using Get and Aggregate Support bug	1	159	August 29, 2024
"Cannot query field \"wordCount\" on type \"Aggregate Support python	2	368	October 1, 2024

How to use WHERE clause from Aggregate meta count result?

Related topics