ElasticSearch - Return Unique Values

Question

How would I get the values of all the languages from the records and make them unique   Records  PUT items 1    language    10    PUT items 2    language    11    PUT items 3    language    10     Query  GET items  search              gt  Expected Response  10  11    Any help would be great

User · Answer

Elasticsearch 1 1  has the Cardinality Aggregation which will give you a unique count  Note that it is actually an approximation and accuracy may diminish with high-cardinality datasets  but it s generally pretty accurate in my testing   You can also tune the accuracy with the precision threshold parameter  The trade-off  or course  is memory usage   This graph from the docs shows how a higher precision threshold leads to much more accurate results

User · Answer

To had to distinct by two fields  derivative id  amp  vehicle type  and to sort by cheapest car  Had to nest aggs  GET  cars  search      quot size quot   0     quot aggs quot          quot distinct by derivative id quot            quot terms quot               quot field quot    quot derivative id quot                  quot aggs quot              quot vehicle type quot                quot terms quot                  quot field quot    quot vehicle type quot                          quot aggs quot                  quot cheapest vehicle quot                    quot top hits quot                      quot sort quot                          quot rental quot      quot order quot    quot asc quot                                          quot  source quot      quot includes quot      quot manufacturer name quot                      quot rental quot                      quot vehicle type quot                                                           quot size quot   1                                                                          Result       quot took quot    3     quot timed out quot    false     quot  shards quot           quot total quot    5       quot successful quot    5       quot skipped quot    0       quot failed quot    0         quot hits quot           quot total quot             quot value quot    8         quot relation quot     quot eq quot              quot max score quot    null       quot hits quot                quot aggregations quot           quot distinct by derivative id quot             quot doc count error upper bound quot    0         quot sum other doc count quot    0         quot buckets quot                           quot key quot     quot 04 quot              quot doc count quot    3             quot vehicle type quot                   quot doc count error upper bound quot    0               quot sum other doc count quot    0               quot buckets quot                                       quot key quot     quot CAR quot                    quot doc count quot    2                   quot cheapest vehicle quot                         quot hits quot                           quot total quot                             quot value quot    2                         quot relation quot     quot eq quot                                              quot max score quot    null                       quot hits quot                                                       quot  index quot     quot cars quot                            quot  type quot     quot  doc quot                            quot  id quot     quot 8 quot                            quot  score quot    null                           quot  source quot                                 quot vehicle type quot     quot CAR quot                              quot manufacturer name quot     quot Renault quot                              quot rental quot    89 99                                                     quot sort quot                                89 99                                                                                                                                                                 quot key quot     quot LCV quot                    quot doc count quot    1                   quot cheapest vehicle quot                         quot hits quot                           quot total quot                             quot value quot    1                         quot relation quot     quot eq quot                                              quot max score quot    null                       quot hits quot                                                       quot  index quot     quot cars quot                            quot  type quot     quot  doc quot                            quot  id quot     quot 7 quot                            quot  score quot    null                           quot  source quot                                 quot vehicle type quot     quot LCV quot                              quot manufacturer name quot     quot Ford quot                              quot rental quot    99 99                                                     quot sort quot                                99 99                                                                                                                                                                                         quot key quot     quot 01 quot              quot doc count quot    2             quot vehicle type quot                   quot doc count error upper bound quot    0               quot sum other doc count quot    0               quot buckets quot                                       quot key quot     quot CAR quot                    quot doc count quot    1                   quot cheapest vehicle quot                         quot hits quot                           quot total quot                             quot value quot    1                         quot relation quot     quot eq quot                                              quot max score quot    null                       quot hits quot                                                       quot  index quot     quot cars quot                            quot  type quot     quot  doc quot                            quot  id quot     quot 1 quot                            quot  score quot    null                           quot  source quot                                 quot vehicle type quot     quot CAR quot                              quot manufacturer name quot     quot Ford quot                              quot rental quot    599 99                                                     quot sort quot                                599 99                                                                                                                                                                 quot key quot     quot LCV quot                    quot doc count quot    1                   quot cheapest vehicle quot                         quot hits quot                           quot total quot                             quot value quot    1                         quot relation quot     quot eq quot                                              quot max score quot    null                       quot hits quot                                                       quot  index quot     quot cars quot                            quot  type quot     quot  doc quot                            quot  id quot     quot 2 quot                            quot  score quot    null                           quot  source quot                                 quot vehicle type quot     quot LCV quot                              quot manufacturer name quot     quot Ford quot                              quot rental quot    599 99                                                     quot sort quot                                599 99                                                                                                                                                                                         quot key quot     quot 02 quot              quot doc count quot    2             quot vehicle type quot                   quot doc count error upper bound quot    0               quot sum other doc count quot    0               quot buckets quot                                       quot key quot     quot CAR quot                    quot doc count quot    2                   quot cheapest vehicle quot                         quot hits quot                           quot total quot                             quot value quot    2                         quot relation quot     quot eq quot                                              quot max score quot    null                       quot hits quot                                                       quot  index quot     quot cars quot                            quot  type quot     quot  doc quot                            quot  id quot     quot 4 quot                            quot  score quot    null                           quot  source quot                                 quot vehicle type quot     quot CAR quot                              quot manufacturer name quot     quot Audi quot                              quot rental quot    499 99                                                     quot sort quot                                499 99                                                                                                                                                                                         quot key quot     quot 03 quot              quot doc count quot    1             quot vehicle type quot                   quot doc count error upper bound quot    0               quot sum other doc count quot    0               quot buckets quot                                       quot key quot     quot CAR quot                    quot doc count quot    1                   quot cheapest vehicle quot                         quot hits quot                           quot total quot                             quot value quot    1                         quot relation quot     quot eq quot                                              quot max score quot    null                       quot hits quot                                                       quot  index quot     quot cars quot                            quot  type quot     quot  doc quot                            quot  id quot     quot 5 quot                            quot  score quot    null                           quot  source quot                                 quot vehicle type quot     quot CAR quot                              quot manufacturer name quot     quot Audi quot                              quot rental quot    399 99                                                     quot sort quot                                399 99

User · Answer

if you want to get the first document for each language field unique value  you can do this       query          match all                   collapse          field    language keyword        inner hits          name    latest          size   1

User · Answer

I am looking for this kind of solution for my self as well  I found reference in terms aggregation   So  according to that following is the proper solution      aggs           langs               terms       field     language                          size    500              But if you ran into following error    error              root cause                                    type    illegal argument exception                    reason    Fielddata is disabled on text fields by default  Set fielddata true on  fastest method  in order to load fielddata in memory by uninverting the inverted index  Note that this can however use significant memory  Alternatively use a keyword field instead                              In that case  you have to add  KEYWORD  in the request  like following             aggs               langs                   terms       field     language keyword                              size    500

User · Answer

If you want to get all unique values without any approximation or setting a magic number  size  500   then use COMPOSITE AGGREGATION  ES 6 5     From official documentation    If you want to retrieve all terms or all combinations of terms in a nested terms aggregation you should use the COMPOSITE AGGREGATION which allows to paginate over all possible terms rather than setting a size greater than the cardinality of the field in the terms aggregation  The terms aggregation is meant to return the top terms and does not allow pagination     Implementation example in JavaScript    x000D   x000D  const ITEMS PER PAGE   1000  x000D   x000D  const body      x000D       size   0     Returning only aggregation results  https   www elastic co guide en elasticsearch reference current returning-only-agg-results html x000D       aggs      x000D           langs     x000D               composite      x000D                   size   ITEMS PER PAGE  x000D                   sources      x000D                         language      terms       field    language        x000D                    x000D                x000D            x000D         x000D     x000D   x000D  const uniqueLanguages       x000D   x000D  while  true    x000D    const result   await es search body   x000D   x000D    const currentUniqueLangs   result aggregations langs buckets map bucket   gt  bucket key   x000D   x000D    uniqueLanguages push    currentUniqueLangs   x000D   x000D    const after   result aggregations langs after key  x000D   x000D    if  after    x000D           continue paginating unique items x000D        body aggs langs composite after   after  x000D      else   x000D        break  x000D      x000D    x000D   x000D  console log uniqueLanguages   x000D   x000D   x000D

User · Answer

You can use the terms aggregation     quot size quot   0   quot aggs quot           quot langs quot               quot terms quot       quot field quot     quot language quot     quot size quot    500             The size parameter within the aggregation specifies the maximum number of terms to include in the aggregation result  If you need all results  set this to a value that is larger than the number of unique terms in your data  A search will return something like     quot took quot    16   quot timed out quot    false   quot  shards quot         quot total quot    2     quot successful quot    2     quot failed quot    0     quot hits quot       quot total quot    1000000   quot max score quot    0 0   quot hits quot            quot aggregations quot         quot langs quot           quot buckets quot               quot key quot     quot 10 quot          quot doc count quot    244812                 quot key quot     quot 11 quot          quot doc count quot    136794                   quot key quot     quot 12 quot          quot doc count quot    32312

[elasticsearch] ElasticSearch - Return Unique Values

Examples related to elasticsearch