W3cubDocs

/RethinkDB Python

ReQL command: distinct

Command syntax

sequence.distinct() → array
table.distinct([index=<indexname>]) → stream
r.distinct(sequence) → array
r.distinct(table, [index=<indexname>]) → stream

Description

Removes duplicate elements from a sequence.

The distinct command can be called on any sequence or table with an index.

While distinct can be called on a table without an index, the only effect will be to convert the table into a stream; the content of the stream will not be affected.

Example: Which unique villains have been vanquished by Marvel heroes?

r.table('marvel').concat_map(
    lambda hero: hero['villain_list']).distinct().run(conn)

Example: Topics in a table of messages have a secondary index on them, and more than one message can have the same topic. What are the unique topics in the table?

r.table('messages').distinct(index='topics').run(conn)

The above structure is functionally identical to:

r.table('messages')['topics'].distinct().run(conn)

However, the first form (passing the index as an argument to distinct) is faster, and won’t run into array limit issues since it’s returning a stream.

Related commands

Get more help

Couldn't find what you were looking for?

© RethinkDB contributors
Licensed under the Creative Commons Attribution-ShareAlike 3.0 Unported License.
https://rethinkdb.com/api/python/distinct/