Random value generator
The following functions have been created to help with our test suite. They are also useful for users testing QuestDB on specific workloads in order to quickly generate large test datasets that mimic the structure of their actual data.
Values can be generated either:
- Pseudo randomly
- Deterministically
when specifying a
seed
QuestDB supports the following random generation functions:
- rnd_boolean
- rnd_byte
- rnd_short
- rnd_int
- rnd_long
- rnd_long256
- rnd_float
- rnd_double
- rnd_date
- rnd_timestamp
- rnd_char
- rnd_symbol
- rnd_str
- rnd_bin
Usage#
Random functions should be used for populating test tables only. They do not hold values in memory and calculations should not be performed at the same time as the random numbers are generated.
For example, running
SELECT round(a,2), a FROM (SELECT rnd_double() a FROM long_sequence(10)); is
bad practice and will return inconsistent results.
A better approach would be to populate a table and then run the query. So for example
- create -
CREATE TABLE test(val double); - populate -
INSERT INTO test SELECT * FROM (SELECT rnd_double() FROM long_sequence(10)); - query -
SELECT round(val,2) FROM test;
Generating sequences#
This page describes the functions to generate values. To generate sequences of values, please refer the page about row generators.
rnd_boolean#
rnd_boolean() - generates a random boolean value, either true or false,
both having equal probability.
Return value:
Return value type is boolean.
Examples:
| a | b |
|---|---|
| true | 47 |
| false | 53 |
rnd_byte#
rnd_byte()- returns a random integer which can take any value between0and127.rnd_byte(min, max)- generates byte values in a specific range (for example only positive, or between 1 and 10).
Arguments:
min: is abyterepresenting the lowest possible generated value (inclusive).max: is abyterepresenting the highest possible generated value (inclusive).
Return value:
Return value type is byte.
Examples:
rnd_short#
rnd_short()- returns a random integer which can take any value between-32768and32767.rnd_short(min, max)- returns short values in a specific range (for example only positive, or between 1 and 10). Supplyingminabovemaxwill result in aninvalid rangeerror.
Arguments:
min: is ashortrepresenting the lowest possible generated value (inclusive).max: is ashortrepresenting the highest possible generated value (inclusive).
Return value:
Return value type is short.
Examples:
rnd_int#
rnd_int()is used to return a random integer which can take any value between-2147483648and2147483647.rnd_int(min, max, nanRate)is used to generate int values in a specific range (for example only positive, or between 1 and 10), or to get occasionalNaNvalues along with int values.
Arguments:
min: is anintrepresenting the lowest possible generated value (inclusive).max: is anintrepresenting the highest possible generated value (inclusive).nanRateis anintdefining the frequency of occurrence ofNaNvalues:0: NoNaNwill be returned.1: Will only returnNaN.N > 1: On average, one in N generated values will be NaN.
Return value:
Return value type is int.
Examples:
rnd_long#
rnd_long()is used to return a random signed integer between0x8000000000000000Land0x7fffffffffffffffL.rnd_long(min, max, nanRate)is used to generate long values in a specific range (for example only positive, or between 1 and 10), or to get occasionalNaNvalues along with int values.
Arguments:
min: is alongrepresenting the lowest possible generated value (inclusive).max: is alongrepresenting the highest possible generated value (inclusive).nanRateis anintdefining the frequency of occurrence ofNaNvalues:0: NoNaNwill be returned.1: Will only returnNaN.N > 1: On average, one in N generated values will beNaN.
Return value:
Return value type is long.
Examples:
rnd_long256#
rnd_long256()- generates a randomlong256value between 0 and 2^256.
Return value:
Return value type is long256.
Examples:
rnd_float#
rnd_float()- generates a random positivefloatbetween 0 and 1.rnd_float(nanRate)- generates a random positivefloatbetween 0 and 1 which will beNaNat a frequency defined bynanRate.
Arguments:
nanRateis anintdefining the frequency of occurrence ofNaNvalues:0: NoNaNwill be returned.1: Will only returnNaN.N > 1: On average, one in N generated values will beNaN.
Return value:
Return value type is float.
Examples:
rnd_double#
rnd_double()- generates a random positivedoublebetween 0 and 1.rnd_double(nanRate)- generates a random positivedoublebetween 0 and 1 which will beNaNat a frequency defined bynanRate.
Arguments:
nanRateis anintdefining the frequency of occurrence ofNaNvalues:0: NoNaNwill be returned.1: Will only returnNaN.N > 1: On average, one in N generated values will beNaN.
Return value:
Return value type is double.
Examples:
rnd_date()#
rnd_date()generates a random date betweenstartandenddates (both inclusive). IT will also generateNaNvalues at a frequency defined bynanRate. Whenstartorendare invalid dates, or whenstartis superior toend, it will returninvalid rangeerror. WhennanRateis inferior to 0, it will returninvalid NAN rateerror.
Arguments:
startis adatedefining the minimum possible generated date (inclusive)endis adatedefining the maximum possible generated date (inclusive)nanRatedefines the frequency of occurrence ofNaNvalues:0: NoNaNwill be returned.1: Will only returnNaN.N > 1: On average, one in N generated values will be NaN.
Return value:
Return value type is date.
Examples:
rnd_timestamp()#
rnd_timestamp(start, end, nanRate)generates a random timestamp betweenstartandendtimestamps (both inclusive). It will also generateNaNvalues at a frequency defined bynanRate. Whenstartorendare invalid timestamps, or whenstartis superior toend, it will returninvalid rangeerror. WhennanRateis inferior to 0, it will returninvalid NAN rateerror.
Arguments:
startis atimestampdefining the minimum possible generated timestamp (inclusive)endis atimestampdefining the maximum possible generated timestamp (inclusive)nanRatedefines the frequency of occurrence ofNaNvalues:0: NoNaNwill be returned.1: Will only returnNaN.N > 1: On average, one in N generated values will be NaN.
Return value:
Return value type is timestamp.
Examples:
Sequences#
To generate increasing timestamps, please refer the page about row generators.
rnd_char#
rnd_char()is used to generate a randomcharwhich will be an uppercase character from the 26-letter A to Z alphabet. Letters from A to Z will be generated with equal probability.
Return value:
Return value type is char.
Examples:
rnd_symbol#
rnd_symbol(symbolList)is used to choose a randomsymbolfrom a list defined by the user. It is useful when looking to generate specific symbols from a finite list (e.gBUY, SELLorAUTUMN, WINTER, SPRING, SUMMER. Symbols are randomly chosen from the list with equal probability. When only one symbol is provided in the list, this symbol will be chosen with 100% probability, in which case it is more efficient to usecast('your_symbol' as symbolrnd_symbol(list_size, minLength, maxLength, nullRate)generated a finite list of distinct random symbols and chooses one symbol from the list at random. The finite list is of sizelist_size. The generated symbols length is betweenminLengthandmaxLength(both inclusive). The function will also generatenullvalues at a rate defined bynullRate.
Arguments:
symbolListis a variable-length list of possiblesymbolvalues expressed as a comma-separated list of strings. For example,'a', 'bcd', 'efg123', '่ก'list_sizeis the number of distinctsymbolvalues to generatedminLengthis anintdefining the minimum length for of a generated symbol (inclusive)maxLengthis anintdefining the maximum length for of a generated symbol (inclusive)nullRateis anintdefining the frequency of occurrence ofnullvalues:0: Nonullwill be returned.1: Will only returnnull.N > 1: On average, one in N generated values will benull.
Return value:
Return value type is symbol.
Examples:
rnd_str#
rnd_str(stringList)is used to choose a randomstringfrom a list defined by the user. It is useful when looking to generate specific strings from a finite list (e.gBUY, SELLorAUTUMN, WINTER, SPRING, SUMMER. Strings are randomly chosen from the list with equal probability. When only one string is provided in the list, this string will be chosen with 100% probability.rnd_str(list_size, minLength, maxLength, nullRate)generated a finite list of distinct random string and chooses one string from the list at random. The finite list is of sizelist_size. The generated strings length is betweenminLengthandmaxLength(both inclusive). The function will also generatenullvalues at a rate defined bynullRate.
Arguments:
strListis a variable-length list of possiblestringvalues expressed as a comma-separated list of strings. For example,'a', 'bcd', 'efg123', '่ก'list_sizeis the number of distinctstringvalues to generatedminLengthis anintdefining the minimum length for of a generated string (inclusive)maxLengthis anintdefining the maximum length for of a generated string (inclusive)nullRateis anintdefining the frequency of occurrence ofnullvalues:0: Nonullwill be returned.1: Will only returnnull.N > 1: On average, one in N generated values will benull.
Return value:
Return value type is string.
Examples:
rnd_bin#
rnd_bin()generates random binary data of a size up to32bytes.rnd_bin(minBytes, maxBytes, nullRate)generates random binary data of a size betweenminBytesandmaxBytesand returnsnullat a rate defined bynullRate.
Arguments:
minBytesis alongdefining the minimum size in bytes for of a generated binary (inclusive)maxBytesis alongdefining the maximum size in bytes for of a generated binary (inclusive)nullRateis anintdefining the frequency of occurrence ofnullvalues:0: Nonullwill be returned.1: Will only returnnull.N > 1: On average, one in N generated values will benull.
Return value:
Return value type is binary.
Examples: