You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: docs/configuration/index-config.md
+5-5Lines changed: 5 additions & 5 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -225,7 +225,7 @@ The timezone name format specifier (`%Z`) is not supported currently.
225
225
- `unix_timestamp`: parse float and integer numbers to Unix timestamps. Floating-point values are converted to timestamps expressed in seconds. Integer values are converted to Unix timestamps whose precision, determined in `seconds`, `milliseconds`, `microseconds`, or `nanoseconds`, is inferred from the number of input digits. Internally, datetimes are converted to UTC (if the time zone is specified) and stored as *i64* integers. As a result, Quickwit only supports timestamp values ranging from `Apr 13, 1972 23:59:55` to `Mar 16, 2242 12:56:31`.
226
226
227
227
:::warning
228
-
Converting timestamps from float to integer values may occurs with a loss of precision.
228
+
Converting timestamps from float to integer values may occur with a loss of precision.
229
229
:::
230
230
231
231
When a `datetime` field is stored as a fast field, the `fast_precision` parameter indicates the precision used to truncate the values before encoding, which improves compression (truncation here means zeroing). The `fast_precision` parameter can take the following values: `seconds`, `milliseconds`, `microseconds`, or `nanoseconds`. It only affects what is stored in fast fields when a `datetime` field is marked as "fast". Finally, operations on `datetime` fast fields, e.g. via aggregations, need to be done at the nanosecond level.
@@ -365,7 +365,7 @@ fast:
365
365
| `description` | Optional description for the field. | `None` |
366
366
| `stored` | Whether value is stored in the document store | `true` |
367
367
| `indexed` | Whether value is indexed | `true` |
368
-
| `fast` | Whether value is stored in a fast field. The default behaviour for text in the JSON is to store the text unchanged. An normalizer can be configured via `normalizer: lowercase`. ([See normalizers](#description-of-available-normalizers)) for a list of available normalizers. | `false` |
368
+
| `fast` | Whether value is stored in a fast field. The default behaviour for text in the JSON is to store the text unchanged. A normalizer can be configured via `normalizer: lowercase`. ([See normalizers](#description-of-available-normalizers)) for a list of available normalizers. | `false` |
369
369
| `tokenizer` | **Only affects strings in the json object**. Name of the `Tokenizer`, choices between `raw`, `default`, `en_stem` and `chinese_compatible` | `raw` |
370
370
| `record` | **Only affects strings in the json object**. Describes the amount of information indexed, choices between `basic`, `freq` and `position` | `basic` |
371
371
| `expand_dots` | If true, json keys containing a `.` should be expanded. For instance, if `expand_dots` is set to true, `{"k8s.node.id": "node-2"}` will be indexed as if it was `{"k8s": {"node": {"id": "node2"}}}`. The benefit is that escaping the `.` will not be required at query time. In other words, `k8s.node.id:node2` will match the document. This does not impact the way the document is stored. | `true` |
@@ -418,7 +418,7 @@ field_mappings:
418
418
#### concatenate
419
419
420
420
Quickwit supports mapping the content of multiple fields to a single one. This can be more efficient at query time than
421
-
searching through dozens of `default_search_fields`. It also allow querying inside a json field without knowing the path
421
+
searching through dozens of `default_search_fields`. It also allows querying inside a json field without knowing the path
422
422
to the field being searched.
423
423
424
424
```yaml
@@ -438,7 +438,7 @@ At query time, concatenate fields don't support range queries.
438
438
Only the following types are supported inside a concatenate field: text, bool, i64, u64, f64, json. Other types are rejected
439
439
at index creation, or silently discarded during indexation if they are found inside a json field.
440
440
Adding an object field to a concatenate field doesn't automatically add its subfields (yet).
441
-
<!-- typing is made so it wouldn't be too hard do add, as well as things like params_* matching all fields which starts name with params_ , but the feature isn't implemented yet -->
441
+
<!-- typing is made so it wouldn't be too hard to add, as well as things like params_* matching all fields which starts name with params_ , but the feature isn't implemented yet -->
442
442
It isn't possible to add subfields from a json field to a concatenate field. For instance if `attributes` is a json field, it's not possible to add only `attributes.color` to a concatenate field.
443
443
444
444
For json fields and dynamic fields, the path is not indexed, only values are. For instance, given the following document:
@@ -507,7 +507,7 @@ When the `dynamic_mapping` is set as indexed (default), fields mapped through
507
507
dynamic mode can be searched by targeting the path needed to access them from
508
508
the root of the JSON object.
509
509
510
-
For instance, in a entirely schemaless settings, a minimal index configuration could be:
510
+
For instance, in an entirely schemaless settings, a minimal index configuration could be:
- run the backward compatibility tests (see below)
80
80
- for older versions, check the diff between the `xxx.expected.modified.json` files created and the matching `xxx.expected.json` files.
81
81
If the changes are acceptable, replace the content of the `xxx.expected.json` files and commit them.
82
-
- check the the `yyyy.json` that was created for the new version and commit it along with the `yyyy.expected.json` file (identical).
82
+
- check the `yyyy.json` that was created for the new version and commit it along with the `yyyy.expected.json` file (identical).
83
83
- possibly update the generation of the default XXXX instance used for regression. It is in the function `TestableForRegression::sample_for_regression`.
84
84
85
85
@@ -117,7 +117,7 @@ the CI will catch it.
117
117
118
118
#### Adding a new test case.
119
119
120
-
If the serialization format changes, an new version should be created and the unit test will
120
+
If the serialization format changes, a new version should be created and the unit test will
121
121
automatically add a new unit test generated from the sample tested objects.
122
122
Concretely, it will just write two files `XXXX.json` and `XXXX.expected.json` for each model.
Copy file name to clipboardExpand all lines: docs/reference/aggregation.md
+6-6Lines changed: 6 additions & 6 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -5,7 +5,7 @@ sidebar_position: 30
5
5
6
6
An aggregation summarizes your data as statistics on buckets or metrics.
7
7
8
-
Aggregations can provide answer to questions like:
8
+
Aggregations can provide answers to questions like:
9
9
10
10
- What is the average price of all sold articles?
11
11
- How many errors with status code 500 do we have per day?
@@ -182,7 +182,7 @@ The returned format is currently fixed at `RFC3339`.
182
182
183
183
By default buckets are returned between the min and max value of the documents, including empty buckets. Setting `min_doc_count > 0` will filter empty buckets.
184
184
185
-
The value range of the buckets can bet extended via [`extended_bounds`](#extended_bounds) or limit the range via [`hard_bounds`](#hard_bounds).
185
+
The value range of the buckets can be extended via [`extended_bounds`](#extended_bounds) or limit the range via [`hard_bounds`](#hard_bounds).
186
186
187
187
#### Example
188
188
@@ -222,7 +222,7 @@ The interval to chunk your data range. Each bucket spans a value range of [0..in
222
222
Intervals implicitly defines an absolute grid of buckets `[interval * k, interval * (k + 1))`.
223
223
Offset makes it possible to shift this grid into `[offset + interval * k, offset + interval (k + 1))`. Offset has to be in the range [0, interval).
224
224
225
-
As an example, if there are two documents with value 8 and 12 and interval 10.0, they would fall into the buckets with the key 0 and 10. With offset 5 and interval 10, they would both fall into the bucket with they key 5 and the range [5..15)
225
+
As an example, if there are two documents with value 8 and 12 and interval 10.0, they would fall into the buckets with the key 0 and 10. With offset 5 and interval 10, they would both fall into the bucket with the key 5 and the range [5..15)
226
226
227
227
```json
228
228
{
@@ -381,7 +381,7 @@ Offset makes it possible to shift this grid into `[offset + interval * k, offset
381
381
382
382
This is especially useful when using `fixed_interval`, to shift the first bucket e.g. at the start of the year.
383
383
384
-
The `offset` parameter is has the same syntax as the `fixed_interval` parameter, but also allows for negative values.
384
+
The `offset` parameter has the same syntax as the `fixed_interval` parameter, but also allows for negative values.
385
385
386
386
###### **min_doc_count**
387
387
@@ -508,7 +508,7 @@ term-count.
508
508
509
509
Even with a larger `shard_size` value, doc_count values for a terms aggregation may be
510
510
approximate. As a result, any sub-aggregations on the terms aggregation may also be approximate.
511
-
`sum_other_doc_count`is the number of documents that didn’t make it into the the top size
511
+
`sum_other_doc_count`is the number of documents that didn’t make it into the top size
512
512
terms. If this is greater than 0, the terms agg had to throw away some
513
513
buckets, either because they didn’t fit into `size` on the root node or they didn’t fit into
514
514
`shard_size`on the leaf node.
@@ -907,7 +907,7 @@ The default value is 2.
907
907
908
908
### Sum
909
909
910
-
A single-value metric aggregation that that sums up numeric values that are that are extracted from the aggregated documents.
910
+
A single-value metric aggregation that sums up numeric values that are that are extracted from the aggregated documents.
911
911
Supported field types are `u64`, `f64`, `i64`, and `datetime`.
0 commit comments