Skip to content

Commit 48651b0

Browse files
Naarcha-AWSshaavangachenqi0805dlvenablenatebower
authored
Data Prepper 2.7 documentation (opensearch-project#6763)
* Obfuscate processor doc (opensearch-project#6387) Signed-off-by: shaavanga <[email protected]> * [Data Prepper] MAINT: document on disable_refresh secret extension setting (opensearch-project#6384) * MAINT: document on disable_refresh secret extension setting Signed-off-by: George Chen <[email protected]> * Update _data-prepper/managing-data-prepper/configuring-data-prepper.md Signed-off-by: Naarcha-AWS <[email protected]> * Update _data-prepper/managing-data-prepper/configuring-data-prepper.md Signed-off-by: Naarcha-AWS <[email protected]> * Apply suggestions from code review Signed-off-by: Naarcha-AWS <[email protected]> --------- Signed-off-by: George Chen <[email protected]> Signed-off-by: Naarcha-AWS <[email protected]> Co-authored-by: Naarcha-AWS <[email protected]> * Updates to the S3 source documentation (opensearch-project#6379) * Updates to the S3 source documentation to include missing features and metrics. Signed-off-by: David Venable <[email protected]> * Apply suggestions from code review Signed-off-by: Naarcha-AWS <[email protected]> * Apply suggestions from code review Signed-off-by: Naarcha-AWS <[email protected]> * Apply suggestions from code review Signed-off-by: Naarcha-AWS <[email protected]> * Try to fix build error * Update s3.md * Update s3.md * Update s3.md * Update s3.md * Update s3.md * Apply suggestions from code review Co-authored-by: Nathan Bower <[email protected]> Signed-off-by: Naarcha-AWS <[email protected]> * Update s3.md * See if removing links removes build error * Apply suggestions from code review Signed-off-by: Naarcha-AWS <[email protected]> --------- Signed-off-by: David Venable <[email protected]> Signed-off-by: Naarcha-AWS <[email protected]> Co-authored-by: Naarcha-AWS <[email protected]> Co-authored-by: Nathan Bower <[email protected]> * Update list_to_map processor in Data Prepper (opensearch-project#6382) * Update list-to-map processor Signed-off-by: Hai Yan <[email protected]> * Apply suggestions from code review Signed-off-by: Naarcha-AWS <[email protected]> * Apply suggestions from code review Signed-off-by: Naarcha-AWS <[email protected]> * Apply suggestions from code review Signed-off-by: Naarcha-AWS <[email protected]> * Apply suggestions from code review Co-authored-by: Nathan Bower <[email protected]> Signed-off-by: Naarcha-AWS <[email protected]> * Apply suggestions from code review Co-authored-by: Nathan Bower <[email protected]> Signed-off-by: Naarcha-AWS <[email protected]> --------- Signed-off-by: Hai Yan <[email protected]> Signed-off-by: Naarcha-AWS <[email protected]> Co-authored-by: Naarcha-AWS <[email protected]> Co-authored-by: Nathan Bower <[email protected]> * Improvements to the S3 sink documenation (opensearch-project#6383) * Corrections and clarifications on the S3 sink. Include an IAM policy and an example Parquet schema. Signed-off-by: David Venable <[email protected]> * Updates to the S3 sink to clarify how the object name is generated. Removes an option which does not exist. Signed-off-by: David Venable <[email protected]> * Update s3.md * Update s3.md * Apply suggestions from code review Signed-off-by: Naarcha-AWS <[email protected]> * Apply suggestions from code review Co-authored-by: Nathan Bower <[email protected]> Signed-off-by: Naarcha-AWS <[email protected]> * Apply suggestions from code review Signed-off-by: Naarcha-AWS <[email protected]> * Update _data-prepper/pipelines/configuration/sinks/s3.md Signed-off-by: Naarcha-AWS <[email protected]> * Apply suggestions from code review Signed-off-by: Naarcha-AWS <[email protected]> * Apply suggestions from code review Signed-off-by: Naarcha-AWS <[email protected]> * Update s3.md * Add David's link Signed-off-by: Naarcha-AWS <[email protected]> --------- Signed-off-by: David Venable <[email protected]> Signed-off-by: Naarcha-AWS <[email protected]> Co-authored-by: Naarcha-AWS <[email protected]> Co-authored-by: Nathan Bower <[email protected]> * Add permissions, metrics, and metadata attributes to Data Prepper dyn… (opensearch-project#6380) * Add permissions, metrics, and metadata attributes to Data Prepper dynamodb source documentation Signed-off-by: Taylor Gray <[email protected]> * Address PR comments Signed-off-by: Taylor Gray <[email protected]> * Apply suggestions from code review Signed-off-by: Naarcha-AWS <[email protected]> * Apply suggestions from code review Signed-off-by: Naarcha-AWS <[email protected]> * Apply suggestions from code review Co-authored-by: Nathan Bower <[email protected]> Signed-off-by: Naarcha-AWS <[email protected]> * Update dynamo-db.md Signed-off-by: Naarcha-AWS <[email protected]> * Apply suggestions from code review Signed-off-by: Naarcha-AWS <[email protected]> --------- Signed-off-by: Taylor Gray <[email protected]> Signed-off-by: Naarcha-AWS <[email protected]> Co-authored-by: Naarcha-AWS <[email protected]> Co-authored-by: Nathan Bower <[email protected]> * Date Processor doc update (opensearch-project#6381) * Date Processor doc update Signed-off-by: Asif Sohail Mohammed <[email protected]> * Fixed table header indentation Signed-off-by: Asif Sohail Mohammed <[email protected]> * Fix formatting and grammar. * Apply suggestions from code review Signed-off-by: Naarcha-AWS <[email protected]> * Apply suggestions from code review Signed-off-by: Naarcha-AWS <[email protected]> * Apply suggestions from code review Co-authored-by: Nathan Bower <[email protected]> Signed-off-by: Naarcha-AWS <[email protected]> * Apply suggestions from code review Co-authored-by: Nathan Bower <[email protected]> Signed-off-by: Naarcha-AWS <[email protected]> * Apply suggestions from code review Co-authored-by: Nathan Bower <[email protected]> Signed-off-by: Naarcha-AWS <[email protected]> * Update date.md --------- Signed-off-by: Asif Sohail Mohammed <[email protected]> Signed-off-by: Naarcha-AWS <[email protected]> Co-authored-by: Naarcha-AWS <[email protected]> Co-authored-by: Nathan Bower <[email protected]> * Add docs for join function in Data Prepper (opensearch-project#6688) * Add docs for join function Signed-off-by: Hai Yan <[email protected]> * Apply suggestions from code review Signed-off-by: Naarcha-AWS <[email protected]> * Apply suggestions from code review Signed-off-by: Naarcha-AWS <[email protected]> * Apply suggestions from code review Signed-off-by: Naarcha-AWS <[email protected]> * Update expression-syntax.md * Apply suggestions from code review Co-authored-by: Nathan Bower <[email protected]> Signed-off-by: Naarcha-AWS <[email protected]> --------- Signed-off-by: Hai Yan <[email protected]> Signed-off-by: Naarcha-AWS <[email protected]> Co-authored-by: Naarcha-AWS <[email protected]> Co-authored-by: Nathan Bower <[email protected]> * Add docs for flatten processor for Data Prepper (opensearch-project#6685) * Add docs for flatten processor Signed-off-by: Hai Yan <[email protected]> * Add grammar edits * Apply suggestions from code review Signed-off-by: Naarcha-AWS <[email protected]> * Update flatten.md * Apply suggestions from code review Co-authored-by: Nathan Bower <[email protected]> Signed-off-by: Naarcha-AWS <[email protected]> --------- Signed-off-by: Hai Yan <[email protected]> Signed-off-by: Naarcha-AWS <[email protected]> Co-authored-by: Naarcha-AWS <[email protected]> Co-authored-by: Nathan Bower <[email protected]> * Adds data prepper decompress processor documentation (opensearch-project#6683) * Add data prepper decompress processor documentation Signed-off-by: Taylor Gray <[email protected]> * Update decompress.md Signed-off-by: Naarcha-AWS <[email protected]> * Update decompress.md Signed-off-by: Naarcha-AWS <[email protected]> * Apply suggestions from code review Signed-off-by: Naarcha-AWS <[email protected]> * Apply suggestions from code review Co-authored-by: Nathan Bower <[email protected]> Signed-off-by: Naarcha-AWS <[email protected]> * Apply suggestions from code review Co-authored-by: Nathan Bower <[email protected]> Signed-off-by: Naarcha-AWS <[email protected]> --------- Signed-off-by: Taylor Gray <[email protected]> Signed-off-by: Naarcha-AWS <[email protected]> Co-authored-by: Naarcha-AWS <[email protected]> Co-authored-by: Nathan Bower <[email protected]> * Add data prepper documentation for grok performance_metadata (opensearch-project#6681) * Add data prepper grok performance metadata documentation Signed-off-by: Taylor Gray <[email protected]> * Update grok.md Signed-off-by: Naarcha-AWS <[email protected]> * Apply suggestions from code review Signed-off-by: Naarcha-AWS <[email protected]> * Apply suggestions from code review Signed-off-by: Naarcha-AWS <[email protected]> * Update grok.md Signed-off-by: Naarcha-AWS <[email protected]> * Apply suggestions from code review Co-authored-by: Nathan Bower <[email protected]> Signed-off-by: Naarcha-AWS <[email protected]> * Apply suggestions from code review Co-authored-by: Nathan Bower <[email protected]> Signed-off-by: Naarcha-AWS <[email protected]> --------- Signed-off-by: Taylor Gray <[email protected]> Signed-off-by: Naarcha-AWS <[email protected]> Co-authored-by: Naarcha-AWS <[email protected]> Co-authored-by: Nathan Bower <[email protected]> * Documentation for data prepper dynamodb source view_on_remove feature (opensearch-project#6738) * Add documentation for dynamodb source view_on_remove feature Signed-off-by: Taylor Gray <[email protected]> * Apply suggestions from code review Signed-off-by: Naarcha-AWS <[email protected]> * Apply suggestions from code review Signed-off-by: Naarcha-AWS <[email protected]> --------- Signed-off-by: Taylor Gray <[email protected]> Signed-off-by: Naarcha-AWS <[email protected]> Co-authored-by: Naarcha-AWS <[email protected]> * Update Data Prepper opensearch sink documentation (opensearch-project#6386) * Update Data Prepper opensearch sink documentation Signed-off-by: Taylor Gray <[email protected]> * Formatting fixes and adding introductory text. Signed-off-by: Naarcha-AWS <[email protected]> * Apply suggestions from code review Signed-off-by: Naarcha-AWS <[email protected]> * Update opensearch.md Signed-off-by: Naarcha-AWS <[email protected]> * Apply suggestions from code review Signed-off-by: Naarcha-AWS <[email protected]> * Update opensearch.md Signed-off-by: Naarcha-AWS <[email protected]> * Update opensearch.md Signed-off-by: Naarcha-AWS <[email protected]> * Update opensearch.md Signed-off-by: Naarcha-AWS <[email protected]> * Update opensearch.md Signed-off-by: Naarcha-AWS <[email protected]> * Apply suggestions from code review Co-authored-by: Nathan Bower <[email protected]> Signed-off-by: Naarcha-AWS <[email protected]> * Apply suggestions from code review Co-authored-by: Nathan Bower <[email protected]> Signed-off-by: Naarcha-AWS <[email protected]> * Apply suggestions from code review Signed-off-by: Naarcha-AWS <[email protected]> --------- Signed-off-by: Taylor Gray <[email protected]> Signed-off-by: Naarcha-AWS <[email protected]> Co-authored-by: Naarcha-AWS <[email protected]> Co-authored-by: Nathan Bower <[email protected]> * Adds a configuration for the Data Prepper S3 source workers field. (opensearch-project#6774) * Adds a configuration for the Data Prepper S3 source workers field. Signed-off-by: David Venable <[email protected]> * Apply suggestions from code review Signed-off-by: Naarcha-AWS <[email protected]> --------- Signed-off-by: David Venable <[email protected]> Signed-off-by: Naarcha-AWS <[email protected]> Co-authored-by: Naarcha-AWS <[email protected]> * Add parse_ion processor (opensearch-project#6761) * Add parse_ion processor Signed-off-by: Archer <[email protected]> * Apply suggestions from code review Co-authored-by: David Venable <[email protected]> Signed-off-by: Naarcha-AWS <[email protected]> * Update parse-ion.md Signed-off-by: Naarcha-AWS <[email protected]> * Apply suggestions from code review Co-authored-by: Heather Halter <[email protected]> Signed-off-by: Naarcha-AWS <[email protected]> * Apply suggestions from code review Co-authored-by: Heather Halter <[email protected]> Signed-off-by: Naarcha-AWS <[email protected]> * Apply suggestions from code review Co-authored-by: Nathan Bower <[email protected]> Signed-off-by: Naarcha-AWS <[email protected]> --------- Signed-off-by: Archer <[email protected]> Signed-off-by: Naarcha-AWS <[email protected]> Co-authored-by: David Venable <[email protected]> Co-authored-by: Heather Halter <[email protected]> Co-authored-by: Nathan Bower <[email protected]> * Add docs for map_to_list processor (opensearch-project#6680) * Add docs for map_to_list processor Signed-off-by: Hai Yan <[email protected]> * Update map-to-list.md * Apply suggestions from code review Co-authored-by: Nathan Bower <[email protected]> Signed-off-by: Naarcha-AWS <[email protected]> --------- Signed-off-by: Hai Yan <[email protected]> Signed-off-by: Naarcha-AWS <[email protected]> Co-authored-by: Naarcha-AWS <[email protected]> Co-authored-by: Nathan Bower <[email protected]> * Adds documentation for the Data Prepper geoip processor and geoip_service extension (opensearch-project#6772) * Adds documentation for the Data Prepper geoip processor and geoip_service extension. Signed-off-by: David Venable <[email protected]> * Update extensions.md * Update geoip_service.md * Update geoip.md * Apply suggestions from code review Co-authored-by: Nathan Bower <[email protected]> Signed-off-by: Naarcha-AWS <[email protected]> * Update _data-prepper/managing-data-prepper/extensions/extensions.md Co-authored-by: Nathan Bower <[email protected]> Signed-off-by: Naarcha-AWS <[email protected]> * Apply suggestions from code review Co-authored-by: Nathan Bower <[email protected]> Signed-off-by: Naarcha-AWS <[email protected]> --------- Signed-off-by: David Venable <[email protected]> Signed-off-by: Naarcha-AWS <[email protected]> Co-authored-by: Naarcha-AWS <[email protected]> Co-authored-by: Nathan Bower <[email protected]> * Update flatten.md Signed-off-by: Naarcha-AWS <[email protected]> * Fix nav order Signed-off-by: Archer <[email protected]> --------- Signed-off-by: shaavanga <[email protected]> Signed-off-by: George Chen <[email protected]> Signed-off-by: Naarcha-AWS <[email protected]> Signed-off-by: David Venable <[email protected]> Signed-off-by: Hai Yan <[email protected]> Signed-off-by: Taylor Gray <[email protected]> Signed-off-by: Asif Sohail Mohammed <[email protected]> Signed-off-by: Archer <[email protected]> Co-authored-by: Prathyusha Vangala <[email protected]> Co-authored-by: Qi Chen <[email protected]> Co-authored-by: David Venable <[email protected]> Co-authored-by: Nathan Bower <[email protected]> Co-authored-by: Hai Yan <[email protected]> Co-authored-by: Taylor Gray <[email protected]> Co-authored-by: Asif Sohail Mohammed <[email protected]> Co-authored-by: Heather Halter <[email protected]>
1 parent 5a873ab commit 48651b0

22 files changed

+1281
-102
lines changed

_data-prepper/managing-data-prepper/configuring-data-prepper.md

+3-1
Original file line numberDiff line numberDiff line change
@@ -128,6 +128,7 @@ extensions:
128128
region: <YOUR_REGION_1>
129129
sts_role_arn: <YOUR_STS_ROLE_ARN_1>
130130
refresh_interval: <YOUR_REFRESH_INTERVAL>
131+
disable_refresh: false
131132
<YOUR_SECRET_CONFIG_ID_2>:
132133
...
133134
```
@@ -148,7 +149,8 @@ Option | Required | Type | Description
148149
secret_id | Yes | String | The AWS secret name or ARN. |
149150
region | No | String | The AWS region of the secret. Defaults to `us-east-1`.
150151
sts_role_arn | No | String | The AWS Security Token Service (AWS STS) role to assume for requests to the AWS Secrets Manager. Defaults to `null`, which will use the [standard SDK behavior for credentials](https://docs.aws.amazon.com/sdk-for-java/latest/developer-guide/credentials.html).
151-
refresh_interval | No | Duration | The refreshment interval for AWS secrets extension plugin to poll new secret values. Defaults to `PT1H`. See [Automatically refreshing secrets](#automatically-refreshing-secrets) for details.
152+
refresh_interval | No | Duration | The refreshment interval for the AWS Secrets extension plugin to poll new secret values. Defaults to `PT1H`. For more information, see [Automatically refreshing secrets](#automatically-refreshing-secrets).
153+
disable_refresh | No | Boolean | Disables regular polling on the latest secret values inside the AWS secrets extension plugin. Defaults to `false`. When set to `true`, `refresh_interval` will not be used.
152154

153155
#### Reference secrets
154156
ß
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,15 @@
1+
---
2+
layout: default
3+
title: Extensions
4+
parent: Managing Data Prepper
5+
has_children: true
6+
nav_order: 18
7+
---
8+
9+
# Extensions
10+
11+
Data Prepper extensions provide Data Prepper functionality outside of core Data Prepper pipeline components.
12+
Many extensions provide configuration options that give Data Prepper administrators greater flexibility over Data Prepper's functionality.
13+
14+
Extension configurations can be configured in the `data-prepper-config.yaml` file under the `extensions:` YAML block.
15+
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,67 @@
1+
---
2+
layout: default
3+
title: geoip_service
4+
nav_order: 5
5+
parent: Extensions
6+
grand_parent: Managing Data Prepper
7+
---
8+
9+
# geoip_service
10+
11+
The `geoip_service` extension configures all [`geoip`]({{site.url}}{{site.baseurl}}/data-prepper/pipelines/configuration/processors/geoip) processors in Data Prepper.
12+
13+
## Usage
14+
15+
You can configure the GeoIP service that Data Prepper uses for the `geoip` processor.
16+
By default, the GeoIP service comes with the [`maxmind`](#maxmind) option configured.
17+
18+
The following example shows how to configure the `geoip_service` in the `data-prepper-config.yaml` file:
19+
20+
```
21+
extensions:
22+
geoip_service:
23+
maxmind:
24+
database_refresh_interval: PT1H
25+
cache_count: 16_384
26+
```
27+
28+
## maxmind
29+
30+
The GeoIP service supports the MaxMind [GeoIP and GeoLite](https://dev.maxmind.com/geoip) databases.
31+
By default, Data Prepper will use all three of the following [MaxMind GeoLite2](https://dev.maxmind.com/geoip/geolite2-free-geolocation-data) databases:
32+
33+
* City
34+
* Country
35+
* ASN
36+
37+
The service also downloads databases automatically to keep Data Prepper up to date with changes from MaxMind.
38+
39+
You can use the following options to configure the `maxmind` extension.
40+
41+
Option | Required | Type | Description
42+
:--- | :--- | :--- | :---
43+
`databases` | No | [database](#database) | The database configuration.
44+
`database_refresh_interval` | No | Duration | How frequently to check for updates from MaxMind. This can be any duration in the range of 15 minutes to 30 days. Default is `PT7D`.
45+
`cache_count` | No | Integer | The maximum cache count by number of items in the cache, with a range of 100--100,000. Default is `4096`.
46+
`database_destination` | No | String | The name of the directory in which to store downloaded databases. Default is `{data-prepper.dir}/data/geoip`.
47+
`aws` | No | [aws](#aws) | Configures the AWS credentials for downloading the database from Amazon Simple Storage Service (Amazon S3).
48+
`insecure` | No | Boolean | When `true`, this options allows you to download database files over HTTP. Default is `false`.
49+
50+
## database
51+
52+
Option | Required | Type | Description
53+
:--- | :--- | :--- | :---
54+
`city` | No | String | The URL of the city in which the database resides. Can be an HTTP URL for a manifest file, an MMDB file, or an S3 URL.
55+
`country` | No | String | The URL of the country in which the database resides. Can be an HTTP URL for a manifest file, an MMDB file, or an S3 URL.
56+
`asn` | No | String | The URL of the Autonomous System Number (ASN) of where the database resides. Can be an HTTP URL for a manifest file, an MMDB file, or an S3 URL.
57+
`enterprise` | No | String | The URL of the enterprise in which the database resides. Can be an HTTP URL for a manifest file, an MMDB file, or an S3 URL.
58+
59+
60+
## aws
61+
62+
Option | Required | Type | Description
63+
:--- | :--- | :--- | :---
64+
`region` | No | String | The AWS Region to use for the credentials. Default is the [standard SDK behavior for determining the Region](https://docs.aws.amazon.com/sdk-for-java/latest/developer-guide/region-selection.html).
65+
`sts_role_arn` | No | String | The AWS Security Token Service (AWS STS) role to assume for requests to Amazon S3. Default is `null`, which will use the [standard SDK behavior for credentials](https://docs.aws.amazon.com/sdk-for-java/latest/developer-guide/credentials.html).
66+
`aws_sts_header_overrides` | No | Map | A map of header overrides that the AWS Identity and Access Management (IAM) role assumes when downloading from Amazon S3.
67+
`sts_external_id` | No | String | An STS external ID used when Data Prepper assumes the STS role. For more information, see the `ExternalID` documentation in the [STS AssumeRole](https://docs.aws.amazon.com/STS/latest/APIReference/API_AssumeRole.html) API reference.

_data-prepper/pipelines/configuration/processors/date.md

+43-11
Original file line numberDiff line numberDiff line change
@@ -9,24 +9,32 @@ nav_order: 50
99
# date
1010

1111

12-
The `date` processor adds a default timestamp to an event, parses timestamp fields, and converts timestamp information to the International Organization for Standardization (ISO) 8601 format. This timestamp information can be used as an event timestamp.
12+
The `date` processor adds a default timestamp to an event, parses timestamp fields, and converts timestamp information to the International Organization for Standardization (ISO) 8601 format. This timestamp information can be used as an event timestamp.
1313

1414
## Configuration
1515

1616
The following table describes the options you can use to configure the `date` processor.
1717

18+
<!-- vale off -->
1819
Option | Required | Type | Description
1920
:--- | :--- | :--- | :---
20-
match | Conditionally | List | List of `key` and `patterns` where patterns is a list. The list of match can have exactly one `key` and `patterns`. There is no default value. This option cannot be defined at the same time as `from_time_received`. Include multiple date processors in your pipeline if both options should be used.
21-
from_time_received | Conditionally | Boolean | A boolean that is used for adding default timestamp to event data from event metadata which is the time when source receives the event. Default value is `false`. This option cannot be defined at the same time as `match`. Include multiple date processors in your pipeline if both options should be used.
22-
destination | No | String | Field to store the timestamp parsed by date processor. It can be used with both `match` and `from_time_received`. Default value is `@timestamp`.
23-
source_timezone | No | String | Time zone used to parse dates. It is used in case the zone or offset cannot be extracted from the value. If the zone or offset are part of the value, then timezone is ignored. Find all the available timezones [the list of database time zones](https://en.wikipedia.org/wiki/List_of_tz_database_time_zones#List) in the **TZ database name** column.
24-
destination_timezone | No | String | Timezone used for storing timestamp in `destination` field. The available timezone values are the same as `source_timestamp`.
25-
locale | No | String | Locale is used for parsing dates. It's commonly used for parsing month names(`MMM`). It can have language, country and variant fields using IETF BCP 47 or String representation of [Locale](https://docs.oracle.com/javase/8/docs/api/java/util/Locale.html) object. For example `en-US` for IETF BCP 47 and `en_US` for string representation of Locale. Full list of locale fields which includes language, country and variant can be found [the language subtag registry](https://www.iana.org/assignments/language-subtag-registry/language-subtag-registry). Default value is `Locale.ROOT`.
21+
`match` | Conditionally | [Match](#Match) | The date match configuration. This option cannot be defined at the same time as `from_time_received`. There is no default value.
22+
`from_time_received` | Conditionally | Boolean | When `true`, the timestamp from the event metadata, which is the time at which the source receives the event, is added to the event data. This option cannot be defined at the same time as `match`. Default is `false`.
23+
`date_when` | No | String | Specifies under what condition the `date` processor should perform matching. Default is no condition.
24+
`to_origination_metadata` | No | Boolean | When `true`, the matched time is also added to the event's metadata as an instance of `Instant`. Default is `false`.
25+
`destination` | No | String | The field used to store the timestamp parsed by the date processor. Can be used with both `match` and `from_time_received`. Default is `@timestamp`.
26+
`output_format` | No | String | Determines the format of the timestamp added to an event. Default is `yyyy-MM-dd'T'HH:mm:ss.SSSXXX`.
27+
`source_timezone` | No | String | The time zone used to parse dates, including when the zone or offset cannot be extracted from the value. If the zone or offset are part of the value, then the time zone is ignored. A list of all the available time zones is contained in the **TZ database name** column of [the list of database time zones](https://en.wikipedia.org/wiki/List_of_tz_database_time_zones#List).
28+
`destination_timezone` | No | String | The time zone used for storing the timestamp in the `destination` field. A list of all the available time zones is contained in the **TZ database name** column of [the list of database time zones](https://en.wikipedia.org/wiki/List_of_tz_database_time_zones#List).
29+
`locale` | No | String | The location used for parsing dates. Commonly used for parsing month names (`MMM`). The value can contain language, country, or variant fields in IETF BCP 47, such as `en-US`, or a string representation of the [locale](https://docs.oracle.com/javase/8/docs/api/java/util/Locale.html) object, such as `en_US`. A full list of locale fields, including language, country, and variant, can be found in [the language subtag registry](https://www.iana.org/assignments/language-subtag-registry/language-subtag-registry). Default is `Locale.ROOT`.
30+
<!-- vale on -->
2631

27-
<!---## Configuration
32+
### Match
2833

29-
Content will be added to this section.--->
34+
Option | Required | Type | Description
35+
:--- | :--- | :--- | :---
36+
`key` | Yes | String | Represents the event key against which to match patterns. Required if `match` is configured.
37+
`patterns` | Yes | List | A list of possible patterns that the timestamp value of the key can have. The patterns are based on a sequence of letters and symbols. The `patterns` support all the patterns listed in the Java [DatetimeFormatter](https://docs.oracle.com/javase/8/docs/api/java/time/format/DateTimeFormatter.html) reference. The timestamp value also supports `epoch_second`, `epoch_milli`, and `epoch_nano` values, which represent the timestamp as the number of seconds, milliseconds, and nanoseconds since the epoch. Epoch values always use the UTC time zone.
3038

3139
## Metrics
3240

@@ -40,5 +48,29 @@ The following table describes common [Abstract processor](https://github.com/ope
4048

4149
The `date` processor includes the following custom metrics.
4250

43-
* `dateProcessingMatchSuccessCounter`: Returns the number of records that match with at least one pattern specified by the `match configuration` option.
44-
* `dateProcessingMatchFailureCounter`: Returns the number of records that did not match any of the patterns specified by the `patterns match` configuration option.
51+
* `dateProcessingMatchSuccessCounter`: Returns the number of records that match at least one pattern specified by the `match configuration` option.
52+
* `dateProcessingMatchFailureCounter`: Returns the number of records that did not match any of the patterns specified by the `patterns match` configuration option.
53+
54+
## Example: Add the default timestamp to an event
55+
The following `date` processor configuration can be used to add a default timestamp in the `@timestamp` filed applied to all events:
56+
57+
```yaml
58+
- date:
59+
from_time_received: true
60+
destination: "@timestamp"
61+
```
62+
63+
## Example: Parse a timestamp to convert its format and time zone
64+
The following `date` processor configuration can be used to parse the value of the timestamp applied to `dd/MMM/yyyy:HH:mm:ss` and write it in `yyyy-MM-dd'T'HH:mm:ss.SSSXXX` format:
65+
66+
```yaml
67+
- date:
68+
match:
69+
- key: timestamp
70+
patterns: ["dd/MMM/yyyy:HH:mm:ss"]
71+
destination: "@timestamp"
72+
output_format: "yyyy-MM-dd'T'HH:mm:ss.SSSXXX"
73+
source_timezone: "America/Los_Angeles"
74+
destination_timezone: "America/Chicago"
75+
locale: "en_US"
76+
```
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,49 @@
1+
---
2+
layout: default
3+
title: decompress
4+
parent: Processors
5+
grand_parent: Pipelines
6+
nav_order: 40
7+
---
8+
9+
# decompress
10+
11+
The `decompress` processor decompresses any Base64-encoded compressed fields inside of an event.
12+
13+
## Configuration
14+
15+
Option | Required | Type | Description
16+
:--- | :--- | :--- | :---
17+
`keys` | Yes | List<String> | The fields in the event that will be decompressed.
18+
`type` | Yes | Enum | The type of decompression to use for the `keys` in the event. Only `gzip` is supported.
19+
`decompress_when` | No | String| A [Data Prepper conditional expression](https://opensearch.org/docs/latest/data-prepper/pipelines/expression-syntax/) that determines when the `decompress` processor will run on certain events.
20+
`tags_on_failure` | No | List<String> | A list of strings with which to tag events when the processor fails to decompress the `keys` inside an event. Defaults to `_decompression_failure`.
21+
22+
## Usage
23+
24+
The following example shows the `decompress` processor used in `pipelines.yaml`:
25+
26+
```yaml
27+
processor:
28+
- decompress:
29+
decompress_when: '/some_key == null'
30+
keys: [ "base_64_gzip_key" ]
31+
type: gzip
32+
```
33+
34+
## Metrics
35+
36+
The following table describes common [abstract processor](https://github.com/opensearch-project/data-prepper/blob/main/data-prepper-api/src/main/java/org/opensearch/dataprepper/model/processor/AbstractProcessor.java) metrics.
37+
38+
| Metric name | Type | Description |
39+
| ------------- | ---- | -----------|
40+
| `recordsIn` | Counter | The ingress of records to a pipeline component. |
41+
| `recordsOut` | Counter | The egress of records from a pipeline component. |
42+
| `timeElapsed` | Timer | The time elapsed during execution of a pipeline component. |
43+
44+
### Counter
45+
46+
The `decompress` processor accounts for the following metrics:
47+
48+
* `processingErrors`: The number of processing errors that have occurred in the `decompress` processor.
49+

_data-prepper/pipelines/configuration/processors/delete-entries.md

+1-1
Original file line numberDiff line numberDiff line change
@@ -3,7 +3,7 @@ layout: default
33
title: delete_entries
44
parent: Processors
55
grand_parent: Pipelines
6-
nav_order: 51
6+
nav_order: 41
77
---
88

99
# delete_entries

_data-prepper/pipelines/configuration/processors/dissect.md

+1-1
Original file line numberDiff line numberDiff line change
@@ -3,7 +3,7 @@ layout: default
33
title: dissect
44
parent: Processors
55
grand_parent: Pipelines
6-
nav_order: 52
6+
nav_order: 45
77
---
88

99
# dissect

_data-prepper/pipelines/configuration/processors/drop-events.md

+1-1
Original file line numberDiff line numberDiff line change
@@ -3,7 +3,7 @@ layout: default
33
title: drop_events
44
parent: Processors
55
grand_parent: Pipelines
6-
nav_order: 53
6+
nav_order: 46
77
---
88

99
# drop_events

0 commit comments

Comments
 (0)