Allow configurable and dynamic s3 path #134

arihantsurana · 2018-06-25T05:28:54Z

Currently, the s3 loader writes files to a single s3 directory. This makes it somewhat hard to use with Athena or other hive query engines because we cannot set up time-based partitions by use of the directory structure. This also makes it harder to maintain and handle the data on s3.
Please allow optional configuration to load data onto s3 with a directory structure that can be filled with values from s3 loader's server timestamp.
eg. -

configure to store in monthly chunks:
s3_path = "base_dir/enriched/good/yr={YYYY}/mo={MM}"

store in base directory:
s3_path = "some_dir/raw"

store in hourly chunks:
s3_path = "base_dir/enriched/good/date={YYYY-MM-dd}/hour={HH}"

The text was updated successfully, but these errors were encountered:

arihantsurana · 2018-06-26T00:51:12Z

Added PR to solve for this - #135

arihantsurana mentioned this issue Jun 26, 2018

Issue-134: allow configurable and dynamic s3 path #135

Merged

BenFradet changed the title ~~allow configurable and dynamic s3 path~~ Allow configurable and dynamic s3 path Jul 4, 2018

BenFradet closed this as completed in f0a5ba8 Jul 4, 2018

peel pushed a commit that referenced this issue Feb 24, 2020

Allow configurable and dynamic s3 path (closes #134)

7cf2988

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow configurable and dynamic s3 path #134

Allow configurable and dynamic s3 path #134

arihantsurana commented Jun 25, 2018

arihantsurana commented Jun 26, 2018

Allow configurable and dynamic s3 path #134

Allow configurable and dynamic s3 path #134

Comments

arihantsurana commented Jun 25, 2018

arihantsurana commented Jun 26, 2018