Skip to content

Export to clickhouse for metrics_v0 fails #3490

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
ccerv1 opened this issue Apr 2, 2025 · 1 comment
Open

Export to clickhouse for metrics_v0 fails #3490

ccerv1 opened this issue Apr 2, 2025 · 1 comment
Labels
c:data Gathering data (e.g. indexing)

Comments

@ccerv1
Copy link
Member

ccerv1 commented Apr 2, 2025

What is it?

Here is an example run that failed repeatedly: https://admin-dagster.opensource.observer/runs/5a505b16-71cd-40e9-be7c-26519a577dcf

To replicate, try to materialize this asset: https://admin-dagster.opensource.observer/assets/clickhouse_metrics/metrics_v0

Error log is:

dagster._core.errors.DagsterExecutionStepExecutionError: Error occurred while executing op "trino_clickhouse_export":

  File "/usr/local/lib/python3.12/dist-packages/dagster/_core/execution/plan/execute_plan.py", line 245, in dagster_event_sequence_for_step
    yield from check.generator(step_events)
  File "/usr/local/lib/python3.12/dist-packages/dagster/_core/execution/plan/execute_step.py", line 501, in core_dagster_event_sequence_for_step
    for user_event in _step_output_error_checked_user_event_sequence(
                      ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/usr/local/lib/python3.12/dist-packages/dagster/_core/execution/plan/execute_step.py", line 184, in _step_output_error_checked_user_event_sequence
    for user_event in user_event_sequence:
                      ^^^^^^^^^^^^^^^^^^^
  File "/usr/local/lib/python3.12/dist-packages/dagster/_core/execution/plan/execute_step.py", line 88, in _process_asset_results_to_events
    for user_event in user_event_sequence:
                      ^^^^^^^^^^^^^^^^^^^
  File "/usr/local/lib/python3.12/dist-packages/dagster/_core/execution/plan/compute.py", line 190, in execute_core_compute
    for step_output in _yield_compute_results(step_context, inputs, compute_fn, compute_context):
                       ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/usr/local/lib/python3.12/dist-packages/dagster/_core/execution/plan/compute.py", line 159, in _yield_compute_results
    for event in iterate_with_context(
                 ^^^^^^^^^^^^^^^^^^^^^
  File "/usr/local/lib/python3.12/dist-packages/dagster/_utils/__init__.py", line 478, in iterate_with_context
    with context_fn():
         ^^^^^^^^^^^^
  File "/usr/lib/python3.12/contextlib.py", line 158, in __exit__
    self.gen.throw(value)
  File "/usr/local/lib/python3.12/dist-packages/dagster/_core/execution/plan/utils.py", line 86, in op_execution_error_boundary
    raise error_cls(

The above exception was caused by the following exception:
clickhouse_connect.driver.exceptions.DatabaseError: HTTPDriver for https://w541fomho5.us-central1.gcp.clickhouse.cloud:8443 received ClickHouse error code 62
 Code: 62. DB::Exception: Syntax error: failed at position 222 ('['): [varchar], `sql_source_path` varchar, `aggregation_function` varchar, ) ENGINE = MergeTree() ORDER BY () . Expected one of: NULL, NOT, DEFAULT, MATERIALIZED, EPHEMERAL, ALIAS, AUTO_INCREMENT, PRIMARY KEY, data type, identifier. (SYNTAX_ERROR) (version 24.10.1.11489 (official build))


  File "/usr/local/lib/python3.12/dist-packages/dagster/_core/execution/plan/utils.py", line 56, in op_execution_error_boundary
    yield
  File "/usr/local/lib/python3.12/dist-packages/dagster/_utils/__init__.py", line 480, in iterate_with_context
    next_output = next(iterator)
                  ^^^^^^^^^^^^^^
  File "/usr/local/lib/python3.12/dist-packages/dagster/_core/execution/plan/compute.py", line 128, in gen_from_async_gen
    yield loop.run_until_complete(async_gen.__anext__())
          ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/usr/lib/python3.12/asyncio/base_events.py", line 691, in run_until_complete
    return future.result()
           ^^^^^^^^^^^^^^^
  File "/usr/src/app/warehouse/oso_dagster/resources/sqlmesh.py", line 138, in trino_clickhouse_export
    await transfer(
  File "/usr/src/app/warehouse/metrics_tools/transfer/coordinator.py", line 46, in transfer
    await destination.importer.import_table(destination.table, export_reference)
  File "/usr/src/app/warehouse/metrics_tools/transfer/clickhouse.py", line 54, in import_table
    create_table(
  File "/usr/src/app/warehouse/oso_dagster/utils/clickhouse.py", line 69, in create_table
    result = client.command(command % params)
             ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/usr/local/lib/python3.12/dist-packages/clickhouse_connect/driver/httpclient.py", line 361, in command
    response = self._raw_request(payload, params, headers, method, fields=fields, server_wait=False)
               ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/usr/local/lib/python3.12/dist-packages/clickhouse_connect/driver/httpclient.py", line 474, in _raw_request
    self._error_handler(response)
  File "/usr/local/lib/python3.12/dist-packages/clickhouse_connect/driver/httpclient.py", line 395, in _error_handler
    raise OperationalError(err_str) if retried else DatabaseError(err_str) from None

@ccerv1 ccerv1 added the c:data Gathering data (e.g. indexing) label Apr 2, 2025
@github-project-automation github-project-automation bot moved this to Backlog in OSO Apr 2, 2025
Copy link

linear bot commented Apr 2, 2025

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
c:data Gathering data (e.g. indexing)
Projects
Status: Backlog
Development

No branches or pull requests

1 participant