Spark not-nullable column converts to EXASOL column failed while saving #60

3cham · 2020-01-29T15:34:19Z

While creating table in EXASOL we infer column information from spark schema.
Spark says if a column does not contain any null value then this column is not nullable, despite it can contain empty string. So our connector will set the column in the EXASOL DDL to NOT NULL. This leads to problem when writing the data to EXASOL if empty strings occur.

I suggest that we remove this nullable checking for world's peace since it does take a lot of time to find the cause of this problem :)

Let's me know about your opinions @morazow @jpizagno

morazow · 2020-01-30T08:42:10Z

Hey @3cham ,

Good to hear from you!

Yes, it is a good point! From my side, I would prefer to remove the NOT NULL for only string types. But, I guess we can remove it for all types. It puts the validation on the database side, but if users want that, they can filter the dataframe before insert or run ALTER command to add that constraint later inside the database. So, it should be safe to remove it from connector.

Please feel free to send a pull request!

We create an Exasol table, if it did not exist, before saving the Spark dataframe. The `NOT NULL` constraint was added to the create table DDL, if the Spark schema field type is not nullable. However, this can be problem in Exasol side. Because, Exasol puts `null` if the string is empty for `VARCHAR` or `CLOB` column types. Therefore, putting not null constraints fails when inserting empty strings. This commit removes the `NOT NULL` constraints from string types even if they are not nullable. Fixes #60.

morazow mentioned this issue Mar 30, 2020

Remove NOT NULL constraint from String types. #63

Merged

morazow closed this as completed in #63 Apr 9, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Spark not-nullable column converts to EXASOL column failed while saving #60

Spark not-nullable column converts to EXASOL column failed while saving #60

3cham commented Jan 29, 2020 •

edited

Loading

morazow commented Jan 30, 2020

Spark not-nullable column converts to EXASOL column failed while saving #60

Spark not-nullable column converts to EXASOL column failed while saving #60

Comments

3cham commented Jan 29, 2020 • edited Loading

morazow commented Jan 30, 2020

3cham commented Jan 29, 2020 •

edited

Loading