Working solution serialization bug. by VersusFacit · Pull Request #5874 · dbt-labs/dbt-core

VersusFacit · 2022-09-18T10:31:40Z

resolves #5436
resolves #5385

Description

Ho, boy! What an odyssey. This PR has some reworks to the adapter logging functions for dbt. These changes help us conform to the intended spec of the logging classes. The resulting behavior makes our log functions reminiscent of python's print().

reproducing the errors

5385 models/asdf.sql:

{% set some_list = ['a', 'b', 'c'] %}
{{ log(some_list, info = true) }}
select 1 as id

DBT_ENV_SECRET_WHATEVER=1234 dbt run -s asdf

5436 asdf2.sql:

dbt --no-use-colors --log-format json run -s asdf2 | jq .msg

select * from foo.bar

——————

dbt-bigquery==1.0.0
dbt-core==1.0.8

1 of 1 START table model my_dataset.asdf........................................ [RUN]
Unhandled error while executing model.jaffle_shop.asdf
Pickling client objects is explicitly not supported.
Clients have non-trivial state that is local and unpickleable.
1 of 1 ERROR creating table model my_dataset.asdf............................... [ERROR in 0.61s]
Finished running 1 table model in 1.47s.
Completed with 1 error and 0 warnings:
Pickling client objects is explicitly not supported.
Clients have non-trivial state that is local and unpickleable.

dbt-bigquery==1.1.0
dbt-core==1.1.2

dbt-bigquery==1.2.0
dbt-bigquery==1.3.0b2

dbt-core==1.3.0b2
dbt-core==1.2.1

1 of 1 START table model my_dataset.asdf ....................................... [RUN]
1 of 1 ERROR creating table model my_dataset.asdf .............................. [ERROR in 0.53s]
Finished running 1 table model in 1.17s.
Completed with 1 error and 0 warnings:
Compilation Error in macro statement (macros/etc/statement.sql)
Object of type NotFound is not JSON serializable

in macro materialization_table_bigquery (macros/materializations/table.sql)
called by macro statement (macros/etc/statement.sql)

Error 1: Serialization Woes

json.dumps presumes that all elements within a dictionary can be simply rendered as a json string. But complex objects inside the argument will cause a serialization error. This happens precisely at line 209 of core/dbt/events/functions.

Several classes have msg parameter. This spec does not always match reality: Some arguments are not strings. This leads to trouble when logger member function calls are made. So, if we force the message given to logger functions to be a string, the spec matches reality and the error vanishes. Local proof of it working on development.

This error is what the user OwenKephart graciously documented over in Bigquery issue 206 as a desired output for my test case model, designed to fail.

QED.

Error 2: Secret scrubbing implodes when provided a non-str type

Jinja log hands a type to Python which should but isn't always a string. Although, we have a function scrub_secrets which expects that.

By making sure these MacroEvent* types are all in fact strings, we solve the current problem and hopefully, future ones.

Checklist

I have read the contributing guide and understand what's expected of me
I have signed the CLA
I have run this code in development and it appears to resolve the stated issue
This PR includes tests, or tests are not required/relevant for this PR
I have opened an issue to add/update docs, or docs changes are not required/relevant for this PR
I have run changie new to create a changelog entry

github-actions · 2022-09-18T10:32:01Z

Thank you for your pull request! We could not find a changelog entry for this change. For details on how to document a change, see the contributing guide.

core/dbt/events/adapter_endpoint.py

gshank

We're converting all of these Exception object to strings in the next structured logging update anyway, because everything in the event has to be encoded into a protobuf message.

colin-rogers-dbt

LGTM

nathaniel-may · 2022-09-22T14:50:53Z

So it looks like #5385 has the exact same problem (passing a non-string to log). Since it's jinja log, the first the value time exits Jinja-land into our control is in fire_event. My question to you is- do you think we should solve this lower in the call stack to knock both of these bugs out?

…utes. Before this change, the spec of various classes expected base_msg and msg params to be str's. This assumption did not always hold true. post_init hooks ensures the spec is obeyed.

VersusFacit · 2022-10-17T00:41:45Z

Revised in light of the recent structured logging change.

PR description revised, mostly expanded to describe the second bug.

Employs the same subclass-as-trait paradigm to . And for good measure, I did check the new structured logging code on main still exhibits the bugs described in the linked issues before folding these changes into the code.

colin-rogers-dbt · 2022-10-17T15:28:46Z

core/dbt/events/base_types.py

+class AdapterEventStringFunctor:
+    def __post_init__(self):
+        super().__post_init__()
+        if not isinstance(self.base_msg, str):


do we need to check if base_msg exists?

Thought about this. We're talking data classes so these objects must be instantiated with params. A little toy example:

from dataclasses import dataclass @dataclass class Item: a:str b:str c:str asdf = Item()

Try running this and it complains:

TypeError: init() missing 3 required positional arguments: 'a', 'b', and 'c'

Hence, if a dataclass exists, it's gotta have the params. And this functor should only be used in classes that have this param. I think that renders an existential check redundant? (if it's not, tell me)

colin-rogers-dbt

Makes sense, just one question. Also, do we need unit tests to validate this behavior?

VersusFacit · 2022-10-17T19:45:47Z

Unsure about unit tests. Is it worth having tests to check that the intended spec holds for something like the structured logging? That feels extra to me, but I'm also open to writing some quick ones if it seems like a good idea from your POV!

colin-rogers-dbt

LGTM

VersusFacit self-assigned this Sep 18, 2022

VersusFacit requested a review from a team as a code owner September 18, 2022 10:31

VersusFacit requested a review from gshank September 18, 2022 10:31

cla-bot bot added the cla:yes label Sep 18, 2022

VersusFacit commented Sep 18, 2022

View reviewed changes

core/dbt/events/adapter_endpoint.py Outdated Show resolved Hide resolved

VersusFacit requested a review from a team as a code owner September 18, 2022 10:36

VersusFacit requested a review from colin-rogers-dbt September 18, 2022 10:36

gshank approved these changes Sep 19, 2022

View reviewed changes

VersusFacit added backport 1.1.latest labels Sep 20, 2022

colin-rogers-dbt approved these changes Sep 21, 2022

View reviewed changes

leahwicz mentioned this pull request Sep 22, 2022

[CT-756] Stringify user-provided messages to {{ log() }} #5385

Closed

Fleid mentioned this pull request Sep 29, 2022

[Bug] "dbt run --log-format json" throws exception "Encountered an error:Object of type CompilationException is not JSON serializable" #5357

Closed

1 task

Create functors to initialize event types with str-type member attrib…

ef4eae9

…utes. Before this change, the spec of various classes expected base_msg and msg params to be str's. This assumption did not always hold true. post_init hooks ensures the spec is obeyed.

VersusFacit force-pushed the CT-803/serialization_errors_in_AdapterLogger branch from e07829f to ef4eae9 Compare October 17, 2022 00:35

VersusFacit requested review from colin-rogers-dbt and nathaniel-may October 17, 2022 00:36

Add new changelog.

bc1d3b0

VersusFacit added the backport 1.3.latest label Oct 17, 2022

Add msg type change functor to a few other events that could use it.

2dae7e7

VersusFacit force-pushed the CT-803/serialization_errors_in_AdapterLogger branch from 7e59699 to 2dae7e7 Compare October 17, 2022 01:20

colin-rogers-dbt reviewed Oct 17, 2022

View reviewed changes

colin-rogers-dbt approved these changes Oct 17, 2022

View reviewed changes

VersusFacit merged commit ff2f1f4 into main Oct 18, 2022

VersusFacit deleted the CT-803/serialization_errors_in_AdapterLogger branch October 18, 2022 19:20

jtcohen6 mentioned this pull request Jan 10, 2023

[CT-1783] [Bug] AttributeError: 'dict' object has no attribute 'replace' when logging a dict if an DBT_ENV_SECRET_ env var is set #6568

Closed

2 tasks

iknox-fa mentioned this pull request Jan 30, 2023

testing merge strat - do not merge ns/add to committer list #6783

Closed

6 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Working solution serialization bug.#5874

Working solution serialization bug.#5874
VersusFacit merged 3 commits intomainfrom
CT-803/serialization_errors_in_AdapterLogger

VersusFacit commented Sep 18, 2022 •

edited

Loading

Uh oh!

github-actions bot commented Sep 18, 2022

Uh oh!

Uh oh!

gshank left a comment

Uh oh!

colin-rogers-dbt left a comment

Uh oh!

nathaniel-may commented Sep 22, 2022 •

edited

Loading

Uh oh!

VersusFacit commented Oct 17, 2022

Uh oh!

colin-rogers-dbt Oct 17, 2022

Uh oh!

VersusFacit Oct 17, 2022 •

edited

Loading

Uh oh!

colin-rogers-dbt left a comment

Uh oh!

VersusFacit commented Oct 17, 2022

Uh oh!

colin-rogers-dbt left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

VersusFacit commented Sep 18, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

reproducing the errors

Error 1: Serialization Woes

Error 2: Secret scrubbing implodes when provided a non-str type

Checklist

Uh oh!

github-actions bot commented Sep 18, 2022

Uh oh!

Uh oh!

gshank left a comment

Choose a reason for hiding this comment

Uh oh!

colin-rogers-dbt left a comment

Choose a reason for hiding this comment

Uh oh!

nathaniel-may commented Sep 22, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

VersusFacit commented Oct 17, 2022

Uh oh!

colin-rogers-dbt Oct 17, 2022

Choose a reason for hiding this comment

Uh oh!

VersusFacit Oct 17, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

colin-rogers-dbt left a comment

Choose a reason for hiding this comment

Uh oh!

VersusFacit commented Oct 17, 2022

Uh oh!

colin-rogers-dbt left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

VersusFacit commented Sep 18, 2022 •

edited

Loading

nathaniel-may commented Sep 22, 2022 •

edited

Loading

VersusFacit Oct 17, 2022 •

edited

Loading