Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

release/2.0.9-spark-3.5 #598

Closed
wants to merge 5 commits into from
Closed

Conversation

shriyavanvari
Copy link
Contributor

Issue #, if available:

Description of changes:
Update release version name in pom.xml to 2.0.9-spark-3.5

By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.

yannis-mentekidis and others added 5 commits September 6, 2024 16:59
* Replace 'withColumns' with 'select' (#582)

'withColumns' was introduced in Spark 3.3, so it won't
work for Deequ's <3.3 builds.

* Replace rdd with dataframe functions in Histogram analyzer (#586)

Co-authored-by: Shriya Vanvari <svanvari@amazon.com>

* Updated version in pom.xml to 2.0.8-spark-3.5 (#578)

Co-authored-by: Yannis Mentekidis <mentekid@amazon.com>

---------

Co-authored-by: Josh <5685731+marcantony@users.noreply.github.com>
Co-authored-by: Shriya Vanvari <vanvari.shriya@gmail.com>
Co-authored-by: Shriya Vanvari <svanvari@amazon.com>
Co-authored-by: Yannis Mentekidis <mentekid@users.noreply.github.com>
Co-authored-by: Yannis Mentekidis <mentekid@amazon.com>
* Fix row level bug when composing outcome

- When a check fails due to a precondition failure, the row level results are not evaluated correctly.
- For example, let's say a check has a completeness constraint which passes, and a minimum constraint which fails due to a precondition failure.
- The row level results will be the results for just the completeness constraint. There will be no results generated for the minimum constraint, and therefore the row level results will be incorrect.
- We fix this by adding a default outcome for when the row level result column is not provided by the analyzer.

* Added similar logic to RowLevelConstraint as well

Skipped RowLevelGroupedConstraint because only UniqueValueRatio/Uniqueness use it, and they don't use preconditions.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

5 participants