Avoid leading and trailing zeros in test_timestamp_seconds_rounding_necessary #9970

thirtiseven · 2023-12-06T06:02:16Z

timestamp_seconds will check for rounding necessary before overflow, the Decimal(20,7) case satisfies both rounding necessary and overflow, so it should complain about rounding necessary. The first element in the df with the failed seed is 1793879511158.1649100, so it bypasses the rounding necessary check and passes the overflow check next.

This PR adds a new parameter full_precision in integration test DecimalGen to avoid it generating data with leading and tailing zeros, and sets it to True in test_timestamp_seconds_rounding_necessary.

…cessary Signed-off-by: Haoyang Li <[email protected]>

res-life · 2023-12-06T09:18:00Z

LGTM

thirtiseven · 2023-12-06T09:25:04Z

build

revans2 · 2023-12-06T14:41:02Z

integration_tests/src/main/python/data_gen.py

@@ -244,7 +244,7 @@ def start(self, rand):

 class DecimalGen(DataGen):
    """Generate Decimals, with some built in corner cases."""
-    def __init__(self, precision=None, scale=None, nullable=True, special_cases=None, avoid_positive_values=False):
+    def __init__(self, precision=None, scale=None, nullable=True, special_cases=None, avoid_positive_values=False, full_precision=False):


Could we add a comment/doc string about what full_precision means? It is not clear from just the name of it.

winningsix

The first element in the df with the failed seed is 1793879511158.1649100, so it bypasses the rounding necessary

Can you help elaborate this more and put this test comments?

To my understanding, this is a problem when we generating data with tailing/heading zeros, it mislead the overflow exceptions.

Is this an issue of test (data) design or our code?

winningsix · 2023-12-07T00:44:53Z

integration_tests/src/main/python/date_time_test.py

-
-@pytest.mark.parametrize('data_gen', [DecimalGen(7, 7), DecimalGen(20, 7)], ids=idfn)
+
+# Make sure every decimal value in test data is rounding necessary by set full_precision=True to


typo: necessarily

I want to keep it as is because it's the error message, so added a quote around it.

winningsix · 2023-12-07T00:45:37Z

integration_tests/src/main/python/date_time_test.py

-@pytest.mark.parametrize('data_gen', [DecimalGen(7, 7), DecimalGen(20, 7)], ids=idfn)
+
+# Make sure every decimal value in test data is rounding necessary by set full_precision=True to
+# avoid leading and trailing zeros


typo: tailing

Seems trailing zeros is correct, I made a typo in PR title.

Signed-off-by: Haoyang Li <[email protected]>

thirtiseven · 2023-12-07T03:29:04Z

build

Signed-off-by: Haoyang Li <[email protected]>

thirtiseven · 2023-12-07T03:41:20Z

Can you help elaborate this more and put this test comments?

To my understanding, this is a problem when we generating data with tailing/heading zeros, it mislead the overflow exceptions.

Is this an issue of test (data) design or our code?

I added a test comments. It's an issue of test data.

thirtiseven · 2023-12-07T03:41:31Z

build

winningsix · 2023-12-07T12:00:53Z

integration_tests/src/main/python/data_gen.py

@@ -244,7 +244,8 @@ def start(self, rand):

 class DecimalGen(DataGen):
    """Generate Decimals, with some built in corner cases."""
-    def __init__(self, precision=None, scale=None, nullable=True, special_cases=None, avoid_positive_values=False):
+    def __init__(self, precision=None, scale=None, nullable=True, special_cases=None, avoid_positive_values=False, full_precision=False):
+        """full_precision: If True, generate decimals with full precision without leading and trailing zeros."""


BTW, will trim_zeros be a better name?

I’m ok with this name, but I personally think trim_zeros means removing leading and trailing zeros from a number, instead of not generating it.

Sounds good. Merged

Avoid leading and tailing zeros in test_timestamp_seconds_rounding_ne…

a1bd436

…cessary Signed-off-by: Haoyang Li <[email protected]>

thirtiseven added the test Only impacts tests label Dec 6, 2023

thirtiseven self-assigned this Dec 6, 2023

thirtiseven linked an issue Dec 6, 2023 that may be closed by this pull request

[BUG] Failed case about test_timestamp_seconds_rounding_necessary[Decimal(20,7)][DATAGEN_SEED=1701412018] – src.main.python.date_time_test #9923

Closed

res-life previously approved these changes Dec 6, 2023

View reviewed changes

revans2 reviewed Dec 6, 2023

View reviewed changes

winningsix reviewed Dec 7, 2023

View reviewed changes

add comments and fix a typo

290521b

Signed-off-by: Haoyang Li <[email protected]>

thirtiseven dismissed res-life’s stale review via 290521b December 7, 2023 03:26

thirtiseven changed the title ~~Avoid leading and tailing zeros in test_timestamp_seconds_rounding_necessary~~ Avoid leading and trailing zeros in test_timestamp_seconds_rounding_necessary Dec 7, 2023

add test comment

8b7cf61

Signed-off-by: Haoyang Li <[email protected]>

winningsix approved these changes Dec 7, 2023

View reviewed changes

winningsix reviewed Dec 7, 2023

View reviewed changes

winningsix merged commit 3950fa0 into NVIDIA:branch-23.12 Dec 7, 2023
36 checks passed

thirtiseven mentioned this pull request Dec 8, 2023

[BUG] Failed case about test_timestamp_seconds_rounding_necessary[Decimal(20,7)][DATAGEN_SEED=1701412018] – src.main.python.date_time_test #9923

Closed

thirtiseven deleted the roundingNecessary branch December 19, 2023 08:44

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Avoid leading and trailing zeros in test_timestamp_seconds_rounding_necessary #9970

Avoid leading and trailing zeros in test_timestamp_seconds_rounding_necessary #9970

thirtiseven commented Dec 6, 2023 •

edited

Loading

res-life commented Dec 6, 2023

thirtiseven commented Dec 6, 2023

revans2 Dec 6, 2023

thirtiseven Dec 7, 2023

winningsix left a comment

winningsix Dec 7, 2023

thirtiseven Dec 7, 2023

winningsix Dec 7, 2023

thirtiseven Dec 7, 2023

thirtiseven commented Dec 7, 2023

thirtiseven commented Dec 7, 2023

thirtiseven commented Dec 7, 2023

winningsix Dec 7, 2023

thirtiseven Dec 7, 2023

winningsix Dec 7, 2023


		@pytest.mark.parametrize('data_gen', [DecimalGen(7, 7), DecimalGen(20, 7)], ids=idfn)

		# Make sure every decimal value in test data is rounding necessary by set full_precision=True to

Avoid leading and trailing zeros in test_timestamp_seconds_rounding_necessary #9970

Avoid leading and trailing zeros in test_timestamp_seconds_rounding_necessary #9970

Conversation

thirtiseven commented Dec 6, 2023 • edited Loading

res-life commented Dec 6, 2023

thirtiseven commented Dec 6, 2023

Choose a reason for hiding this comment

Choose a reason for hiding this comment

winningsix left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

thirtiseven commented Dec 7, 2023

thirtiseven commented Dec 7, 2023

thirtiseven commented Dec 7, 2023

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

thirtiseven commented Dec 6, 2023 •

edited

Loading