Ensure log entries use consistent ordering and types for columns #1404

matt-graham · 2024-06-21T17:19:57Z

As mentioned in #1227 (comment) currently there are instances of structured log entries where the columns value computed for the header entry in the log from the first log instance for that key, is not consistent with the value for columns that would be computed from subsequent log instances for the key, due to for example the columns dicts have keys (column names) in different orders, different values (types for a given key) or completely non-overlapping key-value pairs.

This PR makes a series of related changes

The columns dict values in the header log entry are now stored in the logger object and compared to corresponding dict computed for subsequent log entries with same key and if these do not match a new exception type, InconsistentLoggedColumnsError is raised, with details of the differences.
When converting logging data to a dictionary, if the data is a dictionary or dataframe, which is converted to a dictionary, the resulting dictionary is sorted by its keys (allowing for them to be a mix of string and numeric values), to ensure consistent ordering of the columns even if the dictionaries / dataframes themselves do not necessarily order the keys / column consistently. Similarly set data is sorted before converting to a dictionary.
A helper function is added to tlo.logging.helpers for logging the properties of an individual in the population dataframe as a dictionary in a way that ensures stability of the types of the property values. In particular this is achieved by returning the NumPy / pandas extension scalar types associated with the datatype of the array underlying a particular column / property, except for the case of nullable booleans and categoricals for which there is no corresponding scalar type, in which case a length-1 array is instead used. The JSON encoding rules for these types is also updated accordingly.
For other cases where log entries were found to have inconsistent columns not covered by fixes above, manual fixes to ensure type stability were added (typically wrapping values in a call to a built-in Python type such as float or str) and in some cases to ensure keys were aligned across different logger calls.

Hopefully with fixes here we should be able to achieve consistent logging when suspending and resuming simulation using changes in #1227 compared to running contiguously.

… columns

matt-graham · 2024-07-02T08:54:44Z

In addition to the changes described above, this PR now also includes some more general changes / clean-up to the logging modules

The functionality for old style logging messages has been removed from the Logger class as this no longer seems to be used anywhere.
The functionality for allowing logging of multirow dataframes has also been removed as this was only used in tests and never publicly exposed.
The repetition across the various level specific logging methods has been removed by using partialmethod
The interface to the Logger class has been cleaned up, with all attributes that do not appear to be used outside of the class hinted as private by underscore prefixing, attributes which are used externally generally exposed as read only properties and some methods which did not require access to the class instance factored out to to functions.
The direct coupling between the Logger and Simulation classes has been removed by instead having a global 'private' function _get_simulation_date which can be set when initialising logging to a function which retrieves current date from simulation object. This simplifies unit testing the logging system and keeps a cleaner separation between the different framework components. A new reset function is also added to allow restoring the global state in the logging module (simulation date getter function and dictionary of initialised loggers) to original values.
Some functions which were previously defined in the logging helpers module have been moved to the core module to avoid relying on importing internal (underscore prefixed) names.
Type hints have been added to all functions and methods in src/tlo/logging/core.py and src/tlo/logging/helpers.py.
The unit tests in tests/test_logging.py have been rewritten from scratch to give better coverage and also test the various factored out utility functions.

matt-graham · 2024-07-02T08:59:21Z

A remaining question is whether we want the current InconsistentLoggedColumnsError exception to instead be a warning. At the moment a simulation will fail if an attempt is made to create a log record for a key with a different structure than when the first record was used to create the header. As these failures can occur quite far in to the simulation for inrequently logged keys / log records triggered by rare events, this can lead to annoying late failures, so a warning may be a better option as an inconsitent log entry structure is not a 'fatal' error.

tamuri · 2024-07-08T08:01:58Z

That sounds like a good idea, at least to start. We can monitor the warnings on the nightly scale run.

matt-graham · 2024-07-10T07:07:55Z

That sounds like a good idea, at least to start. We can monitor the warnings on the nightly scale run.

Now changed the exception to a warning.

tamuri

All looks good - one minor suggestion (not a big deal).

src/tlo/logging/core.py

matt-graham added 7 commits June 17, 2024 14:56

Sort structured log dataframe entries and raise error on inconsistent…

eff7fc2

… columns

Sort diff entries in logging error message

03db9ae

Add helper for converting dataframe row to dict for logging

2841be7

Add extra JSON encoding rules for logging

e73b101

Fix errors in modules due to unstable log columns

78fd12b

More unstable log columns fixes

33e5c43

Fix further instance of misaligned log entry key

56933b7

matt-graham marked this pull request as draft June 21, 2024 17:35

matt-graham added 22 commits June 24, 2024 14:30

Fix measles incidence age range log entry float / int instability

1104415

Ensure HSI event priorities are ints

5db2eed

Handle all pandas extension types in logging encoder + helper function

e575610

Use helper function to ensure type stability in equipment logging

22a0cd1

Fix isort spacing issues

aafd819

More manual fixes for log entry float/int type instability

55d7268

Ensure groupby on age_years includes combinations with zero counts

e3f59a8

Ensure stunting log contains all age_years combinations

d4e5fcb

Normalize NumPy scalar types and refactor logging

79381bc

Further logging refactoring

4369b95

Updating logging tests

6ed4f4d

Merge branch 'master' into mmg/inconsistent-log-columns-error-and-fixes

8a67794

Ensure numeric dict keys use natural sort order

ea46dcc

Automatically convert NumPy strings to Python type

4901ad8

Fix bug in length 1 extension array type handling

9e0990e

Add helper function for logging group by counts

12111e0

Remove new line to satisfy isort

9a11589

Use helper function to log person dict

619ec6e

Make district_num_of_residence property categorical

ca0e441

Make dummy missing facility level value str

482a500

Fix int/float switching in TB incidence logging

54d4eca

Fix import order

38180a1

matt-graham added 3 commits July 1, 2024 15:18

Have pylint ignore dynamic property access

3105876

Ensure type stability in CMD logging

f99c8a4

Fix end to end logging tests

c7301bf

matt-graham marked this pull request as ready for review July 2, 2024 14:28

matt-graham requested a review from tamuri July 2, 2024 14:28

Make inconsistent logging columns error a warning

383c5a8

tamuri approved these changes Jul 22, 2024

View reviewed changes

src/tlo/logging/core.py Outdated Show resolved Hide resolved

matt-graham added 3 commits July 23, 2024 11:16

Merge branch 'master' into mmg/inconsistent-log-columns-error-and-fixes

8eb9844

Remove superfluous logger level check

33d7d28

Fix merge conflict placeholders left in bad merge

524e9d5

matt-graham merged commit 625b4d9 into master Jul 24, 2024
59 checks passed

matt-graham deleted the mmg/inconsistent-log-columns-error-and-fixes branch July 24, 2024 08:35

This was referenced Jul 25, 2024

Add support for saving and loading simulation state to / from files #1227

Merged

Log entry for consumables item_codes_not_recognised in health system log is non-deterministic #1434

Closed

Warning generated about logged columns #1440

Open

tamuri mentioned this pull request Nov 6, 2024

Groups in population pyramid from calibration analyses not sorted correctly. #1506

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Ensure log entries use consistent ordering and types for columns #1404

Ensure log entries use consistent ordering and types for columns #1404

matt-graham commented Jun 21, 2024

matt-graham commented Jul 2, 2024

matt-graham commented Jul 2, 2024

tamuri commented Jul 8, 2024

matt-graham commented Jul 10, 2024

tamuri left a comment

Ensure log entries use consistent ordering and types for columns #1404

Ensure log entries use consistent ordering and types for columns #1404

Conversation

matt-graham commented Jun 21, 2024

matt-graham commented Jul 2, 2024

matt-graham commented Jul 2, 2024

tamuri commented Jul 8, 2024

matt-graham commented Jul 10, 2024

tamuri left a comment

Choose a reason for hiding this comment