Adapting the script classification_with_grn_and_vsn to be Backend-Agnostic #2023

Humbulani1234 · 2025-01-08T13:02:03Z

This PR adapts the script classification_with_grn_and_vsn.py from structured_data examples to jax and torch, i.e., make the script Backend-Agnostic.

Approach:

Modified the model architecture and removed the Preprocessing layers and use them with the tf.data.Datasets. Refer to this Keras doc string for more.
Generated the .ipynb and .md files
Achieved the 95% accuracy in the original model

…ostic

hertschuh

@Humbulani1234 ,

Thank you for the PR!

hertschuh · 2025-01-13T20:44:36Z

examples/structured_data/classification_with_grn_and_vsn.py

+"""
+Clean the directory for the downloaded files except the .tar.gz file and
+also remove the empty directories
+"""
+
+subprocess.run(
+    f'find {extracted_path} -type f ! -name "*.tar.gz" -exec rm -f {{}} +',
+    shell=True,
+    check=True,
+)
+subprocess.run(
+    f"find {extracted_path} -type d -empty -exec rmdir {{}} +", shell=True, check=True
+)
+


This could apply to any dataset, but I find this an unnecessary distraction. Can you remove?

hertschuh · 2025-01-13T20:48:39Z

examples/structured_data/classification_with_grn_and_vsn.py

+        # The reason for each individual backend calculation is that I couldn't find
+        # the equivalent keras operation that is backend-agnostic. In the following case there,s
+        # a keras.ops.matmul but it was returning errors. I could have used the tensorflow matmul
+        # for all backends, but due to jax jit tracing it results in an error.
+        def matmul_dependent_on_backend(tensor_1, tensor_2):
+            """
+            Function for executing matmul for each backend.
+            """
+            # jax backend
+            if keras.backend.backend() == "jax":
+                import jax.numpy as jnp
+
+                result = jnp.sum(tensor_1 * tensor_2, axis=1)
+            elif keras.backend.backend() == "torch":
+                result = torch.sum(tensor_1 * tensor_2, dim=1)
+            # tensorflow backend
+            elif keras.backend.backend() == "tensorflow":
+                result = keras.ops.squeeze(tf.matmul(tensor_1, tensor_2, transpose_a=True), axis=1)
+            # unsupported backend exception
+            else:
+                raise ValueError(
+                    "Unsupported backend: {}".format(keras.backend.backend())
+                )
+            return result
+
+        # jax backend
+        if keras.backend.backend() == "jax":
+            # This repetative imports are intentional to force the idea of backend
+            # separation
+            import jax.numpy as jnp
+
+            result_jax = matmul_dependent_on_backend(v, x)
+            return result_jax
+        # torch backend
+        if keras.backend.backend() == "torch":
+            import torch
+
+            result_torch = matmul_dependent_on_backend(v, x)
+            return result_torch
+        # tensorflow backend
+        if keras.backend.backend() == "tensorflow":
+            import tensorflow as tf
+
+            result_tf = keras.ops.squeeze(tf.matmul(v, x, transpose_a=True), axis=1)
+            return result_tf


This definitely should not be needed.

What is the issue with keras.ops.squeeze(keras.ops.matmul(keras.ops.transpose(v), x), axis=1)?

After careful thought, I've made it to work. I also struggled with the Keras doc string of the op keras.transpose, I don't think axes is explicit about the permutations. I had to read tensorflow doc to have a clear picture. But, nonetheless, it is resolved.

hertschuh · 2025-01-13T20:48:53Z

examples/structured_data/classification_with_grn_and_vsn.py

+    # to remove the build warnings
+    def build(self):
+        self.built = True


Added more of this.

hertschuh · 2025-01-13T20:49:10Z

examples/structured_data/classification_with_grn_and_vsn.py

@@ -415,7 +520,7 @@ def create_model(encoding_size):
 learning_rate = 0.001
 dropout_rate = 0.15
 batch_size = 265
-num_epochs = 20
+num_epochs = 1  # maybe adjusted to a desired value


Please revert after you're done testing.

Reverted to 20, but left the comment.

hertschuh · 2025-01-13T20:49:31Z

examples/structured_data/classification_with_grn_and_vsn.py

@@ -108,13 +109,37 @@
    "income_level",
 ]

-data_url = "https://archive.ics.uci.edu/static/public/20/census+income.zip"
+data_url = "https://archive.ics.uci.edu/static/public/117/census+income+kdd.zip"


Why the change in dataset?

It seems like the original dataset was easier to handle.

The dataset description in the script says it must have 41 input features: 7 numerical features and 34 categorical features.. The original dataset only had 14 features and its target variable was in <= or >=50k, whereas in the script it is in -5000 or +5000

…d_vsn.py

Humbulani1234 · 2025-01-13T22:32:34Z

PR sent addressing the comments.

hertschuh · 2025-01-16T19:37:40Z

examples/structured_data/classification_with_grn_and_vsn.py

@@ -415,7 +471,7 @@ def create_model(encoding_size):
 learning_rate = 0.001
 dropout_rate = 0.15
 batch_size = 265
-num_epochs = 20
+num_epochs = 1  # may be adjusted to a desired value


Was this a different one or was it not actually reverted? (Please change back to 20)

Changed. It was an over-sight.

Humbulani1234 · 2025-01-16T21:36:20Z

PR sent. Together with the .md and .ipynb files.

hertschuh

LGTM. Thank you for the port!

Humbulani1234 added 2 commits January 8, 2025 13:18

adapting the script classification_with_grn_and_vsn to be backend-agn…

ad1ac90

…ostic

adapting the script classification_with_grn_and_vsn to be Backend-Agn…

0da607c

…ostic

github-actions bot assigned sachinprasadhs Jan 8, 2025

script variable name changes refactoring

6399fe2

hertschuh requested changes Jan 13, 2025

View reviewed changes

addressing the PR comments for the script: classification_with_grn_an…

2aaadce

…d_vsn.py

hertschuh requested changes Jan 16, 2025

View reviewed changes

Humbulani1234 added 2 commits January 16, 2025 23:31

addressing comments for classification_with_grn_and _vsn.py

bec3344

addressing comments for classification_with_grn_and _vsn.py

290dd60

hertschuh approved these changes Jan 17, 2025

View reviewed changes

hertschuh merged commit fcf47ee into keras-team:master Jan 17, 2025
3 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adapting the script classification_with_grn_and_vsn to be Backend-Agnostic #2023

Adapting the script classification_with_grn_and_vsn to be Backend-Agnostic #2023

Humbulani1234 commented Jan 8, 2025

hertschuh left a comment

hertschuh Jan 13, 2025

Humbulani1234 Jan 13, 2025

hertschuh Jan 13, 2025

Humbulani1234 Jan 13, 2025

hertschuh Jan 13, 2025

Humbulani1234 Jan 13, 2025

hertschuh Jan 13, 2025

Humbulani1234 Jan 13, 2025

hertschuh Jan 13, 2025

Humbulani1234 Jan 13, 2025

Humbulani1234 commented Jan 13, 2025

hertschuh Jan 16, 2025

Humbulani1234 Jan 16, 2025

Humbulani1234 commented Jan 16, 2025

hertschuh left a comment

Adapting the script classification_with_grn_and_vsn to be Backend-Agnostic #2023

Adapting the script classification_with_grn_and_vsn to be Backend-Agnostic #2023

Conversation

Humbulani1234 commented Jan 8, 2025

hertschuh left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Humbulani1234 commented Jan 13, 2025

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Humbulani1234 commented Jan 16, 2025

hertschuh left a comment

Choose a reason for hiding this comment