feat: Export products as dataframe #102

ShriramS-NI · 2025-03-06T11:44:00Z

This contribution adheres to CONTRIBUTING.md.

What does this Pull Request accomplish?

Adds functionality to return a normalized data frame, given a list of products as input.

Why should this Pull Request be merged?

This utility can be useful to get the products data in the form of a Data frame which can be useful for further processing.

What testing has been done?

Unit tests are added

nisystemlink/clients/product/utilities/_dataframe_utilities.py

tests/unit/product/test_product_dataframe_utilities.py

nisystemlink/clients/product/utilities/_dataframe_utilities.py

tests/unit/product/test_product_dataframe_utilities.py

nisystemlink/clients/product/utilities/_dataframe_utilities.py

tests/unit/product/test_product_dataframe_utilities.py

rbell517 · 2025-03-06T19:51:39Z

tests/unit/__init__.py

Lets not add this unit folder and stick with the existing file layout. That would mean putting your tests in a product folder alongside core

rbell517 · 2025-03-06T20:42:04Z

nisystemlink/clients/product/utilities/_dataframe_utilities.py

+            - A new column would be created for unique properties across all products. The property
+            columns would be named in the format `properties.property_name`.
+    """
+    products_dict_representation = [product.dict() for product in products]


dict has an exclude_none option that would apply to this use since we're just going to drop those columns anyway. If we don't add them to the dict version of the object then that could save some fraction of time to checking which columns are empty. If that is reliable be might be able to drop the dropna call altogether, which is probably a bigger win. Or maybe pandas is smart and can do this efficiently regardless. This SO post shows how to do a quick performance test to see if it makes any difference. It would probably be more significant on models that have more fields like results.

rbell517 · 2025-03-06T21:00:38Z

tests/unit/product/test_product_dataframe_utilities.py

+        assert not products_dataframe.empty
+        assert (
+            products_dataframe.columns.to_list()
+            == expected_products_dataframe.columns.to_list()


In addition to verifying the column names match the expected, would you also check the column data types? In particular that the dates are dates and arrays are arrays

feat: export products as dataframe

42725e5

ShriramS-NI force-pushed the users/shriram/feat-products-dataframe-utility branch from 189fddd to 42725e5 Compare March 6, 2025 12:01

SSadaiyappan-NI approved these changes Mar 6, 2025

View reviewed changes

SSadaiyappan-NI reviewed Mar 6, 2025

View reviewed changes

tests/unit/product/test_product_dataframe_utilities.py Show resolved Hide resolved

refactor: update test file

2886cb2

SSSantosh18 requested changes Mar 6, 2025

View reviewed changes

SSSantosh18 reviewed Mar 6, 2025

View reviewed changes

nisystemlink/clients/product/utilities/_dataframe_utilities.py Show resolved Hide resolved

ShriramS-NI added 2 commits March 6, 2025 20:02

test: improvise fixtures

47447c6

update doc string

1c5d889

ShriramS-NI requested a review from SSSantosh18 March 6, 2025 14:46

SSSantosh18 reviewed Mar 6, 2025

View reviewed changes

nisystemlink/clients/product/utilities/_dataframe_utilities.py Outdated Show resolved Hide resolved

SSSantosh18 reviewed Mar 6, 2025

View reviewed changes

nisystemlink/clients/product/utilities/_dataframe_utilities.py Outdated Show resolved Hide resolved

SSSantosh18 reviewed Mar 6, 2025

View reviewed changes

nisystemlink/clients/product/utilities/_dataframe_utilities.py Outdated Show resolved Hide resolved

SSSantosh18 approved these changes Mar 6, 2025

View reviewed changes

tests/unit/product/test_product_dataframe_utilities.py Outdated Show resolved Hide resolved

refactor: doc strings and test file

84891eb

ShriramS-NI marked this pull request as ready for review March 6, 2025 15:15

ShriramS-NI requested review from rbell517, spanglerco and cameronwaterman as code owners March 6, 2025 15:15

rbell517 approved these changes Mar 6, 2025

View reviewed changes

rbell517 reviewed Mar 6, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Export products as dataframe #102

feat: Export products as dataframe #102

ShriramS-NI commented Mar 6, 2025

rbell517 Mar 6, 2025

rbell517 Mar 6, 2025

rbell517 Mar 6, 2025

feat: Export products as dataframe #102

Are you sure you want to change the base?

feat: Export products as dataframe #102

Conversation

ShriramS-NI commented Mar 6, 2025

What does this Pull Request accomplish?

Why should this Pull Request be merged?

What testing has been done?

rbell517 Mar 6, 2025

Choose a reason for hiding this comment

rbell517 Mar 6, 2025

Choose a reason for hiding this comment

rbell517 Mar 6, 2025

Choose a reason for hiding this comment