[PNE-241] Data Manager support #943

lkubie · 2024-06-13T14:23:55Z

Citrine Python PR

Jira Ticket

PNE-241

Description

WIP draft for supporting data manager.

PR Type:

Breaking change (fix or feature that would cause existing functionality to change)
Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
Maintenance (non-breaking change to assist developers)

Adherence to team decisions

I have added tests for 100% coverage
I have written Numpy-style docstrings for every method and class.
I have communicated the downstream consequences of the PR to others.
I have bumped the version in __version__.py

anoto-moniz · 2024-06-21T14:05:47Z

Note about how I PR, since I don't think I've done one for you before.
I'll likely start some comments with "nitpick" or the like. That's a non-blocking comment, which I use to mean "this isn't super important, but I figured I'd flag it, and leave it up to you if you wish to make the change."

live_tests/test_data_manager_changes.py

anoto-moniz · 2024-06-21T14:13:36Z

src/citrine/_utils/functions.py

+import warnings
+warnings.filterwarnings('always', category=DeprecationWarning)


I think this is just here for testing, but just flagging to ensure we remove it before merge.

Yep! Just testing. They weren't showing up in my notebook otherwise for some reason

src/citrine/_utils/functions.py

src/citrine/resources/dataset.py

src/citrine/resources/delete.py

anoto-moniz · 2024-06-21T14:52:40Z

tests/resources/test_gemd_resource.py

+            )
+
+def test_invalid_collection_construction(invalid_collection):
+    # assertion is within the construction of the invalid_collection


Fixtures should be used to construct data. If the construction is the test, it belongs in the test function, unless there's some very good reason.

Got it. I'll fix this and my other occurrences of a similar pattern

anoto-moniz · 2024-06-21T14:54:33Z

tests/utils/fakes/fake_dataset_collection.py

-    def __init__(self, project_id, session):
-        DatasetCollection.__init__(self, project_id=project_id, session=session)
+    def __init__(self, project_id, session, team_id):
+        DatasetCollection.__init__(self, team_id = team_id, project_id=project_id, session=session)


Hmmmmmmmm...not sure why this was never moved over to super. Mind making that change while you're here?

lkubie · 2024-06-21T15:46:43Z

Note about how I PR, since I don't think I've done one for you before. I'll likely start some comments with "nitpick" or the like. That's a non-blocking comment, which I use to mean "this isn't super important, but I figured I'd flag it, and leave it up to you if you wish to make the change."

I appreciate it! The more feedback the better. For example, I didn't know that the standard is always is None and that using == None wasn't OK

pacdaemon · 2024-07-09T10:00:18Z

src/citrine/resources/condition_template.py

@@ -58,8 +58,6 @@ def __str__(self):
 class ConditionTemplateCollection(AttributeTemplateCollection[ConditionTemplate]):
    """A collection of condition templates."""

-    _path_template = 'projects/{project_id}/datasets/{dataset_id}/condition-templates'


👍 , go it, now it is a property on the DataConceptsCollection

pacdaemon · 2024-07-09T10:20:42Z

src/citrine/resources/gemtables.py

@anoto-moniz here we will need to discuss how to pick the appropriate API endpoint for building the table depending on the desired execution scope (whole team or project), I've sent you a pm to start talking about it.

It should be the same as the rest. Add support for a team_id parameter, and expose it on the TeamCollection, while deprecating the project_id parameter.

pacdaemon · 2024-07-09T14:50:10Z

src/citrine/resources/dataset.py

                                  dataset_uid=self.uid,
-                                  project_id=self.project_id
+                                  team_id=self.team_id


I see, even the team_id is not mandatory for the dataset's collection both the project and team initialize the collection with the corresponding team_id. This is ok.

pacdaemon · 2024-07-10T14:20:51Z

src/citrine/resources/material_run.py

+        query = {
+            "criteria": [
+                {
+                    "datasets": str(self.dataset_id),


datasets is an array

anoto-moniz · 2024-07-10T16:19:56Z

src/citrine/resources/file_link.py

@@ -192,14 +193,44 @@ def __str__(self):
 class FileCollection(Collection[FileLink]):
    """Represents the collection of all file links associated with a dataset."""

-    _path_template = 'projects/{project_id}/datasets/{dataset_id}/files'
+    _path_template = 'teams/{team_id}/datasets/{dataset_id}/files'


Will the projects version of the endpoint be dead?

We'll kill it after a while, but given that we are using the datasetId we can freely use the teams-based one for both project->dataset and team->dataset scenarios. It is going to return exactly the same data.

anoto-moniz · 2024-07-10T16:27:42Z

tests/resources/test_dataset.py

For each of these tests, we need to check that the projects version gives a DeprecatedWarning.

anoto-moniz · 2024-07-10T16:29:08Z

tests/resources/test_project.py

We should be checking that invoking the data related endpoints through ProjectCollection raises a deprecation warning.

pacdaemon · 2024-07-19T20:22:53Z

This work was already handled on #949

Lenore Kubie added 18 commits June 12, 2024 14:19

not passing tests, intial pass

d132156

additional updates

6d479b3

working towards passing tests

64da79b

still fixing tests

66bef93

inline comments around collection paths

a1a9381

make Project class mathod

640b3f7

making more DRY

a71e6fd

Move _path_template to DataConceptsCollection to keep code more DRY

934075b

cleaning up code

399af7a

working on tests

8cba57a

depricate dataset-sharing-related Project methods

9e3e9cd

Ensuring calls are forming as expected

ac7b619

Ensuring calls are forming as expected

4eb24e5

checking more paths

b9aa005

passing tests but missing some coverage

10c2945

passing tests will full coverage

6e2352d

done with live tests

b3b9019

adding to live tests

2c9dc1c