Run ensure L2 flow dependently on both devices #285

velp · 2025-01-22T17:35:52Z

This PR changes ensure_l2_flow to synchronize both F5 devices dependently. This change implements:

graph-type flow for both synchronizations and it means if one device fails the changes on the second device will be reverted
revert function for EnsureVLAN
revert function for EnsureRouteDomain

The same changes (using graph flow) are coming soon for remove_l2_flow.

m-kratochvil

Very nice, and much appreciated. As usually I reviewed basic syntax only, actual code review to be done by @BenjaminLudwigSAP and @notandy

octavia_f5/controller/worker/flows/f5_flows.py

octavia_f5/tests/unit/controller/worker/test_l2_sync_manager.py

BenjaminLudwigSAP

Thanks for this PR! I left a few annotations and questions.

BenjaminLudwigSAP · 2025-01-23T10:36:01Z

octavia_f5/controller/worker/l2_sync_manager.py

-        ensure_l2_flow = self._f5flows.make_ensure_l2_flow(selfips, store=store)
-        e = self.taskflow_load(ensure_l2_flow, store=store)
+    def _do_ensure_l2_flow(self, data: dict):
+        ensure_l2_flow = graph_flow.Flow('ensure-l2-flow-from-all-devices')


I don't see the point in using a graph flow here. The way I understand it, a graph flow is for throwing multiple tasks/subflows into a bucket, which are then resolved according to their interdependencies. However, here we only have two subflows, one for each BigIP device. As far as I can see those can be parallelized, so an unordered_flow would suffice. Or is there another good reason to use graph_flow?

The main difference is that you can run flows on both devices in parallel but if one of them fails the second one will be also reverted. You can achieve that with unorder flow only if you put all tasks for both devices into one unordered flow and they will run one by one for a really long time.

BenjaminLudwigSAP · 2025-01-23T13:30:19Z

octavia_f5/controller/worker/flows/f5_flows.py

        """

        # make SelfIP creation subflow
-        ensure_selfips_subflow = unordered_flow.Flow('ensure-selfips-subflow')
+        ensure_selfips_subflow = unordered_flow.Flow(
+            f'ensure-selfips-subflow-{store["bigip"].hostname}')


store["bigip"].hostname is used so often in this function (and reevaluated each time), that I prefer it to be put into a dedicated variable. But it's not very important, I'm also okay with this happening in some future refactoring commit or something.

make sense, will change that. BTW, getting a value from hash-map (dict) costs nothing

BenjaminLudwigSAP · 2025-01-23T13:32:19Z

octavia_f5/controller/worker/flows/f5_flows.py

        for selfip_port in selfips:
            ensure_selfip_task = f5_tasks.EnsureSelfIP(
-                name=f"ensure-selfip-{selfip_port.id}", inject={'port': selfip_port})
+                name=f'ensure-selfip-{selfip_port.id}-{store["bigip"].hostname}',


I think it's better to swap these two interpolations (i. e. {store["bigip"].hostname}-{selfip_port.id}), because the SelfIP is subordinate to the device. But that is a style choice, so I'm fine if you don't adress this.

make sense, will change

BenjaminLudwigSAP · 2025-01-23T13:41:01Z

octavia_f5/controller/worker/flows/f5_flows.py

-            ensure_subnet_route_task = f5_tasks.EnsureSubnetRoute(name=f"ensure-subnet-route-{subnet_route_name}",
-                                                                  inject={'subnet_id': subnet_id})
+            ensure_subnet_route_task = f5_tasks.EnsureSubnetRoute(
+                name=f'ensure-subnet-route-{subnet_route_name}-{store["bigip"].hostname}',


I think it's better to swap these two interpolations (i. e. {store["bigip"].hostname}-{subnet_route_name}), because the route is subordinate to the device. But that is a style choice, so I'm fine if you don't adress this.

BenjaminLudwigSAP · 2025-01-23T13:59:01Z

octavia_f5/controller/worker/flows/f5_flows.py

+            inject={
+                'bigip': store['bigip'],
+                'network': store['network']
+            }


Please move all these identical inject dictionaries into a variable.
It can also include subnet_id, since afaik it's okay to inject that into tasks that don't take it as an argument.

hm, good point, maybe I can just do it like inject=store, will test

BenjaminLudwigSAP · 2025-01-23T14:26:59Z

octavia_f5/controller/worker/tasks/f5_tasks.py

+
+        if res and not res.ok:
+            LOG.warning("%s: Failed removing route domain for network_id=%s vlan_id=%s: %s",
+                        bigip.hostname, network.id, network.vlan_id, res.content)


Please reraise the error, like this: res.raise_for_status().

octavia_f5/controller/worker/l2_sync_manager.py

BenjaminLudwigSAP · 2025-01-23T15:21:39Z

octavia_f5/controller/worker/tasks/f5_tasks.py

+                network: f5_network_models.Network):
+        device_response = bigip.get(path=f"/mgmt/tm/net/vlan/~Common~vlan-{network.vlan_id}?expandSubcollections=true")
+        if device_response.status_code == 404:
+            return None


Please either reraise any error (device_response.raise_for_status()) or remove the RaisesIControlRestError annotation.

We do not need to that here, because if resource does not exist it's ok and we will create it in the next task

BenjaminLudwigSAP · 2025-01-23T15:22:20Z

octavia_f5/controller/worker/tasks/f5_tasks.py

+            device_response = bigip.get(path=path)
+
+        if device_response.status_code == 404:
+            return None


Please either reraise any error (device_response.raise_for_status()) or remove the RaisesIControlRestError annotation.

We do not need to that here, because if resource does not exist it's ok and we will create it in the next task

BenjaminLudwigSAP · 2025-01-23T15:35:28Z

octavia_f5/tests/unit/controller/worker/test_l2_sync_manager.py

+        }
+        # self.manager._do_ensure_l2_flow(data=data)
+        self.assertRaises(Exception, self.manager._do_ensure_l2_flow, data=data)
+        # check thath both devices were called and REVERT task were not called


You mean "were also called" ;)

velp · 2025-02-17T11:30:52Z

@BenjaminLudwigSAP please take a look again. I added an availability check for BigIP devices.

Run ensure L2 flow dependently on both devices

e537619

velp requested review from notandy, m-kratochvil and BenjaminLudwigSAP January 22, 2025 17:35

m-kratochvil requested changes Jan 22, 2025

View reviewed changes

octavia_f5/controller/worker/flows/f5_flows.py Outdated Show resolved Hide resolved

octavia_f5/tests/unit/controller/worker/test_l2_sync_manager.py Outdated Show resolved Hide resolved

octavia_f5/tests/unit/controller/worker/test_l2_sync_manager.py Outdated Show resolved Hide resolved

BenjaminLudwigSAP requested changes Jan 23, 2025

View reviewed changes

Apply fixes for review

da435de

velp requested review from BenjaminLudwigSAP and m-kratochvil January 23, 2025 19:01

velp added 2 commits February 17, 2025 12:07

Add availability check for ensure_l2_flow with revert action

a7a1626

Fix pylint test

7cad0f9

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Run ensure L2 flow dependently on both devices #285

Run ensure L2 flow dependently on both devices #285

velp commented Jan 22, 2025

m-kratochvil left a comment

BenjaminLudwigSAP left a comment

BenjaminLudwigSAP Jan 23, 2025

velp Jan 23, 2025

BenjaminLudwigSAP Jan 23, 2025

velp Jan 23, 2025 •

edited

Loading

BenjaminLudwigSAP Jan 23, 2025

velp Jan 23, 2025

BenjaminLudwigSAP Jan 23, 2025

BenjaminLudwigSAP Jan 23, 2025

velp Jan 23, 2025

BenjaminLudwigSAP Jan 23, 2025

BenjaminLudwigSAP Jan 23, 2025

velp Jan 23, 2025

BenjaminLudwigSAP Jan 23, 2025

velp Jan 23, 2025

BenjaminLudwigSAP Jan 23, 2025

velp commented Feb 17, 2025

Run ensure L2 flow dependently on both devices #285

Are you sure you want to change the base?

Run ensure L2 flow dependently on both devices #285

Conversation

velp commented Jan 22, 2025

m-kratochvil left a comment

Choose a reason for hiding this comment

BenjaminLudwigSAP left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

velp Jan 23, 2025 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

velp commented Feb 17, 2025

velp Jan 23, 2025 •

edited

Loading