[neutron] Mellanox Connectx5 ASAP2+LAG over VF+vxlan
What OS and kernel are you using?
From: Zoltan Langi <zoltan.langi at namecheap.com>
Sent: Friday, June 7, 2019 3:54 PM
To: openstack-discuss at lists.openstack.org
Subject: [neutron] Mellanox Connectx5 ASAP2+LAG over VF+vxlan
Hello everyone, I hope someone more experienced can help me with this problem I've been struggling for a while now.
I'm trying to set up ASAP2 ovs vxlan offload on a dual port Mellanox
ConnectX5 card between 2 hosts using LACP link aggregation and Rocky release.
When the LAG is not there, only one pf is being used, the offload works just fine, getting the line speed out of the vf-s.
(I initially followed this ASAP2 guide, works well:
Decided, to provide HA for the vf-s, may as well use LACP as it's supported for ASAP2 according to Mellanox:
So what I've done I've created a systemd script that puts the eswitch into switchdev mode on both ports before the networking starts at boot time, the bond0 comes up after the mode was changed just like in the docs.
The bond0 interface wasn't added to the ovs as the doc recommends as it keeps the vxlan tunnel, only the vf is there after openstack creates the vm.
The problem is only one direction of the traffic is offloaded when LAG is being used.
I opened a mellanox case and they recommended to install the latest ovs version which I did:
After using the latest OVS from master, the problem still exist, the offload simply doesn't work properly. The speed I am getting is way far away from the values that I get when only a single port is used.
Does anyone has any experience or any idea what should I look our for or check?
Thank you very much, anything is appreciated!