-
Notifications
You must be signed in to change notification settings - Fork 1.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[xcvrd] [cmis manager] CMIS manager cannot automatically select correct host lane count when selecting module application #19336
[xcvrd] [cmis manager] CMIS manager cannot automatically select correct host lane count when selecting module application #19336
Comments
If we have a port and a corresponding transceiver with the following configuration and capability, which app should speed: 100G |
This is a good question to discuss. From my POV, we should try lane number 8->4->2->1. So, I would prefer 100GAUI-4 in this case. @prgeor, @mihirpat1 , what do you think? |
Does configuring the breakout setting solve the problem? As for the example you showed,
How about setting the breakout configuration like below?
|
@ishidawataru CMISmanager will select the application based upon what is there in the config DB's PORT table. Please share your CONFIG_DB'S PORT table dump here for 100G speed. |
@prgeor The current CMIS manager implementation searches for a module application that matches both speed and host lane counts. @Junchao-Mellanox is pointing out this behavior causes port link down with the following configuration and needs improvement.
I initially agreed and implemented sonic-net/sonic-platform-daemons#507 as a draft PR. Currently, I'm waiting for @Junchao-Mellanox's response. |
Hi @ishidawataru , DPB is not a perfect solution for this. As far as I know, DPB has many limitations, for example, it cannot automatically adjust other port related configuration when doing DPB. Also, it is not user friendly to ask sonic user to do an extra DPB configuration when hit this. |
@Junchao-Mellanox Does the SAI require any modification to support this? What happens when the port is configured as 100G with lanes 0,1,2,3,4,5,6,7 on the platform with 50G/lane for example? Will lanes 0 and 1 be used in this case? |
Hi @ishidawataru , it depends on how vendor implement this. Currently, I don't see a problem on nvidia platform regarding SAI. |
@Junchao-Mellanox How does the NVIDIA SAI choose the lanes to use for a speed configuration if there are multiple choices?
Does the NVIDIA SAI choose the lane counts as you mentioned?
If that is the case, does it make sense to spec this behavior in SAI so that the |
Hi @ishidawataru , this is not a problem for ASIC side configuration. User has the ability to choose how many lanes shall be used by ASIC. Here is the sonic config:
|
@Junchao-Mellanox I see, in that case, can xcvrd use that configuration as a hint to choose the module application? |
Description
When CMIS manager is enabled, following configuration will cause port link down:
speed: 100G
lanes: 0,1,2,3,4,5,6,7
module supported application: 100GAUI-2, 400GAUI-8
CMIS manager will deduce host lane count 8 from "0,1,2,3,4,5,6,7", and it will try to find an application by using speed 100G and host lane count 8. It cannot find a proper application because the supported application is 100G 4 lanes.
CMIS manager should be smart enough to automatically choose 100GAUI-2 via 100G lane 2.
A workaround for this issue is to set lanes to:
lanes: 0,1
But there is no CLI to set port lanes.
Steps to reproduce the issue:
Describe the results you received:
Link is down. And error log:
Describe the results you expected:
xcvrd should be able to automatically choose the best application possible by using the current speed and a subset of the lanes.
Output of
show version
:Output of
show techsupport
:Additional information you deem important (e.g. issue happens only occasionally):
The text was updated successfully, but these errors were encountered: