Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

Primary Requirement ID

Secondary Requirement ID

Unit Test ID

LUTF IDIssue

Unit Test Description

cfg-105
basic-cfg-err-001test_udsp_basic_err_01fail (return -22), the error seems known issue listed below
  • Setup: configure single network, 3 NIDs on the network 
  • Add UDSP rule that gives two of the interfaces highest priority
  • Execute UDSP "Show" command
  • Verify that the resulting YAML file contains policy description as expected
  • Execute UDSP "Delete" command providing policy index that doesn't exist
  • Verify that the command failed with an error


basic-cfg-err-002test_udsp_basic_err_02pass
  • Setup: configure single network, 3 NIDs on the network 
  • Verify that attempts to add UDSP rule for source or dst network fail: "0tcp", "0o2ib", "tcpp", "t", "tcp100000000000000000000000000000000000000000000000001"


basic-cfg-err-003case missing; manually tested
Jira
serverWhamcloud Community Jira
serverId8bba2dd1-4333-3006-bfcd-f35d4ebbd2ad
keyLU-15749
  • Setup: configure single network, 3 NIDs on the network 
  • Verify that attempts to add UDSP rule for source or dst NID fail: "@tcp", "1@tcp", "1.1.1@tcp", "1.1.1.1.1@tcp"

...

Primary Requirement ID

Secondary Requirement ID

Unit Test ID

LUTF IDIssue

Unit Test Description




test_udsp_routed_net_01

the test itself passed, but the unconfigure code(at the end of the script) return error

error: "LUTFError:\n  node-name: mds_HOST\n  msg: lustre_rmmod failed\n  arg:\

          \ null\n  file name: /usr/lib64/lustre/tests/lutf/python/tests-infra/lnet.py\n\

          \  line number: 611\n  function: unconfigure\n"


if comment out the unconfigure_lnet(), test can pass

lutf>>> R()

Tests:

  - name: lutf-udsp

    description: auster lutf

    SubTests:

      - status: PASS

        duration: 8

        return_code: 0

        name: udsp_routed_net_01

    duration: 8

    status: PASS

    submission: Fri Apr 22 17:07:56 UTC 2022

  • Configure single local network with two NIDs on the network  (RemoteLayout1)
  • Configure two routers/gateways each providing access to a remote network
  • Configure remote peer to have access to the remote network with two NIDs
  • Add UDSP rule that designates a pair of a router and remote peer NIDs as preferred
  • Add UDSP rule that gives the same remote peer NID highest priority
  • Start traffic
  • Stop traffic
  • Verify that the preferred remote peer NID and router NID were used




pass - manually run
  • Configure single local network with two NIDs on the network (RemoteLayout1)
  • Configure two routers/gateways each providing access to a remote network
  • Configure remote peer to have access to the remote network with two NIDs
  • Assign higher route priority to the route using gateway 1
  • Add UDSP rule that designates a pair of a router (gateway 2) and remote peer NIDs as preferred
  • Start traffic
  • Stop traffic
  • Verify that the higher priority route was used (gateway 1)




fail - manually run

fail to set up the env as layout2, I can make the top path passed, but the bottom one wont work

  • Configure two local networks (net1, net2) with two NIDs on each network (RemoteLayout2)
  • Configure remote peer to have access to the remote network with two NIDs
  • On peerA add UDSP rule that designates gateway 1 as preferred for all NIDs on peerB
  • Start traffic (peerA to peerB)
  • Stop traffic
  • Verify that if net1 was used then the paired router was used 

...

Primary Requirement ID

Secondary Requirement ID

Unit Test ID

Issue

Unit Test Description




pass - manually run

add UDSP rule to interface A

simulate the error by "lctl net_drop_add -s A -d B -i 2"

do lctl ping B from A

check lnetctl net show -v 3 and found send_count is increased on A as well as other interfaces,  the same time, health stats dropped number increased too

delete the rule by "lctl net_drop_del" and do lctl ping again

send_count shows only A increased by 1

*health value didn't been noticed decrease, but developer checked lnet debug log and it showed the changing

  • MR Node with Multiple interfaces
  • Add UDSP rule that gives one of the local interfaces highest priority
  • Send a PING
  • Verify that the NID with the highest priority was used
  • Send a PING
  • Simulate an <error>
  • PING msg should be queued on resend queue
  • PING msg will be resent on a different interface
  • Failed interface's health value will be decremented
  • Failed interface will be placed on the recovery queue
  • Eventually health score should recover so then only the preferred interface will be used (sensitivity > 0)



pass - manualmanually run

add UDSP on  tcp1

then set that nid A in tcp1 in net_drop, the traffic will go either the good interface on tcp1 or failed. we got failed ping every other time. 

  • MR Node with Multiple interfaces: configure two networks, two NIDs on each network
  • Add UDSP rule that gives one of the networks highest priority
  • Start traffic
  • Stop traffic
  • Verify that NIDs on the network with the highest priority were used
  • Start traffic
  • Simulate an <error> (drop on one of the NIDs on the highest priority network)
  • Failed interface's health value will be decremented
  • Failed interface will be placed on the recovery queue
  • Verify that the remaining interface on the failed network is used
  • Eventually health score should recover so then only both interface on the preferred network will be used (sensitivity > 0)



pass - manually run
  • Configure single local network with two NIDs on the network 
  • Configure two routers/gateways each providing access to a remote network
  • Configure remote peer to have access to the remote network with two NIDs
  • Add UDSP rule that designates a router as preferred for all remote peer NIDs 
  • Start traffic
  • Stop traffic
  • Verify that the preferred router was used
  • Add a PUT drop rule on the router to drop traffic on one of the interfaces.
  • Verify that traffic goes to the other interfaces of the preferred router. There shouldn't be any drop in traffic.
  • Add a PUT drop rule on the router to drop traffic on all interfaces.
  • Verify that traffic goes through the non-preferred router until one of the interfaces of the preferred router recovers.

...