RPL rank and forward errors by laurentderu · Pull Request #652 · contiki-os/contiki

laurentderu · 2014-04-18T14:00:12Z

Currently when a packet with a Forward Error set in the Hop-by-Hop option is received by a RPL node, it is forwarded back to its originator. When such a packet reaches the Border Router it trigger a global repair. This means that any route error in the DAG will trigger a complete reconfiguration of the network, wasting a huge amount of energy as all the timer are reset and all the nodes must learn again their neighbourhood. In our testbed we have observed global repair triggered several times per hour.

As solution we have removed the forwarding of the packet with forward error flag set and instead we send back a No-Path DAO to remove the offending route on the originating node, this is as suggested by the RFC. On the BR router, the route is simply removed and the DIO timer is reset to trigger DAO from all the children.

Also, packets with the rank error flag set are currently forwarded, this is forbidden by the RFC (except for the first error as one hop rank errors are tolerated), instead the packet should be dropped and the DIO timer must be reset to refresh the nodes rank.

laurentderu · 2014-04-24T11:42:35Z

RPL test test 08-rpl-dao-route-loss-2 fails, I will have a look at it

adamdunkels · 2014-06-11T20:46:45Z

These DAO errors are problematic. We did a lot of experimentation with this configuration a while back at Thingsquare, as we use DAO routes a lot, and found then that being a bit more aggressive when it comes to rebuilding the network would make the network quicker to repair node outages (which is also the backstory to this particular regression test). Very nice to see more work go into this!

nvt · 2014-07-22T11:09:32Z

@laurentderu Is there any update on this PR coming soon? The functionality is of interest, but we need all tests to pass before merging.

laurentderu · 2014-07-31T06:48:15Z

it's still on my todo list :) I hope to rebase it and have a look at it next week

laurentderu · 2014-08-07T13:44:56Z

The issue is caused by a (non) interaction between NDP and RPL :

The receiver node has only one parent, node 8 or node 4 after the swap. It is a purely receiving node, the only outgoing traffic is the unicast DAOs to its parent. When node 8 is swapped with node 4, the receiver does not receive DIOs from 8 anymore and so no more DAO are triggered. As the rank of node 4 is exactly the same as node 8, the receiver does not select node 4 as his new parent; and as no more outgoing unicast traffic is performed the rank of node 8 does not increase at all. Also NUD on node 8 would only be triggered if the case of outgoing traffic towards node 8.

In 6LoWPAN-ND this issue is avoided as all the host perform a periodic reachability check on all their default routers, so the receiver would discover that node 8 is unreachable and would switch to node 4.

I have made a small workaround in uip_ds6_neighbor_periodic(), when a neighbor leave it's REACHABLE state and is a default router, instead of going to STALE state it enters DELAY state in order to force a NUD on it. This mimics the 6LoWPAN-ND behavior and I guess that it's still less energy consuming that triggering a global repair.

nvt · 2014-09-03T11:30:41Z

Thanks for looking into the problem and proposing a workaround. I'm reluctant to accept a change of the ND implementation's state machine to solve this problem because it may break standard compatibility. The ND implementation should follow RFC 4861, and unless you can show that the workaround is not a problem in this regard, I have to propose that we consider other fixes -- primarily within the RPL implementation. A less appealing alternative would be to embed the current workaround within a preprocessor conditional that checks whether RPL is enabled.

laurentderu · 2014-09-04T13:43:23Z

This workaround does not break standard compatibility with NDP, the transition STALE -> DELAY would occurs anyway when the host send an unicast packet to its neighbor (in this case, its preferred parent), so we are only anticipating the transition, not introducing a unexpected transition.
But I agree with you, this workaround should only be enabled when RPL is enabled, as when you have pure NDP, this is properly taken care of by the default router lifetime.

I just want to stress that if a node does not send upstream unicast traffic, with the current implementation it is not aware of the changes in the network topology and could be rendered non accessible. The current implementation resolves this by triggering a global repair which is quite extreme.

Another workaround more RPL centric (but a bit more complex) would be to use the default router lifetime and discard the preferred parent when the router lifetime expire and trigger the selection another parent. This would require also a modification of the default RPL route lifetime, which is in Contiki is set to 6 month right now

nvt · 2014-10-09T22:35:21Z

OK, I think that the simplest solution is to embed the ND block in a preprocessor conditional to check for RPL. Furthermore, it would be good with a comment within this block that states the purpose of both the code and the conditional. I also have some minor line comments that follow.

@adamdunkels Do you think that this is a suitable solution?

nvt · 2014-10-09T22:36:11Z

core/net/ipv6/uip6.c

Please change into /* Packet cannot be forwarded. */

laurentderu · 2014-11-18T08:31:50Z

Seems Travis got stuck while trying to access github.com, could someone restart the job ?

nvt · 2014-11-28T18:32:14Z

Sure, I'll restart the Travis tests.

laurentderu · 2014-12-01T16:26:13Z

Code rebased and updated as suggested and travis is happy this time

adamdunkels · 2014-12-01T16:29:21Z

I think this looks good, 👍 from me!

nvt · 2014-12-01T16:41:49Z

👍

RPL rank and forward errors

adamdunkels added the Core label Jun 3, 2014

nvt added high-priority and removed high-priority labels Jun 5, 2014

tim-ist mentioned this pull request Jun 5, 2014

RPL: nexthop not updated #496

Closed

g-oikonomou assigned nvt Jun 5, 2014

darconeous added the RPL label Jun 10, 2014

laurentderu closed this Aug 7, 2014

laurentderu deleted the pr-rpl-rank-and-fw-errors branch August 7, 2014 13:07

laurentderu restored the pr-rpl-rank-and-fw-errors branch August 7, 2014 13:07

laurentderu reopened this Aug 7, 2014

nvt reviewed Oct 9, 2014
View reviewed changes

core/net/ipv6/uip6.c Outdated

Copy link

Member

nvt Oct 9, 2014

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Please change into /* Packet cannot be forwarded. */

laurentderu added 3 commits November 14, 2014 09:40

Do not trigger global repair when forwarding error is detected

778d40d

Rank error packet should not be forwarded

a964380

Drop forwarding error packet and send back DAO to originating parent

29f894c

laurentderu force-pushed the pr-rpl-rank-and-fw-errors branch from 308b43f to 9184c54 Compare November 14, 2014 14:28

Force NUD on default routers

35e876e

laurentderu force-pushed the pr-rpl-rank-and-fw-errors branch from 9184c54 to 35e876e Compare November 17, 2014 09:58

nvt pushed a commit that referenced this pull request Dec 1, 2014

Merge pull request #652 from cetic/pr-rpl-rank-and-fw-errors

63563ed

RPL rank and forward errors

nvt merged commit 63563ed into contiki-os:master Dec 1, 2014

laurentderu deleted the pr-rpl-rank-and-fw-errors branch January 8, 2015 14:35

arurke mentioned this pull request Jan 17, 2023

Sub-optimal RPL DAO inconsistency handling contiki-ng/contiki-ng#2386

Open

Conversation

laurentderu commented Apr 18, 2014

Uh oh!

laurentderu commented Apr 24, 2014

Uh oh!

adamdunkels commented Jun 11, 2014

Uh oh!

nvt commented Jul 22, 2014

Uh oh!

laurentderu commented Jul 31, 2014

Uh oh!

laurentderu commented Aug 7, 2014

Uh oh!

nvt commented Sep 3, 2014

Uh oh!

laurentderu commented Sep 4, 2014

Uh oh!

nvt commented Oct 9, 2014

Uh oh!

nvt Oct 9, 2014

Choose a reason for hiding this comment

Uh oh!

laurentderu commented Nov 18, 2014

Uh oh!

nvt commented Nov 28, 2014

Uh oh!

laurentderu commented Dec 1, 2014

Uh oh!

adamdunkels commented Dec 1, 2014

Uh oh!

nvt commented Dec 1, 2014

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants