Attempt to Fix merge of Sliced NodeCollections#3720

Open

med-ayssar wants to merge 11 commits intonest:masterfrom

med-ayssar:issue-3706

Contributor

med-ayssar commented Dec 26, 2025

No description provided.

thomaskroi1996 and others added 11 commits

December 23, 2025 08:08


          replace numpy.in1d() with numpy.isin() in testsuite

1648c57


          Adjust NTree::iterator to work with std20

80b5b3c


          Set std20 as default standard

05b61fa


          Remove unused functions

7e4bac2


          Use cpp::NodeCollection::slice func to slice a nodeCollection in PyNEST

2e0905c


          Fix Slice range in case of exactly one slice

01147b3


          Remove empty line

b3eef5f


          Use const reference

06a0260


          Use std::span instead of boost::span

67cb75a


          Remove boost include header for boost:span

8a052b3


          Attempt to fix merge of sliced NodeCollections

368cbdc

gtrensch added the S: Normal label

Contributor

gtrensch commented Jan 23, 2026

@med-ayssar, thank you for your contribution!

Could you please add a description to this PR? Since you called it an attempt, perhaps this PR should be considered a draft?

heplesser requested changes

View reviewed changes

Contributor

heplesser left a comment

@med-ayssar I am a bit concerned about this PR. In some places it seems to introduce unnecessary complexity into the code and in other places I fear considerable performance hits. Until now, adding sliced node collections has been prohibited, because it is difficult to implement. I have not seen (but may have overlooked) user requests to support this. In any case, it must not reduce performance. I also noticed that some tests now fail (see https://github.com/nest/nest-simulator/actions/runs/20521568875/job/58957854360#step:32:10332) while there are no tests for the new functionality.

nestkernel/nest.cpp

-                node_ids.reserve( n );
-                for ( auto node_ptr = array; node_ptr != array + n; ++node_ptr )
+                if ( n == 0 )

Contributor

heplesser Jan 23, 2026

Am I overlooking something here or does this change essentially turn 8 lines of code into 20 without changing the logic? I suspect that the new code will be less efficient, because it requires n-1 temporary single-element NodeCollections to be created and merged, while the original code just creates a vector of node IDs and then creates the NodeCollection in one go.

libnestutil/vector_util.h

+                Predicate< typename Container::value_type > pred = []( const auto& current, const auto& next )
+                { return next == current + 1; },
+                Filter< typename Container::value_type > filter = []( const auto& ) { return true; } )
+              {

Contributor

heplesser Jan 23, 2026

I find this code challenging to follow without comments explaining the logic.

nestkernel/nest.cpp

+                std::span< const bool > span { array, n };
+                auto slices = vector_util::split_into_contiguous_slices(
+                  span, true, []( auto current, auto next ) { return current == next; }, []( const auto& v ) { return v == true; } );

Contributor

heplesser Jan 23, 2026

This code seems even more complex and likely performance-problematic that the variant working on node IDs directly.

nestkernel/node_collection.cpp

+                  // But here we iterate c1 and check if c2 contains.
+                  for ( const auto& elem : c1 )
+                  {
+                    if ( c2.contains( elem.node_id ) )

Contributor

heplesser Jan 23, 2026

For composite collections c2, contains() is a complex operation, so I am afraid this is too costly. See below for other ideas.

nestkernel/node_collection.cpp

                   }
-                  return NodeCollectionPTR( *this + *rhs_ptr );
+                  std::vector< NodeCollectionPrimitive > new_parts = to_primitives_();

Contributor

heplesser Jan 23, 2026

I think the following could be integrated with the code below if that is revised as suggested below.

nestkernel/node_collection.cpp

+                  {
+                    return std::make_shared< NodeCollectionComposite >( new_parts );
+                  }
                 }

Contributor

heplesser Jan 23, 2026

If I read the code above right, then execution of the if-branch will always lead to either throw or return and execution will in that case never pass this point. That needs to be clear from the code, right now it is hidden.

nestkernel/node_collection.cpp

Comment on lines 1036 to 1037

		std::sort( new_parts.begin(), new_parts.end(), primitive_sort_op );
		merge_parts_( new_parts );

Contributor

heplesser Jan 23, 2026

Wouldn't it be more efficient to apply merge-sort logic here, iterating over both sides in parallel? In that process, one would also detect overlaps, so that no extra overlap detection is needed, improving efficiency. Since parts1 and parts2 are sorted per precondition, no sorting here is needed. One can then cover the case where rhs_ptr is a primitive connection by just constructing a single-element vector containing it.

nestkernel/node_collection.cpp

    
                if ( is_sliced_ )

                {

                  if ( part_it->overlapping( rhs ) )

                  for ( const auto& elem : *this )

Contributor

heplesser Jan 23, 2026

Iterating over all elements of a collection is potentially very expensive, since collections can be large. We need to find a way to avoid it.

nestkernel/node_collection.cpp

+                auto it = begin();
+                const auto end_it = end();
+                // Initialize the first range

Contributor

heplesser Jan 23, 2026

Might want to point out here that the algorithm is very similar to create_().

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels