C++11 std::set lambda comparison function
It's unlikely that the compiler will be able to inline a std::function call, whereas any compiler that supports lambdas would almost certainly inline the functor version, including if that functor is a lambda not hidden by a std::function
.
You could use decltype
to get the lambda's comparator type:
#include <set>
#include <iostream>
#include <iterator>
#include <algorithm>
int main()
{
auto comp = [](int x, int y){ return x < y; };
auto set = std::set<int,decltype(comp)>( comp );
set.insert(1);
set.insert(10);
set.insert(1); // Dupe!
set.insert(2);
std::copy( set.begin(), set.end(), std::ostream_iterator<int>(std::cout, "\n") );
}
Which prints:
1
2
10
See it run live on Coliru.
Yes, a std::function
introduces nearly unavoidable indirection to your set
. While the compiler can always, in theory, figure out that all use of your set
's std::function
involves calling it on a lambda that is always the exact same lambda, that is both hard and extremely fragile.
Fragile, because before the compiler can prove to itself that all calls to that std::function
are actually calls to your lambda, it must prove that no access to your std::set
ever sets the std::function
to anything but your lambda. Which means it has to track down all possible routes to reach your std::set
in all compilation units and prove none of them do it.
This might be possible in some cases, but relatively innocuous changes could break it even if your compiler managed to prove it.
On the other hand, a functor with a stateless operator()
has easy to prove behavior, and optimizations involving that are everyday things.
So yes, in practice I'd suspect std::function
could be slower. On the other hand, std::function
solution is easier to maintain than the make_set
one, and exchanging programmer time for program performance is pretty fungible.
make_set
has the serious disadvantage that any such set
's type must be inferred from the call to make_set
. Often a set
stores persistent state, and not something you create on the stack then let fall out of scope.
If you created a static or global stateless lambda auto MyComp = [](A const&, A const&)->bool { ... }
, you can use the std::set<A, decltype(MyComp)>
syntax to create a set
that can persist, yet is easy for the compiler to optimize (because all instances of decltype(MyComp)
are stateless functors) and inline. I point this out, because you are sticking the set
in a struct
. (Or does your compiler support
struct Foo {
auto mySet = make_set<int>([](int l, int r){ return l<r; });
};
which I would find surprising!)
Finally, if you are worried about performance, consider that std::unordered_set
is much faster (at the cost of being unable to iterate over the contents in order, and having to write/find a good hash), and that a sorted std::vector
is better if you have a 2-phase "insert everything" then "query contents repeatedly". Simply stuff it into the vector
first, then sort
unique
erase
, then use the free equal_range
algorithm.
A stateless lambda (i.e. one with no captures) can decay to a function pointer, so your type could be:
std::set<int, bool (*)(int, int)> numbers;
Otherwise I'd go for the make_set
solution. If you won't use a one-line creation function because it's non-standard you're not going to get much code written!