Why does std::set not have a "contains" member function?
I think it was probably because they were trying to make std::set
and std::multiset
as similar as possible. (And obviously count
has a perfectly sensible meaning for std::multiset
.)
Personally I think this was a mistake.
It doesn't look quite so bad if you pretend that count
is just a misspelling of contains
and write the test as:
if (myset.count(element))
...
It's still a shame though.
To be able to write if (s.contains())
, contains()
has to return a bool
(or a type convertible to bool
, which is another story), like binary_search
does.
The fundamental reason behind the design decision not to do it this way is that contains()
which returns a bool
would lose valuable information about where the element is in the collection. find()
preserves and returns that information in the form of an iterator, therefore is a better choice for a generic library like STL. This has always been the guiding principle for Alex Stepanov, as he has often explained (for example, here).
As to the count()
approach in general, although it's often an okay workaround, the problem with it is that it does more work than a contains()
would have to do.
That is not to say that a bool contains()
isn't a very nice-to-have or even necessary. A while ago we had a long discussion about this very same issue in the
ISO C++ Standard - Future Proposals group.
It lacks it because nobody added it. Nobody added it because the containers from the STL that the std
library incorporated where designed to be minimal in interface. (Note that std::string
did not come from the STL in the same way).
If you don't mind some strange syntax, you can fake it:
template<class K>
struct contains_t {
K&& k;
template<class C>
friend bool operator->*( C&& c, contains_t&& ) {
auto range = std::forward<C>(c).equal_range(std::forward<K>(k));
return range.first != range.second;
// faster than:
// return std::forward<C>(c).count( std::forward<K>(k) ) != 0;
// for multi-meows with lots of duplicates
}
};
template<class K>
containts_t<K> contains( K&& k ) {
return {std::forward<K>(k)};
}
use:
if (some_set->*contains(some_element)) {
}
Basically, you can write extension methods for most C++ std
types using this technique.
It makes a lot more sense to just do this:
if (some_set.count(some_element)) {
}
but I am amused by the extension method method.
The really sad thing is that writing an efficient contains
could be faster on a multimap
or multiset
, as they just have to find one element, while count
has to find each of them and count them.
A multiset containing 1 billion copies of 7 (you know, in case you run out) can have a really slow .count(7)
, but could have a very fast contains(7)
.
With the above extension method, we could make it faster for this case by using lower_bound
, comparing to end
, and then comparing to the element. Doing that for an unordered meow as well as an ordered meow would require fancy SFINAE or container-specific overloads however.