Boost.Locale
Collation

Boost.Locale provides a collator class, derived from std::collate, that adds support for primary, secondary, tertiary, quaternary, and identical comparison levels. They can be approximately defined as:

  1. Primary – ignore accents and character case, comparing base letters only. For example "facade" and "Façade" are the same.
  2. Secondary – ignore character case but consider accents. "facade" and "façade" are different but "Façade" and "façade" are the same.
  3. Tertiary – consider both case and accents: "Façade" and "façade" are different. Ignore punctuation.
  4. Quaternary – consider all case, accents, and punctuation. The words must be identical in terms of Unicode representation.
  5. Identical – as quaternary, but compare code points as well.

There are two ways of using the collator facet: directly: by calling its member functions compare, transform, and hash, or indirectly by using the comparator template class in STL algorithms.

For example:

wstring a=L"Façade", b=L"facade";
bool eq = 0 == use_facet<collator<wchar_t> >(loc).compare(collator_base::secondary,a,b);
wcout << a <<L" and "<<b<<L" are " << (eq ? L"identical" : L"different")<<endl;

std::locale is designed to be useful as a comparison class in STL collections and algorithms. To get similar functionality with comparison levels, you must use the comparator class.

std::map<std::string,std::string,comparator<char,collator_base::secondary> > strings;
// Now strings uses the default system locale for string comparison

You can also set a specific locale or level when creating and using the comparator class:

comparator<char> comp(some_locale,some_level);
std::map<std::string,std::string,comparator<char> > strings(comp);