huh. unicode has rules for accent folding but not other diacritics

@codl that’s not the current character folding practices (that technical report was withdrawn and never made it out of the “draft stage”)

for identifiers and such, case folding is a core part of the standard defined in CaseFolding.txt and section 5.18 here: unicode.org/versions/latest/ch

for other string comparison operations, including removal of accents, that’s defined as a part of CLDR (which, if you thought the ordinary unicode website was hard to navigate… the CLDR website is even worse lmao)

@Lady oh wow you weren't lying about the cldr website

Follow

@codl the Unicode Collation Algorithm (section 1.1, Multi-Level Comparison) might be a better starting point lol unicode.org/reports/tr10/#Mult

Sign in to participate in the conversation
📟🐱 GlitchCat

A small, community‐oriented Mastodon‐compatible Fediverse (GlitchSoc) instance managed as a joint venture between the cat and KIBI families.