The composed forms—`nfc` and nfkc
—represent characters in the fewest
bytes possible. So é
is represented as the single letter é
. The
decomposed forms—`nfd` and nfkd
—represent characters by their
constituent parts, that is e
+ ´
.
The canonical forms—`nfc` and nfd
—represent ligatures like ffi
or
œ
as a single character, while the compatibility forms—`nfkc` and
nfkd
—break down these composed characters into a simpler multiletter
equivalent: f
+ f
+ i
or o
+ e
.