UTF32 detection routines are uniformly prefixed with mulle_utf32_is_
and
take a sole mulle_utf32_t
argument. UTF16 detection routines are uniformly
prefixed with mulle_utf16_is_
and take a sole mulle_utf16_t
argument.
E.g.
alphanumeric
is int mulle_utf32_is_alphanumeric( mulle_utf32_t c)
and
int mulle_utf16_is_alphanumeric( mulle_utf16_t c)
The naming and the semantics are derived from the Unicode 3.0 specification.
Name | Description |
---|---|
alphanumeric | UNICODE alphanumeric |
capitalized | UNICODE capitalized |
control | UNICODE control |
decimaldigit | UNICODE decimaldigit |
decomposable | UNICODE decomposable |
letter | UNICODE letter |
lowercase | UNICODE lowercase |
nonbase | UNICODE nonbase |
nonpercentescape | UNICODE nonpercentescape |
privatecharacter | UNICODE privatecharacter |
punctuation | UNICODE punctuation |
symbol | UNICODE symbol |
uppercase | UNICODE uppercase |
These are based in UNICODE but are extended for usability.
Name | Description |
---|---|
noncharacter | The UNICODE noncharacters plus all surrogates plus the last two codepoints of a bitplane |
newline | 0xa, 0xb, 0xc, 0xd, 0x85, 0x2028, 0x2029 |
whitespace | The UNICODE whitespaces plus 0x9 |
whitespaceornewline | The UNICODE whitespaces plus 0x9, 0xb, 0xc, 0xd, 0x85 |
These are characters that are valid in various parts of URLs.
scheme
://user
:password
@host
/path
#fragment
?query
Name | Description |
---|---|
validurlfragment | Allowable characters in fragment part of URL |
validurlhost | Allowable characters in host part of URL |
validurlpassword | Allowable characters in password part of URL |
validurlpath | Allowable characters in path part of URL |
validurlquery | Allowable characters in query part of URL |
validurlscheme | Allowable characters in scheme part of URL |
validurluser | Allowable characters in user part of URL |
UTF32 character conversion routines are uniformly prefixed with
mulle_utf32_
and take a sole mulle_utf32_t
argument.
The return value is mulle_utf32_t
.
UTF16 character
conversion routines are uniformly prefixed with mulle_utf16_
and take
a sole mulle_utf16_t
argument. The return value is mulle_utf16_t
.
E.g.
tolower
is mulle_utf32_t mulle_utf32_tolower( mulle_utf32_t c)
and
mulle_utf16_t mulle_utf16_tolower( mulle_utf16_t c)
Name | Description |
---|---|
tolower | Convert to lowercase from uppercase, returns argument, if no conversion is possible. |
totitlecase | Convert to titlecase. returns argument, if no conversion is possible. |
toupper | Convert to uppercase from lowercase, returns argument, if no conversion is possible. |