Next: Character Sets, Previous: Character implementation, Up: Characters [Contents][Index]
MIT/GNU Scheme implements the full Unicode character repertoire,
defining predicates for Unicode characters and their associated
integer values. A Unicode code point is an exact non-negative
integer strictly less than #x110000. A Unicode scalar
value is a Unicode code point that doesn’t fall between #xD800
inclusive and #xE000 exclusive; in other words, any Unicode
code point except for the surrogate code points.
Returns #t if object is a Unicode code point, otherwise
it returns #f.
Returns #t if object is a Unicode scalar value, otherwise
it returns #f.
Returns #t if object is any “bitless” character
corresponding to a Unicode code point, except for those with general
category other:surrogate or other:not-assigned.
Returns the Unicode general category of char (or code-point) as a descriptive symbol:
| Category | Symbol |
|---|---|
| Lu | letter:uppercase |
| Ll | letter:lowercase |
| Lt | letter:titlecase |
| Lm | letter:modifier |
| Lo | letter:other |
| Mn | mark:nonspacing |
| Mc | mark:spacing-combining |
| Me | mark:enclosing |
| Nd | number:decimal-digit |
| Nl | number:letter |
| No | number:other |
| Pc | punctuation:connector |
| Pd | punctuation:dash |
| Ps | punctuation:open |
| Pe | punctuation:close |
| Pi | punctuation:initial-quote |
| Pf | punctuation:final-quote |
| Po | punctuation:other |
| Sm | symbol:math |
| Sc | symbol:currency |
| Sk | symbol:modifier |
| So | symbol:other |
| Zs | separator:space |
| Zl | separator:line |
| Zp | separator:paragraph |
| Cc | other:control |
| Cf | other:format |
| Cs | other:surrogate |
| Co | other:private-use |
| Cn | other:not-assigned |