utf8rewind  1.3.0
System library for processing UTF-8 encoded text
Macros
Category flags

Macros

#define UTF8_CATEGORY_LETTER_UPPERCASE   0x00000001
 Uppercase letter code points, Lu in the Unicode database. More...
 
#define UTF8_CATEGORY_LETTER_LOWERCASE   0x00000002
 Lowercase letter code points, Ll in the Unicode database. More...
 
#define UTF8_CATEGORY_LETTER_TITLECASE   0x00000004
 Titlecase letter code points, Lt in the Unicode database. More...
 
#define UTF8_CATEGORY_LETTER_MODIFIER   0x00000008
 Modifier letter code points, Lm in the Unicode database. More...
 
#define UTF8_CATEGORY_LETTER_OTHER   0x00000010
 Other letter code points, Lo in the Unicode database. More...
 
#define UTF8_CATEGORY_LETTER
 Combined flag for all letter categories. More...
 
#define UTF8_CATEGORY_CASE_MAPPED
 Combined flag for all letter categories with case mapping. More...
 
#define UTF8_CATEGORY_MARK_NON_SPACING   0x00000020
 Non-spacing mark code points, Mn in the Unicode database. More...
 
#define UTF8_CATEGORY_MARK_SPACING   0x00000040
 Spacing mark code points, Mc in the Unicode database. More...
 
#define UTF8_CATEGORY_MARK_ENCLOSING   0x00000080
 Enclosing mark code points, Me in the Unicode database. More...
 
#define UTF8_CATEGORY_MARK
 Combined flag for all mark categories. More...
 
#define UTF8_CATEGORY_NUMBER_DECIMAL   0x00000100
 Decimal number code points, Nd in the Unicode database. More...
 
#define UTF8_CATEGORY_NUMBER_LETTER   0x00000200
 Letter number code points, Nl in the Unicode database. More...
 
#define UTF8_CATEGORY_NUMBER_OTHER   0x00000400
 Other number code points, No in the Unicode database. More...
 
#define UTF8_CATEGORY_NUMBER
 Combined flag for all number categories. More...
 
#define UTF8_CATEGORY_PUNCTUATION_CONNECTOR   0x00000800
 Connector punctuation category, Pc in the Unicode database. More...
 
#define UTF8_CATEGORY_PUNCTUATION_DASH   0x00001000
 Dash punctuation category, Pd in the Unicode database. More...
 
#define UTF8_CATEGORY_PUNCTUATION_OPEN   0x00002000
 Open punctuation category, Ps in the Unicode database. More...
 
#define UTF8_CATEGORY_PUNCTUATION_CLOSE   0x00004000
 Close punctuation category, Pe in the Unicode database. More...
 
#define UTF8_CATEGORY_PUNCTUATION_INITIAL   0x00008000
 Initial punctuation category, Pi in the Unicode database. More...
 
#define UTF8_CATEGORY_PUNCTUATION_FINAL   0x00010000
 Final punctuation category, Pf in the Unicode database. More...
 
#define UTF8_CATEGORY_PUNCTUATION_OTHER   0x00020000
 Other punctuation category, Po in the Unicode database. More...
 
#define UTF8_CATEGORY_PUNCTUATION
 Combined flag for all punctuation categories. More...
 
#define UTF8_CATEGORY_SYMBOL_MATH   0x00040000
 Math symbol category, Sm in the Unicode database. More...
 
#define UTF8_CATEGORY_SYMBOL_CURRENCY   0x00080000
 Currency symbol category, Sc in the Unicode database. More...
 
#define UTF8_CATEGORY_SYMBOL_MODIFIER   0x00100000
 Modifier symbol category, Sk in the Unicode database. More...
 
#define UTF8_CATEGORY_SYMBOL_OTHER   0x00200000
 Other symbol category, So in the Unicode database. More...
 
#define UTF8_CATEGORY_SYMBOL
 Combined flag for all symbol categories. More...
 
#define UTF8_CATEGORY_SEPARATOR_SPACE   0x00400000
 Space separator category, Zs in the Unicode database. More...
 
#define UTF8_CATEGORY_SEPARATOR_LINE   0x00800000
 Line separator category, Zl in the Unicode database. More...
 
#define UTF8_CATEGORY_SEPARATOR_PARAGRAPH   0x01000000
 Paragraph separator category, Zp in the Unicode database. More...
 
#define UTF8_CATEGORY_SEPARATOR
 Combined flag for all separator categories. More...
 
#define UTF8_CATEGORY_CONTROL   0x02000000
 Control category, Cc in the Unicode database. More...
 
#define UTF8_CATEGORY_FORMAT   0x04000000
 Format category, Cf in the Unicode database. More...
 
#define UTF8_CATEGORY_SURROGATE   0x08000000
 Surrogate category, Cs in the Unicode database. More...
 
#define UTF8_CATEGORY_PRIVATE_USE   0x10000000
 Private use category, Co in the Unicode database. More...
 
#define UTF8_CATEGORY_UNASSIGNED   0x20000000
 Unassigned category, Cn in the Unicode database. More...
 
#define UTF8_CATEGORY_COMPATIBILITY   0x40000000
 Flag used for maintaining backwards compatibility with POSIX functions, not found in the Unicode database. More...
 
#define UTF8_CATEGORY_IGNORE_GRAPHEME_CLUSTER   0x80000000
 Flag used for checking only the general category of code points at the start of a grapheme cluster. More...
 
#define UTF8_CATEGORY_ISCNTRL
 Flag used for maintaining backwards compatibility with POSIX iscntrl function. More...
 
#define UTF8_CATEGORY_ISPRINT
 Flag used for maintaining backwards compatibility with POSIX isprint function. More...
 
#define UTF8_CATEGORY_ISSPACE
 Flag used for maintaining backwards compatibility with POSIX isspace function. More...
 
#define UTF8_CATEGORY_ISBLANK
 Flag used for maintaining backwards compatibility with POSIX isblank function. More...
 
#define UTF8_CATEGORY_ISGRAPH
 Flag used for maintaining backwards compatibility with POSIX isgraph function. More...
 
#define UTF8_CATEGORY_ISPUNCT
 Flag used for maintaining backwards compatibility with POSIX ispunct function. More...
 
#define UTF8_CATEGORY_ISALNUM
 Flag used for maintaining backwards compatibility with POSIX isalnum function. More...
 
#define UTF8_CATEGORY_ISALPHA
 Flag used for maintaining backwards compatibility with POSIX isalpha function. More...
 
#define UTF8_CATEGORY_ISUPPER
 Flag used for maintaining backwards compatibility with POSIX isupper function. More...
 
#define UTF8_CATEGORY_ISLOWER
 Flag used for maintaining backwards compatibility with POSIX islower function. More...
 
#define UTF8_CATEGORY_ISDIGIT
 Flag used for maintaining backwards compatibility with POSIX isdigit function. More...
 
#define UTF8_CATEGORY_ISXDIGIT
 Flag used for maintaining backwards compatibility with POSIX isxdigit function. More...
 

Detailed Description

Flags to be used with utf8iscategory, to check whether code points in a string are part of that category.

Macro Definition Documentation

#define UTF8_CATEGORY_LETTER_UPPERCASE   0x00000001

Uppercase letter code points, Lu in the Unicode database.

#define UTF8_CATEGORY_LETTER_LOWERCASE   0x00000002

Lowercase letter code points, Ll in the Unicode database.

#define UTF8_CATEGORY_LETTER_TITLECASE   0x00000004

Titlecase letter code points, Lt in the Unicode database.

#define UTF8_CATEGORY_LETTER_MODIFIER   0x00000008

Modifier letter code points, Lm in the Unicode database.

#define UTF8_CATEGORY_LETTER_OTHER   0x00000010

Other letter code points, Lo in the Unicode database.

#define UTF8_CATEGORY_LETTER
Value:
#define UTF8_CATEGORY_LETTER_MODIFIER
Modifier letter code points, Lm in the Unicode database.
Definition: utf8rewind.h:1275
#define UTF8_CATEGORY_LETTER_LOWERCASE
Lowercase letter code points, Ll in the Unicode database.
Definition: utf8rewind.h:1263
#define UTF8_CATEGORY_LETTER_TITLECASE
Titlecase letter code points, Lt in the Unicode database.
Definition: utf8rewind.h:1269
#define UTF8_CATEGORY_LETTER_UPPERCASE
Uppercase letter code points, Lu in the Unicode database.
Definition: utf8rewind.h:1257
#define UTF8_CATEGORY_LETTER_OTHER
Other letter code points, Lo in the Unicode database.
Definition: utf8rewind.h:1281

Combined flag for all letter categories.

#define UTF8_CATEGORY_CASE_MAPPED
Value:
#define UTF8_CATEGORY_LETTER_LOWERCASE
Lowercase letter code points, Ll in the Unicode database.
Definition: utf8rewind.h:1263
#define UTF8_CATEGORY_LETTER_TITLECASE
Titlecase letter code points, Lt in the Unicode database.
Definition: utf8rewind.h:1269
#define UTF8_CATEGORY_LETTER_UPPERCASE
Uppercase letter code points, Lu in the Unicode database.
Definition: utf8rewind.h:1257

Combined flag for all letter categories with case mapping.

#define UTF8_CATEGORY_MARK_NON_SPACING   0x00000020

Non-spacing mark code points, Mn in the Unicode database.

#define UTF8_CATEGORY_MARK_SPACING   0x00000040

Spacing mark code points, Mc in the Unicode database.

#define UTF8_CATEGORY_MARK_ENCLOSING   0x00000080

Enclosing mark code points, Me in the Unicode database.

#define UTF8_CATEGORY_MARK
Value:
#define UTF8_CATEGORY_MARK_SPACING
Spacing mark code points, Mc in the Unicode database.
Definition: utf8rewind.h:1310
#define UTF8_CATEGORY_MARK_ENCLOSING
Enclosing mark code points, Me in the Unicode database.
Definition: utf8rewind.h:1316
#define UTF8_CATEGORY_MARK_NON_SPACING
Non-spacing mark code points, Mn in the Unicode database.
Definition: utf8rewind.h:1304

Combined flag for all mark categories.

#define UTF8_CATEGORY_NUMBER_DECIMAL   0x00000100

Decimal number code points, Nd in the Unicode database.

#define UTF8_CATEGORY_NUMBER_LETTER   0x00000200

Letter number code points, Nl in the Unicode database.

#define UTF8_CATEGORY_NUMBER_OTHER   0x00000400

Other number code points, No in the Unicode database.

#define UTF8_CATEGORY_NUMBER
Value:
#define UTF8_CATEGORY_NUMBER_OTHER
Other number code points, No in the Unicode database.
Definition: utf8rewind.h:1342
#define UTF8_CATEGORY_NUMBER_LETTER
Letter number code points, Nl in the Unicode database.
Definition: utf8rewind.h:1336
#define UTF8_CATEGORY_NUMBER_DECIMAL
Decimal number code points, Nd in the Unicode database.
Definition: utf8rewind.h:1330

Combined flag for all number categories.

#define UTF8_CATEGORY_PUNCTUATION_CONNECTOR   0x00000800

Connector punctuation category, Pc in the Unicode database.

#define UTF8_CATEGORY_PUNCTUATION_DASH   0x00001000

Dash punctuation category, Pd in the Unicode database.

#define UTF8_CATEGORY_PUNCTUATION_OPEN   0x00002000

Open punctuation category, Ps in the Unicode database.

#define UTF8_CATEGORY_PUNCTUATION_CLOSE   0x00004000

Close punctuation category, Pe in the Unicode database.

#define UTF8_CATEGORY_PUNCTUATION_INITIAL   0x00008000

Initial punctuation category, Pi in the Unicode database.

#define UTF8_CATEGORY_PUNCTUATION_FINAL   0x00010000

Final punctuation category, Pf in the Unicode database.

#define UTF8_CATEGORY_PUNCTUATION_OTHER   0x00020000

Other punctuation category, Po in the Unicode database.

#define UTF8_CATEGORY_PUNCTUATION
Value:
#define UTF8_CATEGORY_PUNCTUATION_FINAL
Final punctuation category, Pf in the Unicode database.
Definition: utf8rewind.h:1386
#define UTF8_CATEGORY_PUNCTUATION_CONNECTOR
Connector punctuation category, Pc in the Unicode database.
Definition: utf8rewind.h:1356
#define UTF8_CATEGORY_PUNCTUATION_CLOSE
Close punctuation category, Pe in the Unicode database.
Definition: utf8rewind.h:1374
#define UTF8_CATEGORY_PUNCTUATION_DASH
Dash punctuation category, Pd in the Unicode database.
Definition: utf8rewind.h:1362
#define UTF8_CATEGORY_PUNCTUATION_INITIAL
Initial punctuation category, Pi in the Unicode database.
Definition: utf8rewind.h:1380
#define UTF8_CATEGORY_PUNCTUATION_OPEN
Open punctuation category, Ps in the Unicode database.
Definition: utf8rewind.h:1368
#define UTF8_CATEGORY_PUNCTUATION_OTHER
Other punctuation category, Po in the Unicode database.
Definition: utf8rewind.h:1392

Combined flag for all punctuation categories.

#define UTF8_CATEGORY_SYMBOL_MATH   0x00040000

Math symbol category, Sm in the Unicode database.

#define UTF8_CATEGORY_SYMBOL_CURRENCY   0x00080000

Currency symbol category, Sc in the Unicode database.

#define UTF8_CATEGORY_SYMBOL_MODIFIER   0x00100000

Modifier symbol category, Sk in the Unicode database.

#define UTF8_CATEGORY_SYMBOL_OTHER   0x00200000

Other symbol category, So in the Unicode database.

#define UTF8_CATEGORY_SYMBOL
Value:
#define UTF8_CATEGORY_SYMBOL_OTHER
Other symbol category, So in the Unicode database.
Definition: utf8rewind.h:1426
#define UTF8_CATEGORY_SYMBOL_MATH
Math symbol category, Sm in the Unicode database.
Definition: utf8rewind.h:1408
#define UTF8_CATEGORY_SYMBOL_MODIFIER
Modifier symbol category, Sk in the Unicode database.
Definition: utf8rewind.h:1420
#define UTF8_CATEGORY_SYMBOL_CURRENCY
Currency symbol category, Sc in the Unicode database.
Definition: utf8rewind.h:1414

Combined flag for all symbol categories.

#define UTF8_CATEGORY_SEPARATOR_SPACE   0x00400000

Space separator category, Zs in the Unicode database.

#define UTF8_CATEGORY_SEPARATOR_LINE   0x00800000

Line separator category, Zl in the Unicode database.

#define UTF8_CATEGORY_SEPARATOR_PARAGRAPH   0x01000000

Paragraph separator category, Zp in the Unicode database.

#define UTF8_CATEGORY_SEPARATOR
Value:
#define UTF8_CATEGORY_SEPARATOR_PARAGRAPH
Paragraph separator category, Zp in the Unicode database.
Definition: utf8rewind.h:1452
#define UTF8_CATEGORY_SEPARATOR_SPACE
Space separator category, Zs in the Unicode database.
Definition: utf8rewind.h:1440
#define UTF8_CATEGORY_SEPARATOR_LINE
Line separator category, Zl in the Unicode database.
Definition: utf8rewind.h:1446

Combined flag for all separator categories.

#define UTF8_CATEGORY_CONTROL   0x02000000

Control category, Cc in the Unicode database.

#define UTF8_CATEGORY_FORMAT   0x04000000

Format category, Cf in the Unicode database.

#define UTF8_CATEGORY_SURROGATE   0x08000000

Surrogate category, Cs in the Unicode database.

#define UTF8_CATEGORY_PRIVATE_USE   0x10000000

Private use category, Co in the Unicode database.

#define UTF8_CATEGORY_UNASSIGNED   0x20000000

Unassigned category, Cn in the Unicode database.

#define UTF8_CATEGORY_COMPATIBILITY   0x40000000

Flag used for maintaining backwards compatibility with POSIX functions, not found in the Unicode database.

#define UTF8_CATEGORY_IGNORE_GRAPHEME_CLUSTER   0x80000000

Flag used for checking only the general category of code points at the start of a grapheme cluster.

#define UTF8_CATEGORY_ISCNTRL
Value:
#define UTF8_CATEGORY_CONTROL
Control category, Cc in the Unicode database.
Definition: utf8rewind.h:1466
#define UTF8_CATEGORY_COMPATIBILITY
Flag used for maintaining backwards compatibility with POSIX functions, not found in the Unicode data...
Definition: utf8rewind.h:1497

Flag used for maintaining backwards compatibility with POSIX iscntrl function.

#define UTF8_CATEGORY_ISPRINT
Value:
#define UTF8_CATEGORY_SYMBOL
Combined flag for all symbol categories.
Definition: utf8rewind.h:1432
#define UTF8_CATEGORY_COMPATIBILITY
Flag used for maintaining backwards compatibility with POSIX functions, not found in the Unicode data...
Definition: utf8rewind.h:1497
#define UTF8_CATEGORY_NUMBER
Combined flag for all number categories.
Definition: utf8rewind.h:1348
#define UTF8_CATEGORY_SEPARATOR
Combined flag for all separator categories.
Definition: utf8rewind.h:1458
#define UTF8_CATEGORY_LETTER
Combined flag for all letter categories.
Definition: utf8rewind.h:1287
#define UTF8_CATEGORY_PUNCTUATION
Combined flag for all punctuation categories.
Definition: utf8rewind.h:1398

Flag used for maintaining backwards compatibility with POSIX isprint function.

#define UTF8_CATEGORY_ISSPACE
Value:
#define UTF8_CATEGORY_SEPARATOR_SPACE
Space separator category, Zs in the Unicode database.
Definition: utf8rewind.h:1440
#define UTF8_CATEGORY_COMPATIBILITY
Flag used for maintaining backwards compatibility with POSIX functions, not found in the Unicode data...
Definition: utf8rewind.h:1497

Flag used for maintaining backwards compatibility with POSIX isspace function.

#define UTF8_CATEGORY_ISBLANK
Value:
#define UTF8_CATEGORY_SEPARATOR_SPACE
Space separator category, Zs in the Unicode database.
Definition: utf8rewind.h:1440
#define UTF8_CATEGORY_PRIVATE_USE
Private use category, Co in the Unicode database.
Definition: utf8rewind.h:1484
#define UTF8_CATEGORY_COMPATIBILITY
Flag used for maintaining backwards compatibility with POSIX functions, not found in the Unicode data...
Definition: utf8rewind.h:1497

Flag used for maintaining backwards compatibility with POSIX isblank function.

#define UTF8_CATEGORY_ISGRAPH
Value:
#define UTF8_CATEGORY_SYMBOL
Combined flag for all symbol categories.
Definition: utf8rewind.h:1432
#define UTF8_CATEGORY_COMPATIBILITY
Flag used for maintaining backwards compatibility with POSIX functions, not found in the Unicode data...
Definition: utf8rewind.h:1497
#define UTF8_CATEGORY_NUMBER
Combined flag for all number categories.
Definition: utf8rewind.h:1348
#define UTF8_CATEGORY_LETTER
Combined flag for all letter categories.
Definition: utf8rewind.h:1287
#define UTF8_CATEGORY_PUNCTUATION
Combined flag for all punctuation categories.
Definition: utf8rewind.h:1398

Flag used for maintaining backwards compatibility with POSIX isgraph function.

#define UTF8_CATEGORY_ISPUNCT
Value:
#define UTF8_CATEGORY_SYMBOL
Combined flag for all symbol categories.
Definition: utf8rewind.h:1432
#define UTF8_CATEGORY_COMPATIBILITY
Flag used for maintaining backwards compatibility with POSIX functions, not found in the Unicode data...
Definition: utf8rewind.h:1497
#define UTF8_CATEGORY_PUNCTUATION
Combined flag for all punctuation categories.
Definition: utf8rewind.h:1398

Flag used for maintaining backwards compatibility with POSIX ispunct function.

#define UTF8_CATEGORY_ISALNUM
Value:
#define UTF8_CATEGORY_COMPATIBILITY
Flag used for maintaining backwards compatibility with POSIX functions, not found in the Unicode data...
Definition: utf8rewind.h:1497
#define UTF8_CATEGORY_NUMBER
Combined flag for all number categories.
Definition: utf8rewind.h:1348
#define UTF8_CATEGORY_LETTER
Combined flag for all letter categories.
Definition: utf8rewind.h:1287

Flag used for maintaining backwards compatibility with POSIX isalnum function.

#define UTF8_CATEGORY_ISALPHA
Value:
#define UTF8_CATEGORY_COMPATIBILITY
Flag used for maintaining backwards compatibility with POSIX functions, not found in the Unicode data...
Definition: utf8rewind.h:1497
#define UTF8_CATEGORY_LETTER
Combined flag for all letter categories.
Definition: utf8rewind.h:1287

Flag used for maintaining backwards compatibility with POSIX isalpha function.

#define UTF8_CATEGORY_ISUPPER
Value:
#define UTF8_CATEGORY_COMPATIBILITY
Flag used for maintaining backwards compatibility with POSIX functions, not found in the Unicode data...
Definition: utf8rewind.h:1497
#define UTF8_CATEGORY_LETTER_UPPERCASE
Uppercase letter code points, Lu in the Unicode database.
Definition: utf8rewind.h:1257

Flag used for maintaining backwards compatibility with POSIX isupper function.

#define UTF8_CATEGORY_ISLOWER
Value:
#define UTF8_CATEGORY_LETTER_LOWERCASE
Lowercase letter code points, Ll in the Unicode database.
Definition: utf8rewind.h:1263
#define UTF8_CATEGORY_COMPATIBILITY
Flag used for maintaining backwards compatibility with POSIX functions, not found in the Unicode data...
Definition: utf8rewind.h:1497

Flag used for maintaining backwards compatibility with POSIX islower function.

#define UTF8_CATEGORY_ISDIGIT
Value:
#define UTF8_CATEGORY_COMPATIBILITY
Flag used for maintaining backwards compatibility with POSIX functions, not found in the Unicode data...
Definition: utf8rewind.h:1497
#define UTF8_CATEGORY_NUMBER
Combined flag for all number categories.
Definition: utf8rewind.h:1348

Flag used for maintaining backwards compatibility with POSIX isdigit function.

#define UTF8_CATEGORY_ISXDIGIT
Value:
#define UTF8_CATEGORY_PRIVATE_USE
Private use category, Co in the Unicode database.
Definition: utf8rewind.h:1484
#define UTF8_CATEGORY_COMPATIBILITY
Flag used for maintaining backwards compatibility with POSIX functions, not found in the Unicode data...
Definition: utf8rewind.h:1497
#define UTF8_CATEGORY_NUMBER
Combined flag for all number categories.
Definition: utf8rewind.h:1348

Flag used for maintaining backwards compatibility with POSIX isxdigit function.