Anaplan supports unicode, with the exception of some control characters. However, the types of character you import can have various effects.
Note: You can import special characters into Anaplan as data. However, try to avoid special characters in list item names, or other values used as mappings in an import. This can result in errors when you import.
Unicode planes
Unicode provides code points that combine to create a range of characters for different scripts and languages.
Unicode code points are organized into blocks on either:
- The Basic Multilingual Plane (BMP) for common scripts and languages that use an alphabet.
- The sixteen Supplementary Multilingual Planes (SMPs), which support less common scripts, emoji, and complex ideograms outside of the BMP.
For more information about which plane different characters are part of, see the Unicode Code Charts.
Effects of multilingual planes
In Anaplan, characters from the BMP have a length of one, and characters from the SMP have a length of two.
This length affects the behavior of the LEFT, LENGTH, MID and RIGHT functions in the Classic engine. Each function considers a character from the SMP to be equivalent to two characters from the BMP.
Additionally, this length affects the 60 character limit for list item names (or other names you can define, such as module names). List item names have a limit of 60 characters, but characters from the SMP count as two characters towards this limit.
Normalization
You should normalize your data when you import characters that contain variants such as accents. This can reduce errors, and ensure consistency. For example, certain calculation functions handle canonically equivalent characters with different encoding differently.
Control characters not supported in imports
The following Unicode code points and the control characters they represent cannot be imported into an Anaplan model:
Unicode code point | Character description |
0000 | Null character |
0001 | Start of heading |
0002 | Start of text |
0003 | End-of-text character |
0004 | End-of-transmission character |
0005 | Enquiry character |
0006 | Acknowledge character |
0007 | Bell character |
0008 | Backspace |
000B | Vertical tab |
000C | Form feed |
000E | Shift out |
000F | Shift in |
0010 | Data link escape |
0011 | Device control 1 |
0012 | Device control 2 |
0013 | Device control 3 |
0014 | Device control 4 |
0015 | Negative-acknowledge character |
0016 | Synchronous idle |
0017 | End-of-transmission block |
0018 | Cancel character |
0019 | End of medium |
001A | Substitute character |
001B | Escape character |
001C | File separator |
001D | Group separator |
001E | Record separator |
001F | Unit separator |