The following file extensions are supported when setting up file format configurations for Email Messages: .msg, .eml.
For more information, please see the following sections:
To learn more about working with file format configurations, please see the following pages:
- View file format configurations
- Modify file format configurations
- Create file format configurations
- Test and validate file format configurations
Default Email Message Configuration
Every file format has a default configuration to ensure that a file can be translated; however, it does not handle every complex property that could be thrown your way when translating a source file. The default configuration for Email Messages does the following:
- Uses US-ASCII as the default character encoding.
- Translates email title and body.
- Hides tags preceeding and trailing text.
Uses SRX Rules for text segmentation.
Custom Email Message Configurations
If you are performing a Email Message translation, a custom file format configuration might be necessary to achieve the right results in your target file. Outside of the default configuration selections, Wordbee Translator offers many additional choices for configuring:
- A different character encoding.
- To not translate the email title and/or body.
Translate sender ("From") and destinations ("To", "Cc", "Bcc").
Show whitespaces at beginning and end to the translator.
Do not show texts containing neither letters nor digits to the translator.
- To not hide tags that preceed or trail text.
- Exclude specific content (texts, segments, regular expressions) from the translation.
- Always split text at line breaks.
Email Message Configuration Options
When setting up a file format configuration for Email Messages, there are many options to choose from to ensure extraction is successful. The General Tab contains options for changing the type of encoding, general content options, exclusion of content, and text segmentation. The options are described in general below based on individual sections.
- Encoding - This option may be used to change the type of encoding used for the Email Message source file. The default setting is US-ASCII.
- Content - This section contains options for content within the source file and may be used to choose what information in the email will be translated or not translated (title, body, sender, destinations, etc.). It also provides optins for showing or hiding beginning and ending whitespaces or texts that do not contain letters nor digits. Additionally, tags preceding and trailing text may be shown or hidden.
- Exclude Content - This option may be used to define content that should not be extracted for translation. For each piece of text (segments), the system looks for the texts or regular expression patterns entered in this section of the configuration. If a text/pattern matches, then it is possible to mark the segment as either translatable or not translatable. The system checks one pattern after the other until one matches. Text that matches none of the patterns is considered translatable.
- Text Segmentation - Enable/Disable SRX rules for text segmentation and choose to spilt or not split text at line breaks.