Microsoft Word Files
The following file extensions are supported when setting up file format configurations for Microsoft Word: .doc, .docx, .dot, .dotx, .docm, .dotm.
For more information, please see the following sections:
Default Word Configuration
Every file format has a default configuration to ensure that a file can be translated; however, it does not handle every complex property that could be thrown your way when translating a source file. The default configuration for Microsoft Word files does the following:
- Extracts document properties, headers, footers, the Table of Contents, and user comments for translation.
- Handles whitespaces and symbols.
- Uses SRX rules for text segmentation and splits text at line breaks.
- Ignores formatting changes made to whitespaces for reduced markup.
- Ignores Asian font changes when the source language is not Asian.
Custom Word Configurations
If you use Microsoft Word files for translations, a custom file format configuration might be necessary to achieve the right results in your target file. Outside of the default configuration selections, Wordbee Translator offers many additional choices for configuring:
- What will not be translated.
- Managing fonts for certain lanaguages.
- Reducing markup.
- Handling embedded files.
To learn more about available file format configurations for Word, please see the Word Configuration Options page.
A Few Examples
The following are just a few examples of what you can do with a custom Word configuration in Wordbee Translator. For more advanced examples, please see our Word Questions and Answers page.
Excluding Colored Text
For instance, if red and green text within a Microsoft Word file should not be translated, this formatting rule is not part of the standard default configuration. However, this can easily be accomplished by creating a custom configuration for Microsoft Word files.
Excluding Specific Segments or Words
Another great example might be having the need to omit certain segments of text for translation. In this case, you will need to include the segment to be omited as part of the "Do not translate" portion of the Microsoft Word file format configuration.
Extracting Embedded Files
Custom configurations may also be used to extract embedded files such as an Excel or additional Word files that have been included as part of the source document.
For more information, please see the following pages: