description | ms.assetid | title | ms.topic | ms.date |
---|---|---|---|---|
Linguistic and Unicode Considerations |
a210bffc-fe71-4909-bc5c-d440890265c9 |
Linguistic and Unicode Considerations |
article |
05/31/2018 |
This section contains a list of linguistic and Unicode considerations that might affect word breaker and stemmer implementation. The list is not an exhaustive one.
This section includes the following topics:
- For a list of lanuages supported by word breakers, see Languages Supported by Windows Search.
- If you need to identify the language of a piece of text, you can use Language Auto-Detection (LAD), which is available in Windows 7 and later. For more information, see Extended Linguistic Services (ELS).
- For applicable reference documentation, see Data Add-in Interfaces.