Improvements to Chinese language search in SharePoint


There is a longstanding issue with Chinese language search in SharePoint where users got strange results and the summary text highlighting was wrong. Chinese is hard to do word segmentation on (e.g. there is no punctuation character for word separation like the space character in English) and there are issues with neologisms introducing ambiguity. As part of my team's commitment to improving SharePoint for our Chinese customers, we have an experimental project to automatically detect these neologisms and allow customers to add them to SharePoint's custom dictionary. In the process, we found a major cause of poor search results and have made fixes for both SharePoint 2013 and 2016. You can read about it here.

Comments (0)

Skip to main content