New word breakers may return different results in SQL Server 2012 Full-Text Search

If you rely on SQL Server Full-Text Search, then this is important information, even if you can’t recite the definition of word breaking and stemming. Word breakers identify word boundaries and break words into their components. Stemmers identify alternate forms of a word, such as “run,” “ran,” and “running.” Word breakers and stemmers are applied…


Custom proximity search with NEAR in SQL Server 2012

When you perform a proximity search with NEAR in versions of SQL Server prior to SQL Server 2012 "Denali," we decide what’s NEAR for you. In SQL Server 2012, however, you’re in control! With the new customizable NEAR operator, your full-text query can now specify 2 optional conditions that you couldn’t specify before: How near…


Looping over document similarity details in Semantic Search

Statistical Semantic Search, which is new in SQL Server 2012 “Denali,” indexes documents and text columns to support queries that go beyond the simple keyword search of LIKE or of Full-Text Search. Semantic Search lets you query for 3 types of information about your documents: Key phrases. Return the statistically significant phrases in each document….


Introducing Fulltext Statistical Semantic Search in SQL Server code name ‘Denali’ Release!

We are very excited to introduce a new Statistical Semantic Search feature with Fulltext Search in Denali. With an optional new parameter ‘statistical_semantics’ you can now augment your Fulltext keywords index with two new indexes – one that captures top 100 statistically relevant words from a document and another that based on these relevant words,…

2

Getting a custom IFilter working with SQL Server 2008/R2 (IFilterSample)

In this post I will cover how to get a custom IFilter working on SQL Server 2008/R2 (64-bit or 32-bit). For the purpose of this demo, I am using the IFilterSample that is available with Windows Server 2008 SDK.  Once you have installed the SDK you can find the sample here -> C:\Program Files\Microsoft SDKs\Windows\v6.1\Samples\winui\wsearch\IFilterSample…


Fulltext Search Improvements in SQL Server code-named ‘Denali’ CTP1

Fulltext Search codebase has been significantly revamped to address both query performance and throughput on large scale (millions of documents) with concurrent updates. With SQL2008 we moved all the index storage in the database file and majority of population logic in to the core engine to make fulltext search an integral and fully manageable engine…

1

SQL Server 2008/R2 Fulltext Search Fix for improving queries performance during concurrent index updates

SQL Server 2008/R2 Fulltext Search Fix for improving queries performance during concurrent index updates – http://support.microsoft.com/kb/958947 As you may already know, Microsoft has made significant improvements since SQL Server 2005 to fully integrate full-text search components inside the SQL Server execution engine and storage (http://technet.microsoft.com/en-us/library/cc721269(SQL.100).aspx). Full-text index population and storage happen within SQL Server (not…

0

Part2 – SQL Full text Search and International Characters: Interesting Scenarios, solutions and deep-dive using DMV’s

Continuing on my last post which dealt with Chinese characters, I will cover some user scenarios on Germanic characters like German/Swedish/Norwegian with SQL FTS. Read on… Scenario1 – German language  Mr Customer: Hey, we have some full text search catalogs for German words, and we noticed that FREETEXTTABLE fails to find text clearly defined in…


Part1 – SQL Full text Search and International Characters: Interesting Scenarios, solutions and deep-dive using DMV’s

As part of my work I come across varied scenarios that customers encounter with SQL Server. I recently had the opportunity to work on some interesting scenarios using SQL Full-text search and international characters (non-English). To most people, the concept of word-breaking especially with non-English characters is still a mystery. In this blog series, I will…