Part # 1 : Working with SCL (Spam Confidence Level)

Article
02/11/2009

What is the SCL?

The spam confidence level (SCL) is the normalized value assigned to a message that indicates, based on the characteristics of a message (such as the content, message header, and so forth), the likelihood that the message is spam.

What are the SCL values?

There are eleven values available to categorize spam, as outlined in the following table.

SCL Value	Spam Categorization
-1	Reserved by Microsoft® Exchange Server 2003 for messages submitted internally. A value of -1 should not be overwritten because it is this value that is used to eliminate false positives for internally-submitted e-mail.
0	Assigned to messages that are not spam.
1 to 9	Extremely low likelihood that the message is spam. ...ranging to... Extremely high likelihood that the message is spam.

This array allows you to choose how aggressive or conservative you want your spam filtering to be by selecting a threshold value above which you consider a message to be spam. If you want to aggressively filter spam, you can choose a fairly low threshold, such as an SCL value of 5, which would catch a higher number of spam messages. However, a higher number of false positives would also be caught. To filter spam more conservatively, you can choose a higher threshold, such as an SCL value of 8, which would catch fewer spam messages, with a lower number of false positives being caught.

How this values are getting assigned to messages?

Spam filtering algorithms assign spam ratings, scores, or probabilities to messages. This value is referred to as the algorithm's raw score. The raw scores are then normalized to a set of standard SCL values and assigned to a message by the spam filtering algorithm. Raw scores are normalized to a set of standard SCL values for the following reasons:

· Configuration settings for the handling of spam are based on the SCL value. Actions performed on messages will typically be determined by thresholds, for example, "move all messages with an SCL value greater than x to the Junk E-mail folder."

· As algorithms evolve, the raw scores they produce may change in meaning. Normalizing the raw scores ensures that the user experience stays relatively constant throughout the evolution of an algorithm.

· Developers will create different spam filtering algorithms that will distinctly assign raw scores. Normalizing these varying raw scores will present a standard value to the end user.

How it’s mapped?

Because different filters will have unique methods of rating messages, the precise mapping of a raw score to an SCL value will vary. The following are general guidelines for mapping raw scores to SCL values:

· Binary results. If an algorithm produces a binary result where the message is determined to be either not spam or spam, a rating of 0 should be used for not spam and a rating of 9 for spam.

· Distribution. If an algorithm produces a distribution of raw scores, a rating of 0 should be assigned to messages determined to not be spam. The remaining raw scores should be mapped in the range of 1 to 9, determined by the probability that the message is spam.

How can I configure Exchange 2003 to block unsolicited commercial e-mail (spam) with Intelligent Message Filter?

Microsoft Exchange Intelligent Message Filter helps companies reduce the amount of unsolicited commercial e-mail (UCE), or spam, received by users. The Intelligent Message Filter is based on Microsoft SmartScreen Technology from Microsoft Research. By using e-mail characteristics tracked by SmartScreen technology, Intelligent Message Filter can help determine whether each incoming e-mail message is likely to be spam. Based on this likelihood, you can choose to block e-mail messages at the gateway or at the mailbox store.

How we can expose the SCL value in Outlook?

There is a wonderful article from James Webster which discusses regarding this.

Part # 1 : Working with SCL (Spam Confidence Level)

Additional resources