
While many businesses work with SharePoint Server to store files for important projects, file servers still play a role in most corporate environments. File Classification helps system administrators better manage data stored on servers and prioritize critical data for retention and e-discovery purposes.
File servers often contain unwieldy folder hierarchies that evolve over time without any particular planning for how data might be best structured to ensure it can be managed effectively. Without knowing where critical business data is located, file management becomes an almost impossible task. Sysadmins must rely on users to store documents in the right places so that security can be set appropriately and stale data can be archived to reduce storage costs.
The File Classification Infrastructure (FCI) is new in all editions of Windows Server 2008 R2. It allows files to be classified automatically based on location or content, manually from Microsoft Office 2010 and via an Application Programming Interface (API) using scripts or FCI-compatible applications. Classification Properties can be defined to add classification information to files. Any file stored on a New Technology Files System (NTFS) volume can be classified. Microsoft Office files store classification information within the document as well as in the NTFS Alternate Data Stream, so if a file is moved to a non-NTFS volume, it will retain its file classification properties.
Once classified, files retain the original folder hierarchy even after they expire and are moved to an assigned directory. Other management tasks can be automated using custom commands, based on classification information or additional criteria. File owners can also be notified of tasks pending on their files by e-mail. The reporting feature allows sysadmins to generate reports in a variety of formats that provide an overview of where files are located on a server based on classification information.
Classification Properties define the form in which classification information can be added to files when a Classification Rule is applied. Classification Properties don’t contain any information about the final classification of files. FCI supports seven different property types:
| Classification Property |
Description |
|
Yes/No (Boolean) |
Used to create a flag. For example, to indicate whether the information in a file contains personal data. |
|
Date/Time |
To assign a date/time. |
|
Number |
For assigning a number label. |
|
Multiple Choice List |
For assigning more than one flag to a file. |
|
Ordered List |
One flag must be chosen from multiple options. |
|
String |
To assign a text label. |
|
Multi String |
To assign more than one text label. |
Let’s start by installing the File Server Resource Manager (FSRM) on your file server and defining three Classification Properties that can be used to assign classification information to files:

Now that you’ve defined some Classification Properties, you need to create one or more Classification Rules to classify files based on their location or content. Create three rules to identify files that contain the word Contoso and classify each file with a Project_Name of Contoso, Business_Impact as High and Personal_Info as No.


FCI uses Windows Search to identify content in files. Classification Parameters can be set to String, StringCaseSensitive or RegularExpression (more information on using .NET Regular Expressions for matching text can be found here [1]). You must set up multiple rules if you need to search for multiple strings or expressions. The Content Classifier combines multiple rules so that files are searched only once. Repeat steps 1 through 5 to create two more rules to set Business_Impact as High and Personal_Info as No, and don’t forget to click Advanced each time to set Additional Classification Parameters.
The process of automatically classifying files can be run on demand or according to a schedule.

To test the new Classification Rules, make sure there is at least one file in c:\user files, or equivalent directory, that contains the word Contoso.

The Contoso project was completed two years ago and all related files can be archived. Now let’s set up a management task to move all files classified as Project_Name=Contoso to a directory called Expired.

You should see that any files containing the word Contoso have been moved to the c:\expired folder.
Links:
[1] http://msdn.microsoft.com/en-us/library/hs600312.aspx