PDF iFilter Test, with SharePoint 2010

[update Adobe PDF iFilter result] 

Recently Foxit updated their PDF iFilter 2.0. An interesting improvement is “optimized for multi-core processor”. It claims that you can get 3 times better performance compared with its own version 1.0. Let’s see how my test result goes.

Machine:

Dell 2950 (I bought this two years ago)

2 x quad core Intel Xeon E5410 @ 2.33GHz, 8GB Memory, 3 x 10000 rpm HD RAID 5

File set:

22,559 PDF Files in 84 folders, 15.7 GB total, from People’s Daily Archive

Content Source:

File Share, on the same machine, to avoid network latency and bottlenecks

Software Platform

Windows Server 2008 R2

SQL Server 2008 R2 Nov CTP

SharePoint Server 2010 Beta

Foxit PDF iFilter 2.0 x64

TET PDF iFIlter 3.0 x64

Adobe PDF iFilter 9.0 x64

After each crawl, reset index, restart osearch14 service.

Test Result – update Adobe PDF iFilter result

Vendor

Crawl Time(h:m:s)

Crawl Time(s)

File Per Second

Success(including warning)

Error

Foxit 0:13:00 780 29.02 22637 0
TET 1:17:44 4664 5.07 22637 0
Adobe 8:29:43 30583 0.74 22637 0

Amazing! The full crawl was done in just 13 minutes. If you still remember the last test I did, Foxit version 1.0 got 10.73 files per second on SharePoint 2007 x64 with the same hardware spec.

Adobe PDF iFilter is single threaded. Only one CPU core was working during the full indexing process.

Here’re the screenshots of the CPU usage…

Vendor

Screenshots

Comment

Foxit snap0012 Foxit PDF iFilter 2 is able to use multiple cores. Most of the time CPU usage is at around 90%.
TET snap0013 TET PDF iFilter 3 is also able to use multiple cores. CPU usage is 100% all the time.
Adobe snap0015 Adobe PDF iFilter 9 is single threaded. It can only use one core of the machine.

Nice job, Foxit!

Jie.