Bulk Convert DOC to DOCX


Microsoft has a bulk conversion utility that can convert multiple DOC files to DOCX files.  This blog post presents very basic information on its use.  This utility has been around for a while, but a fair number of people don’t know of its existence.  Because it’s a powerful tool designed to convert all files on a server (or even a bunch of servers), its use is a little cryptic.  This is fine for IT professionals, but if all you want to do is bulk convert a single directory of DOC files, this post shows what you need to do.

This blog is inactive.
New blog: EricWhite.com/blog

Blog TOC
Before you can use this tool, you need to install the Microsoft Office Compatibility Pack.  While the compatibility pack is primarily used with Office 2003 to allow it to load and save Open XML files, in essence, the compatibility pack is comprised of file format converters, which the Office File Converter (OFC) uses.  To use OFC, you need to install the compatibility pack even if you only have Office 2007 installed.  Download the compatibility pack here.

Next, you need to download and install the Microsoft Office Migration Planning Manager (OMPM).  This contains a bunch of stuff, but for bulk file conversion, we are primarily interested in just one tool, OFC.EXE.  Download the Microsoft Office Migration Planning Manager here.

When installing, as usual, you will select the installation directory.  OFC.EXE is in the Tools directory under the installation directory.  In addition, in that directory, there is a sample OFC.INI file.  To run OFC.EXE, you need to create an OFC.INI, and place it in the directory where you run OFC.EXE from.

For my simple example, I want to convert all DOC files found in the C:UsersericwhitDocuments8-09-19-Doc, and place the converted DOCX files in C:UsersericwhitDocuments8-09-19-Docx.  Here is the OFC.INI file for this scenario:

[Run]
LogDestinationPath=C:UsersericwhitDocuments8-09-19-Conversion-Log
TimeOut = 3
 
[ConversionOptions]
; FullUpgradeOnOpen: if set to 1, Word documents will be fully converted to the OpenXML format
;                    if set to 0 (default), Word documents will be saved in the OpenXML format in compatibility mode
FullUpgradeOnOpen=1
 
CABLogs=0
 
MacroControl=0
 
[FoldersToConvert]
fldr=C:UsersericwhitDocuments8-09-19-Doc
 
[ConversionInfo]
SourcePathTemplate=********************
DestinationPathTemplate=C:UsersericwhitDocuments8-09-19-Docx
 

The SourcePathTemplate and DestinationPathTemplate provide extremely flexible configuration for bulk conversion of a large number of directories.  They allow an IT professional to, say, convert files from one entire server, place the converted files on a different server, and use various sub-paths of the path of the source document to form the path where the DOCX will be placed.  Fully configuring SourcePathTemplate and DestinationPathTemplate is beyond the scope of this blog post.  The configuration above basically specifies to just take all of the DOC files in the specified fldr directory, and place them in the specified DestinationPathTemplate directory.

So, place a modified version of the above INI file in some directory, and run OFC.EXE from that directory.  You will see something like this:

 Your newly created DOCX files will be in the directory you specified in DestinationPathTemplate.

Comments (39)

  1. Voici les quelques liens techniques intéressants autour de Open XML de cette dernière semaine : Créer

  2. Voici les quelques liens techniques intéressants autour de Open XML de cette dernière semaine : Créer

  3. Gary Nevills says:

    Thanks for posting this, very big time saver.  I was wondering if there is a simple command to put in the INI file to do a folder and all of it’s subfolders

  4. Kingsley Starling says:

    If you want to make sure that files in all subfolders are also dealt with, use the ConvertSubFolders setting.

    For example, add the following line to the [FoldersToConvert] section:

    ConvertSubFolders=1

  5. Callum says:

    I was wondering if this could be used to convert from DOCX to just DOC?

    Thanks,

    Callum

  6. Hi Callum,

    This utility only converts from DOC to DOCX.

    -Eric

  7. Leo says:

    This creates duplicate copies in the new format, it does not convert in the sense of the word. what most people need is to convert the doc file to docx, not to make another copy in a different location. How can you tell this tool to remove the old doc files after it’s created the new docx files?

  8. Hi Leo, yes, it’s true that OFC.EXE only converts files, and doesn’t remove old ones.  To implement something like this is pretty easy using PowerShell.

    -Eric

  9. Leo says:

    I’m not familiar wih using that, has anyone got something already that I could use or toy with?

  10. Kyle Finley says:

    A few months ago I thought of this. I wanted an explorer add-in that would allow me to right click…

  11. Robert says:

    Why not have a built in way for it to clean up after a successful conversion?  We have hundreds of thousands of files to convert and do not want to create duplicates.  We want to benefit from the reduced file size to shrink our file storage needs.  Powershell is not much of a solution, if I didn’t like GUI’s, I would switch to Unix!

  12. Hi Robert,

    I don’t know the reasoning of the dev team, but I would guess that it’s because removing documents gets into the whole area of document retention policies – who can delete which documents – what about documents that are owned by someone who doesn’t want them deleted, etc.  This is an extremely complicated issue, and adding functionality into a document conversion tool that encroaches on the issues of document retention would really complicate the tool.

    One approach – you can configure OFC.INI to convert documents and place the new documents in a parallel directory structure.  You can then delete the original directory structure with a single command, which reduces your burden significantly.

    -Eric

  13. Robert says:

    I can see your point, but can you imagine moving 1500 users cheese on them?  I have thought about converting one home directory at a time, putting all the converted docs in one place, and then moving them back.  Can you see where I am going here?  

    Anyway, I guess after about 10 or so years, all the old docs and spreadsheets will be gone.

    I just wish a tool could be developed to make this task feasible for large amounts of files so the end users do not feel the pain.  I guess until something third party comes out since Microsoft evidently has no intentions of doing it, we will not benefit from the XML format except in newly created files.

  14. Cohen says:

    Actually you don’t need the ofc just the compatibility pack. You can use the commandline (good for batch scripting/ powershell for those looking for a different behaviour than the ofc offers):

    "C:Program FilesMicrosoft OfficeOffice12wordconv.exe" -oice -nme <input file> <output file>

  15. Babz says:

    I have converted the file doc to docx format. But I could find that the Document variables in the file has got corrupted. And also, I have tried the convert in the Office Menu Button.

  16. Riaan says:

    Hi Guys,

    Same problem here… this really sucks!

    We have a 900GB user drive, and using this tool means I have to almost duplicate the drive capacity to have the same amount of free space.  Wonder why they didn’t just put the option in there for the "business" to decide..

    Riaan

  17. Davey says:

    I have been struggling with this issue forsome time now. I have several hundred documents created in MSOffice 98 for Mac, and I want some way to convert them in bulk. I have tried this conversion program, and it only converts a handful of documents-I’m assuming just the ones that were written in a more recent version of office. Is there any way anyone can helpme figure out how to convert these files, other than loading each one up on the mac and going through the steps to savethem in a different format? Thanks.

    Davey.vee AT Gmail DOT com

  18. Max says:

    it is taking all the files and put them all in one folder it is not creating the subfolders!

    could u tell me how can i make the INI creat the subfolders ?

    it will take long time to creat the subfolders and arrange the files again.

  19. Davide Russo says:

    I have try this tool, but when i open the converted docx, it seems that these stay in compatibility modus. I think the compatibility pack is only for office2007?

    I have also tried FullUpgradeOnOpen=1, but the compatibility modus on the document stay.

  20. E-Z-E says:

    You shouldn't need twice the storage space as the new files should be significantly smaller.  However, I have found that files with inserted images do not compress much.

    Re: removing orginal files…If you feel confident that the file conversion was successful, just do a search for .doc, select all, hit Delete.

  21. Clint says:

    I did a bulk convert on a folder, one of the main reasons was to reduce the overall size of the word documents.  But I found that using the ofc utility treats the file differently than converting it using Word.  There were 452 files and 128MB in the .doc format, after converting to .docx format, the folder size grew to 141MB and 434 files, as not all the files in the original folder ended up being word documents.  I checked one of the files that got larger 30MB to 46MB and opened the original in Word 2007, saved as a 2007 document and the resulting file size was 27MB.

    Any thoughts on what may be causing this? I have compatibility on in both scenarios, there are no macros… We have about 5 Gigs of .doc files and I was hoping to reduce that, not grow it.  Any help would be appreciated.

    Thank you

  22. Jim says:

    Very helpful, especially since the ofc.exe will crash, providing no feedback at all, if you type the SourcePathTemplate and DestinationPathTemplate incorrectly–which on my machine, included leaving them in their original format.

  23. Jim says:

    It might be helpful if you also posted a link, where we could log bugs against ofc.exe….  When I converted from source = "c:StuffOld" to "c:Stuff", the directories directly underenath "old" vanished, leaving me with hundreds of directories splattered into the "stuff" directory, and trying to figure out which ones went where.

  24. Matt Hatch says:

    I am trying to convert word files (Office 97-2003) that open up and you must select RTF (Rich Text Formatting) before using the file as they were all converted from PDFs.  Is there a way to change the script to account for this and if so what is it?  I tried a test file for a normal word 2003 doc and it worked fine.  Just need to change the scrpit around a bit to account for this extra step. Please contact me at Mstretton@hatch.ca if you can be of any help.

  25. Pepe says:

    Hi Riaan and Robert, please tell if you found the solution. I would like to do the same in my company, but I can't duplicate the files and require the users to clean the drives! I guess you are asking for the same as I do. I would be very grateful if you help me.

    My email address is xola87@yahoo.com

  26. billy sherlock says:

    i have a quick questiion… i have 30,000 files to do and have 100's of nested folders, how can i leave them in the folders or moved them in to new location and keep the same folder names with new file.

  27. Zarfishan says:

    http://saaspose.com/api/words also allows you to convert .doc file to .docx formats and to many other formats.

  28. Paul H says:

    @billy sherlock

    Use the following for source and destination at the end of the ini file. Also make sure you have ConvertSubFolders=1 under [FoldersToConvert] as well.

    SourcePathTemplate=*

    DestinationPathTemplate=*1

  29. Emanuel says:

    Hi, I', getting an error: Failed to get date-time attributes.

    What should I do in order to fix this?

    Regards,

    Emanuel Santos

  30. Christos says:

    I have office 2010 installed on my computer and ofc won't run because "Format converters are not installed on this machine"

  31. Arne says:

    @Christos,

    despite having installed Office 2010, you need the FileFormatConverter Tool to let ofc.exe run.

    Please refer to technet.microsoft.com/…/cc179019(v=office.14).aspx for further details.

    And as described, the files will just be converted from 2003 format to 2007 format.

    After converting *.doc to 2007 *.docx Format you will still get the hint in Word 2010, that the

    document is jsut opened in 'compatibility mode'.

    Hope This helps. Regards Arne

  32. ISK says:

    Convert docx to pdf: Visit:  http://www.youtube.com/watch

  33. beginner says:

    Hi, why not just do it with a simple easy DOS command:

    1/ Put all the files you want to change into one new folder in C:  (you can name it anything but I will call it "Test")

    2/ then (via Start) go to "Run" and type  CMD  (and enter) this opens a (black) DOS command space

    3/ in the black command space you type   CDTest  (and enter)   (CD means change directory)

    You are now in the folder "Test"

    4/Make sure you are now in your target folder "Test" or else you will chance other stuff also!

    Now type the DOS command   REN .  *.DOCX   (what you typed is: REN asterix dot asterix space asterix dot docx)

    What you did here is renaming for all files everything what is AFTER the dot into the new extention "docx"

    (Note: what is before the dot will not be renamed, since the asterix makes everything before the dot remain the same).

    P.s. BEFORE START:

    a/ make sure you can see the extensions after the dot since sometimes Windows hides them

    b/ Make also sure that you have everything in a COPY FOLDER in case yo make a mistake!!

    c/ experiment first with a folder with some files that would be no loss if you made a mistake!

    good luck!

  34. POE says:

    @Beginner

    Thats not a conversion but a simple renaming and it will not work!

    Regards Poe

  35. Dylan says:

    This converted all my .doc files to .docm files. I thought it was supposed to convert them to .docx, did I do something wrong?

  36. Anon says:

    This open source utility might be better:-

    sourceforge.net/…/b2xtranslator

  37. Michael says:

    Hi

    I what i need to add for add source sub folders , and while making the conversation it will be also create sub folders

  38. Boris says:

    Hi,

    what should I type in the DestinationPathTemplate if I want the new files to stay in the source folder of the DOC files?

  39. Jesse Pepper says:

    Hi there, I presume this runs the word executable is that correct?  Is this safe to do on an unattended machine (automated process that converts .doc files to .docx as they come in, but with no human interaction).  Microsoft suggests that running word in headless mode is not suitable for unattended environments since the application may still show dialogs to the user.  Does OFC.exe just call into word?

    If so, does anyone know of a solution that is safe to run on an unattended computer?