Update: Mar 6th 2010. I finally updated this comparative analysis of free PDF to Word conversion software, re-writing it from the ground up, and in the process doubling the size of the article since the last iteration
If you’ve seen this article before you will notice that some new conversion options have been added while a few that were previously mentioned are gone. This is because since I first wrote this a number of excellent conversion options have become available, and I wanted to focus on the services that offer the highest quality. Ten different programs/services are presented here; 5 desktop-based apps and 5 online conversion services. Of the six options presented in the original article only two remain.
Online Conversion services: (6) NuancePDF (7) PDF to Word Free from NitroPDF; (8) Koolwire; (9) Zamzar, and (10) PDFOnline.com.Before I move on to the comparison section, a few issues to address:
Will the document be editable once it is converted?: typically, yes; however the real answer is it depends on your original document and how it was created. Generally speaking you will be able to edit any converted PDF that was created electronically and published using software, which comprises the overwhelming majority of forms and documents that are on the internet. The exception is if your PDF was scanned from a paper document; in this case most converters will consider this to be no different from a photograph, and when converting will produce images inside a .DOC file rather than text. What you need in this case is “Optical Character Recognition” (OCR) software, or a converter than can do OCR (the only option mentioned in this article that can do OCR is NuancePDF); otherwise what you can use free OCR software such as FreeOCR. These programs can “read” documents based on images and convert them to text, but in most cases it will be somewhat labor intensive to get a good conversion, especially if you have a large number of pages.
Online vs. desktop conversions: there are two categories of tools for PDF to Word file conversions. The first is a program that you would install on your PC that would convert the PDF file for you and save the output on your hard drive, which is what you would typically expect. The second category are online conversion websites. With these, you would need to email or upload your file that you want to convert remotely to a website. The actual conversion would occur remotely on their server, and subsequently the converted document is either emailed back to you to a valid email address that you provide or a link is produced that you can click to download the resulting file.
Typically, online conversion services offer a higher quality conversion, although depending on your document that may or may not be the case. Two issues that have to be considered with online conversions are reliability and security. With respect to the former issue (reliability), while most online services will send your converted document within a matter of minutes, it is sometimes the case that your file will arrive hours or even days afterwards. As for the latter issue (security), virtually all of the online services mentioned here promise that your files will be transferred via a secure connection, that your document will not be looked at by a human, and that your email will not be used or shared with any other party. Still, it is up to you to decide whether you are comfortable sending certain private or business documents remotely for conversion.
Converting to images: if you are interested in converting your PDF to images then you’re in the wrong place. For this try the freeware program aptly entitled JPEG to PDF or use the excellent free PDF reader called PDF X-Change Viewer which has an export to images option.
Documents used in the conversion tests:To perform this test I converted five documents. The first is a PDF document I created using Open Office’s export to PDF function that had the following elements (1) a two-column table with special formatting, which contained (2) three thumbnail-sized jpeg images. My PDF document also contained (3) working hyperlinks; and (4) it featured text that was bolded; (5) contained bulleted points, as well as (6) a numbered list. For the sake of simplicity my PDF was only a single page. To see what the document looked like click on the screenshot to the right.
The second document was a simple scanned image that was converted to PDF, to test whether any of the services would perform OCR (optical character recognition) and render an editable document. (only one did). The third document was a trade brochure that contained text, images, charts, and tables, and was 21 pages long, which I imagined would be typical of what most people would need to convert. The fourth document: a single-page PDF I created of a Russian website, for the sole purpose of testing whether any of these services support special characters (none did). Lastly, the fifth document consisted of a bunch of tables full of data, to see how these programs dealt with this.
The objective was to get a converted document that could be loaded into MS Word where the text/contents could be edited and subsequently saved as .DOC format, which is to say any of the following formats would be satisfactory: .DOC, .HTML, and .RTF.
Overview of programs/online services: #1: Free PDF to Word Converter from SmartSoft: a free, fully functional desktop-based PDF to Word program that produces excellent results. (See my full review of this program here). More info as follows:
- Interface: very simple interface and straightforward; single-click conversions; drag and drop files in batch.
- Commercial use: allowed; the issue is not addressed on their FAQ or anywhere on their site, therefore it is safe to assume that commercial use is ok.
- Download size: a mere 3.82 megs
- OCR support: none
- Strengths: excellent general PDF to DOC conversion for a desktop app.
- Weaknesses: text is handled via floating windows, no continuous editable stream makes editing large documents cumbersome.
- The results: I give this one 8/10, if just for being a well rounded free converter. Your converted document will generally look like the original. See the results sections below for a detailed discussion and head-to-head comparison against the other software mentioned in this article.
- Download size: a whopping 148 megs
- Commercial use: free for commercial use.
- How to use: run gDoc creator from the start menu, use the dropdown to select the conversion settings you want, then drag and drop your PDFs onto the interface.
- OCR support: none
- Bundled trialware: installs gDoc Fusion, a bundled trialware which you don’t really need. The software will by default launch this program at the conclusion of each conversion process, which can be somewhat annoying. However, there is a preference (button next to Open…) to turn automatic viewing off.
- Strengths: excellent handling/extraction of continuous editable text if that is what you’re after, fast; drag and drop interface, virtual printers to create PDFs and XPS from any printable document.
- Weaknesses: handling of images and tables is rudimentary (lumps these elements together into a single image for each page). Huge download size. Installs virtual printers to create PDFs and XPS from any printable document (a weakness if you don’t want or need them, that is, but could also be a strength).
- The results: I give this one 8/10. Your converted document will generally look like the original. See the results sections below for a detailed discussion and head-to-head comparison against the other software mentioned in this article.
This program is very competent and performed an excellent job. More info as follows:
- Interface: simple, easy, drag-and-drop. Adds a right click “convert PDF to Word” context menu entry, which can potentially be useful.
- Download size: 6.3 megs
- OCR support: none
- Commercial use: NOT allowed. If you are using this software for a business/commercial use you will need to purchase a license.
- Strengths: excellent handling of images, and formatting, can convert in batch.
- Weaknesses: text handling could have been better. Restricted to non-commercial use. Forcing you to “like” them on Facebook before receiving the free version of the software.
- The results: I give this one 8/10. This is a good general PDF to Word converted, and your converted document will look very much like the original. See the results sections below for a detailed discussion and head-to-head comparison against the other software mentioned in this article.
- Download size: a mere 220K for the add-in. OpenOffice itself is about 150 megs download or so. For instructions on how to install OpenOffice extensions go here.
- OCR support: none
- Commercial use: free for commercial use.
- Strengths: instantly open and edit PDFs, good handling of images and formatting, a good option for quick, on-the-fly edits of PDF documents.
- Weaknesses: text is handled via floating windows, not a continuous editable stream (which can make editing large documents cumbersome). Takes a long time (or flatly would not load) a couple of long, complicated documents that I threw at it.
- The results: I give this one 7.5/10 overall. Your converted document will generally look like the original. See the results sections below for a detailed discussion and head-to-head comparison against the other software mentioned in this article.
- Download size: a mere 690K installer.
- OCR support: none
- Commercial use: is not broached on their site, but the program is billed as “totally free” and it is safe to assume that this includes commercial use.
- Strengths: flexible handling of text, fast.
- Weaknesses: handling of images and formatting is considerably poor . Weak rendering of overall look and feel and layout.
- The results: I give this one 5/10 in terms of results. Your converted document is unlikely to look like the original if it contains images and formatting. See the results sections below for a detailed discussion and head-to-head comparison against the other software mentioned in this article.
- Download size: despite being an online service you will need to download the free PDF reader software, approx 19 megs.
- Reliability: good. At the time of this writing it was the most responsive of all online conversion services mentioned here, sending my files back within minutes. However, I previously had to wait for my files to arrive over an entire weekend.
- OCR support: Yes, and surprisingly high quality. This is the only conversion service mentioned in this article that offers OCR.
- Commercial use: allowed. I scoured their website and read their legal notices without finding any specific mention of this issue.
- Privacy: they will not sell or share your email (from their privacy policy, which doesn’t comment on files sent for conversion). Aside from that, their terms of use link was not clickable, which is a strange omission. This is not to say that your files will not be held in confidence and disposed of properly, but that I wasn’t able to read a clear policy that expressly stated this.
- Strengths: excellent handling of images and formatting, OCR support, integration with a desktop PDF reader, good handling of text and images, secure upload connection, can convert PDF to Excel as well.
- Weaknesses: no clear terms of service statement on their site. Response variability, which afflicts most popular online services, although I will say that at the moment this service is being quite reliable. Colors can at times be slightly off in the converted document.
- The results: I give this one 9/10 overall. Your converted document will generally look like the original. See the results section below for more detailed discussion and comparison against the other software mentioned in this article.
- Reliability: variable. At the time of this writing they were scheduled to update their servers within 2 weeks or so, at which point they promise better responsiveness. In my previous use of this service I generally received my files within minutes but also remember having to wait up to 12 hours for the files to appear.
- OCR support: None.
- Commercial use: is allowed. I scoured their website and read their legal notices without finding anything to the contrary.
- Privacy: they promise to not look at your documents, not share your email, and delete your files as soon as they send you the email response.
- Strengths: excellent handling of images and formatting, as well as text. A related service can also convert PDF to Excel format.
- Weaknesses: images can at times end up behind other elements, giving the impression that they did not make it into the converted document; variability in response time, which afflicts most popular online services.
- The results: I give this one 8.5/10 overall. Your converted document will generally look like the original. See the results section below for more detailed discussion and comparison against the other software mentioned in this article.
Still, Koolwire does a great job. Here are more notes on this one.
- How it works: you can email a file that is less than 10 megs in size. Files over 10 megs can be uploaded through the Koolwire website.
- Reliability: variable. One big negative is that it will flatly refuse to convert some complicated files (which the others incidentally had no problems with). Otherwise the files convert either within a few minutes. I previously had to wait up to 5 hours though.
- OCR support: None.
- Commercial use: allowed.
- Privacy policy: their legal blurb states that users will not receive any unsolicited emails at any point. They also maintain “the right (but not the obligation) to verify Content (i.e. view your documents) at their sole discretion”. Files (input and output) will be deleted upon being emailed and/or downloaded.
- Strengths: good handling of text, excellent handling of tables.
- Weaknesses: response variability; will fail to convert some files without explanation, handling of images and tables is rudimentary (lumps these elements together into a single image for each page).
- The results: I give this one 7.5/10 overall. Your converted document will generally look like the original. See the results section below for more detailed discussion and comparison against the other software mentioned in this article.
- How it works: max uploadable file size is 100 megs in the free version. It will let you upload files in batch, which can be very useful. You will have to specify the desired output format (.DOC) in the dropdown.
- Reliability: good. I was always able to access my converted files within a matter of minutes.
- OCR support: None.
- Privacy policy: your files are deleted once the conversion happens and the converted files are deleted once the download link expires. Also, to quote “files stored for download are only accessible by Zamzar”. Another quote: “Zamzar does not rent, sell, or share your personal information or email address with any other companies.”
- Strengths: good handling of text.:
- Weaknesses: response variability, handling of images and tables is rudimentary (lumps these elements together into a single image for each page).
- Results: I give this one 7/10 overall. See the results section below for more detailed discussion and comparison against the other software mentioned in this article.
- Reliability: high, since you do not need to wait for a return email and your conversions will occur more or less on the spot.
- OCR support: None.
- Commercial use: seems to be allowed, since the issue is not broached on the site.
- Privacy policy: they will not send you unsolicited email and, to quote their site , “will not monitor, edit, or disclose any personal information about you or the documents you submit for conversion, including their contents, without your prior permission”.
- Strengths: good handling of both text and images, best handling of tables. An excellent overall converter.
- Weaknesses: none that I can think of. This one was surprisingly excellent.
- The results: I give this one 9/10 overall. Your converted document will generally look like the original. See the results section below for more detailed discussion and comparison against the other software mentioned in this article.
Comparison of Results:
Image handling: a common issue here is that many services blend all images and visual elements on the page into a single page-wide background image, as opposed to rendering distinct images and distinct formatting. This is obviously ok if you are merely interested in editing the text, but will make editing a document holistically somewhat difficult.- Free PDF to Word Converter from SmartSoft: does not create a single background image that lumps all images within it, which is a good thing. Instead it will render distinct, individual images, which can be easily moved around or manipulated. **Excellent**
- gDOC Creator: blends all images and formatting within a page into a single background image. This makes it difficult and/or labor intensive to edit images or change image placement, but can make life simpler if you just want to edit the text.
- AnyBizSoft PDF to Word Free: image handling seems to vary but is generally excellent; at times it will combine all images and formatting on a page into a single background image, while other times the images are extracted individually and in the correct placement. It seems to depends on the source. **Excellent**
- OpenOffice with PDF import extension: images are rendered correctly where they should be, and are generally easy to work with. **Excellent**
- Some PDF to Word Converter: unpredictable. Will extract individual images at times, while other times it simply will not extract any images at all. In general image placement and layout will most certainly be incorrect.
- NuancePDF: does not create a single background image that lumps all images into it; rather, it seems to merge adjoining clusters of images together, which actually kind of works well and generally speaking is not a bad way to handle images. **Excellent**
- PDF to Word Free from NitroPDF: image handling seems to vary but is generally excellent; at times it will combine all images and formatting on a page into a single background image, while other times the images are extracted individually and in the correct placement. It seems to depends on the source. **Excellent**
- Koolwire: blends all images and formatting within a page into a single background image
- Zamzar: blends all images and formatting within a page into a single background image
- PDFOnline.com: image handling seems to vary but is generally excellent; at times it will combine all images and formatting on a page into a single background image, while other times the images are extracted individually and in the correct placement. It seems to depends on the source. **Excellent**
- Free PDF to Word Converter from SmartSoft: each line of text was its own text box, filling the page with dozens of separate text boxes and potentially making the editing process labor intensive. In some complex documents case can be handled incorrectly (e.g. “PuBlishing firMs” instead of “Publishing Firms”.
- gDOC Creator: excellent text handling; a continuous editable stream of text uninterrupted by carriage returns where they shouldn’t be. Text always rendered correctly. **Excellent**
- AnyBizSoft PDF to Word Free: continuous text in a single text box, but with carriage returns at the end of each line, interrupting flow. Text was always rendered correctly.
- OpenOffice with PDF import extension: each line of text was its own text box, filling the page with dozens of separate text boxes and potentially making the editing process labor intensive.
- Some PDF to Word Converter: this one allows the user to change the settings for text extraction in order to get exactly what you want (using text boxes, auto merge, deleting line breaks). If you just want to extract the text quickly from a PDF it might be a good option.
- NuancePDF: a continuous editable stream of text uninterrupted by carriage returns where they shouldn’t be. However, the text case sometimes did not render correctly (e.g. “IncrEasInG portalIzatIon” instead of “Increasing Portalization”).
- PDF to Word Free from NitroPDF: a continuous editable stream of text uninterrupted by carriage returns where they shouldn’t be. However, the text case sometimes did not render correctly (e.g. “IncrEasInG portalIzatIon” instead of “Increasing Portalization”).
- Koolwire: seems to handle text well, producing a continuous editable stream of text uninterrupted by carriage returns where they shouldn’t be. However, I couldn’t tell if it had any of the case rendering problems mentioned above because it flatly refused to convert the complicated document where they occured (without explanation).
- Zamzar: continuous text in a single text box, but with carriage returns at the end of each line, interrupting flow. Also, the text case sometimes did not render correctly (e.g. “IncrEasInG portalIzatIon” instead of “Increasing Portalization”).
- PDFOnline.com: a continuous editable stream of text uninterrupted by carriage returns where they shouldn’t. Moreover the text case always rendered correctly. **Excellent**
If your document is table-intensive or if tables are your primary focus you would be better off with something that converts to Excel rather than to Word. Check out PDF to Excel Free, an online PDF to Excel service from the makers of PDF to Word Free, or try using”Excel” as the output format in NuancePDF.
- Free PDF to Word Converter from SmartSoft: no actual tables produced, but did an excellent job of “recreating” tables using text boxes, including the formatting. But no actual tables produced.
- gDOC Creator: recreated tables using text boxe and formatting, with an end result that looked quite acceptable.
- AnyBizSoft PDF to Word Free: recreated tables using text boxes and formatting, but the two elements were not quite in sync and the end result looked somewhat messy.
- OpenOffice with PDF import extension: no tables, of course, since the editing takes place in the drawing program (OpenOffice Draw).
- Some PDF to Word Converter: created a complete and total mess. Neither the content no’r the look of the table was preserved.
- NuancePDF: recreated tables using text boxes and formatting, with an end result that looked quite acceptable.
- PDF to Word Free from NitroPDF: recreated tables using text boxes and formatting, with an end result that looked quite acceptable.
- Koolwire: was able to recreate actual tables, as well as recreate the formatting. Very impressive. **Excellent**
- Zamzar: recreated tables using text boxes and formatting, but the two elements were not quite in sync and the end result looked somewhat messy.
- PDFOnline.com: was able to recreate actual tables, as well as recreate the formatting. Very impressive. **Excellent**
- Free PDF to Word Converter from SmartSoft: no
- gDOC Creator: yes
- AnyBizSoft PDF to Word Free: yes
- OpenOffice with PDF import extension: no (documents re-exported to PDF did not maintain working hyperlinks)
- Some PDF to Word Converter: no
- NuancePDF: no
- PDF to Word Free from NitroPDF: yes
- Koolwire: yes
- Zamzar: yes
- PDFOnline.com: yes
- NuancePDF: is the only converter that is able to do this. Check out the image on the right for an illustration.
- Not a single conversion option successfully dealt with this, I’m afraid.
- Free PDF to Word Converter from SmartSoft: 8/10. If you want a well-rounded desktop-based converter that you could also use commercially this is the one for you.
- gDOC Creator: 8/10. gDOC Creator’s handling of text is among the best discussed in this article. If your primary concern is editing the text of PDF documents (and you want to use a desktop app rather than a web service) then this is the one I would recommend.
- AnyBizSoft PDF to Word Free: 8/10. A good desktop converter, but cannot be used for commercial projects (which takes off a point in my scoring system)
- OpenOffice with PDF import extension: 7.5/10. This one is recommended for quick, on the fly edits. If you use OpenOffice then installing the PDF Import plugin is a must. If you are planning to do extensive editing of large documents, however, you would be better served by some of the other options presented here.
- Some PDF to Word Converter: 5/10. Use only to extract text from PDFs quickly.
- NuancePDF: 9/10. Excellent quality conversions, as well as support for OCR. This is your destination of choice if your conversions with other software/services simply renders DOC or RTF files with images inside and no editable text.
- PDF to Word Free from NitroPDF: 8/10. excellent quality conversions, but hampered by variable reliability. This may change, however, once their promised server upgrade is implemented.
- Koolwire: 7.5. ok conversions, but not as advanced as some of the other online conversion options.
- Zamzar: 7/10. ok conversions, but no reason to use this one given that other online services are better.
- PDFOnline.com: 9/10. This was completely unexpected. It provides the best quality PDF to Word conversion; will produce a download link instantly and will not keep you waiting. Highly reommended.
No comments:
Post a Comment