This is a read-only archive. Find the latest Linux articles, documentation, and answers at the new Linux.com!

Linux.com

Feature: Reviews

OpenOffice.org extension will add PDF editing

By Bruce Byfield on July 03, 2008 (7:00:00 PM)

Share    Print    Comments   

Easy PDF editing is coming to OpenOffice.org, but you'll have to be patient for a few months. Recently posted to the OpenOffice.org Extensions site, the Sun PDF Import extension (SPI) is only in beta, and only works with recent developer builds of OpenOffice.org 3.0, which is scheduled for September release. Right now, the quality of the final release is anybody's guess, but the beta's capabilities fall squarely in the middle of the available PDF import tools.

To investigate SPI, you need to download and install a snapshot build of OpenOffice.org 3.0. Then, from within the build, you can install SPI from Tools -> Extension Manager. The next time you start OpenOffice.org 3.0, you'll be able to open PDF files from any of the options for opening an existing document in the File menu.

By default, SPI opens PDF files in the Draw application, although you could also use Impress, which shares much of the same code. This default might seem strange at first, especially if your PDF file is text. Actually, though, using Draw is logical, given the limitations of the PDF format. No application -- not even Acrobat, the proprietary PDF editor made by Adobe, the company that wrote the PDF specifications -- is able to edit more than a single line while preserving format. Given this limitation, importing to Draw makes sense, because it can treat each line as a separate text object for editing. Although rearrangement of a paragraph requires line by line editing with SPI, and can be tricky if you need to add an extra line, the extension leaves you no worse off than any other PDF editor.

But at least you are in a relatively friendly user interface. Aside from the limitation of editing one line at a time, the worst problem you are likely to have is the automatic capitalization of the first character of each line if you have Autocorrect turned on while you edit.

In testing, SPI's success at importing text depended largely on the fonts being used in the document. For best results, you need to have the fonts in the imported PDF file installed on your system; otherwise, SPI will use a substitute font that may not correspond to the letter spacing of the original. Also, while common fonts such as Helvetica or Times Roman create few problems, SPI seems to have trouble reading the metrics of some PostScript fonts and displaying them correctly. Usually, the display problem takes the form of a line of text that, converted to a text object, extends far beyond the page margins, and makes reformatting tedious if not impossible. At times, too, the problem leaves random spaces scattered throughout all the lines.

Graphics in imported PDF files had similar mixed success during testing. Many import into Draw without any trouble, with text wrapping around them in the same style as the original document. However, some PNG images -- but not all -- were imported vertically inverted, and, in another case, a graphic became an uneditable object. And, in some complex layouts, the positioning of some graphics was off by perhaps a dozen pixels.

In the beta, SPI cannot handle PDF forms, and text alignment is not always preserved, with fully justified text showing a strong tendency to import as left-aligned. Nor are hyperlinks supported, although they are a basic necessity for many online documents.

Otherwise, the list of what SPI can handle is much larger than the list of problems: Text frames, sections, multilevel lists, and table formatting including border and background color, are all imported without any problems, all of which makes for a promising start for the extension.

For now, though, the problems with rendering fonts and graphics mean that SPI, like OpenOffice.org 3.0, is not ready for production use.

Still, in its current state, SPI is ahead of Abiword, which simply extracts the text from a PDF file and not the graphics, and KWord, which preserves line division but not most other text formatting.

But SPI's current state is behind that of Inkscape, whose main limitations are a restriction to single-page imports and the failure to preserve hyperlinks. Nor is SPI as reliable as PDFedit, which, despite being aimed at advanced users, remains the most reliable PDF editor for the GNU/Linux desktop. Still, a lot of development can happen in the next few months, and if SPI continues as it has started, its final release just might become an essential OpenOffice.org extension.

Bruce Byfield is a computer journalist who writes regularly for Linux.com.

Share    Print    Comments   

Comments

on OpenOffice.org extension will add PDF editing

Note: Comments are owned by the poster. We are not responsible for their content.

OpenOffice.org extension will add PDF editing

Posted by: Anonymous [ip: 205.167.180.132] on July 03, 2008 08:19 PM
"No application -- not even Acrobat, the proprietary PDF editor made by Adobe, the company that wrote the PDF specifications -- is able to edit more than a single line while preserving format."

They can't figure out a way to automatically link the single-line text elements in a natural fashion? Honestly I'm very surprised... I'm thinking of the MS Publisher feature that links two text boxes so the content flows from one to the other. It seems like that would be extremely easy to implement.

#

Re: OpenOffice.org extension will add PDF editing

Posted by: Anonymous [ip: 71.230.208.105] on July 04, 2008 04:10 AM
The problem is in how the PDF standard works. Each block of text, encrypted or not, is checksummed. Likewise, each page is its own object. (I'm speaking a bit loosely as it's been a few years since I've written PDF code). Between the checksumming of each object on each page and some of the other features, it's actually quite difficult to remove or change a single object without recomputing changes across the whole document.

#

OpenOffice.org extension will add PDF editing

Posted by: Anonymous [ip: 147.163.1.235] on July 04, 2008 07:22 AM
I appreciate very much a better export than a new edit option.
I need to generate PDF with embedded fonts but only Acrobat distiller offers a fully functional way to to this.
I need to switch to Windows OS every time I ned to generate PDF with embedded fonts, a very frustating practice.
Do you know a way to generate PDF with embedded fonts form OO or other Linux applications?

#

Re: OpenOffice.org extension will add PDF editing

Posted by: Anonymous [ip: 89.57.42.171] on July 04, 2008 08:05 AM
To my knowledge is scribus able to embed fonts.
http://www.scribus.net/

#

Re: OpenOffice.org extension will add PDF editing

Posted by: Anonymous [ip: 69.81.118.3] on July 08, 2008 12:07 AM
I thought OOo did font embedding. I've always been able to print PDFs generated from it without the need for the fonts to be installed on the computer I'm printing from.

#

OpenOffice.org extension will add PDF editing

Posted by: Anonymous [ip: 203.208.110.227] on July 04, 2008 11:32 AM
I think this would be a killer extension. What an addition this would make to the already great OpenOffice.org.

http://tnetech.net

#

OpenOffice.org extension will add PDF editing

Posted by: Anonymous [ip: 190.42.5.156] on July 05, 2008 03:48 AM
What I'd like to know is what can we do with that pdf after imported to OOo. I would like to be able to rearrange pages and put multiple pages in one easily, also cut out the borders and add watermarks. I know there are command line apps for some of this, but a GUI app would be killer.

#

No one else excited about Hybrid-PDFs?

Posted by: Anonymous [ip: 86.14.232.225] on July 05, 2008 10:17 PM
I think this review missed the most exciting feature in this extension: hybrid-PDF creation. Hybrid PDFs are PDFs with the ODF source xml embedded. These can be circulated and viewed the same as any other PDF, but can also be openned by OOo-3.0 (with the extension) in the original component (i.e. text docs in writer), edited and re-saved with NO loss of fidelity.

#

OpenOffice.org extension will add PDF editing

Posted by: Anonymous [ip: 87.235.201.94] on July 17, 2008 05:45 PM
"the extension leaves you no worse off than any other PDF editor". That's what I call high goals...

It seems "no worse" is all OOo devs aim for. They aren't making a product better than the competition, only "no worse" than the competition. No wonder nobody but geeks and public administrations choose it.

#

This story has been archived. Comments can no longer be posted.



 
Tableless layout Validate XHTML 1.0 Strict Validate CSS Powered by Xaraya