Handling PDF: Difference between revisions
No edit summary |
No edit summary |
||
Line 1: | Line 1: | ||
[[Category:Developing_with_Qt]]<br />[toc align_right="yes&quot; depth="2&quot;] | |||
= Handling PDF = | |||
This page discusses various available options for working with "Portable Document Format &#40;PDF&amp;#41;":http://en.wikipedia.org/wiki/Portable_Document_Format documents in your Qt application. Please also read the general considerations outlined on the [[Handling_Document_Formats | Handling Document Formats]] page. | |||
p{width:60%;border:solid 1px #99a;background:#eef;color:#335;padding:2pt 4pt;font-size:0.9em;line-height:150%;font-style:italic}. Note that this information is collaboratively collected by the community, with no promise of completeness or correctness. In particular, use your own research and judgment when evaluating third-party libraries or tools! | |||
== | == Reading / Writing == | ||
=== Using QPrinter === | |||
For {color:#709}creating PDF documents from scratch, you can use Qt's built-in print support which also allows "printing&quot; to PDF files. To do so you can set up a [[Doc:QPrinter]] instance like this: <code>QPrinter printer(QPrinter::HighResolution);<br />printer.setOutputFormat(QPrinter::PdfFormat);<br />printer.setOutputFileName("path/to/file.pdf&quot;);</code> Since QPrinter inherits [[Doc:QPaintDevice]], anything that supports outputting graphical content to a QPaintDevice (or has convenience API for printing with QPrinter) can thus be used for generating PDFs: | |||
* ''''' | * '''''manual QPainter painting'''''<br />The most basic (but not necessarily simplest) way of creating PDF documents with QPrinter is by manually painting the document's content with Qt's "Arthur paint system&quot;:/doc/qt-4.8/qt4-arthur.html.<br />Just pass the QPrinter object as a reference to the constructor of [[Doc:QPainter]] (or, alternatively, to "QPainter::begin&quot;:/doc/qt-4.8/qpainter.html#begin for an already existing QPainter) and then perform any painting operations with that QPainter instance like you usually would (with intermittent calls to "QPrinter::newPage&quot;:/doc/qt-4.8/qprinter.html#newPage whenever you want to move on to the next PDF page). | ||
* '''''Scribe'''''<br />For a more high-level API for creating structured rich-text documents, use Qt's Scribe framework (see [[Handling_Document_Formats | Handling Document Formats]]). You can export the whole document or a part of it to PDF with "QTextDocument::print&quot;:/doc/qt-4.8/qtextdocument.html#print or "QTextEdit::print&quot;:/doc/qt-4.8/qtextedit.html#print (again, using a QPrinter object set up as shown above). | |||
* '''''Graphics View'''''<br />Qt's "Graphics View framework&quot;:/doc/qt-4.8/graphicsview.html can be a more suitable alternative for creating PDF documents with content that is mainly based on arbitrarily positioned and transformed 2D graphical items rather than continuous flowed rich text.<br />To export the content of a graphics scene or view (or a part of it) to PDF, you need to manually initialize a QPainter configured to paint on a PDF-creating QPrinter (as described above), and pass it to "QGraphicsScene::render&quot;:/doc/qt-4.8/qgraphicsscene.html#render or "QGraphicsView::render&quot;:/doc/qt-4.8/qgraphicsview.html#render. | |||
=== Using third-party libraries === | |||
If you need more control over the output when {color:#709}creating PDF documents, or you need to {color:#709}parse existing PDF documents (anything from extracting specific information to assembling a full in-memory document object tree) and maybe even {color:#709}modify their structure or content before writing them back to disk, refer to third-party PDF reading/writing libraries: | |||
table{width:95%;margin-left:2.5%}.<br />| |''. API |''. {color:#709}parsing |''. {color:#709}modifying |''. {color:#709}creating |''. platforms |''. license |<br />| "'''poppler-qt4'''":http://freedesktop.org/wiki/Software/poppler | C+''/Qt | {color:#580}yes | {color:#920}? | {color:#920}? | Win, Mac?, Linux, … | GPL v2'' {color:#458}[strong copyleft] |<br />| "'''Hummus'''":http://pdfhummus.com/ | C++ | {color:#580}yes | {color:#580}yes | {color:#580}yes | Win, Mac, Linux | Apache 2.0 {color:#458}[permissive] |<br />| "'''PoDoFo'''":http://podofo.sourceforge.net | C++ | {color:#580}yes | {color:#580}yes | {color:#580}yes | Win, Mac, Linux | LGPL {color:#458}[weak copyleft] | | |||
=== Using batch conversion tools === | |||
If all else fails, there is always the option of using an existing tool to automatically convert between PDF files and a more manageable format, and let your Qt application read/write that format instead. The conversion tool could be bundled with your application or specified as a prerequisite, and controlled via [[Doc:QProcess]]. Some possibilities are: | |||
table{width:95%;margin-left:2.5%}.<br />|''. |''. executable names |''. {font-family:monospace}.pdf to: |''. … to {font-family:monospace}.pdf |''. platforms |''. license |<br />| "'''poppler-utils'''":http://freedesktop.org/wiki/Software/poppler | pdftotext, pdftocairo, pdftohtml | {font-family:monospace}.txt .svg .html … | {font-family:monospace}<s>% | Win, Mac?, Linux, … | GPL v2+ {color:#458}[strong copyleft]|<br />| "'''Inkscape'''":http://inkscape.org | inkscape | {font-family:monospace}.svg … | {font-family:monospace}.svg … | Win, Mac, Linux, … | GPL v2 {color:#458}[strong copyleft]| | |||
<br />h2. Rendering | |||
<br />h3. Using third-party libraries/tools | |||
<br />For rendering pages or elements from existing PDF documents to image files or in-memory pixmaps (useful e.g. for thumbnail generation or implementing custom viewers), third-party libraries can be used: | |||
<br />table{width:95%;margin-left:2.5%}.<br />| |''. API |''. can render |''. output to |''. platforms |''. license |<br />| "'''poppler-qt4'''":http://freedesktop.org/wiki/Software/poppler | C+''/Qt | pages, …? | QImage | Win, Mac?, Linux, … | GPL v2'' {color:#458}[strong copyleft] |<br />| "'''muPDF'''":http://mupdf.com | C | pages | RGBA byte array | Win, Mac, Linux, … | GPL v3+ {color:#458}[strong copyleft]; or commercial | | |||
<br />Alternatively, the task can be delegated to existing command-line tools: | |||
<br />table{width:95%;margin-left:2.5%}.<br />| |''. executable names |''. can render |''. output to |''. platforms |''. license |<br />| "'''poppler-utils'''":http://freedesktop.org/wiki/Software/poppler | pdftocairo, pdftoppm, pdfimages | pages, image elements | {font-family:monospace}.png .jpg .svg .ppm … | Win, Mac?, Linux, … | GPL v2+ {color:#458}[strong copyleft]|<br />| "'''muPDF'''":http://mupdf.com | pdfdraw | pages | {font-family:monospace}.png, .ppm, .pgm, .pam, .pbm | Win, Mac, Linux, … | GPL v3+ {color:#458}[strong copyleft]; or commercial | | |||
<br />h2. Interactive Viewing | |||
<br />h3. Calling an external viewer application | |||
<br />If your application merely needs to let the user view/read certain PDF documents on demand, displaying them within the UI of the application itself might not be necessary, and delegating the task to an existing viewer application can be a viable option. | |||
<br />Many users have already chosen and installed a stand-alone PDF viewer according to their personal preferences, so simply letting the operating system open the PDF file with whatever it considers the default viewer for such files, might be the easiest (and potentially most user-friendly) choice.<br />To do so, simply pass the PDF file's URL to "QDesktopServices::openUrl&quot;:/doc/qt-4.8/qdesktopservices.html#openUrl. If you're downloading the file from the Internet, store it on disk using [[Doc:QTemporaryFile]] first, since not all viewers can handle remote URLs. | |||
<br />h3. Using a third-party Qt widget | |||
<br />The following widgets provide native PDF viewing for Qt applications: | |||
<br />table{width:95%;margin-left:2.5%}.<br />| |''. class name |''. platforms |''. license |<br />| "'''XpdfWidget/Qt'''":http://www.glyphandcog.com/XpdfWidgetQt.html | XpdfWidget | Win, Mac, Linux, … | commercial | | |||
<br />h3. Embedding a third-party ActiveX control | |||
<br />If you are exclusively targeting the Windows platform, you can embed an existing ActiveX component for viewing PDFs in your Qt applications by instantiating it as a [[Doc:QAxWidget]] (see "Qt's ActiveX Framework&quot;:/doc/qt-4.8/activeqt.html). | |||
<br />The following PDF viewers provide such an ActiveX control: | |||
<br />table{width:95%;margin-left:2.5%}.<br />| |''. DLL file |''. ActiveX control name |''. platforms |''. license |<br />| "'''Adobe Reader'''":http://get.adobe.com/reader/ | Acropdf.dll | {font-family:monospace}AxAcroPDFLib.AxAcroPDF | Win, Mac, Linux, … | {color:#458}freeware''(for commercial redistribution see "here&quot;:http://www.adobe.com/products/reader/distribution.html)_ | | |||
<br />In the case of the Adobe Reader control, opening a PDF file is done with:<br /><code>dynamicCall("LoadFile&amp;#40;const QString&amp;#41;", pathToFile)<code> | |||
<br />h3. Embedding a third-party browser plugin | |||
<br />A more cross-platform technology for embedding reusable components is the "NPAPI&quot;:http://en.wikipedia.org/wiki/NPAPI browser plugin architecture</s> which Qt's WebKit-based browser framework "happens to support&quot;:/doc/qt-4.8/qtwebkit.html#netscape-plugin-support. You'll need to set up a simple HTML page containing appropriate{font-family:monospace}&lt;embed&amp;gt;…&lt;/embed&amp;gt;% tags, and let a [[Doc:QWebView]] display it (with "QWebSettings::PluginsEnabled&quot;:/doc/qt-4.8/qwebsettings.html#WebAttribute-enum set to true). | |||
The following applications provide a reusable NPAPI plugin for viewing PDF: | |||
{ | table{width:95%;margin-left:2.5%}.<br />| |''. plugin name |''. platforms |''. license |<br />| "'''Adobe Reader'''":http://get.adobe.com/reader/ | nppdf | Win, Mac, (Linux)[1], … | {color:#458}freeware''(for commercial redistribution see "here&quot;:http://www.adobe.com/products/reader/distribution.html)_ | | ||
| | |||
| | |||
| | |||
| Win, Mac | |||
| | |||
| | |||
fn1{font-size:0.9em;line-height:150%;font-style:italic;margin-left:0.5em;margin-right:0.5em;color:#555}. While in theory it should work on all Desktop platforms, application developers have "reported problems&quot;:/forums/viewthread/14055 in trying to get it to work with Qt Webkit on Linux. | |||
As an alternative to using QWebView for running the plugin, it is possible to use a third-party solution that allows embedding NPAPI plugins in a Qt application without the overhead of a full web browser instance: | |||
table{width:95%;margin-left:2.5%}.<br />| |''. component type |''. has special convenience API for |''. platforms |''. license |<br />| "'''QtitanMultimedia'''":http://www.devmachines.com/products/qtitanmultimedia-overview.html | QWidget | Adobe Reader, … | Win, Linux | {color:#458}commercial | | |||
=== | === Implementing a custom viewer === | ||
p{border:dashed 1px #a94;background:#fbf3dd;color:#530;padding:2pt 4pt;margin-left:2pt;margin-right:2pt;font-size:0.9em;line-height:150%;font-style:italic}. TODO: Tips for implementing a custom interactive viewer, using Qt and the PDF parsing and rendering libraries mentioned above | |||
p{color:#fff;border-bottom:solid 1px #ccc}. . | |||
== | == See Also == | ||
* [[Handling_Document_Formats | Handling Document Formats]] | |||
** ''other "text document&quot; formats:'' | |||
*** [[Handling_HTML | HTML]] | |||
*** [[Handling_RTF | RTF]] | |||
*** [[Handling_Microsoft_Word_(file_format) | Microsoft Word]] | |||
*** [[Handling_OpenDocument_Text | OpenDocument Text]] | |||
[ | |||
| | |||
** ''other | |||
*** [[ | |||
*** [[ | |||
*** [[ | |||
*** [[ | |||
Revision as of 14:13, 23 February 2015
[toc align_right="yes" depth="2"]
Handling PDF
This page discusses various available options for working with "Portable Document Format (PDF&#41;":http://en.wikipedia.org/wiki/Portable_Document_Format documents in your Qt application. Please also read the general considerations outlined on the Handling Document Formats page.
p{width:60%;border:solid 1px #99a;background:#eef;color:#335;padding:2pt 4pt;font-size:0.9em;line-height:150%;font-style:italic}. Note that this information is collaboratively collected by the community, with no promise of completeness or correctness. In particular, use your own research and judgment when evaluating third-party libraries or tools!
Reading / Writing
Using QPrinter
For {color:#709}creating PDF documents from scratch, you can use Qt's built-in print support which also allows "printing" to PDF files. To do so you can set up a Doc:QPrinter instance like this:
QPrinter printer(QPrinter::HighResolution);<br />printer.setOutputFormat(QPrinter::PdfFormat);<br />printer.setOutputFileName("path/to/file.pdf&quot;);
Since QPrinter inherits Doc:QPaintDevice, anything that supports outputting graphical content to a QPaintDevice (or has convenience API for printing with QPrinter) can thus be used for generating PDFs:
- manual QPainter painting
The most basic (but not necessarily simplest) way of creating PDF documents with QPrinter is by manually painting the document's content with Qt's "Arthur paint system":/doc/qt-4.8/qt4-arthur.html.
Just pass the QPrinter object as a reference to the constructor of Doc:QPainter (or, alternatively, to "QPainter::begin":/doc/qt-4.8/qpainter.html#begin for an already existing QPainter) and then perform any painting operations with that QPainter instance like you usually would (with intermittent calls to "QPrinter::newPage":/doc/qt-4.8/qprinter.html#newPage whenever you want to move on to the next PDF page).
- Scribe
For a more high-level API for creating structured rich-text documents, use Qt's Scribe framework (see Handling Document Formats). You can export the whole document or a part of it to PDF with "QTextDocument::print":/doc/qt-4.8/qtextdocument.html#print or "QTextEdit::print":/doc/qt-4.8/qtextedit.html#print (again, using a QPrinter object set up as shown above).
- Graphics View
Qt's "Graphics View framework":/doc/qt-4.8/graphicsview.html can be a more suitable alternative for creating PDF documents with content that is mainly based on arbitrarily positioned and transformed 2D graphical items rather than continuous flowed rich text.
To export the content of a graphics scene or view (or a part of it) to PDF, you need to manually initialize a QPainter configured to paint on a PDF-creating QPrinter (as described above), and pass it to "QGraphicsScene::render":/doc/qt-4.8/qgraphicsscene.html#render or "QGraphicsView::render":/doc/qt-4.8/qgraphicsview.html#render.
Using third-party libraries
If you need more control over the output when {color:#709}creating PDF documents, or you need to {color:#709}parse existing PDF documents (anything from extracting specific information to assembling a full in-memory document object tree) and maybe even {color:#709}modify their structure or content before writing them back to disk, refer to third-party PDF reading/writing libraries:
table{width:95%;margin-left:2.5%}.
| |. API |. {color:#709}parsing |. {color:#709}modifying |. {color:#709}creating |. platforms |. license |
| "poppler-qt4":http://freedesktop.org/wiki/Software/poppler | C+/Qt | {color:#580}yes | {color:#920}? | {color:#920}? | Win, Mac?, Linux, … | GPL v2 {color:#458}[strong copyleft] |
| "Hummus":http://pdfhummus.com/ | C++ | {color:#580}yes | {color:#580}yes | {color:#580}yes | Win, Mac, Linux | Apache 2.0 {color:#458}[permissive] |
| "PoDoFo":http://podofo.sourceforge.net | C++ | {color:#580}yes | {color:#580}yes | {color:#580}yes | Win, Mac, Linux | LGPL {color:#458}[weak copyleft] |
Using batch conversion tools
If all else fails, there is always the option of using an existing tool to automatically convert between PDF files and a more manageable format, and let your Qt application read/write that format instead. The conversion tool could be bundled with your application or specified as a prerequisite, and controlled via Doc:QProcess. Some possibilities are:
table{width:95%;margin-left:2.5%}.
|. |. executable names |. {font-family:monospace}.pdf to: |. … to {font-family:monospace}.pdf |. platforms |. license |
| "poppler-utils":http://freedesktop.org/wiki/Software/poppler | pdftotext, pdftocairo, pdftohtml | {font-family:monospace}.txt .svg .html … | {font-family:monospace}% | Win, Mac?, Linux, … | GPL v2+ {color:#458}[strong copyleft]|
| "Inkscape":http://inkscape.org | inkscape | {font-family:monospace}.svg … | {font-family:monospace}.svg … | Win, Mac, Linux, … | GPL v2 {color:#458}[strong copyleft]|
h2. Rendering
h3. Using third-party libraries/tools
For rendering pages or elements from existing PDF documents to image files or in-memory pixmaps (useful e.g. for thumbnail generation or implementing custom viewers), third-party libraries can be used:
table{width:95%;margin-left:2.5%}.
| |. API |. can render |. output to |. platforms |. license |
| "poppler-qt4":http://freedesktop.org/wiki/Software/poppler | C+/Qt | pages, …? | QImage | Win, Mac?, Linux, … | GPL v2 {color:#458}[strong copyleft] |
| "muPDF":http://mupdf.com | C | pages | RGBA byte array | Win, Mac, Linux, … | GPL v3+ {color:#458}[strong copyleft]; or commercial |
Alternatively, the task can be delegated to existing command-line tools:
table{width:95%;margin-left:2.5%}.
| |. executable names |. can render |. output to |. platforms |. license |
| "poppler-utils":http://freedesktop.org/wiki/Software/poppler | pdftocairo, pdftoppm, pdfimages | pages, image elements | {font-family:monospace}.png .jpg .svg .ppm … | Win, Mac?, Linux, … | GPL v2+ {color:#458}[strong copyleft]|
| "muPDF":http://mupdf.com | pdfdraw | pages | {font-family:monospace}.png, .ppm, .pgm, .pam, .pbm | Win, Mac, Linux, … | GPL v3+ {color:#458}[strong copyleft]; or commercial |
h2. Interactive Viewing
h3. Calling an external viewer application
If your application merely needs to let the user view/read certain PDF documents on demand, displaying them within the UI of the application itself might not be necessary, and delegating the task to an existing viewer application can be a viable option.
Many users have already chosen and installed a stand-alone PDF viewer according to their personal preferences, so simply letting the operating system open the PDF file with whatever it considers the default viewer for such files, might be the easiest (and potentially most user-friendly) choice.
To do so, simply pass the PDF file's URL to "QDesktopServices::openUrl":/doc/qt-4.8/qdesktopservices.html#openUrl. If you're downloading the file from the Internet, store it on disk using Doc:QTemporaryFile first, since not all viewers can handle remote URLs.
h3. Using a third-party Qt widget
The following widgets provide native PDF viewing for Qt applications:
table{width:95%;margin-left:2.5%}.
| |. class name |. platforms |. license |
| "XpdfWidget/Qt":http://www.glyphandcog.com/XpdfWidgetQt.html | XpdfWidget | Win, Mac, Linux, … | commercial |
h3. Embedding a third-party ActiveX control
If you are exclusively targeting the Windows platform, you can embed an existing ActiveX component for viewing PDFs in your Qt applications by instantiating it as a Doc:QAxWidget (see "Qt's ActiveX Framework":/doc/qt-4.8/activeqt.html).
The following PDF viewers provide such an ActiveX control:
table{width:95%;margin-left:2.5%}.
| |. DLL file |. ActiveX control name |. platforms |. license |
| "Adobe Reader":http://get.adobe.com/reader/ | Acropdf.dll | {font-family:monospace}AxAcroPDFLib.AxAcroPDF | Win, Mac, Linux, … | {color:#458}freeware(for commercial redistribution see "here":http://www.adobe.com/products/reader/distribution.html)_ |
In the case of the Adobe Reader control, opening a PDF file is done with:dynamicCall("LoadFile&#40;const QString&#41;", pathToFile)
h3. Embedding a third-party browser plugin
A more cross-platform technology for embedding reusable components is the "NPAPI":http://en.wikipedia.org/wiki/NPAPI browser plugin architecture which Qt's WebKit-based browser framework "happens to support":/doc/qt-4.8/qtwebkit.html#netscape-plugin-support. You'll need to set up a simple HTML page containing appropriate{font-family:monospace}<embed&gt;…</embed&gt;% tags, and let a Doc:QWebView display it (with "QWebSettings::PluginsEnabled":/doc/qt-4.8/qwebsettings.html#WebAttribute-enum set to true).
The following applications provide a reusable NPAPI plugin for viewing PDF:
table{width:95%;margin-left:2.5%}.
| |. plugin name |. platforms |. license |
| "Adobe Reader":http://get.adobe.com/reader/ | nppdf | Win, Mac, (Linux)[1], … | {color:#458}freeware(for commercial redistribution see "here":http://www.adobe.com/products/reader/distribution.html)_ |
fn1{font-size:0.9em;line-height:150%;font-style:italic;margin-left:0.5em;margin-right:0.5em;color:#555}. While in theory it should work on all Desktop platforms, application developers have "reported problems":/forums/viewthread/14055 in trying to get it to work with Qt Webkit on Linux.
As an alternative to using QWebView for running the plugin, it is possible to use a third-party solution that allows embedding NPAPI plugins in a Qt application without the overhead of a full web browser instance:
table{width:95%;margin-left:2.5%}.
| |. component type |. has special convenience API for |. platforms |. license |
| "QtitanMultimedia":http://www.devmachines.com/products/qtitanmultimedia-overview.html | QWidget | Adobe Reader, … | Win, Linux | {color:#458}commercial |
Implementing a custom viewer
p{border:dashed 1px #a94;background:#fbf3dd;color:#530;padding:2pt 4pt;margin-left:2pt;margin-right:2pt;font-size:0.9em;line-height:150%;font-style:italic}. TODO: Tips for implementing a custom interactive viewer, using Qt and the PDF parsing and rendering libraries mentioned above
p{color:#fff;border-bottom:solid 1px #ccc}. .
See Also
- Handling Document Formats
- other "text document" formats: