How to convert PDF files to images

Question

I need to convert PDF files to images  If the PDF file is multi-page I just need one image that contains all of the PDF pages   Is there an open source solution which is not charged like the Acrobat product

User · Answer

Disclaimer I worked on this component at Software Siglo XXI   You could use Super Pdf2Image Converter to generate a TIFF multi-page file with all the rendered pages from the PDF in high resolution  It s available for both 32 and 64 bit and is very cheap and effective  I d recommend you to try it   Just one line of code     GetImage outputFileName  firstPage  lastPage  resolution  imageFormat   Converts specifies pages to image and save them to outputFileName  tiff allows multi-page or creates several files    You can take a look here  http   softwaresigloxxi com SuperPdf2ImageConverter html

User · Answer

Apache PDFBox also works great for me   Usage with the command line tool   javar -jar pdfbox-app-2 0 19 jar PDFToImage -quality 1 0  -dpi 150 -prefix out dir page -format png

User · Answer

You can use Ghostscript to convert PDF to images   To use Ghostscript from  NET you can take a look at Ghostscript NET library  managed wrapper around the Ghostscript library    To produce image from the PDF by using Ghostscript NET  take a look at RasterizerSample   To combine multiple images into the single image  check out this sample  http   www niteshluharuka com 2012 08 combine-several-images-to-form-a-single-image-using-c

User · Answer

Regarding PDFiumSharp  After elaboration I was able to create PNG files from a PDF solution   This is my code   using PDFiumSharp  using System Collections Generic  using System Drawing  using System IO   public class Program       static public void Main String   args                var renderfoo   new Renderfoo           renderfoo RenderPDFAsImages   C  Temp example pdf     C  temp               public class Renderfoo        public void RenderPDFAsImages string Inputfile  string OutputFolder                string fileName   Path GetFileNameWithoutExtension Inputfile            using  PDFiumSharp PdfDocument doc   new PDFiumSharp PdfDocument Inputfile                         for  int i   0  i  lt  doc Pages Count  i                                  var page   doc Pages i                   using  var bitmap   new System Drawing Bitmap  int page Width   int page Height                                         var grahpics   Graphics FromImage bitmap                       grahpics Clear Color White                       page Render bitmap                       var targetFile   Path Combine OutputFolder  fileName         i     png                        bitmap Save targetFile                                                        For starters  you need to take the following steps to get the PDFium wrapper up and running    Run the Custom Code tool for both tt files via right click in Visual Studio Compile the GDIPlus Project Copy the compiled assemblies  from the GDIPlus project  to your project Reference both PDFiumSharp and PDFiumsharp GdiPlus assemblies in your project Make sure that pdfium x64 dll and or pdfium x86 dll are both found in your project output directory

User · Answer

Using Android default libraries like AppCompat  you can convert all the PDF pages into images  This way is very fast and optimized  The below code is for getting separate images of a PDF page  It is very fast and quick   ParcelFileDescriptor fileDescriptor   ParcelFileDescriptor open new File  pdfFilePath pdf    MODE READ ONLY       PdfRenderer renderer   new PdfRenderer fileDescriptor       final int pageCount   renderer getPageCount        for  int i   0  i  lt  pageCount  i              PdfRenderer Page page   renderer openPage i           Bitmap bitmap   Bitmap createBitmap page getWidth    page getHeight   Bitmap Config ARGB 8888           Canvas canvas   new Canvas bitmap           canvas drawColor Color WHITE           canvas drawBitmap bitmap  0  0  null           page render bitmap  null  null  PdfRenderer Page RENDER MODE FOR DISPLAY           page close             if  bitmap    null              return null           if  bitmapIsBlankOrWhite bitmap               return null           String root   Environment getExternalStorageDirectory   toString            File file   new File root   filename     png             if  file exists    file delete            try               FileOutputStream out   new FileOutputStream file               bitmap compress Bitmap CompressFormat PNG  100  out               Log v  Saved Image -    file getAbsolutePath                 out flush                out close              catch  Exception e                e printStackTrace                                                                               private static boolean bitmapIsBlankOrWhite Bitmap bitmap        if  bitmap    null          return true       int w   bitmap getWidth        int h   bitmap getHeight        for  int i    0  i  lt  w  i              for  int j   0  j  lt  h  j                  int pixel    bitmap getPixel i  j               if  pixel    Color WHITE                    return false                                    return true

User · Answer

There is a free nuget package  Pdf2Image   which allows the extraction of pdf pages to jpg files or to a collection of images  List   in just one line         string file    quot c   tmp  test pdf quot            List lt System Drawing Image gt  images   PdfSplitter GetImages file  PdfSplitter Scale High            PdfSplitter WriteImages file   quot c   tmp quot   PdfSplitter Scale High  PdfSplitter CompressionLevel Medium    All source is also available on github Pdf2Image

User · Answer

The thread  converting PDF file to a JPEG image  is suitable for your request   One solution is to use a third-party library   ImageMagick is a very popular and is freely available too  You can get a  NET wrapper for it here  The original ImageMagick download page is here    Convert PDF pages to image files using the Solid Framework Convert PDF pages to image files using the Solid Framework  dead link  the deleted document is available on Internet Archive   Convert PDF to JPG Universal Document Converter 6 Ways to Convert a PDF to a JPG Image   And you also can take a look at the thread  How to open a page from a pdf file in pictureBox in C     If you use this process to convert a PDF to tiff  you can use this class to retrieve the bitmap from TIFF   public class TiffImage       private string myPath      private Guid myGuid      private FrameDimension myDimension      public ArrayList myImages   new ArrayList        private int myPageCount      private Bitmap myBMP       public TiffImage string path                MemoryStream ms          Image myImage           myPath   path          FileStream fs   new FileStream myPath  FileMode Open           myImage   Image FromStream fs           myGuid   myImage FrameDimensionsList 0           myDimension   new FrameDimension myGuid           myPageCount   myImage GetFrameCount myDimension           for  int i   0  i  lt  myPageCount  i                          ms   new MemoryStream                myImage SelectActiveFrame myDimension  i               myImage Save ms  ImageFormat Bmp               myBMP   new Bitmap ms               myImages Add myBMP               ms Close                      fs Close              Use it like so   private void button1 Click object sender  EventArgs e        TiffImage myTiff   new TiffImage  D   Some tif          imageBox is a PictureBox control  and the    operators pass back       the Bitmap stored at that position in the myImages ArrayList in the TiffImage     this pictureBox1 Image    Bitmap myTiff myImages 0       this pictureBox2 Image    Bitmap myTiff myImages 1       this pictureBox3 Image    Bitmap myTiff myImages 2

User · Answer

https   www codeproject com articles 317700 convert-a-pdf-into-a-series-of-images-using-csharp I found this GhostScript wrapper to be working like a charm for converting the PDFs to PNGs  page by page  Usage  string pdf filename     quot C  TEMP test pdf quot               var pdf2Image   new Cyotek GhostScript PdfConversion Pdf2Image pdf filename   for  var page   1  page  lt  pdf2Image PageCount  page          string png filename     quot C  TEMP test quot    page    quot  png quot       pdf2Image ConvertPdfPageToImage png filename  page      Being built on GhostScript  obviously for commercial application the licensing question remains

User · Answer

As for 2018 there is still not a simple answer to the question of how to convert a PDF document to an image in C   many libraries use Ghostscript licensed under AGPL and in most cases an expensive commercial license is required for production use   A good alternative might be using the popular  pdftoppm  utility which has a GPL license  it can be used from C  as command line tool executed with System Diagnostics Process  Popular tools are well known in the Linux world  but a windows build is also available   If you don t want to integrate pdftoppm by yourself  you can use my PdfRenderer popular wrapper  supports both classic  NET Framework and  NET Core  - it is not free  but pricing is very affordable

User · Answer

The PDF engine used in Google Chrome  called PDFium  is open source under the  BSD 3-clause  license  I believe this allows redistribution when used in a commercial product   There is a  NET wrapper for it called PdfiumViewer  NuGet  which works well to the extent I have tried it  It is under the Apache license which also allows redistribution    Note that this is NOT the same  wrapper  as https   pdfium patagames com  which requires a commercial license     There is one other PDFium  NET wrapper  PDFiumSharp  but I have not evaluated it    In my opinion  so far  this may be the best choice of open-source  free as in beer  PDF libraries to do the job which do not put restrictions on the closed-source   commercial nature of the software utilizing them  I don t think anything else in the answers here satisfy that criteria  to the best of my knowledge

User · Answer

I kind of bumped into this project at SourceForge  It seems to me it s still active    PDF convert to JPEG at SourceForge Developer s site   My two cents

User · Answer

The NuGet package Pdf2Png is available for free and is only protected by the MIT License  which is very open   I ve tested around a bit and this is the code to get it to convert a PDF file to an image  tt does save the image in the debug folder    using cs pdf to image  using PdfToImage   private void BtnConvert Click object sender  EventArgs e        if openFileDialog1 ShowDialog      DialogResult OK                try                       string PdfFile   openFileDialog1 FileName              string PngFile    Convert png               List lt string gt  Conversion   cs pdf to image Pdf2Image Convert PdfFile  PngFile               Bitmap Output   new Bitmap PngFile               PbConversion Image   Output                    catch Exception E                        MessageBox Show E Message

User · Answer

I used PDFiumSharp and ImageSharp in a  NET Standard 2 1 class library        lt summary gt      Saves a thumbnail  jpg  to the same folder as the PDF file  using dimensions 300x423      which corresponds to the aspect ratio of  A  paper sizes like A4  ratio h w sqrt 2        lt  summary gt       lt param name  pdfPath  gt Source path of the pdf file  lt  param gt       lt param name  thumbnailPath  gt Target path of the thumbnail file  lt  param gt       lt param name  width  gt  lt  param gt       lt param name  height  gt  lt  param gt  public static void SaveThumbnail string pdfPath  string thumbnailPath       int width   300  int height   423        using var pdfDocument   new PdfDocument pdfPath       var firstPage   pdfDocument Pages 0        using var pageBitmap   new PDFiumBitmap width  height  true        firstPage Render pageBitmap        var imageJpgPath   string IsNullOrWhiteSpace thumbnailPath            Path ChangeExtension pdfPath   jpg             thumbnailPath      var image   Image Load pageBitmap AsBmpStream             Set the background to white  otherwise it s black  https   github com SixLabors ImageSharp issues 355 issuecomment-333133991     image Mutate x   gt  x BackgroundColor Rgba32 White         image Save imageJpgPath  new JpegEncoder

[c#] How to convert PDF files to images

Examples related to c#

Examples related to image

Examples related to pdf