ExcelBanter

ExcelBanter (https://www.excelbanter.com/)
-   Excel Discussion (Misc queries) (https://www.excelbanter.com/excel-discussion-misc-queries/)
-   -   Convert PDF to Excel? (https://www.excelbanter.com/excel-discussion-misc-queries/126560-re-convert-pdf-excel.html)

John Taylor

Convert PDF to Excel?
 
G'day,

I can't offer any easy or foolproof method, but will give one suggestion
that *may* help (especially if you're desperate).

You mentioned that you had tried saving from Acrobat to html, so I'm
guessing that you think you can get the data into Excel, maybe with less
problems, if you can first get it into an html file.

The following steps produce a "reasonable" html file, but I don't know if it
will suit your requirements.

You need to download and install Ghostscript (free) from
http://www.cs.wisc.edu/~ghost/

Next download pdf2htmlgui.exe (free) from
http://guiguy.wminds.com/downloads/pdf2htmlgui

Next download pdftohtml.exe (free) - there's a link on the above site.

After installing Ghostscript, run pdf2htmlgui and select the PDF to convert,
the html to create, the page numbers to be converted, "generate complex
document", "generate no frames", and "ignore images".

If it's not quite what you're after you could try different combinations of
the options offered.

HTH

Cheers,

John

-----------------------------------------

From: "MC"
Subject: Convert PDF to Excel?
Date: Thursday, 18 January 2007 6:59 AM

Well when it opens in word, it often converts the tables I want to pictures.
I can select it as text when it is a PDF or select the table in segments and
copy as table, but I want to be able to easily import it into excel. If I
save as html from Acrobat it saves as distorted pictures.

If i save it as plain text, when it opens in word, it is disorganized.
Instead of being like this:

1. Item 111111
2. Item 111111

It comes up like this

1.
2.

11111
11111

Item
Item

But this is for about 30 pages of data and too tedious to go through and see
if the line items match up.

Ultimately I used to be able to import it into excel when it was a forms
document after saving it as text with line bands. each line item had a
unique
code that I could use in vlookups to get the values so I could do what i
wanted with the data.

Now that it is a PDF file I am not having the same ability with the data.
Hopefully my question makes sense.

If you want the example of the data click he

http://www.ffiec.gov/nicpubweb/nicwe...E ND=99991231

and go to the financial data at the bottom of the page, choosing the second
one (Y-9C) and getting any date's report. You'll see how the document opens
as a pdf with line items and then the values on the right side with each
item
having a unique code.

How can I get it to open in excel with these three columns?

Thanks so much for your help.




All times are GMT +1. The time now is 08:39 PM.

Powered by vBulletin® Copyright ©2000 - 2025, Jelsoft Enterprises Ltd.
ExcelBanter.com