View Single Post
  #1   Report Post  
Posted to microsoft.public.excel.worksheet.functions
joeu2004 joeu2004 is offline
external usenet poster
 
Posts: 2,059
Default How to import PDF into Excel?

I want to import some data from PDF files into Excel. Is there a
straight-forward way to do this?

For example, see http://muddybuddy.com/pdf/sanjose/results-09.pdf.

What I have done in the past is: open the PDF file, save as text, and write
a VBA macro to read the text file and parse the data line-by-line, putting
it into a worksheet in the form that I require.

The issue is: the data in this particular file does not follow a consistent
pattern when it is saved to text. For example compare the data for "Bib:"
numbers 349, 299, 479 and 1084.

(Aside: Can anyone explain why? The data appears consistently in the PDF
file.)

The issue is not insurmountable. I can recognize and deal with the
different patterns in my parser.

The problem is: I don't know (yet) how many different patterns are
possible. I have found 4 so far. But I would have to look carefully at all
1032 entries to determine if there are other forms.

(Actually, I would simply parse what I know and see what is missing, then
add a parser for the missing pattern. But that's tedious.)