View Single Post
  #5   Report Post  
Posted to microsoft.public.excel,microsoft.public.excel.programming
Harlan Grove Harlan Grove is offline
external usenet poster
 
Posts: 733
Default I give up-- I need help Pleeeezze? Anybody up to a little challange?

Mctabish wrote...
I recieve data every few weeks (about 65 files, each having up to 7000 or
so) that I need to evaluate. I want to work in Excel., but the data comes in
a PDF format. I can not recieve data in anything but PDF (too simple, I
know)

....

Your best choices are either convincing whoever sends you these files
to send .XLS or even plain text files rather than .PDF files or use
better tools than Excel to deal with these files.

The fields are fixed len, except the description field (second to last
feild) Each record wraps around 2 or three lines
I need to get each field into it's own cell. I probably could get it IF I
can figure out how to get each record into it's own row....

I have tried to save to TEXT from adobe. then opened it up under word and
tried to seperate at the records, but was not sucessful.

....

There's a utility program named pdf2text that used to be available on
Simtel. I haven't been able to find it there today, but it may be
available from other sites (I haven't looked). If you can't find it,
then you could consider GhostScript, which can convert PDF files to
other formats. If these files have inconsistent fields in different
records but a common record terminator or separator, you scripting
languages like Perl or Python to parse the actual fields and use
default values for any missing fields. You could also do this in Excel
once you had the text from the .PDF files, but it's more difficult in
Excel.