ExcelBanter

ExcelBanter (https://www.excelbanter.com/)
-   Excel Programming (https://www.excelbanter.com/excel-programming/)
-   -   scraping PDFs (https://www.excelbanter.com/excel-programming/444862-scraping-pdfs.html)

Jeff[_66_]

scraping PDFs
 
Hello,

I have a stack of PDFs (created electronically thankfully) that I need to parse a bit of text from. Been looking through the forum and PlanetPDF a bit for solutions, most posts are for working with Distiller the other way 'round, or outdated.

My current solution, which 'works' in a grim fashion, is to ducttape the handy pdftohtml (http://pdftohtml.sourceforge.net/) to a vba call, then parse one of the resulting html frames.

It ain't pretty, so I wondered how others might've approached this?

Thanks for your insights.


All times are GMT +1. The time now is 12:14 AM.

Powered by vBulletin® Copyright ©2000 - 2025, Jelsoft Enterprises Ltd.
ExcelBanter.com