Reply
 
LinkBack Thread Tools Search this Thread Display Modes
  #1   Report Post  
Posted to microsoft.public.excel.programming
external usenet poster
 
Posts: 3
Default scraping PDFs

Hello,

I have a stack of PDFs (created electronically thankfully) that I need to parse a bit of text from. Been looking through the forum and PlanetPDF a bit for solutions, most posts are for working with Distiller the other way 'round, or outdated.

My current solution, which 'works' in a grim fashion, is to ducttape the handy pdftohtml (http://pdftohtml.sourceforge.net/) to a vba call, then parse one of the resulting html frames.

It ain't pretty, so I wondered how others might've approached this?

Thanks for your insights.
Reply
Thread Tools Search this Thread
Search this Thread:

Advanced Search
Display Modes

Posting Rules

Smilies are On
[IMG] code is On
HTML code is Off
Trackbacks are On
Pingbacks are On
Refbacks are On


Similar Threads
Thread Thread Starter Forum Replies Last Post
Web Scraping With Loops qcan Excel Programming 2 February 25th 08 10:26 PM
Web scraping mickbarry Excel Worksheet Functions 2 February 1st 06 10:20 AM
Web Screen Scraping scottymelloty[_19_] Excel Programming 0 November 29th 05 01:53 PM
scraping text from the active window geoffrey pritchard tillingsley Excel Programming 1 July 21st 03 12:45 AM


All times are GMT +1. The time now is 07:39 PM.

Powered by vBulletin® Copyright ©2000 - 2024, Jelsoft Enterprises Ltd.
Copyright ©2004-2024 ExcelBanter.
The comments are property of their posters.
 

About Us

"It's about Microsoft Excel"