ExcelBanter

ExcelBanter (https://www.excelbanter.com/)
-   Excel Programming (https://www.excelbanter.com/excel-programming/)
-   -   CreateDocumentFromUrl (https://www.excelbanter.com/excel-programming/379743-createdocumentfromurl.html)

Jack Clift[_3_]

CreateDocumentFromUrl
 
I am writing an application that uses oMSHTML.createDocumentFromUrl to parse
a series of .html files located on my local disk drive (not the web).

It appears that even though the files are found locally, this instruction is
trying to download the web resources referenced in the file (say images or
the like) - the reason for the file being on the local disk in the first
place is that the internet connection is slow at best and may or may not be
live.

Is there anyway to use this (or similar instructions) that do not attempt to
download items from to the web?

All that I want from the parsing exercise is information that is in the file
(in a table). My only option appears to be to write a routine that deletes
all text in the html file that is not between a "<table " and "</table tags.
Crude to say the least.

thanks

JC

NickHK

CreateDocumentFromUrl
 
Jack,
You can open HTML in Excel. That will give you the data.
Maybe using a web query through a local web server would work also, as it
does not appear to work if you just point the browser to the local file.

NickHK

"Jack Clift" wrote in message
...
I am writing an application that uses oMSHTML.createDocumentFromUrl to

parse
a series of .html files located on my local disk drive (not the web).

It appears that even though the files are found locally, this instruction

is
trying to download the web resources referenced in the file (say images or
the like) - the reason for the file being on the local disk in the first
place is that the internet connection is slow at best and may or may not

be
live.

Is there anyway to use this (or similar instructions) that do not attempt

to
download items from to the web?

All that I want from the parsing exercise is information that is in the

file
(in a table). My only option appears to be to write a routine that

deletes
all text in the html file that is not between a "<table " and "</table

tags.
Crude to say the least.

thanks

JC




Martin Fishlock

CreateDocumentFromUrl
 
Jack,

You could try opening the file as text and getting the table information
like that, it is much the same idea that you had.



--
Hope this helps
Martin Fishlock, Bangkok, Thailand
Please do not forget to rate this reply.


"Jack Clift" wrote:

I am writing an application that uses oMSHTML.createDocumentFromUrl to parse
a series of .html files located on my local disk drive (not the web).

It appears that even though the files are found locally, this instruction is
trying to download the web resources referenced in the file (say images or
the like) - the reason for the file being on the local disk in the first
place is that the internet connection is slow at best and may or may not be
live.

Is there anyway to use this (or similar instructions) that do not attempt to
download items from to the web?

All that I want from the parsing exercise is information that is in the file
(in a table). My only option appears to be to write a routine that deletes
all text in the html file that is not between a "<table " and "</table tags.
Crude to say the least.

thanks

JC



All times are GMT +1. The time now is 03:34 PM.

Powered by vBulletin® Copyright ©2000 - 2025, Jelsoft Enterprises Ltd.
ExcelBanter.com