ExcelBanter - CreateDocumentFromUrl

ExcelBanter (https://www.excelbanter.com/)

- Excel Programming (https://www.excelbanter.com/excel-programming/)

- - CreateDocumentFromUrl (https://www.excelbanter.com/excel-programming/379743-createdocumentfromurl.html)

CreateDocumentFromUrl

I am writing an application that uses oMSHTML.createDocumentFromUrl to parse
a series of .html files located on my local disk drive (not the web).

It appears that even though the files are found locally, this instruction is
trying to download the web resources referenced in the file (say images or
the like) - the reason for the file being on the local disk in the first
place is that the internet connection is slow at best and may or may not be
live.

Is there anyway to use this (or similar instructions) that do not attempt to
download items from to the web?

All that I want from the parsing exercise is information that is in the file
(in a table). My only option appears to be to write a routine that deletes
all text in the html file that is not between a "<table " and "</table tags.
Crude to say the least.

thanks

JC

CreateDocumentFromUrl

Jack,
You can open HTML in Excel. That will give you the data.
Maybe using a web query through a local web server would work also, as it
does not appear to work if you just point the browser to the local file.

NickHK

"Jack Clift" wrote in message
...
I am writing an application that uses oMSHTML.createDocumentFromUrl to
parse
a series of .html files located on my local disk drive (not the web).

It appears that even though the files are found locally, this instruction
is
trying to download the web resources referenced in the file (say images or
the like) - the reason for the file being on the local disk in the first
place is that the internet connection is slow at best and may or may not
be
live.

Is there anyway to use this (or similar instructions) that do not attempt
to
download items from to the web?

All that I want from the parsing exercise is information that is in the
file
(in a table). My only option appears to be to write a routine that
deletes
all text in the html file that is not between a "<table " and "</table
tags.
Crude to say the least.

thanks

JC

CreateDocumentFromUrl

Jack,

You could try opening the file as text and getting the table information
like that, it is much the same idea that you had.

--
Hope this helps
Martin Fishlock, Bangkok, Thailand
Please do not forget to rate this reply.

"Jack Clift" wrote:

I am writing an application that uses oMSHTML.createDocumentFromUrl to parse
a series of .html files located on my local disk drive (not the web).

It appears that even though the files are found locally, this instruction is
trying to download the web resources referenced in the file (say images or
the like) - the reason for the file being on the local disk in the first
place is that the internet connection is slow at best and may or may not be
live.

Is there anyway to use this (or similar instructions) that do not attempt to
download items from to the web?

All that I want from the parsing exercise is information that is in the file
(in a table). My only option appears to be to write a routine that deletes
all text in the html file that is not between a "<table " and "</table tags.
Crude to say the least.

thanks

JC