How can I extract data from Html page??

  • Thread starter Thread starter mark h
  • Start date Start date
M

mark h

Guest
Hi,
I would like to know if this software enables me to extract data like :
address, Fax, Phone or else on a web page. For exemple, I would like to extact and send to excel the Address:, Phone:, Fax:, Email:, Web:
I would need to have one company on one line in excel with all the details. Can I do it with a software or do you know one soft that would do it?
 
I think the only way to do it is create a program that suit your need. If you are going to do so, I recommend that you use perl (Active perl for windows), PHP (command line) or any scripting language that supports good text processing. If the library it shipped with supports DOM library and XPath, then you can use it to parse the downloaded HTML file into DOM and use XPath to find the node that contains the data you need and extract it or else, use regular expression instead.
 
Back
Top