FRIHOST FORUMS SEARCH FAQ TOS BLOGS COMPETITIONS
You are invited to Log in or Register a free Frihost Account!


Scrape email address from HTML page content





bgillingham
I have been working on a script that is close to being an automated BOT, but I have total control over which pages get scanned.

I would use this code as part of my manual building of my links directory. It can be found here: http://betterwindowssoftware.com/directory/.

The thing that I am trying to do is to analyze the HTML and come up with a score, add them to my web directory, and email the result to the domain's contact email. Most of this is already done and working, but I haven't put in anything to look for email addresses yet; the Meta Descripton, and Meta Keywords, favicon, and all of the href links in the page are all that I scrape at this point in time.

I know that there are plenty of ways to keep your email address private - especially from page-scraping bots. I also know that I would have to avoid spamming anybody. Any advice or recommendations are deeply appreciated.
Related topics
Validate email address with PHP
for your comments
"Hiding" Your Email Address from Bots
How to wrap a html page ?
how to publish .flv files on html page
Hide Email Address With Javascript - Worth It?
Need super simple method to send a form to an email address
Email address help
planning a site in CSS
submit email to an email address
Free Tsismosa.com Email Address
"email address will not be abused" notice on sign-
Contact form or email address?
Changing of my email address, after I am registered
Reply to topic    Frihost Forum Index -> Scripting -> Php and MySQL

FRIHOST HOME | FAQ | TOS | ABOUT US | CONTACT US | SITE MAP
© 2005-2011 Frihost, forums powered by phpBB.