Can you hear the scraping noises?

No, you can’t hear them, but scrapers are visiting your site right now.  If you have valuable content, like directories or classifieds, then chances are that bots are trying to “scrape” your intellectual property and your site’s advertising income.

Bots, by simply imitating the activity of human users, copy your entire database and re-use it for their own purposes.  They may even build a parallel site and compete for your ad revenues!

Site owners that succeed in blocking bots benefit in a variety of ways:

  • Secure, stable advertising revenues
  • More visitors as you restore your exclusivity
  • Reduced server & network expenditures as bot traffic is eliminated
  • Reliable response times and protection from DoS attacks
  • Monetization of your intellectual property as would-be scrapers purchase your content

But – bots have grown more sophisticated

While Web Application Firewalls (WAFs) and Port Monitoring Appliances were able to provide some elementary protection against bots, these solutions have been rendered obsolete.  Why?

Most anti-bot systems scan IP addresses bearing hostile traffic, and then blacklist these addresses.
But, today, it’s a cinch for bots to change IPs, forge user-agents, and restart user sessions. An effective solution must block such sophisticated bots, and continue to heuristically learn as new scraping technologies appear.

The Solution – SiteBlackBox

SiteBlackBox offers an innovative Software-as-a-Service that employs a cloud-based risk assessment engine that is constantly amassing new intelligence.

As bots take steps to pass or to bypass a challenge – for example, by utilizing captcha farms, creating new sessions, or changing IP addresses – SiteBlackBox identifies their behavior and immediately blocks their access to the web site.  It neutralizes traffic from hostile sources, while ensuring the highest quality-of-experience to legitimate users.

Only SiteBlackBox provides real-time protection from sophisticated bots:

  • Recognition of bots that dynamically change IP addresses and user-agents
  • Recognition of captcha farms and  OCR software
  • Selective blocking of specific sessions on IP addresses shared by multiple users
  • Recognition and blocking of zero-day distributed attacks
  • Global device reputation database
  • Blocks malicious bots using white-listed organizations’ IP addresses and apps

 

Want to quiet the scraping noises?

Contact us now for a demo and a free trial!

Telephone:                        +972-3-7363041
U.S./Canada Toll Free:  +1-888-777-7020
Email:                                   info@siteblackbox.com
Web:                                     http://www.siteblackbox.com

 

Posted in anti-scraping solutions, web-scraping | Tagged , , , , , | Leave a comment

My web site’s ad revenues are up 10%. Want to know how?

Want to read how DCH – Digital Community Holdings (http://dch.com), a conglomerate of over 100 local and community-based classified sites, increased its ad revenues by 10%?

DCH, like so many classified sites, is plagued by bots that scrape data and use the information for their own purposes, sometimes duplicating the data and establishing a competitive site!

What’s more, bots never click on your ads.  So while bots are busy attacking your site and choking off system resources, they reduce click-through rates on site-based advertising, and lower eCPM (Effective Cost per Mille) – which is widely used by web publishers to set advertising rates on their sites.

In June 2012, DCH invited SiteBlackBox to protect its flagship site, http://cartrucktrader.com in a free trial, and within two months, their ad revenues were up by 10%!  Want to read how?

Click here to read how SiteBlackBox helped DCH increase ad revenues.

Rob Hage, DCH’s CEO, said: I recommend giving SiteBlackBox a trial run.  The results should be as clear to them as they are to us!”

Want to improve your ad revenues?

Contact us now for a demo and a free trial!

 

Telephone:                        +972-3-7363041

U.S./Canada Toll Free:  +1-888-777-7020

Email:                                   info@siteblackbox.com

Web:                                     http://www.siteblackbox.com

 

Posted in anti-scraping solutions, Uncategorized, web-scraping | Tagged , , , , , , | Leave a comment

Want to make your web site’s performance and profitability rock?

Want to make your web site’s performance and profitability rock?  It’s easy, once you get rid of the bots that attack your site, scrape your data, and choke up your servers!

Do you have valuable content on your site?  Directories, classifieds, databases of any kind?  Chances are that bots are at work right now, scraping content and using it for their own purposes.  They’re choking your network and server resources, too.  Your users can feel it, as response times slow to a crawl.

We’ve seen cases in which 95% of site traffic is hostile.  You don’t need a resource upgrade! You need SiteBlackBox.

SiteBlackBox is a powerful, innovative solution that protects web sites from the malicious bot-based activity.  SiteBlackBox monitors your site’s user traffic and discerns, in real-time, users from abusers, and humans from bots.  It neutralizes traffic from hostile sources, while ensuring the highest quality-of-experience to legitimate users.

Click here to read how SiteBlackBox improved a site’s performance and ended denial-of-service attacks.

SiteBlackBox blocks the most sophisticated bots using a cloud-based risk assessment engine that is constantly amassing new intelligence.  The engine was designed with the understanding that it’s a cinch for bots to change IPs, forge user-agents, and restart user sessions.

SiteBlackBox is designed to deter sophisticated bot attacks.  As bots take steps to pass or to bypass a challenge – for example, by utilizing captcha farms, creating new sessions, or changing IP addresses – they become more easily distinguished from real users, and can be immediately sanctioned by the web site.

Want to make your site rock?

Contact us now for a demo and a free trial!

 

Telephone:                        +972-3-7363041

U.S./Canada Toll Free:  +1-888-777-7020

Email:                                   info@siteblackbox.com

Web:                                     http://www.siteblackbox.com

 

Posted in Uncategorized | Leave a comment

Anonymity and automation – a deadly combination

The battle for the security of web sites and their valuable content has evolved into a highly automated struggle.

In today’s security and content-protection arms race, nobody is naïve anymore…

Any company with a web presence knows it’s going to be targeted for attacks.  So it deploys measures to fight hackers, intruders and abuse.  Some companies design their code to block or ignore suspicious requests, while others use WAFs (Web Application Firewalls) and IPSs (Intrusion Prevention Systems) that recognize anomalies and attempt to deter hostile traffic.

Bots are perfectly suited for the malicious activities of hackers, intruders, scrapers, spammers, abusers and spies.  As automated tools, bots are faster and more efficient in guessing passwords, crawling websites, scraping content, and finding vulnerabilities.

Hacking, DDoSing, scraping and spamming are made easy with this kind of advanced automation.  For example, if you wish to make millions of attempts to break a password, all you have to do is use a few tens of thousands of different IP addresses.  This is why many web security industry reports have identified Insufficient Anti-Automation as the major vulnerability in web sites today .

So, Is there anything that can be done to defend against sophisticated bots?

Click here to read more!

Web site owners have reported a 3-5 percent revenue increase following BotBlackBox deployment, increasing profitability and achieving return-on-investment within months.

Want to join eBay Classified, JunkMail Classified, Yellow Pages,Yad 2, Cellar Tracker, and many other companies that stopped the bots and improved their bottom line by tens of thousands of dollars per month?

See how it works

Contact us now for a demo and a free trial!

Telephone:                        +972-3-7363041
U.S./Canada Toll Free:  +1-888-777-7020
Email:                                   info@siteblackbox.com
Web:                                     http://www.siteblackbox.com

Posted in anti-hacking solutions | Leave a comment

Spam bots can destroy your company’s reputation – but you can stop them!

Spam bots invade your web site, collecting your customers’ email addresses and phone numbers.  When the spamming begins, you’ll be busy handling complaints from annoyed customers.  Can the bots be stopped?

“SiteBlackBox helped restore our credibility.  The number of spamming complaints we receive has dropped sharply!”

Click here to read about a SiteBlackBox customer who eliminated spamming and improved customer satisfaction.

What Spam Bots Do

Spam bots are automated web crawling systems used to systematically “scrape” your users’ email addresses and phone numbers from web sites.  These data are then passed on to spammers, who pester customers with unwanted emails and telemarketing pitches – undermining customer loyalty and causing severe damage to your business.

The Solution:  BotBlackBox

BotBlackBox is a powerful, innovative Software-as-a-Service (SaaS) designed to protect your valuable online assets by neutralizing traffic from scraping, spamming and other hostile sources, while ensuring the highest quality-of-experience to legitimate users.

When you deploy BotBlackBox on your website, you protect yourself from costly bot-inflicted damage:

  • Content Scraping
  • Spamming of advertisers’ email addresses and phone numbers
  • Fraudulent injection of rogue advertisements

Web site owners have reported a 3-5 percent revenue increase following BotBlackBox deployment, increasing profitability and achieving return-on-investment within months.

Want to put an end to spamming headaches?

Contact us now for a demo and a free trial!

Telephone:                        +972-3-7363041
U.S./Canada Toll Free:    +1-888-777-7020
Email:                                   info@siteblackbox.com
Web:                                     http://www.siteblackbox.com

Posted in anti-scraping solutions, spam bots, web-scraping | Tagged , , | Leave a comment

Block the bots… and double your ad revenues!

Bots invade your web site and scrape your data.  But they never click on your ads!  As a result, you suffer from poor click-through rates, low Google/Bing rankings, and reduced ad revenues.  What would happen if you could block the bots?

Click here  to read about a SiteBlackBox customer who eliminated bots and doubled advertising revenues!

What Bots Do

Bots are automated web crawling systems used to systematically abuse web sites and their content.  They imitate the activity of human users, methodically collecting data from web sites.  Using bots, hackers and unscrupulous competitors can “scrape” entire directories, causing severe damage to your business.

The Solution:  BotBlackBox

BotBlackBox is a powerful, innovative Software-as-a-Service (SaaS) designed to protect your valuable online assets by neutralizing traffic from scraping, spaming and other hostile sources, while ensuring the highest quality-of-experience to legitimate users.

When you deploy BotBlackBox, you are assured that your on-line assets remain in your control. Web site owners have reported a 3-5 percent revenue increase following BotBlackBox deployment, increasing profitability and achieving return-on-investment within months.

Want to Increase Your Ad Revenues?

Contact us now for a demo and a free trial!

Telephone:                        +972-3-7363041
U.S./Canada Toll Free:  +1-888-777-7020
Email:                                   info@siteblackbox.com
Web:                                     http://www.siteblackbox.com

Posted in anti-scraping solutions, content aggregation, Content aggregation and web-scraping, web-scraping | Tagged , , | Leave a comment

Airlines seek anti-scraping solutions

A growing number of airlines are suffering from the attempts of travel sites to steal their precious information for the purpose of reselling flights. This has led a growing interest in sophisticated anti-scraping solutions that can tell apart users from abusers and humans from bots with minimum or no interruption to legitimate users.

Source: Ryanair

This became evident after Ryanair admitted that it had tried to block third parties from using information from its site. Earlier this week, Travel Weekly reported that the Irish low-cost carrier had “made what is being seen as the most aggressive move yet to prevent travel agents booking its flights using the screen-scraping method.

”Ryanair has reportedly added a new verification step to its booking process to ensure the inquiry is being made by a genuine human being rather than a machine. The procedure requires the customer to enter a unique code. The move allows agents to book flights manually, but this prevents copying large numbers of flights. Ryanair decided, however, to remove the verification measure after it learned that the overall number of bookings has dropped.

“Ryanair’s case illustrates that airlines should refrain from using solutions that disrupt each session by default,” said Shay Rapaport, CEO of SiteBlackBox. “They should, instead, use solutions that take surgical strikes at hostile traffic, while allowing uninterrupted experience for legitimate users.”

Travel Weekly reported that the Irish budget carrier had fought and lost a number of legal cases around Europe in its attempts to prevent third party sales, although many firms are able to get around attempts to stifle screen scraping by using companies based outside of the EU.

Last year, a Dutch court ruled that flight search website Wegolo had to stop extracting flight information from Ryanair’s website. The court forced Wegolo to hand over to the airline any profits made from unlawful transactions and pay damages and costs.

Ryanair said at that time that it welcomed agreements with “genuine price-comparison websites” that show Ryanair’s fares without charging fees for the service.

Posted in anti-scraping solutions | Tagged , , , , | 1 Comment

Canadian Court Declares Scraping Illegal

A Canadian court set a legal precedent when it ruled that “scraping” and using online content without permission violated the terms of use and copyright of  a website.

The Supreme Court of British Columbia ruled that property search website Zoocasa illegally indexed, stored and displayed photos and descriptions of properties for sale that had been posted on Century 21 Canada’s website and awarded the real estate company $33,000 in damages.

Judge Robert Punnett clarified that the unauthorized use of online listings was illegal.

The court ruled that by accessing Century 21’s site, Zoocasa had legally agreed to accept the site’s terms of use, which forbids the copying or reuse of its content.

The court found that the photographs and property descriptions reproduced from the Century 21 website constituted “a substantial portion of each real estate listing page on the Century 21 website, not only with respect to quantity but also in their overall significance respecting the property listing described.”

“This is an important step towards forbidding abuse of website resources and exploitation of Internet anonymity,” said Shay Rapaport, CEO of SiteBlackBox. “Yet I think it is unlikely that this legal ratio will sustain, since it could also imply that site users agree to any caprice mentioned in the Terms & Conditions page. Personally I find the legal approach of seeing uninvited bot activity as a form of trespassing much more precise, especially if the site uses means to block bots and the bots try to bypass those means.”

“Websites such as Yellow Pages, whose unique and costly information is stolen by bots on a regular basis, can’t afford to wait for the completion of long legal procedures and may even be exposed to sites that are operating from countries where scraping is not forbidden by law,” Rapaport added.

“Such businesses need to actively protect themselves by adopting advanced solutions to clean their traffic from automated abusers and robots. These systems must be specifically designed to protect valuable online assets and take surgical strikes at hostile traffic, while allowing uninterrupted experience to legitimate users.”

Posted in Content aggregation and web-scraping | Tagged , , , , , | Leave a comment

Common misconceptions about aggregators

1.       Aggregator sites are no threat (they are, at worst case, co-opetition).

Many sites that aggregate your content and re-publish it would argue that they are, just like you, a viable organ in a content “ecosystem”.  They may even use the common smoke-screen of bringing some traffic to your site (via deep-links) making you think it is a win-win situation. In most cases, this is a misrepresentation.  The questions you should ask yourself are simple: Is this aggregator site suggesting a service that resembles mine? Do users use this site for the same purpose they’d use mine? Do we monetize our traffic in similar ways? If the answer to these questions is yes,  then what you have here is a competitor. Your competitor has a crystal clear goal in mind; to be more relevant than you are at any point in time. It is not your content this aggregator is after, but rather your user-base.

2.       The actual losses my business sustains from aggregators are insignificant.

At some point in time you have probably calculated how much a customer is worth to you, how much it costs you to acquire a customer and to how much a single user session or page-hit monetizes. This is probably the most important math for online publishers.   Can you estimate to how many page views and user sessions your data is exposed on the competitor’s site and how much you could (and should) be making? Can you estimate how many end-users, whether your clients or not, who get exposed to this aggregator, start thinking of it as a viable alternative to your services? How many clients do you lose? How much are they worth to you? What is the tipping point between losing customers to worning out your brand?

3.       My brand is strong and will not be harmed by small players.

Who said that diamonds are forever? At the bottom line a brand name means consumers’ habits. If a competing site aggregates your content (and probably more content from other sites), it is creating an alternative to using your services and the next thing it will do is eat away your brand. Perhaps right now many users think of you when they need the service you provide. However, the aggregator’s goal is to convince consumers they have a more efficient alternative. Once users start choosing rather than following what seemed to be the best of brands, you lose their loyalty and slowly start losing them. This is the slippery curve to the tipping point where your brand breaks.

4.       Security products, like insurance, have no tangible ROI.

Perhaps it is true, but anti-scraper services are not security products per se. It is your investment, your user-base and your brand that are being safeguarded when we block competitors from stealing your content. If you lose your content to competitors, you consequently loose traffic and lose consumer’s loyalty. These are actual financial losses which can be roughly calculated.
On top of this, in many cases non-competitor entities extract your data for commercial purposes. One such example is real-estate agencies scraping real-estate boards. Once you’re able to enforce business policies, you can sell this data rather then giving away a free ride. We’ve seen clients of ours doing this and turning our BotBlackBox into a revenue channel.

Posted in Content aggregation and web-scraping | 2 Comments