Case Study: Discovering Ad Channels by Adify

Good content can be hard to find. As individuals, we spend several hours each week browsing through web pages and trying to find content that's interesting. Advertising networks know that, and they want to make sure the ads they deliver are displayed next to the best and most interesting content. To do this, they crawl the web, trying to identify websites with interesting content. The more websites they know about, the more potential ad channels they have.

Adify is one of the top advertising networks in the country, and they use 80legs to help power their web crawling and analysis of interesting web content as a component of their market mappingTM methodology. To do this, Adify has created their own custom 80legs code to process the content of a web page and determine whether or not the web page or domain provides interesting and relevant content. Over time, they have built several applications on the 80legs platform to tell them whether or not a domain fits potential advertisers' needs. With the scale and customization provided by 80legs, Adify can do this quickly, easily and cost-effectively.

Adify has crawled over 50 million targeted websites with 80legs. When coupled with Adify's proprietary insights and oother industry leading sources of analytics, these crawls help expand Adify's extensive database of websites and create a comprehensive map of potential advertising channels on the web. By mapping the Internet in this manner and creating market mapsTM, Adify is able to provide their customers strategic guidance on content monetization.

Case Study: Sentiment Analysis by Lingway

Sentiment analysis is in big demand these days. Lingway uses natural language processing (NLP) to understand how people feel about various brands. Lingway specializes in processing text data, but they rely on the specialty of 80legs to gather that data from the Web.

Here's how Lingway's workflow handles data extraction and collection:

  • Search engines are used to generate a list of URLs related to given keywords about a brand.
  • The URL list is uploaded to 80legs as a seed list, and a web crawl is started from this seed list.
  • During the web crawl, a custom data extractor (aka "80app") is used to process and cleanup the text content of a web page.
  • The results generated by the 80legs web crawl are then fed into Lingway's NLP tools, which determine sentiment.

The 80legs API and 80app framework, along with the raw bandwidth and web crawling speed provided by 80legs, lets Lingway crawl the web in a very short time for any given topic. 80legs helps Lingway with massive distributed data cleanup and enhances the performance of its own product.