Home‎ > ‎Talks‎ > ‎

Crawling The Web

This talk will discuss different strategies & methods to crawl the web:
  •      Real-time priority decisions
  •      Web-Page's parsing strategies
  •      Domain information discovery
  •      Crawler scaling considerations
 Also, I will give an overview of message-based, distributed crawler architecture.

Speaker: 
Sharon Djebnoun - sharon [dot] djebnoun [at] picscout [dot] com
Comments