Open Access Dissertation
Doctor of Philosophy (PhD)
College of Technology
Dr. Ali Eydgahi, Ph.D., (Chair)
Dr. Daniel Fields, Ph.D.
Dr. Huei Lee, Ph.D.
Dr. Alphonso Bellamy, Ph.D.
Web robots also known as crawlers or spiders are used by search engines, hackers and spammers to gather information about web pages. Timely detection and prevention of unwanted crawlers increases privacy and security of websites. In this research, a novel method to identify web crawlers is proposed to prevent unwanted crawler to access websites. The proposed method suggests a five-factor identification process to detect unwanted crawlers. This study provides the pretest and posttest results along with a systematic evaluation of web pages with the proposed identification technique versus web pages without the proposed identification process. An experiment was performed with repeated measures for two groups with each group containing ninety web pages. The outputs of the logistic regression analysis of treatment and control groups confirm the novel five-factor identification process as an effective mechanism to prevent unwanted web crawlers. This study concluded that the proposed five distinct identifier process is a very effective technique as demonstrated by a successful outcome.
Aghamohammadi, Alireza, "A novel defense mechanism against web crawler intrusion" (2013). Master's Theses and Doctoral Dissertations. 544.