• Journal of Internet Computing and Services
    ISSN 2287 - 1136 (Online) / ISSN 1598 - 0170 (Print)
    https://jics.or.kr/

Numerical Formula and Verification of Web Robot for Collection Speedup of Web Documents


Kim Weon, Kim Jeong Geun, Kim Young-Ki, Hong Een-Kee, Chin Yong-Ok, Journal of Internet Computing and Services, Vol. 5, No. 6, pp. 1-10, Dec. 2004
Full Text:
Keywords: web robot, Multi-agent, Amdahl's Law, Nemerical Formula, Dynamic URL Partition algorithm, Performance, Scheduling, QoS, Proportional Fairness, WFQ, CDMA2000 l¡¿EV-DO

Abstract

A web robot is a software that has abilities of tracking and collecting web documents on the Internet(l), The performance scalability of recent web robots reached the limit CIS the number of web documents on the internet has increased sharply as the rapid growth of the Internet continues, Accordingly, it is strongly demanded to study on the performance scalability in searching and collecting documents on the web. 'Design of web robot based on Multi-Agent to speed up documents collection ' rather than 'Sequentially executing Web Robot based on the existing Fork-Join method' and the results of analysis on its performance scalability is presented in the thesis, For collection speedup, a Multi-Agent based web robot performs the independent process for inactive URL ('Dead-links' URL), which is caused by overloaded web documents, temporary network or web-server disturbance, after dividing them into each agent. The agents consist of four component; Loader, Extractor, Active URL Scanner and inactive URL Scanner. The thesis models a Multi-Agent based web robot based on 'Amdahl's Law' to speed up documents collection, introduces a numerical formula for collection speedup, and verifies its performance improvement by comparing data from the formula with data from experiments based on the formula. Moreover, 'Dynamic URL Partition algorithm' is introduced and realized to minimize the workload of the web server by maximizing a interval of the web server which can be a collection target.


Statistics
Show / Hide Statistics

Statistics (Cumulative Counts from November 1st, 2017)
Multiple requests among the same browser session are counted as one view.
If you mouse over a chart, the values of data points will be shown.


Cite this article
[APA Style]
Weon, K., Geun, K., Young-Ki, K., Een-Kee, H., & Yong-Ok, C. (2004). Numerical Formula and Verification of Web Robot for Collection Speedup of Web Documents. Journal of Internet Computing and Services, 5(6), 1-10.

[IEEE Style]
K. Weon, K. J. Geun, K. Young-Ki, H. Een-Kee, C. Yong-Ok, "Numerical Formula and Verification of Web Robot for Collection Speedup of Web Documents," Journal of Internet Computing and Services, vol. 5, no. 6, pp. 1-10, 2004.

[ACM Style]
Kim Weon, Kim Jeong Geun, Kim Young-Ki, Hong Een-Kee, and Chin Yong-Ok. 2004. Numerical Formula and Verification of Web Robot for Collection Speedup of Web Documents. Journal of Internet Computing and Services, 5, 6, (2004), 1-10.