Selection of Suitable PageRank Calculation for Analysis of Differences Between Expected and Observed Probability of Accesses to Web Pages
Konferenční objektOtevřený přístuppeer-reviewedpostprintDatum publikování
2018
Vedoucí práce
Oponent
Název časopisu
Název svazku
Vydavatel
Springer
Abstrakt
We describe various approaches how to calculate the value of PageRank in this paper. There are few methods how to calculate the PageRank, from the basic historical one to more enhanced versions. Most of them are using the original value of the damping factor. We describe the experiment we realised using our method for analysing differences between expected and observed probability of accesses to web pages of the selected portal. We used five slightly different methods for PageRank estimation using both the original value of damping factor and the value calculated from data in the web server log file. We assumed and confirmed that the estimation/calculation of the damping factor would have a significant impact on the estimation of the PageRank. We also wrongly assumed that the estimation/calculation of the damping factor would have a significant impact on the number of suspicious pages. We also compared the computational complexity of used PageRank methods, and the most effective method seems to be a method with the estimated value of the damping factor.
Rozsah stran
p. 139-150
ISSN
0302-9743
Trvalý odkaz na tento záznam
Projekt
GA16-19590S/Analýza témat a sentimentu vícenásobných textových zdrojů pro finanční rozhodování
Zdrojový dokument
Multi-disciplinary Trends in Artificial Intelligence
Vydavatelská verze
https://link.springer.com/chapter/10.1007/978-3-030-03014-8_12
Přístup k e-verzi
embargoed access
Název akce
12th International Conference on Multi-disciplinary Trends in Artificial Intelligence, MIWAI 2018 (18.11.2018 - 20.11.2018, Hanoj)
ISBN
978-3-030-03013-1
Studijní obor
Studijní program
Signatura tištěné verze
Umístění tištěné verze
Přístup k tištěné verzi
Klíčová slova
Web usage mining, Web structure mining, PageRank, Damping factor, Support, Observed visit rate, Expected visit rate