A study on website log data analysis methodology by transition probability

  • Authors

    • Jae Kyeong Lee
    • Mi Hwan Hyun
    • Dong Gu Shin
    2018-04-03
    https://doi.org/10.14419/ijet.v7i2.12.11118
  • Weblog, Markov Chain, Transition Matrix, Probability Transition Matrix, Steady State
  • Background/Objectives: To measure occupancy using transition probability matrix as a data analysis method to predict future requirements for web use. From this study, Executives facing business challenges can enhance the decision-making process for management and can be provided quantified evidence.

    Methods/Statistical analysis: Transition matrix and transition probability matrix are estimated if web users’ webpage use patterns are tied with frequency, using web log data. Occupancy is forecasted based on a Markov chain model.

    Findings: Data analysis from the perspective of web log-based marketing mostly focuses on increasing traffic and improving transition rates. However, general-purpose tools such as Google Analytics provide diverse web log data. In assumption of independence on users’ page reload, occupancy can be easily estimated through matrix on page reload (transition). As a result, we obtained slightly different results from the usual method that reported only frequency. In particular, rather than making business decisions with the frequency of absolute concepts, we were able to identify the top priority services through the percentage value of relative concepts.

    Improvements/Applications: The occupancy prediction using transition matrix is about future prediction based on previous information. However, it differs from marketing techniques in that it is estimated based on probability. In addition, it is able to predict more accurately through a probability model.

     

  • References

    1. [1] Online Creative Economy Town, South Korea online idea commercialization platform, www.creativekorea.or.kr

      [2] Y.H. Kim, U.M. Kim, M.S. Jung, W.J. Kang, A Web Usage Predicton model by Transition Probability Matrix, Communications of the Korean Institute of Information Scientists and Engineers, Vol. 31, No.2, pp. 31.

      [3] D.S. Kang, A Study on the Determinants of User Satisfaction of e-Government Services, skku, 2009

      [4] American Customer Satisfaction Index, www.theacsi.org

      [5] Robert Cooley, Pang-Ning Tan, and JaideepSrivastave. Discovery of interesting usage patterns from web data. Technical Report TR 99-022, University of Minnesota, 1999.

      [6] V.N. Padmanabham and J.C.Mogul. Using predictive prefetching to improve world wide web latency. Computer Communication Review, 199

      [7] Ramesh R. Sarukkai. Link prediction and path analysis using markov chains. In Ninth International World Wide Web Conference, 200I

      [8] gor Cadez, David Heckerman, Christopher Meek, Padhraic Smyth, and Steven White. Vidualization of navigational patterns on web site using model based clustering. Technical Report MSR-TR-0018, Microsoft Research, Microsoft Corporation, 2000.

      [9] Mukund Deshpande and George Karypis. Selective Markov Models for Predicting Web-Page Accesses, ACM Transactions on Internet Technology, Volume 4, Isuue 2, May 2004.

      [10] Jianhan Zhu, Jun Hong, and John G. Hughes, Using Markov Chains for Link Prediction in Adaptive Web Sites, Springer-Verlag Berlin Heidelberg, 2002.

  • Downloads

  • How to Cite

    Kyeong Lee, J., Hwan Hyun, M., & Gu Shin, D. (2018). A study on website log data analysis methodology by transition probability. International Journal of Engineering & Technology, 7(2.12), 171-173. https://doi.org/10.14419/ijet.v7i2.12.11118