出版社:Vilnius University, University of Latvia, Latvia University of Agriculture, Institute of Mathematics and Informatics of University of Latvia
摘要:This paper describes still encountered problems of documents visual content comparison in contemporary computerized workplaces. There are many ways for creating HTML documents and plenty of invisible to user data that carries no information in terms of content. Such circumstances make the automation and visualization process of HTML document comparison rather complex. Introduced algorithm compares versions of HTML documents and displays changes in a result document. The comparison is carried out in such a way that all style and metadata of the document is preserved. Furthermore, the design phases and implementation aspects of the algorithm are investigated in order to share achieved results, to create an effectively working tool and draw guidelines for future work.
关键词:document content comparison; style data preservation; changes visual tracking; HTML document.