首页    期刊浏览 2024年10月05日 星期六
登录注册

文章基本信息

  • 标题:Web Scraper Revealing Trends of Target Products and New Insights in Online Shopping Websites
  • 作者:Habib Ullah ; Zahid Ullah ; Shahid Maqsood
  • 期刊名称:International Journal of Advanced Computer Science and Applications(IJACSA)
  • 印刷版ISSN:2158-107X
  • 电子版ISSN:2156-5570
  • 出版年度:2018
  • 卷号:9
  • 期号:6
  • DOI:10.14569/IJACSA.2018.090658
  • 出版社:Science and Information Society (SAI)
  • 摘要:Trillions of posts from Facebook, tweets in Twitter, photos on Instagram and e-mails on exchange servers are overwhelming the Internet with big data. This necessitates the development of such tools that can detect the frequent updates and select the required information instantly. This research work aims to implement scraper software that is capable of collecting the updated information from the target products hosted in fabulous online e-commerce websites. The software is implemented using Scrapy and Django frameworks. The software is configured and evaluated across different e-commerce websites. Individual website generates a greater amount of data about the products that need to be scraped. The proposed software provides the ability to search a target product in a single consolidated place instead of searching across various websites, such as amazon.com, alibaba.com and daraz.pk. Furthermore, the scheduling mechanism enables the scraper to execute at a required frequency within a specified time frame.
  • 关键词:Django QuerySet (DQS); e-commerce; hamming distance algorithm (HDA); Levenshtein distance algorithm (LDA); scraper; scheduling mechanism
Loading...
联系我们|关于我们|网站声明
国家哲学社会科学文献中心版权所有