首页    期刊浏览 2024年11月08日 星期五
登录注册

文章基本信息

  • 标题:Business Data Extraction Using a Programming Language
  • 本地全文:下载
  • 作者:Fabio Bragato Do Carmo ; Vinícius Medeiros Magnani ; Rafael Confetti Gatsios
  • 期刊名称:Theoretical Economics Letters
  • 印刷版ISSN:2162-2078
  • 电子版ISSN:2162-2086
  • 出版年度:2022
  • 卷号:12
  • 期号:1
  • 页码:195-215
  • DOI:10.4236/tel.2022.121011
  • 语种:English
  • 出版社:Scientific Research Publishing
  • 摘要:In the era of great informational quantity, the presence of technologies that assist in the extraction, transformation, and loading of data has become increasingly necessary. The term Big Data, usually used to describe this volume of information, requires the user to have knowledge of multiple tools such as Excel, VBA, SQL, Tableau, Python, Spark, AWS, and so on. In this context, the present work aims to study data extraction techniques using different methodologies. At the end of the work, a library of functions in the Python language will be made available that will deliver a compilation of stock price information available on the Yahoo Finance website as well as balance sheets from financial institutions released by Bacen. The main resource used will be Web Scraping, which is a method that aims to automate data collection via the web. Once the collection of functions has been structured, it will be made available for public enjoyment through the GitHub platform.
  • 关键词:Data ExtractionBig DataPythonWeb Scraping
国家哲学社会科学文献中心版权所有