首页    期刊浏览 2024年11月25日 星期一
登录注册

文章基本信息

  • 标题:Comparison and Evaluation of Different Methods for the Feature Extraction from Educational Contents
  • 本地全文:下载
  • 作者:Jose Aguilar ; Camilo Salazar ; Henry Velasco
  • 期刊名称:Computation
  • 电子版ISSN:2079-3197
  • 出版年度:2020
  • 卷号:8
  • 期号:2
  • 页码:30-49
  • DOI:10.3390/computation8020030
  • 出版社:MDPI Publishing
  • 摘要:This paper analyses the capabilities of different techniques to build a semantic representation of educational digital resources. Educational digital resources are modeled using the Learning Object Metadata (LOM) standard, and these semantic representations can be obtained from different LOM fields, like the title, description, among others, in order to extract the features/characteristics from the digital resources. The feature extraction methods used in this paper are the Best Matching 25 (BM25), the Latent Semantic Analysis (LSA), Doc2Vec, and the Latent Dirichlet allocation (LDA). The utilization of the features/descriptors generated by them are tested in three types of educational digital resources (scientific publications, learning objects, patents), a paraphrase corpus and two use cases: in an information retrieval context and in an educational recommendation system. For this analysis are used unsupervised metrics to determine the feature quality proposed by each one, which are two similarity functions and the entropy. In addition, the paper presents tests of the techniques for the classification of paraphrases. The experiments show that according to the type of content and metric, the performance of the feature extraction methods is very different; in some cases are better than the others, and in other cases is the inverse.
  • 关键词:feature extraction; content analysis; educational contents; semantic representation; information retrieval; recommendation system feature extraction ; content analysis ; educational contents ; semantic representation ; information retrieval ; recommendation system
国家哲学社会科学文献中心版权所有