首页    期刊浏览 2024年09月20日 星期五
登录注册

文章基本信息

  • 标题:映像・音声認識,自然言語処理の適用によるメタデータ生成の作業コスト削減効果に関する考察
  • 本地全文:下载
  • 作者:桑野 秀豪 ; 松尾 義博 ; 川添 雄彦
  • 期刊名称:映像情報メディア学会誌
  • 印刷版ISSN:1342-6907
  • 电子版ISSN:1881-6908
  • 出版年度:2007
  • 卷号:61
  • 期号:6
  • 页码:842-852
  • DOI:10.3169/itej.61.842
  • 出版社:The Institute of Image Information and Television Engineers
  • 摘要:We propose a task model that semi-automatically generates scene-based metadata based on mediaanalysis technology such as audio/visual indexing and natural-language processing to reduce the costs of generat-ing metadata.Our task model can shorten the task time by reusing both the results of media analysis and existingtext information such as program scripts.SceneCabinet,a metadata generation and editing system,can automati-cally extract scene-based metadata from videos.The system extracts meaningful video slices and textual informa-tion such as scene titles,synopses,and keywords using natural-language processing based on the results of speechrecognition and video OCR.Moreover,the system can import program scripts and use them to automaticallyextract keywords.SceneCabinet provides an intuitive user operation interface including a video browser with keyimages that are automatically detected based on scene changes,on-screen text,camerawork,speech,and music.Experiments showed that SceneCabinet could significantly reduce metadata generation costs.
  • 关键词:メタデータ生成;映像認識;音声認識;自然言語処理;ユーザインタフェース
国家哲学社会科学文献中心版权所有