文章基本信息

标题：VIETNAMESE TEXT EXTRACTION FROM BOOK COVERS
本地全文：下载
作者：Phan Thi Thanh Nga ; Nguyen Thi Huyen Trang ; Nguyen Van Phuc 等
期刊名称：Tạp chí Khoa học Đại học Đà Lạt
印刷版ISSN：0866-787X
出版年度：2017
卷号：7
期号：2
页码：142-152
语种：English
出版社：Dalat University
摘要：Automatic information extraction from images reduces the cost,human interference,and timely processing. Converting printed book covers to readable text for later automation process would be useful for a wide range of users such as librarians,bookshop keepers,and individual users. In this paper,we present a novel method for the Vietnamese text extraction from images of scanned book covers. The proposed system accepts the book covers snapshot, filters the input image for an enhancement of quality,locates the regions with text,then utilizes the optical character recognizer (OCR) to extract the text. The last step is to filter the extracted text in accompany with at dictionary to achieve the final text result. Carrying out the experiments with the proposed system using our dataset delivered encouraging experimental results..
其他摘要：Nhận dạng văn bản từ hình ảnh giúp giảm công sức,chi phí và thời gian xử lý. Việc số hóa thông tin sách một cách tự động bằng cách nhận dạng bìa sách giúp ích rất nhiều cho những người làm việc trực tiếp đến lưu trữ và phân loại sách như thủ thư,nhân vi
关键词：Book cover;OCR (Optical Character Recognition);Text information extraction; Vietnamese text detection.
其他关键词：Bìa sách;Nhận dạng tiếng Việt;Nhận dạng văn bản.