Vui lòng dùng định danh này để trích dẫn hoặc liên kết đến tài liệu này: http://thuvienso.vanlanguni.edu.vn/handle/Vanlang_TV/31324
Toàn bộ biểu ghi siêu dữ liệu
Trường DCGiá trị Ngôn ngữ
dc.contributor.authorNguyen, Quoc‑Dung-
dc.contributor.authorLe, Duc‑Anh-
dc.contributor.authorPhan, Nguyet‑Minh-
dc.contributor.authorZelinka, Ivan-
dc.date.accessioned2021-06-12T14:40:28Z-
dc.date.available2021-06-12T14:40:28Z-
dc.date.issued2020-
dc.identifier.issn1433-7541-
dc.identifier.issn1433-755X-
dc.identifier.urihttp://thuvienso.vanlanguni.edu.vn/handle/Vanlang_TV/31324-
dc.description21p.; 3.2 MBvi
dc.description.abstractOptical character recognition (OCR) systems help to digitize paper-based historical achieves. However, poor quality of scanned documents and limitations of text recognition techniques result in different kinds of errors in OCR outputs. Postprocessing is an essential step in improving the output quality of OCR systems by detecting and cleaning the errors. In this paper, we present an automatic model consisting of both error detection and error correction phases for OCR post-processing. We propose a novel approach of OCR post-processing error correction using correction pattern edits and evolutionary algorithm which has been mainly used for solving optimization problems. Our model adopts a variant of the self-organizing migrating algorithm along with a fitness function based on modifications of important linguistic features. We illustrate how to construct the table of correction pattern edits involving all types of edit operations and being directly learned from the training dataset. Through efficient settings of the algorithm parameters, our model can be performed with high-quality candidate generation and error correction. The experimental results show that our proposed approach outperforms various baseline approaches as evaluated on the benchmark dataset of ICDAR 2017 Post-OCR text correction competition.vi
dc.language.isoenvi
dc.publisherPattern Analysis and Applicationsvi
dc.subjectOCRvi
dc.subjectN-gramsvi
dc.subjectSimilarityvi
dc.subjectContextvi
dc.subjectCorrection patternvi
dc.subjectEvolutionary algorithmvi
dc.titleOCR Error Correction Using Correction Patterns And Self‑Organizing Migrating Algorithmvi
dc.typeArticlevi
Bộ sưu tập: Bài báo khoa học giảng viên

Các tập tin trong tài liệu này:
Tập tin Mô tả Kích thước Định dạng  
BBKH2578_OCR error correction using correction patterns.pdf
  Giới hạn truy cập
OCR Error Correction Using Correction Patterns And Self‑Organizing Migrating Algorithm3.17 MBAdobe PDFXem/Tải về  Yêu cầu tài liệu


Khi sử dụng các tài liệu trong Thư viện số phải tuân thủ Luật bản quyền.