Text line segmentation of Ottoman manuscripts

Kutal, Seçilay and Aptoula, Erchan and Yanıkoğlu, Berrin (2025) Text line segmentation of Ottoman manuscripts. In: 33rd Signal Processing and Communications Applications Conference (SIU), Istanbul, Turkiye

Full text not available from this repository. (Request a copy)

Abstract

Text line segmentation is a key challenge in historical document analysis, especially for Ottoman manuscripts with complex calligraphy and overlapping text. This paper presents a comparative study of various deep-learning approaches for the automatic segmentation of plain text lines in Ottoman manuscripts. In the study based on YOLO and U-Net architectures, a series of adjustments and sub-approaches are used to address the challenges of Ottoman script. The U-Net focuses on fine-grained pixel-by-pixel segmentation, whereas YOLO handles the problem in two strategies: direct instance segmentation and instance segmentation applied after object detection using oriented bounding boxes to simplify the segmentation. Approaches are evaluated on manually labeled 25-paged Ottoman manuscripts. The comparative analysis reveals that the YOLO segmentation approach achieves the highest performance for plain text line segmentation with a pixel-level IoU score of 83.9% and F1-score of 91%.
Item Type: Papers in Conference Proceedings
Uncontrolled Keywords: computer vision; handwritten manuscript; Ottoman; segmentation; text line
Divisions: Center of Excellence in Data Analytics
Faculty of Engineering and Natural Sciences
Depositing User: Erchan Aptoula
Date Deposited: 26 Sep 2025 15:06
Last Modified: 26 Sep 2025 15:06
URI: https://research.sabanciuniv.edu/id/eprint/52563

Actions (login required)

View Item
View Item