Text line segmentation of Ottoman manuscripts

Warning The system is temporarily closed to updates for reporting purpose.

Kutal, Seçilay and Aptoula, Erchan and Yanıkoğlu, Berrin (2025) Text line segmentation of Ottoman manuscripts. In: 33rd Signal Processing and Communications Applications Conference (SIU), Istanbul, Turkiye

Full text not available from this repository. (Request a copy)

Abstract

Text line segmentation is a key challenge in historical document analysis, especially for Ottoman manuscripts with complex calligraphy and overlapping text. This paper presents a comparative study of various deep-learning approaches for the automatic segmentation of plain text lines in Ottoman manuscripts. In the study based on YOLO and U-Net architectures, a series of adjustments and sub-approaches are used to address the challenges of Ottoman script. The U-Net focuses on fine-grained pixel-by-pixel segmentation, whereas YOLO handles the problem in two strategies: direct instance segmentation and instance segmentation applied after object detection using oriented bounding boxes to simplify the segmentation. Approaches are evaluated on manually labeled 25-paged Ottoman manuscripts. The comparative analysis reveals that the YOLO segmentation approach achieves the highest performance for plain text line segmentation with a pixel-level IoU score of 83.9% and F1-score of 91%.
Item Type: Papers in Conference Proceedings
Uncontrolled Keywords: computer vision; handwritten manuscript; Ottoman; segmentation; text line
Divisions: Center of Excellence in Data Analytics
Faculty of Engineering and Natural Sciences
Depositing User: Erchan Aptoula
Date Deposited: 26 Sep 2025 15:06
Last Modified: 26 Sep 2025 15:06
URI: https://research.sabanciuniv.edu/id/eprint/52563

Actions (login required)

View Item
View Item