Kutal, Seçilay and Aptoula, Erchan and Yanıkoğlu, Berrin (2025) Text line segmentation of Ottoman manuscripts. In: 33rd Signal Processing and Communications Applications Conference (SIU), Istanbul, Turkiye
Full text not available from this repository. (Request a copy)
Official URL: https://dx.doi.org/10.1109/SIU66497.2025.11112097
Abstract
Text line segmentation is a key challenge in historical document analysis, especially for Ottoman manuscripts with complex calligraphy and overlapping text. This paper presents a comparative study of various deep-learning approaches for the automatic segmentation of plain text lines in Ottoman manuscripts. In the study based on YOLO and U-Net architectures, a series of adjustments and sub-approaches are used to address the challenges of Ottoman script. The U-Net focuses on fine-grained pixel-by-pixel segmentation, whereas YOLO handles the problem in two strategies: direct instance segmentation and instance segmentation applied after object detection using oriented bounding boxes to simplify the segmentation. Approaches are evaluated on manually labeled 25-paged Ottoman manuscripts. The comparative analysis reveals that the YOLO segmentation approach achieves the highest performance for plain text line segmentation with a pixel-level IoU score of 83.9% and F1-score of 91%.
| Item Type: | Papers in Conference Proceedings |
|---|---|
| Uncontrolled Keywords: | computer vision; handwritten manuscript; Ottoman; segmentation; text line |
| Divisions: | Center of Excellence in Data Analytics Faculty of Engineering and Natural Sciences |
| Depositing User: | Erchan Aptoula |
| Date Deposited: | 26 Sep 2025 15:06 |
| Last Modified: | 26 Sep 2025 15:06 |
| URI: | https://research.sabanciuniv.edu/id/eprint/52563 |


