Skip to main content
The British Library is starting round two of their Optical Character Recognition (OCR) contest, and are calling on anyone with OCR software to apply. More details can be found here: https://blogs.bl.uk/digital-scholarship/2019/02/automatic-transcription-of-historical-arabic-scientific-manuscripts-round-2.html

In the RBSCL Digital Collections, the lack of quality OCR software has hampered research use as well as some potentially interesting projects. Most notably, AUC's student newspaper The Caravan (aka Caravan, Campus Caravan, Caravan Weekly, and AUC Review) has been published in English and Arabic since its founding in 1925. Unfortunately the Arabic OCR text is generally not as accurate or user friendly as the English, thus reducing some of the usefulness. Additionally, it could be interesting to study how Arabic and English word usage changes over time, but without good Arabic OCR it would require a lot of manual entry of data.

Hopefully the British Library contest makes progress in much the way the first contest did.

Comments

Popular posts from this blog

Manuscripts | Digital Resources and Projects in Islamic Studies

We recently came across this interesting and extensive initiative by "The Maydan" . Below is a description of this ongoing project as it appeared on their website. The Maydan is proud to introduce a new initiative highlighting digital resources and projects in the field of Islamic Studies. Included in this roundup are manuscripts collections, digitized manuscripts, and manuscripts catalogues from universities and libraries around the world. *This is an ongoing project – if you know of similar resources that we may have missed  please email us at publish@themaydan.com . **In the near future we plan to add to this page essays from scholars who have engaged these manuscript collections/catalogues. Collections Open Islamicate Texts Initiative (OpenITI ) The Zaydi Manuscript Tradition, Institute for Advanced Study The Sinai Palimpsests Project Mingana Collection, University of Birmingham The Minassian Collection of Qur’anic Manuscripts, Brown University ...

Welcome on Board!

This blog intends to be an open space for digital humanists, librarians, scholars, and researchers working in or on Egypt and the Middle East to share their respective projects and discuss any ideas and tools regarding digital humanities. This blog is created and managed by the Digital Humanities Program at the American University in Cairo library. If you would like to contribute please contact Abdel Aziz Galal , Digital Humanities librarian at AUC. Please consider joining our mailing list .

The Baki Project

The Department of Near Eastern Languages and Civilization at the University of Washington is currently working on a project revolving around Mahmud AbdulBaki (1526-1600) who wrote poetry under the penname Baki (Bāḳī = the Enduring) during the reigns of 4 Ottoman sultans.  As the acclaimed “Sultan of Poets” during the so-called “Golden Age” of Ottoman literature, Baki’s influence as a poet echoed down through the centuries.  He was also a regular guest at the salons and private entertainments of Sultan Suleyman the Magnificent (reigned 1520-1566) and a noted scholar and jurist who rose to become the Chief Magistrate of the European Provinces, the second highest canon law position in the Empire.  Whether or not he was the “best” poet ever among the Ottomans is still argued today but very few would claim that he was not the most famous. Among the goals of the project is to bring digital technologies to bear on the problems of dealing with large and comp...