Skip to main content
The British Library is starting round two of their Optical Character Recognition (OCR) contest, and are calling on anyone with OCR software to apply. More details can be found here: https://blogs.bl.uk/digital-scholarship/2019/02/automatic-transcription-of-historical-arabic-scientific-manuscripts-round-2.html

In the RBSCL Digital Collections, the lack of quality OCR software has hampered research use as well as some potentially interesting projects. Most notably, AUC's student newspaper The Caravan (aka Caravan, Campus Caravan, Caravan Weekly, and AUC Review) has been published in English and Arabic since its founding in 1925. Unfortunately the Arabic OCR text is generally not as accurate or user friendly as the English, thus reducing some of the usefulness. Additionally, it could be interesting to study how Arabic and English word usage changes over time, but without good Arabic OCR it would require a lot of manual entry of data.

Hopefully the British Library contest makes progress in much the way the first contest did.

Comments

Popular posts from this blog

Manuscripts | Digital Resources and Projects in Islamic Studies

We recently came across this interesting and extensive initiative by "The Maydan" . Below is a description of this ongoing project as it appeared on their website. The Maydan is proud to introduce a new initiative highlighting digital resources and projects in the field of Islamic Studies. Included in this roundup are manuscripts collections, digitized manuscripts, and manuscripts catalogues from universities and libraries around the world. *This is an ongoing project – if you know of similar resources that we may have missed  please email us at publish@themaydan.com . **In the near future we plan to add to this page essays from scholars who have engaged these manuscript collections/catalogues. Collections Open Islamicate Texts Initiative (OpenITI ) The Zaydi Manuscript Tradition, Institute for Advanced Study The Sinai Palimpsests Project Mingana Collection, University of Birmingham The Minassian Collection of Qur’anic Manuscripts, Brown University ...

Middle East Digital Humanities Digest Blog

The Middle East Digital Humanities Digest is a blog created by the Digital Humanities program at the American University in Cairo Libraries & Learning technologies. AUC is a leading English-language, American-accredited institution of higher education and center of the intellectual, social, and cultural life of the Arab world. Its community of students, parents, faculty and staff, trustees, alumni, and other generous sponsors represent more than 60 countries. The University stands as a crossroads for the world’s cultures and a vibrant forum for reasoned argument, spirited debate, and understanding across the diversity of languages, facilities, and human experiences. MEDHD Vision: Quality guide of content for digital humanities projects and content in Egypt and the Middle East. MEDHD Audience: (Our ideal reader & Bloggers) MEDHD  blog intends to be a space for digital humanists, librarians, scholars, and researchers, and students who show great interest or experience...

Welcome on Board!

This blog intends to be an open space for digital humanists, librarians, scholars, and researchers working in or on Egypt and the Middle East to share their respective projects and discuss any ideas and tools regarding digital humanities. This blog is created and managed by the Digital Humanities Program at the American University in Cairo library. If you would like to contribute please contact Abdel Aziz Galal , Digital Humanities librarian at AUC. Please consider joining our mailing list .