Skip to main content

Vicav – Viennese Corpus of Arabic Varieties

The Vienna Corpus of Arabic Varieties (VICAV) was set up with two main purposes in mind: to serve as a virtual research platform targeting the particular needs of Arabic dialectology and to serve as a test bed for newly developed text technological methodologies and tools.
As part of these efforts, it was designed as a means to promote the efficient exchange of ideas and experiences in an active international community. Being located at the border between areal and corpus linguistics, the aim is to gather varying digital language resources for a number of different localities. The description of the different varieties will hinge on language profiles, concise and uniformly structured form sheets that offer information on the research history, available literature, salient grammatical features etc. of particular varieties. VICAV makes accessible bibliographies, dictionaries, glossaries, and different types of transcribed texts.
The project aims at providing a platform of exchange for a scientific community which is increasingly producing digital data but still lacks the infrastructure to make it widely available. The project is conducted as a co-operation between the Department of Oriental Studies of the University of Vienna and the ACDH. It is also part of CLARIN-AT’s project bundle aiming at more language resources for under-resourced linguistic varieties. The ACDH’s research interests in this project are related to issues of digital lexicography, visualisation of digital language resources in a multilingual environment and the application of (de-facto) standards such as TEI, LMF and/or MAF in the creation of digital language resources.

Comments

Popular posts from this blog

Welcome on Board!

This blog intends to be an open space for digital humanists, librarians, scholars, and researchers working in or on Egypt and the Middle East to share their respective projects and discuss any ideas and tools regarding digital humanities. This blog is created and managed by the Digital Humanities Program at the American University in Cairo library. If you would like to contribute please contact Abdel Aziz Galal , Digital Humanities librarian at AUC. Please consider joining our mailing list .

Women are oppressed, coeds are elected, and men are swindled: A brief intro into text analysis using AUC's student newspaper

My next foray into digital humanities ( you can read about mapping the nationalities of AUC students here ) involves the venerable students newspaper the Caravan (aka the AUC Review , Campus Caravan , and Caravan Weekly ). The first issue was published in 1925 and it is still going strong today. Currently, we have issues up to 1996 available in our Digital Library though some years are missing (either because of scanning issues or we don’t have them at all, in the latter case please let us know if you have copies). The Caravan has been bilingual through most of its history, though this project will focus on the English issues only. With the excellent work done by the digitization lab we have over 4,000 English pages scanned, and through ABBYY FineReader we’ve generated text files for each page, creating a corpus to explore. Unfortunately for some pages the text recognition leaves a lot to be desired; often this is caused by poor quality printing or ABBYY being confused. ...

The Baki Project

The Department of Near Eastern Languages and Civilization at the University of Washington is currently working on a project revolving around Mahmud AbdulBaki (1526-1600) who wrote poetry under the penname Baki (Bāḳī = the Enduring) during the reigns of 4 Ottoman sultans.  As the acclaimed “Sultan of Poets” during the so-called “Golden Age” of Ottoman literature, Baki’s influence as a poet echoed down through the centuries.  He was also a regular guest at the salons and private entertainments of Sultan Suleyman the Magnificent (reigned 1520-1566) and a noted scholar and jurist who rose to become the Chief Magistrate of the European Provinces, the second highest canon law position in the Empire.  Whether or not he was the “best” poet ever among the Ottomans is still argued today but very few would claim that he was not the most famous. Among the goals of the project is to bring digital technologies to bear on the problems of dealing with large and comp...