Nadir Durrani

Arabic Language Technologies, QCRI

prof_pic.jpg

Room 1139, RDC-1

HBKU Research Complex

Gharrafa, Al-Rayan

I am a Senior Scientist at the Arabic Language Technologies (ALT) where I work on several projects such as Interpretability (NeuroX), Machine Translation (Shaheen), Speech Synthesis (NatiQ) and language processing tools for Arabic (Farasa). QCRI is a unique place that provides a flavor of academic research and productization at the same time. Please gloss through my projects and research.

Previously I was a Research Associate, under Philipp Koehn, at the Institute of Language, Cognition and Computation at the University of Edinburgh. I worked on different problems in SMT, such as Unsupervised Transliteration and Markov-based translation models.

Here is a periodically updated resume.

news

Jan 18, 2025 Fanar, an Arabic AI Large Language Model, is now open to the public! Please try it out and share your feedback by evaluating Fanar’s responses. Also, check out the report detailing its development and capabilities.
Nov 30, 2024 Our paper, ARADICE: Benchmarks for Dialectal and Cultural Capabilities in LLMs, has been accepted for presentation at COLING 2025. We introduce dialectal and cultural benchmarks aimed at assessing Arabic LLMs on their handling of dialects and culturally nuanced tasks. The benchmark and the underlying dialectal MT models have been released for public use
Nov 11, 2024 More News ...

selected publications

  1. Discovering Latent Concepts Learned in BERT
    Fahim Dalvi, Abdul Rafae Khan, Firoj Alam, and 3 more authors
    In International Conference on Learning Representations Apr 2022
  2. What Is One Grain of Sand in the Desert? Analyzing Individual Neurons in Deep NLP Models
    Fahim Dalvi, Nadir Durrani, Hassan Sajjad, and 3 more authors
    In Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence (AAAI, Oral presentation) Jan 2019
  3. What do Neural Machine Translation Models Learn about Morphology?
    Yonatan Belinkov, Nadir Durrani, Fahim Dalvi, and 2 more authors
    In Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) Jul 2017
  4. Integrating an Unsupervised Transliteration Model into Statistical Machine Translation
    Nadir Durrani, Hassan Sajjad, Hieu Hoang, and 1 more author
    In Proceedings of the 14th Conference of the European Chapter of the Association for Computational Linguistics, volume 2: Short Papers Apr 2014
  5. A Joint Sequence Translation Model with Integrated Reordering
    Nadir Durrani, Helmut Schmid, and Alexander Fraser
    In Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies Jun 2011