Schedule

We will stick to this schedule, and should there be any important changes to the schedule, assignments, or reading materials, you’ll receive an email notification.

Week 1 - Technology for African Languages, Why Technology for African Languages, Current state of Technology for African Languages

In this module, we will have a high-level discussion about language technology i.e ChatGPT, Gemini, Llama etc, the current state of technology for African languages and, challenges facing the development of language technology for African Languages.

Sep 4 Discussion Technology for African Languages
  • Slides

  • Pre-Class Reflection:
  • Take a Look at the Ethnologue page, Try to answer the following questions;
    • How many languages are spoken in the world today? How many of the spoken languages are from Africa? What countries have the most languages? What continents have the most indigenous languages?
  • Read through this blog, see if you can understand the state of Natural Language Processing Research for African languages in 2019.
  • Read through the Masakhane, and the AfricaNLP pages, make a reflection about Masakhane, AfricaNLP and the efforts being made to create technology for African languages.
  • Learn about African languages One Thousand Languages
  • About technology and technology for African languages, a reflection about Generative AI.

Week 2 - Introduction to main linguistic groups of Africa

Sep 11 Discussion Linguistic groups of Africa

Videos The Amazing Languages of Africa - sounds, grammar and writing systems of African languages. The Languages of Africa https://www.youtube.com/watch?v=1WhIiqHr0q0 Example of Khoisan - Siki Jo-An – ‘The Click Song’ | Blind Audition | The Voice SA: Seaso 3 | M-Net Pelonomi Moiloa: Decolonizing Artificial Intelligence to empower local talent Sabelosethu Mhlambi: Decolonizing AI

Additional materials

  • Chapter3 (An Introduction to African Languages): Identify linguistic features prevalent in linguistic families
  • International Journal of American Linguistics
  • African languages an Introduction Bernd Heine & Derek Nurse Cambridge University Press 2000
  • The Linguistic Face of Africa by Benard Odoyo Okal 2016

Week 3 - Introduction to (Ki)Swahili Language, Swahili as an African Language vs a dialect of Arabic

Week 4 - Introduction to NLP and Its applications

It is common for people to use NLP technology every day without even knowing it. For example, Google Search knows what you’re looking for through either text or speech, Gmail generates smart reply responses based on messages etc. This module will introduce NLP and its applications.

Sept 25 Discussion Introduction to NLP and LLMs

Week 5 - Multilingual NLP and corpus annotation

In this module, we will discuss existing African language datasets, we will practice named Entity recognition data annotation for African languages using an existing tool.

Oct 2 Discussion Multilingual NLP and corpus annotation

Week 6 - NLP and social media

Oct 09 Mid-term TBD
  • [Mid-term exam]
Oct 09 Discussion NLP and social media
  • Readings
  • Video
  • Supplementary/Additional Readings
    • Ong, W. J. (1982). Orality and literacy: The technologizing of the word. London: Methuen. Publishing House.
    • African Languages and Information and Communication Technologies: Literacy, Access, and the Future

Week 7 - Information Extraction

In this week we will discuss techniques for extracting semantic content from text. And how this process of information extraction turns the unstructured information embedded in texts into structured data.

Oct 23 Discussion Information extraction

Week 8 - What is a Search engine – an under the hood view

As soon as computers were inveted we were asking them questions, because we need to know things. Systems in the early 1960s were answering questions about baseball statistics and scientific facts. In this module we will have a sufficient overview of search and search engine.

Oct 30 Discussion Search Engine
  • [Slides coming soon]

Related Readings

Week 9 - The movie was okay – analyzing the sentiment of texts

This week will discuss sentiment analysis, the extraction of sentiment, the positive or negative orientation that a writer expresses towards some object.

Nov 6 Discussion Sentiment of texts

Week 10 - Bridging the language barrier using machine translation

Translation, in its full generality, such as the translation of literature, or poetry, is a difficult, fascinating, and intensely human endeavor, as rich as any other area of human creativity. This module introduces machine translation (MT), the use of computers to translate from one language to another.

Nov 13 Discussion Machine Translation

Week 11 - Going beyond text processing; complexities of spoken language

One of the earliest goals of language processing in computers is to understand spoken language. This module will introduce Automatic Speech Recognition (ASR), Text-to-Speech (TTS) and how to build speech recognition systems for African languages.

Nov 20 Discussion complexities of spoken language

Week 12 - Project presentations

Dec 4 Present Present final project
  • A group will choose an African languages or languages and create a profile around the language:
    1. Language family, structure, where it is spoken, some statistics, interesting fact, a video or audio etc.
    2. Work on any of the tasks and or address a problem we covered in class for that language.
  • Prepare slide presentation and or demonstration for 15 minutes and 5 minutes for QA.
  • Submit the slides plus a maximum 2 page individual report of the project.