Building Digital Dictionaries for Local Languages in Indonesia with Python NLP

Track

AI / ML

Type

Talk

Level

beginner

Language

English

Duration

20 minutes

Abstract

Learn how to build open-source bilingual dictionaries for Indonesia’s local languages using Python NLP libraries such as spaCy, NLTK, and Transformers. This talk explores practical methods to process low-resource languages, including hybrid rule-based and transformer models. We will discuss how to build dictionary data pipelines and add features like search, synonyms, antonyms, and lemmatization. The session also highlights the role of open-source collaboration in preserving linguistic diversity through technology.

Speakers

Muh Naufal Muzhaffar
Sunan Kalijaga State Islamic University Yogyakarta