Building Digital Dictionaries for Local Languages in Indonesia with Python NLP

Tematický okruh

AI / ML

Druh

Talk

Úroveň

beginner

Jazyk

English

Místnost

Main Stream

Začátek

2025-12-06T01:50:00Z

Konec

2025-12-06T02:10:00Z

Čas trvání

20 minut

Abstrakt

Learn how to build open-source bilingual dictionaries for Indonesia’s local languages using Python NLP libraries such as spaCy, NLTK, and Transformers. This talk explores practical methods to process low-resource languages, including hybrid rule-based and transformer models. We will discuss how to build dictionary data pipelines and add features like search, synonyms, antonyms, and lemmatization. The session also highlights the role of open-source collaboration in preserving linguistic diversity through technology.

Přednášející

Muh Naufal Muzhaffar
Sunan Kalijaga State Islamic University Yogyakarta