Decodeswitch - Codeswitching language identification

Meng Xia

ML Engineer

GPT-3

Python

What's codeswitching?

I'm glad you asked. The NLP nerd in the past would start a lengthy textbook definition of that. But to make things simple. When people live in places where speaking multiple languages is a common practice, such as in Montreal, Quebec, les gens parlent en francais et anglais en meme temp. This is fine but most Natural Language Processing systems aren't equipped to deal with this, hence they need a codeswitching language identification preprocessor.
Decodeswitch was a codeswitching language detection machine learning model that I did some years ago. It combines subword language model with Conditional Random Field to produce a fast yet sufficiently accurate prediction. You can read more about it in the full webpage below.
Like this project
0

Posted Nov 14, 2022

Codeswitching - Identify the language of each word in a sentence mixed of English and Spanish.

Likes

0

Views

22

Tags

ML Engineer

GPT-3

Python

Lemma - Morphological Parser in Elixir
Lemma - Morphological Parser in Elixir
PokerMemory
PokerMemory
GitHub - Expostal: Geocoder for Elixir
GitHub - Expostal: Geocoder for Elixir
Billing and Payment system for a unicorn startup
Billing and Payment system for a unicorn startup