Decodeswitch - Codeswitching language identification

Meng Xia

What's codeswitching?

I'm glad you asked. The NLP nerd in the past would start a lengthy textbook definition of that. But to make things simple. When people live in places where speaking multiple languages is a common practice, such as in Montreal, Quebec, les gens parlent en francais et anglais en meme temp. This is fine but most Natural Language Processing systems aren't equipped to deal with this, hence they need a codeswitching language identification preprocessor.
Decodeswitch was a codeswitching language detection machine learning model that I did some years ago. It combines subword language model with Conditional Random Field to produce a fast yet sufficiently accurate prediction.
Like this project

Posted Nov 14, 2022

Codeswitching - Identify the language of each word in a sentence mixed of English and Spanish.

Billing and Payment system for a unicorn startup
Billing and Payment system for a unicorn startup
Statically typed PostgreSQL queries in TypeScript
Statically typed PostgreSQL queries in TypeScript

Join 50k+ companies and 1M+ independents

Contra Logo

© 2025 Contra.Work Inc