Marseille, France
Collocated with LREC 2020
The 4th Workshop will be Collocated with LREC 2020 on Saturday May 16th, 2020. Marseille, France

We have centralized many code-switching datasets, including the data from the CALCS series, into a single code-switching benchmark. Please consider using the improved version of the data and the public leaderboards available here: ritual.uh.edu/lince

Attention - Regarding COVID-19

Due to the current situation with COVID 19 LREC and all the associated workshops have been canceled. But we will continue with the review process and we have been told that the proceedings will still be issued. We will be in touch as more information becomes available. In the meantime we expect to send out notifications of the paper submissions as originally planned. We hope everyone stays healthy! Feel free to contact us at codeswitching_workshop@googlegroups.com if you have any questions or concerns.

Workshop Dates and Locations


  • Workshop date: Saturday May 16th, 2020
  • Paper submission: February 28th, 2020
  • Notification of acceptance: March 23rd, 2020
  • Camera ready submission deadline: April 5th, 2020
  • Main Event: https://lrec2020.lrec-conf.org/en/
  • Venue: Le Palais du Pharo
*All deadlines are at 23:59 GMT -08:00

What is Code Switching?

  • Code-switching (CS) is the phenomenon by which multilingual speakers switch back and forth between their common languages in written or spoken communication.
  • CS is typically present on the intersentential, intrasentential (mixing of words from multiple languages in the same utterance) and even morphological (mixing of morphemes) levels.
  • CS presents serious challenges for language technologies such as Parsing, Machine Translation (MT), Automatic Speech Recognition (ASR), information retrieval (IR) and extraction (IE), and semantic processing.
  • Traditional techniques trained for one language quickly break down when there is input mixed in from another language.
  • Even for problems that are considered solved for specific domains and languages, such as language identification, or part of speech tagging, performance degrades at a rate proportional to the amount and level of the mixed-language present.

Updates will be given through the workshop Google group: codeswitching_workshop@googlegroups.com, and the Twitter account: @WCALCS. Direct updates will be sent by email to the participants based on the information provided in the registration form.