Venue to be defined
Computational Approaches to Linguistic Code-Switching, CALCS 2021
Venue to be defined

First Call for Papers


Multilingual speakers will often mix languages when they communicate with other multilingual speakers in what is usually known as code-switching (CSW). CSW is typically present on the intersentential, intrasentential and even morphological levels. CSW presents serious challenges for language technologies such as Machine Translation (MT), Automatic Speech Recognition (ASR), language generation (LG), information retrieval (IR) and extraction (IE), and semantic processing. Traditional techniques trained for one language quickly break down when there is input mixed in from another. Recent work has shown that even powerful multilingual models, such as multilingual BERT, yield subpar performance on CSW data (cf. Aguilar and Solorio, 2020).

Considering the ubiquitous nature of CSW in informal text communication such as newsgroups, tweets, blogs, and other social media, and the number of multilingual speakers worldwide that use these platforms, addressing the challenge of processing CSW data continues to be of great practical value. This workshop aims to bring together researchers interested in technology for mixed language data, in either spoken or written form, and increase community awareness of the different efforts developed to date in this space.

Topics of Interest


The workshop will invite contributions from researchers working in NLP and speech approaches for the analysis and processing of mixed-language data. Topics of relevance to the workshop will include the following:

  1. Development of linguistic resources to support research on code-switched data;
  2. NLP approaches for any of language identification/named entity recognition/sentiment analysis/machine translation/language generation in code-switched data;
  3. NLP techniques for the syntactic analysis of code-switched data;
  4. Domain/dialect/genre adaptation techniques applied to code-switched data processing;
  5. Language modeling approaches to code-switched data processing;
  6. Crowdsourcing approaches for the annotation of code-switched data;
  7. Position papers discussing the challenges of code-switched data to NLP techniques;
  8. Methods for improving ASR in code switched data;
  9. Survey papers of NLP research for code-switched data;
  10. Sociolinguistic and/or sociopragmatic aspects of code-switching.

Important Dates


  • Workshop submission deadline (long, short and special track): March 15th, 2021
  • Notification of acceptance: April 15th, 2021
  • Workshop date: June 11th, 2021

Shared Task


TBD

Invited Speakers


Ozlem Cetinoglu    University of Stuttgart
Ngoc Thang Vu    University of Stuttgart


Additional invited speakers will be added soon.

Program Committee


Gustavo Aguilar    University of Houston
Elena Álvarez Mellado    University of Southern California
Segun Aroyehun    Insituto Politécnico Nacional
Kalika Bali    Microsoft Research India
Astik Biswas    Oracle
Monojit Choudhury    Microsoft Research India
Amitava Das    Wipro AI Lab
Indranil Dutta    Jadavpur University
Alexander Gelbukh    Insituto Politécnico Nacional
Genta Indra Winata    Hong Kong University of Science and Technology
Sudipta Kar    Amazon
Grande Lee    National University of Singapore
Els Lefever    Ghent University
Constantine Lignos    University of Pennsylvania
Yang Liu    Amazon
Manuel Mager    Universität Stuttgart
Parth Patwa    Indian Institute of Information Technology Sri City
Sai Krishna Rallabandi    Carnegie Mellon University
Yihong Theis    Kansas State University
Van Tung Pham    Nanyang Technological University

Organizers


Thamar Solorio (contact person)
Professor
Department of Computer Science
University of Houston
Shuguang Chen (webmaster)
Ph.D. Student
Department of Computer Science
University of Houston
Alan W. Black
Professor
Department of Computer Science
Carnegie Mellon University
Mona Diab
Research Scientist, Facebook AI
Professor, Department of Computer Science
George Washington University
Sunayana Sitaram
Senior Researcher
MSR India
Victor Soto
Applied Scientist
Amazon Alexa AI
Emre Yilmaz
Advanced Computer Scientist
SRI International