The workshop will provide a forum for highlighting current research on multilingual document analysis systems with particular emphasis on OCR. The predecessors to this workshop were held in conjunction with ICDAR1999 in Bangalore, India and ICDAR2009 in Barcelona, Spain and ICDAR 2013 in Washington, DC. A joint Workshop on Multilingual OCR and Analytics for Noisy Unstructured Text Data was held in conjunction with ICDAR2011 in Beijing, China. The scope of 'Multilingual OCR' is defined to include systems that are capable of reading more than one language in the same document, as well as one-language-per-document systems that can be easily retargeted to new languages. The proposed workshop will provide a forum for technical discussions on three important themes: i) recent progress in the field and promising new techniques , ii) attempts to identify and address 'hard' open research problems, and iii) performance evaluation of multilingual OCR systems.
The topics that will be addressed by this Workshop are:
Proven Methodologies for OCR: Efficacy of existing methodologies for Latin script to other scripts (HMMs, Neural networks etc.)