Скачать слайды

UTIC-2015. Translation and multilingualism in the era of Big Data and Open Data. Kimmo Rossi

Kimmo Rossi
Translation technology is becoming more and more reliant on large and diverse data assets (translation memories, corpora, language models). At the same time, more data does not always mean better quality. High-quality linguistic data can be a competitive advantage, but who has access to such data? Open Data can be useful for translation, but it is still not widely available in all languages. The presentation will outline the concepts of Big Data, Linked Data and Open Data and show how successful harnessing of these assets can result in better translations, user-friendly multilingual services and more streamlined business processes. Examples on relevant European projects and actions are presented.

Kimmo Rossi graduated 1989 from Helsinki University of Technology (physics, IT, economics). Until 1994 he was co-owner and board member responsible for technology, quality and contractual issues for a Finnish translation and localization company Trantex. In 1994 he joined the European Commission where he worked 5 years in the translation tools and terminology department. Since 2000 he has been managing science, technology and innovation programs in the fields of statistics, data&content management and language technologies. Currently, he leads the Research & Innovation sector of Data Value Chain Unit (CNECT.G.3) managing a portfolio of 150 technology projects.