Language Data Business

Flitto is the world’s biggest language data creation platform. The diverse array of data accumulated through the platform, including text data (corpus), audio data, and image data, goes through Flitto’s thorough quality control process and is used to enhance machine learning engines for our customers around the world. In addition, we can also create and provide new data to our customers in a short period of time that is customized for specific conditions.

Image of language Data Business
Flow image of on the quality assurance of language data

Sample Data

Language Pair Format Encoding Size
English -> Chinese (Simplified) CSV UTF-8 4KB Download
English -> Arabic CSV UTF-8 4KB Download
English -> Indonesian CSV UTF-8 5KB Download
English -> Japanese CSV UTF-8 5KB Download
English -> French CSV UTF-8 5KB Download

* Other formats upon client's request

Flitto's CMS Initiative

Corpus Management System

As a language data company, Flitto has developed solutions that make it a pioneer in the industry.
We provide not only a translation platform, but also a new crowdsourcing-based corpus creation system.
Our greatest strength is our ability to create language data quickly and at reasonable prices through our user-participation-based 'Arcade' service, which makes up one part of our CMS features.

Image of customize business

What is Arcade?

Arcade is a service that allows users to earn points by taking on various language-related tasks,
such as the translation, editing, proofreading, and dictation of text, images, audio, and video.

Examples of Using Arcade

1. On the Web

2. Using the App