Become a Partner/Become a Customer/Contact Us          Login to Language Studio™ Enterprise Online
 
Join the Asia Online
Mailing List
Products & Technology
  • Ground Breaking Linguistic Technology
  • Products
  • Sentence Alignment
  • Word Segmentation
  • SOCRO - Optical Character Recognition (OCR)
Lost & found in translation

Great quotes about language and translation:

"It's a strange world of language in which skating on thin ice can get you into hot water."
-Franklin P. Jones

"Translators live off the differences between languages, all the while working toward eliminating them."
-Edmond Cary

"Learn a new language and get a new soul"
-Czech Proverb

"A different language is a different vision of life"
-Federico Fellini

"Swearing was invented as a compromise between running away and fighting"
-Peter Finley Dunne

"In certain trying circumstances, urgent circumstances, desperate circumstances, profanity furnishes a relief denied even to prayer"
-Mark Twain

Many critics, no defenders,
translators have but two regrets:
when we hit, no one remembers,
when we miss, no one forgets.
-Anonymous

"Common European thought is the fruit of the immense toil of translators. Without translators, Europe would not exist; translators are more important than members of the European Parliament."
-Milan Kundera

"I hope to finish the book before I'm 90. It keeps you alive. The secret of being a translator is not to be in a hurry. Sometimes it takes hours to find a single word."
-Dr. Leonard Rosenman

"There are few efforts more conducive to humility than that of the translator trying to communicate an incommunicable beauty. Yet, unless we do try, something unique and never surpassed will cease to exist except in the libraries of a few inquisitive book lovers."
-Edith Hamilton

"The best translators slip into the glove of a text and then turn it inside out into another language, and the whole thing comes out looking like a brand-new glove again. I'm completely in awe of this skill, since I happen to be both bilingual and a writer, but nevertheless a lousy translator."
-Alma Guillermoprieto

"Language forces us to perceive the world as man presents it to us."
-Julia Penelope

"The quantity of consonants in the English language is constant. If omitted in one place, they turn up in another. When a Bostonian "pahks" his "cah," the lost r's migrate southwest, causing a Texan to "warsh" his car and invest in "erl wells.""
-Author Unknown

"No one means all he says, and yet very few say all they mean, for words are slippery and thought is viscous."
-Henry Brooks Adams

"I personally believe we developed language because of our deep inner need to complain."
-Jane Wagner

"At no time is freedom of speech more precious than when a man hits his thumb with a hammer"
-Marshall Lumsden

"Language is the blood of the soul into which thoughts run and out of which they grow"
-Oliver Wendell Holmes

"If you can speak three languages you're trilingual. If you can speak two languages you're bilingual. If you can speak only one language you're an American"
-Author Unknown

"Dictionaries are like watches; the worst is better than none, and the best cannot be expected to go quite true."
-Samuel Johnson

"Language is the most imperfect and expensive means yet discovered for communicating thought"
-William James

"Language is the means of getting an idea from my brain into yours without surgery"
-Mark Amidon

"He who does not know foreign languages does not know anything about his own"
-Johann Wolfgang von Goethe, Kunst and Alterthum

"I would never use a long word where a short one would answer the purpose. I know there are professors in this country who 'ligate' arteries. Other surgeons only tie them, and it stops the bleeding just as well"
-Oliver Wendell Holmes

"Our language is funny - a fat chance and slim chance are the same thing"
-J. Gustav White

"Whenever ideas fail, men invent words"
-Martin H. Fischer

"Learning preserves the errors of the past, as well as its wisdom. For this reason, dictionaries are public dangers, although they are necessities"
-Alfred North Whitehead

"Every American child should grow up knowing a second language, preferably English"
-Mignon McLaughlin

"But if thought corrupts language, language can also corrupt thought"
-George Orwell

Ground Breaking Linguistic Technology Contact Us Contact Us

The Language Studio™ Suite is a comprehensive translation platform that contains a set of technologies ranging in scope from tools for linguistic analysis, data mining, real-time multilingual chat and language processing tools to fully integrated, automated, high speed translation (millions of words per day).

High quality automated translation is Asia Online’s flagship service. In addition to reducing the cost and time of traditional human translation services dramatically, automated translation enables customers to undertake translation of new content where such translation was not possible previously.

Years of development and research have gone into developing the core Language Studio™ platform that translates more than 500 language pairs. A large number of European, Middle Eastern and Asian languages are supported now, and more than 200 are currently under development.

Many of the technologies that have made Asia Online the quality leader in automated machine translation are available under license. Complex tasks such as word and sentence segmentation, Optical Character Recognition (OCR) and sentence alignment are made simple with the Language Studio™ platform.

The products and technologies listed below are a subset of Asia Online’s comprehensive linguistic technology portfolio. These include:

  • Alignment tools to ensure that parallel data is accurately paired and aligned
  • Translation memory quality assessment and cleanup tools
  • Translation memory normalization and scrubbing tools
  • Data format conversion tools
  • Data cleaning tools
  • Dictionary and glossary management tools
  • Optical Character Recognition – Asia Online is the first to deliver very high accuracy OCR services for a number of complex Asian language scripts.
  • Word and phrase segmentation - Asia Online software automatically segments Asian language words and phrases, which speeds automated spell-checking and indexed text searches

Contact an Asia Online team member today to discuss your technology requirements and how we can best serve your linguistic needs.

Top Top
Products Contact Us Contact Us

Language Studio™ Enterprise is a unique server-based platform that provides a real-time and batch processing translation environment designed to give near-human quality translation output. Language Studio™ Enterprise leverages Asia Online’s iterative human feedback loop to constantly improve the quality of translations.

Features include:

  • High volume commercial grade translation service
  • Fast and reliable. Built to scale to hundreds of thousands of words per minute
  • Flexible billing and account options
  • Multiple document formats
  • Discount rates based on translation volume

Language Studio™ PowerTrain is a server based platform at the heart of every customized translation engine. Language Studio™ PowerTrain studies and analyzes all of the data provided and builds statistical and data based models that enable Language Studio™ Enterprise to deliver near-human quality automated machine translations.

Features include:

  • Create new language pairs and custom domains
  • Improve custom domains using your post edited feedback
  • "brain" constantly learns how to translate better
  • Learns from existing human translation examples

Language Studio™ Enterprise and Language Studio™ PowerTrain are available are available in the following deployment models:

Software as a Service
(On-Demand / Cloud Based)
Asia Online hosts a shared infrastructure with a large server farm. All customer translations are load balanced across the server farm to ensure maximum performance.
Secure Dedicated Hosting Asia Online hosts dedicated servers for customers in its secure data center. Also suitable for location sensitive translation processing such as with Legal Discovery.
Onsite Licenses For customers with very high volumes or high security requirements.

Language Studio™ Pro is the world's first desktop product to tightly integrate the customization process when building a high-quality automated machine translation engine. A comprehensive suite of tools provides all that is necessary to prepare data to be trained with Language Studio™ PowerTrain. Publication quality comes via full customization and Language Studio™ Pro is the only tool suite in the market to provide a complete customization solution.

Features include:

  • A complete translation project management environment for customized translation engines
  • Data preparation and processing environment to build custom domains and custom engines
  • Manages feedback process for translation domain improvement
  • Automation of data cleaning
  • Corpus Crawling
  • Automated bilingual corpus alignment
  • Source content data analysis
  • Corpus quality analysis
    • Bilingual data
    • Monolingual data
  • Ongoing translation quality refinement

Language Studio™ Pro (Lite Edition) is a free desktop tool that allows for easy submission of translation jobs. Hot Drop Folders are also included so that, when a file is simply copied into a folder on your local machine or the network, it is automatically translated. Human and automated quality metric tools are also provided, making Language Studio™ Pro (Lite Edition) the only tool you need to translate and measure quality directly from your desktop.

Features include:

  • Translation Job Submission and Management
  • Editors
    • Spelling Correction Dictionary (DIC)
    • Runtime Glossary (RTG)
    • Non-Translatable Terms (NTT)
    • Post Translation Adjustments (PTA)
  • Provides key metrics to better understand translation quality
  • Human Translation Metrics:
    • Fully customizable
    • Supports LISA QA Model, SAE J2450 and more.
  • Automated Translation Metrics:
    • Bilingual Evaluation Units (BLEU)
    • F-Measure
    • Translation Error Rate (TER)
    • METEOR

Language Studio™ Data Collector is the first desktop tool that enables the rapid customization of automated machine translation. Simply point the tool in the direction of your existing documents, whether translated already or in just one language and the tool quickly gathers the information necessary to provide the best possible translations from the provided data. With the additional of glossaries and other linguistic knowledge, even great quality can be achieved. Language Studio™ Data Collector reduces the time to customize from weeks to days.

Features include:

  • Rapid customization
  • Corpus crawling
  • Data extraction
  • Data gathering
  • Glossary processing

Language Studio™ API is an Application Programmer Interface (API) that allows for direct integration into applications, workflow and translation memory management systems. Support is currently available for Microsoft .NET and JAVA. Language Studio™ API is architected to make translation and integration easy with asynchronous access to translation and other server based functions.

Features include:

  • Connect your applications and workflow directly to Language Studio™ Enterprise and Language Studio™ PowerTrain
  • Available as Microsoft .NET, Java interface.
  • Comprehensive sample code

Language Studio™ Multilingual Chat provides a low cost means to offer multilingual chat rooms with real-time translation between multiple languages. Ideal for dating, sports, news and other websites. With Language Studio™ Multilingual Chat, you website users from around the world can all talk to each other in their own language. Use the built in multilingual chat interface or integrate with your own web or third-party chat or instant messenger application.

Features include:

  • Real-time chat in multiple languages
  • Unlimited number of language pairs in one room
  • Anonymous or registered user support
  • Pre-defined chat rooms
  • User created chat rooms
  • API and Web Interface
  • Customizable look and feel

Language Studio™ Multilingual Customer Support (MCS) is a highly cost effective way to increase engagement with your international customers by providing direct interaction with a live support agent in any of the 500+ language pairs supported by Language Studio™ Enterprise. Language Studio™ MCS allows enterprises to broaden their market reach and increase revenues by engaging people in real time in their preferred language.

Whether it is a simple sales query or a complex support request, Language Studio™ MCS provides access to knowledge base recommendations, standard prepared responses and real-time translations. English-speaking contact center agents can easily engage with non-English speaking customers, improving satisfaction and increasing sales, while enabling faster time to purchase.

Features include:

  • Allow your English-speaking contact center agents to engage with your non-English speaking customers
  • Real-time chat in multiple languages
  • Knowledge base lookup and article recommendations
  • One agent can support multiple customers simultaneously
  • High frequency response database
  • Session transcript logging
  • Open support or linked to internal user database
  • Fully customizable look and feel
  • Customized engines for knowledge base translation can be reused for customer support

Top Top
Sentence Alignment Contact Us Contact Us

In order to build high quality SMT engines, alignment of sentences in paired languages is required. Asia Online has developed a unique set of tools for sentence alignment that is able to align mass volumes of data with high levels of accuracy. Support is provided for Romanized and Asian languages, including Chinese, Japanese, Korean and Hindi.

Many organizations have bilingual information available to them that could be used to help train an SMT engine. However, often the data is not in translation memory format. It is contained in raw documents that have traditionally had only a loose relationship in terms of sentence and paragraph positions.

Asia Online’s sentence alignment technology uses text analysis, length, numeric, punctuation, synonyms, inflections and many other matching criteria to determine which sentence in a source language document corresponds to a translated sentence in a target language document.

Asia Online currently offers alignment as a service to its customers and in the near future will offer these tools online via the Enterprise Translation Portal.

Languages currently supported include:

Asian Languages

Arabic (AR), Bahasa Malay (MS), Bahasa Indonesia (ID), Bengali (BN), Chinese (ZH), Gujarati (GU), Hindi (HI), Japanese (JA), Korean (KO), Punjabi (PA), Tagalog (TL), Tamil (TM), Thai (TH) and Vietnamese (VI).

European Languages

Bulgarian (BG), Czech (CS), Danish (DA), Dutch (NL), English (EN), Estonian (ET), Finnish (FI), French (FR), German (DE), Greek (EL), Hebrew (HE), Hungarian (HU), Irish (GA), Italian (IT), Latvian (LV), Lithuanian (LT), Maltese (MT), Polish (PL), Portuguese (PT), Romanian (RO), Russian (RU), Slovak (SK), Slovene (SL), Spanish (ES) and Swedish (SV).

Text in all supported languages can be aligned with text in any other supported language. The Asia Online sentence alignment tool has been designed with flexibility and productivity as key features. New languages can be added quickly within 5-10 days. If the language you require is not listed above, talk to us about adding support for your desired language.

While it is very difficult to obtain 100% alignment accuracy due to the nature of how humans translate from one language to another, our sentence alignment engine can be tuned to the specific linguistic characteristics of each language. This ensures that a very high level of accuracy is achieved. Human validation of a recent project which aligned 500,000 documents determined an error margin of just 1.8%.

Asia Online is working with many of its customers on alignment projects ranging from a few hundred marketing documents to more than 10 million translated patents.


Contact Asia Online now for a FREE data quality and SMT data suitability assessment  

Our professional linguists will analyze your data and explain what work is required to make it suitable for use as training material to produce a high quality SMT engine.

Top Top
Word Segmentation Contact Us Contact Us

Many Asian languages are written without spaces between the words and sentences. Incorrect word or sentence segmentation can completely distort the output and change the resulting meaning entirely. Asia Online has developed a set of tools that determines where best to segment words to ensure the correct meaning. These tools are specifically designed to work with complex Asian scripts and symbols.

The importance of proper word segmentation is paramount in Asian languages such as Japanese, Korean, Chinese and Thai where word boundaries are not clear in standard text. Before any words can be processed, the first task of any information processing system is to segment the initial text into a sequence of words. Accuracy is paramount. The Thai examples below show how segmenting in different places can change the meaning of a sentence completely.

Many traditional approaches use dictionaries, but this does not go far enough. The examples above are all valid using dictionary. Asia Online takes the approach of evaluating all possible segmentation paths through the data and then using n-grams (patterns of word sequences) to score each path, with the most statistically correct path selected. This approach allows for spelling errors, unknown words and mixed language text to be incorporated and handled correctly.

Language Supported:
Chinese (ZH), Japanese (JA), Korean (KO) and Thai (TH)
Top Top
SOCRO - Optical Character Recognition (OCR) Contact Us Contact Us

SOCRO (Smart OCR Optimization) is an OCR technology designed to learn and correct patterns of OCR errors and constantly improve. Created to address the problem of poor OCR capability for Asian languages, SOCRO can deliver OCR quality in trained fonts on Asian languages with accuracy rates as high as 99.6%.

As Asia Online developed SMT tools for Asian languages, it became apparent that Asian language OCR left a lot to be desired when compared to OCR of European languages. Asia Online’s own need for Thai language OCR was a key driver. When determining what tools were available, the best Thai OCR we could find achieved a result of around 70% accuracy. As Thai is an alphabet, 70% accuracy meant that nearly every word had a spelling error making the results almost useless for SMT purposes.

SOCRO was developed as a generic OCR solution that can work with Asia language scripts and characters and allows for fine tuning and adjustments. SOCRO can be adapted to almost any language. With SOCRO, document management vendors can generate Thai language OCR text with accuracy that rivals – and sometimes surpasses – the world’s best English language OCR software.

Language Supported:
Available Now: Thai (TH)
Under Development: Arabic (AR), Chinese (ZH), Hindi (HI), Japanese (JA) and Korean (KO)

Contact Asia Online to discuss adding SOCRO support for your language.

Top Top
The World Speaks One Language - Yours™
Home/ LSPs/ Enterprises & Publishers/ Translation/ Portals/ Solutions/ Products & Technology/ Partners/ Languages/ Resources/ News/ About Us/ Contact Us/ Mailing List