Become a Partner/Become a Customer/Contact Us          Enterprise Translation Portal
 
Join the Asia Online
Mailing List
Lost & found in translation
Word-sense disambiguation  is a major challenge for any machine translation platform and often even humans. Can the story below be translated?

To help with planting on the farm next to the power plant, the farmer taught his sow to sow so that he could produce more produce. A gust of wind blew the farmer over into the sewer where he hit a sharp object. He did not object to the object itself, but the wound from his fall wound up making him an invalid and his insurance was invalid. He did not think they would wind up seeing his new wound objectively. After a number of injections his wound got number, but he shed a tear when he saw the tear in his favorite shirt. He would have to subject himself to being the subject of a series of medical tests when got out of the sewer and as the sewer was wet and cold he was sure to get a cold. If I get ill, I’ll be in real trouble he thought.

The dump in the town was so full that it had to refuse more refuse so after being refused the locals had been dumping their refuse against the door leading to the sewer. He knew he had to close the door so he could pass into the corridor, but he could not get close enough to it because there was some old Polish furniture blocking him.

He had long wondered how he could intimate his feelings this to his most intimate friend the seamstress. With a little spit and polish the Polish furniture would make a fantastic present. To his surprise he discovered that the seamstress and the sewer had also fallen into the sewer, along with the oarsmen who had fallen while having a row about how to row. Since there is no time like the present, he thought it was time to present the present to the seamstress. It sent shivers down his spine as he was going to book a restaurant and give her his favorite book, but the spine was too damaged and he would save a few bucks with the polished Polish furniture which was much nicer than the book from Nice given to him by his niece. The buck does funny things when the does are present however and he thought if the others had not been present how he would love to make love with abandon with the seamstress right now under the abandoned bass drum with the bass painted on it.

His head was pounding like it was being squeezed in a vice, and he did not want to wait for the local vice squad to rescue them. He knew that if he could get the lead out he could lead them all out through the desert, but in order to do so he would have to dessert his desert. As they exited the sewer it was a great relief to see a dove that dove into the bushes. He didn’t want to relive that experience again. If only he could wind back time and had been more careful of the wind, he would have wound up being home to see the last over in cricket before the game was over.

Frequently Asked Questions
uestions Click on a question to display the answer

Translation
  • I have seen machine translation before and the quality has always been very disappointing. What makes Asia Online's translation platform any different or better?
  • What languages does Asia Online support?
  • Can you build a new language pair for me?
  • Where can I find more information about Machine Translation?
  • How does Asia Online charge for its translation services?
  • What is a Baseline Engine?
  • The text I wish to Translate refers to a specific topic, such as automotive engineering, or is written in a very specific writing style, how can Asia Online cope with this type of translation?
  • What computing resources or infrastructure do I need to avail of the Asia Online Translation Service?
  • My data is sensitive. How do I ensure that my data is secure?
  • My data is mission critical and cannot leave my secure environment. How can use the Asia Online translation technology when I can not give you the data?
  • How do I measure translation quality?
  • I have been told that you need a large amount of multilingual data to be able to build a custom domain. How much data do I need?
Portal
  • How do I advertise on the Asia Online portal?
  • How do I work with Asia Online to include my content on the Asia Online portal?
Technology
  • Does Asia Online license its technology to third parties for inclusion in their products?
  • Does Asia Online allow educational or non-profit use of its technologies?
Company
  • What is the background of the company?
  • What jobs are available at Asia Online?
  • I am interested in investing in Asia Online, how to I learn more?
Top Top
nswers - Translation
I have seen machine translation before and the quality has always been very disappointing. What makes Asia Online's translation platform any different or better?

Our goal is publication quality output, not gist quality where only the basic meaning can be understood. With the Asia Online tools, this can be achieved.

Traditionally machine translation used a technique called Rule Based Machine Translation (RBMT). RBMT systems have the rules of a language coded by programmers. They have limited flexability and are difficult to customize and improve. Stylistic control is not easily possible and each language pair must be built independently of other language pairs.

Asia Online uses a technique called Statistical Machine Translation (SMT). SMT translation platforms use parrallel corpora (original text with a matching translation in the target language) to learn statistically which phrase or partial phrase is matches the other. The parallel corpora is analyzed and the statistical result of the analysis is used to determine new translations. With data available a new language pair can be built very quickly. This has enabled Asia Online to develop over 500 language pairs. In addition to the ease of developing a new language pair, with the right parallel corpora, SMT can give far higher translation quality than RBMT.

Asia Online also differentiates itself from others who have created SMT platforms by using only very clean parallel corpora. Others such as Google scrape the Internet and other content sources to get their corpora. Relying on public and uncontrolled sources for corpora means that there is a significant amount of dirty data in the parallel corpora. This results in lower quality. Asia Online's language pairs have far less parallel corpora than Google, but the data is clean, so when an error occurs, it is due to gaps in the data. Data can be submitted to fill the gaps when they present themselves. With dirty data, many thousands of examples would be needed to override the dirty statistics. With Asia Online between 5-20 examples usually fixes a problem and it will then not occur again.

Contact Asia Online for a free demonstration  and see the quality difference for yourself.

What languages does Asia Online support?
Languages are supported by machine translation platforms in pairs, for example English into Chinese. Asia Online supports over 500 language pairs. See our list of language pairs currently supported and language pairs currently under development
Can you build a new language pair for me?
Asia Online has developed tools that enable the rapid development of a new language pair or a customized domain within an existing language pair. Depending on the languages involved, a new language pair or domain can be developed in a little as 2-3 weeks. Contact Asia Online for more information 
Where can I find more information about machine translation?
The Asia Online resources page has a large number of links to information about machine translation, articles and industry groups. Asia Online is also sponsoring a new website for businesses that are trying to understand how machine translation can fit their needs better. http://www.translationforbusiness.com is a site that aims to explain how companies can best utilize machine translation and provides a community group to discuss issues in the industry, best practice and more.
How does Asia Online charge for its translation services?

Asia Online provides a range of charging models to suit a variety of situations. The Asia Online Enterprise Translation Portal offers a solution for small-mid size organizations where no infrastructure is required and all translation is performed using the web portal. Using this model, Asia Online charges based on volume of translation and language pairs. There are a number of charging models to choose from.

Asia Online also licenses its software to large organizations that can support the infrastructure required to run a translation platform. Asia Online's translation platform is designed to scale to millions of words per day. The smallest implimentation requires 2 servers, with the ability to scale to thousands of servers. Asia Online provides a variety of onsite server licenses depending on customer requirements.

What is a Baseline Engine?

For each language pair Asia Online has developed a Baseline Engine. Baseline is the translation engine before it has been customized to focus on specific domains of knowledge. This is for general translation, but ideally you would add your custom data to tune the engine to your specific requirements. A baseline will typically translate general text such as news reasonably well, but would not have the same level of quality if the text was specific to a domain such as football or medical.

The text I wish to Translate refers to a specific topic, such as automotive engineering, or is written in a very specific writing style, how can Asia Online cope with this type of translation?

The topic of the text or writing style is called the Domain. The specific domain that you wish to translate shall be developed and layered on top of the baseline. Many domains are currently under development such as politics and IT, but others may need to be developed specifically to meet customer needs and to get the highest levels of quality possible.

The tuning for a specific domain of knowledge requires gathering additional data within that domain so that the knowledge in the engine is more focused and biased towards a particular subject. A domain may also be focused on the customers own preferred terminology.

What computing resources or infrastructure do I need to avail of the Asia Online Translation Service?

Using the Asia Online Enterprise Translation Portal is the easiest means. This requires no special infrastructure at the client site. The Portal is accessed over the Internet through any common browser. Charges are based on a use model, or a monthly model depending on needs and give access to all available language pairs.

For large scale translation projects, the Asia Online translation environment can be installed at a customer’s site. This requires a minimum of 2 PCs to do translation. The performance will typically be many thousands of words per minute. Pricing for these licenses varies depending on the number of language pairs and domains. There are a number of charging models to choose from.

My data is sensitive. How do I ensure that my data is secure?

This question falls into two categories:

1. Data being translated - Using the Asia Online Enterprise Translation Portal, data is encrypted using SSL between client and the Asia Online servers. No data is kept by Asia Online after translation and all data is secure at all times. Alternatively the Asia Online translation environment can be installed at a customer’s site so the customers data never leaves its own work environment. Both options provide runtime security at differing levels based on customer needs.

2. Training Data – the data used to create an engine or domain. Training is very CPU intensive and occupies a number of high end servers for multiple weeks where the training data is analysed and statistics extracted. This is not required for most customers. In the training environment the number and power of servers determines how fast you can train. Typically an 8 CPU machine with 32GB RAM can take a number of days to train and tune and engine built from 10 million sentences. As this is very complex, we typically do this in house for our customers. When done in house, every measure is taken to ensure customer data is secure. The resulting engine or domain data is then used in the customer’s engine and no other company or users have access to the data or engine.

Asia Online provides a range of charging models to suit a variety of situations. The Asia Online Enterprise Translation Portal offers a solution for small-mid size organizations where no infrastructure is required and all translation is performed using the web portal. Using this model, Asia Online charges based on volume of translation and language pairs. There are a number of charging models to choose from.

Asia Online also licenses its software to large organizations that can support the infrastructure required to run a translation platform. Asia Online's translation platform is designed to scale to millions of words per day. The smallest implimentation requires 2 servers, with the ability to scale to thousands of servers. Asia Online provides a variety of onsite server licenses depending on customer requirements.

My data is mission critical and cannot leave my secure environment. How can use the Asia Online translation technology when I can not give you the data?

In cases such as government or military where the training data may be highly sensitive, training can be done on the customer’s site with the purchase of special server licenses. Contact Asia Online for more information 

How do I measure translation quality?

Quality is always subjective and there are many ways to measure it. There is seldom agreement on one absolute measure. The most accurate quality measurements will typically be produced when humans are given a clear set of measurement guidelines. However, even this can be inaccurate at times. Consider the following: "It rained on Monday" vs. "On Monday it rained", or "The speedo was maxxed into the red line" vs. "The speedometer needle was hard against the red line". Individual preferences for linguistic style would score these variations differently.

Using humans to measure quality is usually slow and expensive. There are also many different human assessment metrics, such as scoring each sentence on a defined quality scale of 1 to 5 or similar approaches. But what is usually more important than measuring some absolute quality is understanding quality in the SMT translation engine process. It is useful to measure the improvement in quality over time, as data configurations are varied or measuring the difference between automated translations using different translation platforms and tools.

To facilitate the rapid evolution and evaluation of potential translation engines, some automated quality measurement tools have been developed. There are a number of different metrics. BLEU, F-Measure and Translation Error Rate (TER) are all useful indicators of quality. Asia Online has developed a set of free downloadable tools and utilities to help you to measure quality of automated translation engines. Each of the metrics uses a simple basic concept - have an original document translated by a human as a "gold standard" or reference for comparison. Then translate the same original document using the software and compare the automated translation output to the human reference. The Asia Online tools and the BLEU, F-Measure and TER measurements can be used to compare multiple different sources of translated output, which can help in understanding the best translation platform or software for your documents.

I have been told that you need a large amount of multilingual data to be able to build a custom domain. How much data do I need?

To build a custom domain, typically the more high quality data you have the higher the quality of translated output. Traditionally this meant you needed 400,000 or more multilingul segments. Because of the extensive foundation work that Asia Online has performed on our baseline engines and our data optimization and generation capabilities, we have been able to build high quality custom domains on as little as 20,000 multilingual segments. We have also developed a number of custom domains when there was no data available from customers by applying our data generation technologies.

Contact Asia Online  for a more detailed discussion on your custom domain requirments, custom domain case studies and a demonstration of some of the existing custom domains.

Top Top
nswers - Portal
How do I advertise on the Asia Online portal?
Asia Online provides competitive advertising rates and using IAB industry standard advertising sizes. More information can be found on our Portal Advertising page.
How do I work with Asia Online to include my content on the Asia Online portal?

Asia Online is always interested in talking to content partners about creative ways to monetize their content in a variety of Asian markets.

Contact Asia Online  to discuss your unique content and requirements with an Asia Online representative.

Top Top
nswers - Technology
Does Asia Online license its technology to third parties for inclusion in their products?
Asia Online is licensing its technology to a number of third parties for integration into their products. Contact Asia Online  to discuss your unique requirements with an Asia Online representative.
Does Asia Online allow educational or non-profit use of its technologies?

Asia Online encourages growth and education. We work with a number of educational and non-profit organizations. Contact Asia Online  to discuss your requirements with an Asia Online representative. In most cases, Asia Online will be happy to contribute and help.

Top Top
nswers - Company
What is the background of the company?
Everything you want to know about company and our background can be found on the About Us page.
What jobs are available at Asia Online?

The latest jobs are always listed on our careers  page.

I am interested in investing in Asia Online, how to I learn more?

Asia Online is currently funded by its founders, angels and series A and B institutional investors Japan Asia Investment Corporation (JAIC) and T-One Capital. For further investment information, please contact Asia Online  and an Asia Online representative will discuss your interest in more detail.

Top Top
The World Speaks One Language - Yours
Home/ Portal/ Translation/ Solutions/ Technology/ Tools & Downloads/ Resources/ News/ FAQ/ About Us/ Blog/ Contact Us/ Join Mailing List