How to make Translation easier with Machines, by Hao Zong, Global Tone Communication Technology...

Post on 15-Apr-2017

275 views 2 download

Transcript of How to make Translation easier with Machines, by Hao Zong, Global Tone Communication Technology...

How to make translation easier with machines

HELLO!Zong Hao (Peter) Natural Language Processing Director from Globa

l Tone Communication Technology Co., Ltd. zonghao@gtcom.com.cn

1. About GTC( 中译语通 )

• Established in 1973, the first and largest LSP in China

• 100+ languages for human translation

• 32 languages for machine translation

• Billions of professional corpus

• Machine Translation

• Natural Language Processing

• ASR & TTS

• Semantic Search

• Deep Learning

Our History

Our Resources

Core Technologies

2. Our PlatformsCross-LanguageBig Data Analytical Platform

Global Language Service Platform

Open Translation Toolkit

2.1. YeeCloud

YeeCloud™ is an integrated service ecosystem based on cloud computing and language big data.

Provide smart language services for Internet users, commercial clients, professional translators and LSPs.

2.2. YeeKit

YeeKit

Multi-Language Machine Translation-32 Languages-High Quality and No Latency-Incremental Learning

Computer-Assisted Translation -Free Cloud-Based Platform-Cooperative Work and Process Management-Online Translation Memory

YeeCaption Video Translation-Real-tme Subtitle Translation-Automatic Search and Match-Automatic Speech Recognition

Webpage Localization-Browser Plug-in -11 Browsers Supported-Viewing Foreign Webpages in Native Languages

Open Language Assets-Billions of Corpora-Cloud Synchronization-Big Data Sharing

Developers Platform-Open API-Support User Integration & Customized Development

2.3. YeeSightCross-Language Big Data Analytical Platform

1. Integrate, secure and analyze cross-language data at a massive scale

2. Enable users to interact smoothly with data and make better decisions

3. An intelligent engine combining machine translation and semantic search

• Analysis of the trends in social network data• Intelligent analysis and forecast of market

trends• Analysis of economic and financial statistics

What is translation?

Social Science: from language A to language B.

Neural Science: expressing the same thing with different interpretation.

Origin Two men who recently spent

five days on a coral island wished they had stayed there longer. They were taking a badly damaged boat from the Virgin Islands to Miami to have it repaired.

Translation

Translation 最近有两个人,在珊瑚岛上度过了 5 天时间,而且想再呆得时间长一点。他们驾着一艘严重损坏的小船从维尔京群岛出发去迈阿密港修理。

Translation Market

• Top 100 firms ranging from $427M down to $4M according to studies by Common Sense Advisory.

• $33.5 billion in 2012, and $37 billion in 2018. − The United States represents the largest single market for translation services.− Europe is a close second and Asia is the largest growth area.

• The translation industry is expected to grow by 42% between 2010 – 2020.

Big Growth

GlobalizationMachine TranslationComputer Assisted Translation

Translate with machinesBad Semantic Just like the combination of the results of looking up the dictionary.

Poor Grammar The grammar of machine translation is not stable, and seems to be

good when the translation text meet its trained data.

Not Smooth

Not readable, and hard to understand.

So basically the machine translation is a crap?

Applications using machine translation

Origin Two men who recently spent

five days on a coral island wished they had stayed there longer. They were taking a badly damaged boat from the Virgin Islands to Miami to have it repaired.

Translation

Translation 最近有两个人,在珊瑚岛上度过了 5 天时间,而且想再呆得时间长一点。他们驾着一艘严重损坏的小船从维尔京群岛出发去迈阿密港修理。

Machine Translation is Not Reliable

Long & Complicated Middle Short & Easy

Are Translators Reliable?

• Translation Memory• Terminology Memory

Translation Network

our office

1,000,000,000

1,000,000,000Translation memory

50%~100%Perferably matching rate

0.20 second

With Computer Assisted Translation System

Anyone Anywhere Anytime

Let’s review what CAT can do

Translation memory Match the translation text with appropr

iate translation memory.

Terminology memory

Gives the translation of some pre-defined terminology.

Machine translation Integrated with machine translation

Dcitionary

Get explanation of some rare words

Crowdsourcing

Spilt a large project into small pieces.

Others

Smart input method.Auto progress reminder.Teamwork.

Use your Android phone to do translation

Android Project

iPhone Project

Use your iPhone to do translation

Tablet Project

Use your tablet to do translation

Desktop Project

Use your desktop to do translation

Not enough!

Voice Translation

• Convert the Voice to text• Translate the text to target language.

Image Translation

• Recognize the text• Translate the text to target language• Reconstruct the image with

corresponding translation

Subtitle Translation

• Convert the voice to text with timeline.• Translate the voice’s text to target language.• Add the translation to the original subtitle.

Subtitle Translation

Web Translation

www.yeecloud.com

www.yeekit.com

Thanks!www.yeecloud.comwww.yeekit.com