tonybaldwin | blog

non compos mentis

Posts Tagged ‘machine translation

exorcising bad translations

leave a comment »

This, my friends, is why Professional Translators are still a necessity.

Il Foglio, an Italian newspaper, has come out critizing the NY Times, who (OMGSTFUBBQ…can’t believe they did this!) used a computer generated translation of an article regarding the Vatican’s response to sexual abuse complaints.

The failure to translate led the American newspaper to argue that Cardinal Joseph Ratzinger was protecting a sexually abusive priest from Milwaukee.

The article, titled “New York Times does not translate,” starts by saying, “New York Times columnist Maureen Dowd returned to attack the Pope. Commenting on the words of exorcist Gabriele Amorth, who said that behind pedophile priests is the devil, Dowd suggested a way for the Catholic church to solve the problem: hire a ‘sexorcist.'” 1

Learn from this, kiddies.
When the text is important, neither Google Translate, nor Yahoo! BabelFish is truly your friend.

Go to, Proz.com and find a real, professional translator.
Of course, if your text requires translation from any of French, Portuguese or Spanish to American English, I’ve got you covered, right here.

tony


posted with Xpostulate

Written by tonybaldwin

April 13, 2010 at 8:20 pm

exorcising bad translations

leave a comment »

This, my friends, is why Professional Translators are still a necessity.

Il Foglio, an Italian newspaper, has come out critizing the NY Times, who (OMGSTFUBBQ…can’t believe they did this!) used a computer generated translation of an article regarding the Vatican’s response to sexual abuse complaints.

The failure to translate led the American newspaper to argue that Cardinal Joseph Ratzinger was protecting a sexually abusive priest from Milwaukee.

The article, titled “New York Times does not translate,” starts by saying, “New York Times columnist Maureen Dowd returned to attack the Pope. Commenting on the words of exorcist Gabriele Amorth, who said that behind pedophile priests is the devil, Dowd suggested a way for the Catholic church to solve the problem: hire a ‘sexorcist.'” 1

Learn from this, kiddies.
When the text is important, neither Google Translate, nor Yahoo! BabelFish is truly your friend.

Go to, Proz.com and find a real, professional translator.
Of course, if your text requires translation from any of French, Portuguese or Spanish to American English, I’ve got you covered, right here.

tony


posted with Xpostulate

Written by tonybaldwin

April 13, 2010 at 1:20 pm

Microsoft gives up on Machine Translation

with one comment

Microsoft avoids being lost in translation with new framework

The Microsoft Translator team has given up and concluded that “no matter how many machines you throw at translation, it is still impossible to get the correct, error-free, contextually accurate translation every time.” Microsoft’s solution to this problem is the Collaborative Translations Framework, which supposedly combines the scale and speed of automatic machine translation with the accuracy and context awareness of human translation.

For the record, for once, I agree with Microsoft.

Written by tonybaldwin

March 18, 2010 at 6:37 am

fun with sp@m comments…(clear examples of obvious use of machine translations)

leave a comment »

engrish.com

I wash it and pray...engrish.com

I get a lot of spam comments.

People drop in to leave links to online pharmaceutical sales sites, lonely wives dating, co-ed cam shows, get rich quick schemes…It’s all really annoying amusing.

Nonetheless, I have chosen to allow a few through just this past 24 hours, and would like to draw your attention to them.

This comment, for instance, is rather amusing. The poster posts a link to some site advertising the sale of some online translator software. It is obvious that the poster used said software.
His comment?

Greetings, naturally i solely thought i publish and let you be aware of your site layout is literally definitely nice”

I enjoyed that.

Here is another highly amusing comment.

I’m actually going to play right into this poster’s game, sort of, and encourage you to click through to his “blog” (it will open in a new tab/window if you click that link). This poster apparently copied articles from relevant news sources about the current health care debate in the US into their “blog, and, in between those articles, sprinkled original posts advertising acne control medications.

The reason said blog is so much fun is that it is patently obvious that the posts are the product of machine translation, and they are absolutely HILARIOUS!

Here is an example:

The well-wishing of acne mutilation with which you are afflicted velleity lay down the law the well-intentioned of acne spoil output you mould want and testament use. In most cases, selecting the most owing acne cicatrix artifact in behalf of your outer layer becomes uneasy because you may establish a combination of unlike kinds of acne burn in your body. So it is sick you misappropriate possession of point to about yourself to affirm the the uniform that commitment be seemly in state of your acne scar. This reason, this article comes in jolly masterly as it commitment book you on how to receive rid of acne waste virgin fast.

If comments like these continue to come in, I will continue to share them. Evidence to ensure me and my translator colleageus. No time in the near future will we be supplanted by machine translation.

./tony

Written by tonybaldwin

March 13, 2010 at 9:31 am

This technology can make the language barrier is gone

with 4 comments

Just for grins…

Engrish Mastars

English Mastary made simple....

First, let me state, for the millionth time, that I ❤ GOOGLE!

I use tonso google stuff…google search, gmail, google calendar (lifesaver!), google reader, google code, google groups, google plumbing, you name it…Google’s got it, I’m using it.  So, I’m not doing this to pick on Google.  Even so, a guy has to protect his own interests, no?  So, in the interest of demonstrating precisely why even the great Google will not supplant professional, human translators, I took yesterday’s NYTimes article on Google Translate, and ran it through Google Translate.  First, I translated it to French, then to Spanish, then back to English.

Now, I have to confess, the result is not unintelligible.  Most readers will be able to make some coherent sense of most of the resulting text.  Nonetheless, there  will be confusion (and laughter).  Now, imagine, if you will, the potential confusion, and quite possibly rather dire consequences were this method of translation used for, say, the instructions on your medication, international treaties, safety regulations, medical device instruction manuals, and a whole smathering of other complex textual materials of important significance.

There’s going to be confusion

That, folks, is why I still have a job.

And now, for your reading pleasure, the resultant text:


MOUNTAIN VIEW, Calif. – In a meeting with Google in 2004, the discussion focused on an e-mail the company had received from a fan in South Korea. Sergey Brin, one of the founders of Google, ran the message through an automatic translation service that the company had a license.

The message says that Google is a search engine of your choice, but the result is as follows: “The footwear of sliced raw fish you want. Google the green onion!”

Mr. Brin said Google should be able to do better. Six years later, its free Google Translate supports 52 languages, more than any other similar system, and use hundreds of millions of times a week to translate web pages and other texts.

“What you see on Google Translate is the state of the art in computer translation is not limited to a particular area,” said Alon Lavie, research associate professor in the Language Technologies Institute at Carnegie Mellon University.

Google’s efforts to expand beyond Web search has been uneven. Your digital book project, was hanged in the courtyard, and the introduction of its social network, Buzz, has raised fears of intimacy. The model suggests that this can sometimes stumble when it comes to challenge the traditions and conventions of cultural enterprise.

However, Google’s rapid growth to higher levels of translation is a reminder of what can happen when Google releases its power of brute force calculation of complex problems.

The network of data centers built to search the web, now, when united, the biggest team in the world. Google uses this machine to push the limits of translation technology. Last month, for example, said he was working to combine your translation tool with image analysis, allowing a person, for example, taking a photo of a German phone menu and get the machine translation into English.

“Machine translation is one of the best examples that demonstrates the vision of Google, said Tim O’Reilly, founder and CEO of tech publisher O’Reilly Media.” This is not something that someone no one takes seriously. However, Google understands something about the data that nobody understands and is willing to make the investments needed to address these types of complex problems ahead of the market. “

Creating a machine translation has been considered one of the toughest challenges in artificial intelligence. For decades, scientists tried using a team approach standards – teaching language regime of both languages and dictionaries give necessary.

But in half of the 1990s, researchers began to promote a statistical approach. They found that if they feed thousands or millions of computers and their human translations generated parts, you can learn to make assumptions about the exact form to translate new texts.

It turns out that this technique, which requires huge amounts of data and lots of computing power, Google has increased.

“Our infrastructure is well suited to this” Vic Gundotra, Google engineering vice president, said. “We can not adopt approaches that others can only dream.

Machine translation systems are far from perfect, and even Google’s human translators will not work soon. Experts say it is extremely difficult for a team to break a sentence into two parts, and then bring them back.

But the Google service is good enough to convey the essence of a news article, and became a source for quick translations for millions of people. “If you need a rough-and-ready translation is the place to go,” said Philip Resnik, an expert in machine translation and associate professor of linguistics at the University of Maryland, College Park.

Like its competitors in the field, including Microsoft and IBM, Google has promoted its translation engine transcripts of the United Nations, which are translated by the man in six languages, and the European Parliament, which resulted in 23 . This material is used to form systems most commonly used languages.

However, Google has traveled the Web text, and data from their project to digitize books and other sources to go beyond these languages. For more obscure languages, published a guide to help users with translations, then add the text in its database.

Offer Google could make a big hole in the translation business sale software companies like IBM, but machine translation is not likely to be a great Moneymaker, at least not by the standards of advertising google. But Google’s efforts could bear fruit in several ways.

Because the ads are online everywhere, while making it easier for people to use the Web to benefit society. And the system could have interesting applications. Last week, the company said that using speech recognition to generate English language subtitles for videos from YouTube, which could then be translated into 50 languages.

This technology can make the language barrier is gone,” said Franz Och, Google’s chief scientist who heads the team of the automatic translation company. This would allow anyone to communicate with anyone else. “

Mr. Och, a German researcher who previously worked at the University of Southern California, said he was reluctant to join Google, fearing that it would be the translation as a side project. Larry Page, Google’s other founder, called to reassure him.

“I just said is something that is very important to Google,” he recalled recently by Mr. Och. Mr. Och signed in 2004 and quickly was able to bring the promise of Mr. Page in the test.

While many translation systems such as using Google for one billion words of text to create a model of a language, Google has gone much more: hundreds of billions of few words in English. “The models are getting better the process rather than text,” said Och.

The effort was worth it. A year later, Google has won a competition run by the government that proof of sophisticated translation systems.

Google has used a similar approach – computing power, mounds of data and statistics – to address other complex issues. In 2007, for example, began offering 800-GOOG-411, directory assistance calls free interpretation of spoken. It has allowed Google to get the votes of millions of people who do better in the English speech recognition.

A year later, Google launched a search for the voice system that was as good as the other companies that have taken years to build.

And last year, Google launched a service called glasses, which analyzes the image of the phone, which is an online database of more than one billion images, including pictures of her taken to the streets Street View service.

Mr. Och has acknowledged that the Google translation still needs improvement, but he said he feels better quickly. “The curve of the current quality improvement is still very strong,” he said.

http://www.nytimes.com/2010/03/09/technology/09translate.html

This article was translated by Google, the English, then French, Spanish, then back to English.

TRANSLATORS domain of man!*

🙂


Tony

*this phrase was “Human Translators Rule!! prior to the above treatment)

Just for fun, I ran that article through Simplied Chinese, then Czech, then back to English, again.

here is that result

Written by tonybaldwin

March 10, 2010 at 10:42 am

Machine Translations, Google, and my job…

with 2 comments

I just thought I’d share this, quickly: Google’s Computing Power Refines Translation Tool

Google’s efforts to expand beyond searching the Web have met with mixed success. Its digital books project has been hung up in court, and the introduction of its social network, Buzz, raised privacy fears. The pattern suggests that it can sometimes misstep when it tries to challenge business traditions and cultural conventions.

But Google’s quick rise to the top echelons of the translation business is a reminder of what can happen when Google unleashes its brute-force computing power on complex problems.


Being both, a computer technology geek, and, a professional HUMAN translator, of course, I have mixed feelings about MT or Machine Translation. Personally, I don’t think MT will ever replace humans. Ever. Language is just too complex.
The internet is riddled with humorous examples of bad machine translation. Just take a look at Engrish.com, or, here’s a lovely example right here: Lost in Translation, Seriously.
Funny stuff.

Computers, or course, are a very powerful and useful tool in translation, of course. I would never deny that. Computer technology has brought about a great many changes in the translation industry over the past several years. Many translators feel threatened by that technology. I prefer to embrace it, frankly. I see it as a tool, not a threat. I confess, I use Google Translate sometimes. You already know I ❤ Google. Moreover, OmegaT, my preferred CAT (computer aided translation) tool now has integrated an optional Google Translate feature, so that, while I am translating a document, OmegaT will show me the Google Translate result for that segment. I have to say that instances in which I can simply insert that result without editing it are few. Perhaps 15 to 20%. I suppose that’s not too bad, really, considering the success of earlier attempts at MT, but it is also a clear indication that, without MY intervention, the translation would come out terribly. Sometimes this Google Translate feature is helpful, speeds things up, makes my work more efficient. I have found, however, that if I use Google Translate to translate an entire document, the revision process thereafter often becomes so cumbersome that the job becomes more work than it would had I simply translated the document on my own. Or with OmegaT, with Google Translate at my side. Using OmegaT, with Google Translate, I have access to the utility in Google’s tool, only using results when appropriate, thus, and my work does become more efficient. This becomes a sort of ménage à trois of Computer Aided Translation, Machine Aided Human Translation, and of course, Human Translation. Or, we could just call it “Human Aided Machine Translation” (not a new term). No matter what you call it, Machine translations will always, in my opinion, require human intervention. So, as I see it, Machine Translation is a useful tool. But it will never, ever take the place of professional, human translators. Language, and the human brain, are simply too complex.


Relevant links:

Written by tonybaldwin

March 9, 2010 at 11:05 am