Stories
Slash Boxes
Comments

News for nerds, stuff that matters

Slashdot Log In

Log In

Create Account  |  Retrieve Password

Copier Auto-Translates Japanese to English

Posted by CowboyNeal on Thu Sep 27, 2007 10:56 PM
from the hopefully-better-than-babelfish dept.
StCredZero writes "Wild. Fuji has created a photocopier that automatically translates documents from Japanese to English. That's pretty nuts. Apparently, the copier can figure out what sections are text, OCR the text, send it to a translation engine, and put the english back into place."
+ -
story

Related Stories

This discussion has been archived. No new comments can be posted.
The Fine Print: The following comments are owned by whoever posted them. We are not responsible for them in any way.
 Full
 Abbreviated
 Hidden
More
Loading... please wait.
  • Not new (Score:5, Funny)

    by Anonymous Coward on Thursday September 27 2007, @11:02PM (#20777977)
    They've been using this for years to translate instruction manuals.
  • Manga and Anime (Score:3, Insightful)

    by biocute (936687) on Thursday September 27 2007, @11:11PM (#20777981) Homepage
    Imagine if you upload manga scans to Flickr, and it automatically translates them to English.

    Imagine if you upload anime to YouTube, and it automatically includes an English subtitle.
    • Re: (Score:3, Interesting)

      Automatic english subtitles for an anime? That's more of a job for speech recognition software(which is also being worked on, one of the profs in my department had a friend working on Japanese to English speech recognition/translation many years ago).

      This OCR based stuff would still be handy for automatically translating manga I suppose though.
      I know that there are a few things out there in Japanese that haven't been released in English yet I wouldn't mind.
    • Re:Manga and Anime (Score:5, Informative)

      by dancingmad (128588) on Thursday September 27 2007, @11:51PM (#20778297)
      Imagine if you upload anime to YouTube, and it automatically includes an English subtitle.

      Without the kanji, since a large number of Japanese words are homophones, I can't see this being practical in the near future. Text is different - with the kanji, it's not terribly difficult to look up the correct word and with kana grammar beside it, the task gets much easier. I can't see a machine understand a conversation in context, however.
  • by Spy Hunter (317220) on Thursday September 27 2007, @11:13PM (#20777991) Journal
    StCredZero "writes wildness. Fuji drew up the photocopying machine which automatically translates the document from English from Japanese. That is the clean nut. With respect to appearance, as for the copier the text, as for OCR what kind of section text, to send that to the translation engine, and in the place English". You reset, or can grasp.
      • by Helios1182 (629010) on Thursday September 27 2007, @11:29PM (#20778127)
        Either you are old, or a bit naive. I think in the next 10 years we will see significant improvement. It just happens that the general public doesn't have access to the state of the art research. Systems are improving all of the time. They won't, however; reach the level of a fluent human translator for a long time if ever. But for most documents a machine will be able to do a decent job.
        • by Valdrax (32670) on Friday September 28 2007, @01:54AM (#20778989)
          Either you are old, or a bit naive. I think in the next 10 years we will see significant improvement.

          Yeah, 'cause researchers have long promised us that AI will reach us in 10 years. <sarcasm>

          Seriously, I think you underestimate the difficulty of translating. Have you done any major foreign-language translation -- especially of conversational speech? My experience has primarily been with Japanese and English, and I'll tell you right now that it can be nightmarish.

          Sentence fragments are the worst part. Japanese has a completely different word order from English. All modifiers (including phrases and clauses) come before the word they modify, and the language has a Subject-Object-Verb order. "I just saw the man who stole my friend's watch last Tuesday" becomes "Just I Last Tuesday friend's watch stole man saw." Now try translating that from Japanese to English when the sentence is cut in half.

          Worse, the language has very different levels of allowed vagueness. "Complete" sentences in Japanese can contain just a descriptor or an action without any specification of who did/was what. Conversely, translating "3 of them" in English to Japanese is hard because you have to know "3 of what?" to know what counting suffix to use.

          Another problem is that many very different words sound exactly the same when conjugated to the gerund or perfective forms. English has a number of homonyms, but there are MANY more opportunities for mix-ups if you don't have access to kanji to tell the semantic meaning apart because Japanese has a much more limited range of phonemes. For example, take "katte" which is the gerund form of the verbs "kau" (buy), "kau" (keep/raise), "karu" (cut), "karu" (spur on), and "katsu" (win). That's 5 completely different verbs that conjugate to the same sound. If they're written phonetically or your going from speech, then you have to be able to understand the meaning behind the words to translate. (Did I mention earlier that you may not have an explicit subject and object to go off of?)

          Then you get into issues of translating things like politeness levels, different ways of addressing people, and other concepts that don't translate well into English or concepts like singular vs. plural that are dropped in going to Japanese. Let's not even consider puns and poetry!

          These are not trivial issues. An automatic translator would need to somehow be able to conceptualize what a person is trying to speak about, which would require understanding the story being told and an ability to predict where they are going with it. This will require strong AI.

          Accurate and intelligible translation is an art -- not a science -- because it requires an intuitive and empathetic ability to understand the mind of the speaker well enough to map their thoughts into a different method of expression.
  • by alshithead (981606) on Thursday September 27 2007, @11:14PM (#20777995)
    It translates to Engrish?
  • by Anonymous Coward on Thursday September 27 2007, @11:23PM (#20778081)
    [Reading from his book, "Jimmy James: Macho Business Donkey Wrestler," translated to Japanese and back again]
    Jimmy: I had a small house of brokerage on Wall Street. Many days no business comes to my hut. Jimmy has fear? A thousand times no. I never doubted myself for a minute, for I knew that my monkey-strong bowels were girded with strength like the loins of a dragon ribboned with fat and the opulence of buffalo...
    [pauses while turning page]
    Jimmy: dung.
  • It's impossible... (Score:5, Informative)

    by wanderingknight (1103573) on Thursday September 27 2007, @11:28PM (#20778117)
    ...for a machine, under the current paradigms (that is, no true artificial intelligence) to properly translate something. Translation is not an exact science, and you can't expect to get a decent translation by just having a word-per-word approach. Heck, not even a sentence-per-sentence or paragraph-per-paragraph approach would ever be enough. Translation requires deep social knowledge--you need to know what you are translating, from whom you are translating, for whom you are translating... that is, you need to enclose your translation in a sociological context. No machine can ever wish to do that without artificial intelligence. It's hard enough as it is to get a human being to understand that word-per-word translation is stupid--imagine telling that to your CPU core.

    Disclaimer: I'm a translation student myself ;)
    • by kklein (900361) on Thursday September 27 2007, @11:59PM (#20778345)

      Disclaimer: I'm a second language acquisition researcher and assessor.

      I concur. Absolutely. Language is not pure information; it's information shorthand. It assumes a high degree of already-shared knowledge about the world. Some of these assumptions are near-universal; many are not.

      Japanese and English (my languages) offer a great example, especially as it pertains to machine translation. Whereas English is a subject-predicate language, where basically all the information is encoded in the language stream, Japanese is a topic-comment language, where, once set, the "subject" is not re-stated until it changes. Beginning Anglophone learners of Japanese make the mistake of putting a "wa" to denote what they think of as the subject in every sentence, when it does not need to be there. "Wa" is a topic marker; not a subject marker.

      This is a fundamentally different way of thinking about language and, therefore, about the world. Germanic languages seek to operate regardless of context; Asian languages seek to augment (or "comment on") it. If you've ever felt that Japanese people who speak English are beating around the bush or being vague, part of that is cultural, but part of that is the language of the culture that does not require explicitness. A big part of learning Japanese or, for Japanese people, of learning English is learning how to think about the world and about human interactions in a very different way.

      Machines aren't human. They are information processors. They don't know what a "cat" is; they just know that it's a piece of code that can be slotted into a certain place in a set of syntax. Until machines are really intelligent (and I don't think that will be anytime soon), expect more crappy translation than useful. Anyone who tells you otherwise is probably selling something (a crappy machine translator, to be exact!).

    • Re:Great! (Score:5, Funny)

      by Divebus (860563) on Thursday September 27 2007, @11:57PM (#20778337)
      They probably used it to translate the instruction manual into Engrish. [engrish.com]
      • Re:Great! (Score:5, Funny)

        by PsamtikNerd (1101713) on Friday September 28 2007, @12:43AM (#20778603)
        They did something like this with a Japanese car manual a long time ago; it ended up something like: "If a passenger of foot should obstacle your passage, tootle the horn. Tootle him melodiously at first, but if he continues obstacles your passage, then tootle him with vigor."
              • Re:Great! (Score:4, Funny)

                by mikael (484) on Friday September 28 2007, @09:14AM (#20781061)
                That's the fun part of these automatic translators - put in perfectly good English, translate to a foreign language, and translate back to English. Good fun for a rainy afternoon.

                "Today is under construction... please do not be alarmed by the construction men hanging themselves from outside your balcony. We will take them down tomorrow..."

    • by cheros (223479) on Friday September 28 2007, @03:08AM (#20779287)
      What is wild is that anyone with half a working braincell would use a photocopier in an office where a copy of every document is sent to an uncontrolled 3rd party for translation.

      Yeah, put that baby in the CEO's office..

      (not the mention the fact that there's a huge gap between mechanical translations and the subtleties of language only a skilled translator and/or native speaker has any hope of translating).

      So, IMHO cute idea, but don't expect me to bu one any time soon.
    • by rde (17364) on Friday September 28 2007, @05:34AM (#20779903)
      Ooh, I hope not. If it does, then it means that it's got a real prediliction towards anime. That could produce some interesting translations, such as...

      Dear Sir,
      Your opponent is me! With regard to your memo dated 14th inst., I'll never forgive you, vampire bastard! Super ultra science business meeting, engage!
      Noooooooooooooooo!

      Yours faithfully,
      Bob Morton
      Chief Gundam Officer