TOSHIBA REVIEW
2009. VOL.64 NO.2

  Special Reports

Natural Language Processing Technologies
—On the Occasion of the IEEE Milestone Award for Toshiba's Japanese-Language Word Processor


Innovation Creation by Natural Language Processing
MORI Kenichi


Toshiba Natural Language Processing Technologies Starting from Japanese-Language Word Processors:History and Prospects
SUMITA Kazuo
Natural language is the fundamental means by which we convey our intentions to others and record our knowledge. Since its development of the first Japanese-language word processor, Toshiba has been further advancing natural language processing technologies and developing applications and services using them. In 2008, the Japanese-language word processor was recognized as an IEEE (Institute of Electrical and Electronics Engineers, Inc.) Milestone.
We will continue to promote the research and development of natural language processing for multiple languages in order to provide new intelligent solutions that improve the efficiency of office work, to realize new intelligent functions for digital media products, and to create innovative products such as speech-to-speech translators.


Introduction of IEEE Milestone Program Recognizing Important Historical Achievements
OHNO Eiichi
The IEEE (Institute of Electrical and Electronics Engineers, Inc.) Milestones are a program to recognize important historical achievements in the electrical, electronic, information, and communication system fields, which are the technology areas of IEEE. The IEEE Milestones are awarded in recognition of technological innovation and excellence for the benefit of society and industry. Seventy-eight milestones had been awarded worldwide,including seven in Japan, as of December 2007.
In 2008, the first Japanese-language word processor, which was developed by Toshiba Corporation in 1978, was selected as the recipient of the eighth IEEE Milestone in Japan.


Machine Translation Technology to Accelerate Globalization of Intellectual Property
KUMANO Akira
There is a large volume of intellectual property documentation, including patent documents, in Japan. Although these documents are worth accessing from other countries, very few of them are written in English. Machine translation is essential as a means of translating them into English.
However, specific problems are encountered in the machine translation of patent documents, particularly the difficulty of translating the long sentences in claims. Pre-editing is of assistance in this area. Dictionary building technology using a parallel corpus is also useful for the compilation of technical terms.
Toshiba has developed a machine translation technology as an accumulation of these technologies. This machine translation technology makes it possible to realize high-quality translations for widespread use in commercial products and Internet services.


Natural Language Information Retrieval for XML Database System
MANABE Toshihiko / KOKUBU Tomoharu
Toshiba has been developing an extensible markup language (XML) database system with flexible search functions. To enhance the search capability of this XML database system, we have newly developed a natural language information retrieval function on the system. XML documents are ranked in descending order by relevance scores in response to a user’s natural language query. In addition, both query expansion and query-based document summarization are realized in this function.
This natural language information retrieval function allows users to utilize the query language of the XML database in combination with Boolean search and full-text search.


Advanced Text Mining Technology for Corporate Reputation Information

SAKURAI Shigeaki
Toshiba has developed a technology that makes it possible to automatically discover, at an early stage, important threads that might cause significant damage to a particular corporation or organization from sets of articles related to specific topics on bulletin board sites. Using both text mining and natural language processing techniques, this technology performs original characterization of threads and can extract important threads and expressions related to topics in the threads using these characterizations.
We evaluated the effectiveness of the newly developed technology using articles collected from bulletin board sites, and confirmed that the results based on the technology corresponded to user-based results with high probability.


XML Structuring Technology for Various Types of Document Applications

FUME Kosei / ISHITANI Yasuto / GOTO Kazuyuki
The dramatic increase in the volume of electronic documents in the office environment has spurred demand for easy access to information resources and for their effective management.
Toshiba has developed an extensible markup language (XML) document structuring technology that facilitates exploitation of information resources corresponding to these needs. Utilizing natural language processing and XML, this technology makes it possible to extract document attributes, such as logical elements, logical structures, and term semantics, and embed them as machine-processable metadata. We have achieved various applications based on this technology, such as a document transformation system from paper to XML, a document categorization system, and an information access interface.


Japanese/ Chinese/ English Hybrid Speech Translation System

CHINO Tetsuro/ KAMATANI Satoshi
Toshiba has proposed a new hybrid machine translation (MT) method to overcome the language barrier in cross-linguistic communication. The proposed method utilizes both of two complementary methods of MT; namely, the example-based MT (EBMT) method that can produce natural translations within restricted domains, and the rule-based MT (RBMT) method that produces relatively halting translation with wide coverage.
We have developed an experimental hybrid speech translation system for Japanese, Chinese, and English, and confirmed a task achievement rate of about 70% within about two minutes in typical tasks in travel situations through field tests conducted in Japan, China, and Australia.


Approach to Development of Business Support Solutions Utilizing Japanese-Language Analysis Technologies

HAYAKAWA Rumi/ MATSUMOTO Shigeru/ SAITO Yoshimi
Toshiba Solutions Corporation has been advancing the research and development of technologies for more practical use of business documents that are being created and stored every day. For this purpose, our technologies support improvements in the quality of documents and classification accuracy.
We are currently focusing on the utilization of business document checking technology, business document classification technology, and paraphrase searching technology. Utilizing these technologies, we are building and evaluating prototype systems such as a document checking system for offshore development with China and an automatic classification system for patent documents.


  Feature Articles


Microphone Array Technique for Automotive Applications

AMADA Tadashi
Toshiba has been developing a microphone array technique for application as a noise canceller for speech recognition and hands-free communication in automotive environments. To cope with performance degradation of conventional microphone arrays in the reverberant environment in a car cabin, we have newly developed a robust method enabling acquisition of directional characteristics by means of offline learning methods and achievement of optimized performance in the target reverberant room. Experiments on speech recognition rates and noise suppression capabilities in real car environments showed that the proposed method achieves successful performance under reverberant conditions in cars.


W65T CDMA2000 1x Cellular Phone with Improved Usability and Functions

AKIYAMA Kenji/ FUKUMOTO Yuji/ MORI Hirofumi
In addition to multiple functions and high performance, demand has been increasing recently for cellular phones with greater ease of use and higher image quality, reflecting their role as a tool to enrich people's lives with images, music, sports, etc.
In response to these requirements, Toshiba has developed the W65T CDMA2000 1x (code division multiple access 2000 1x) cellular phone. The W65T offers easy operation through the use of a new-function"speedy controller" as well as the upgrading of various software functions. This model is also equipped with image quality compensation technology for one-segment broadcasting that offers both high image quality and low power consumption, technology for high-quality sound processing of data from the"Chaku-Uta" music download service, and a receiving antenna diversity function for voice calls.


International Standardization of Next-Generation Train Communication Network

KAMATA Keiichi
International standardization activities have recently become increasingly important for corporations in order to expand business opportunities and boost the competitiveness of products, including both systems and equipment.
Toshiba has developed a new high-speed metal transmission system that is highly versatile and expandable, and applied it to railways in Japan and other countries. This technology, called TEBus (Train Ethernet Bus), has been proposed to Working Group 43, Technical Committee 9 for Electrical Equipment and Systems for Railways, of the International Electrotechnical Commission (IEC/TC9/WG43) as a new vehicle bus for the train communication network standard through the Japanese committee. This proposal includes technologies that can enhance the appeal of railway transportation with the possibility of its application to the level of a train backbone network, which requires higher speed and reliability of data transmission,aimed at achieving greater safety, stability, and comfort.


Design and Construction of Ministry of Health, Labour and Welfare Integrated Network

CHO Kazuhiro/ YAMAGUCHI Takuya/ YAMATO Akira
Toshiba Solutions Corporation received an order for the Ministry of Health, Labour and Welfare Integrated Network in alliance with Softbank Telecom Corp. in October 2007. The purposes of this network are the reduction of communication expenses and integration of the ministry's network infrastructure, achieved by making the independently managed networks for the individual communication systems of each department shared on one integrated network. These purposes conform with the“ Optimization Plan" being promoted by the Ministry of Internal Affairs and Communications.
We have been engaged in the speedy construction and delivery of the network, taking advantage of both companies' technologies and design know-how to maintain high availability and operational support.


Pseudo-SOC Technology to Integrate Different Types of Devices

ONOZUKA Yutaka/ YAMADA Hiroshi/ ITAYA Kazuhiko
High-density heterogeneous integration technology has become increasingly important to realize high-performance multifunctional systems with smaller size and lighter weight for use in the ubiquitous society. However, system-in-package (SIP) technology, which can integrate different types of devices, has an integration density limitation, while the types of integrated devices are restricted in system-on-chip (SOC) technology.
To overcome these problems, Toshiba has developed a novel integration technology named pseudo-SOC technology. The feature of this technology is the interconnection of wiring using semiconductor processing technology after the large-scale integrations (LSIs) and various components are reconstructed into wafer shape with resin. Pseudo-SOC technology makes it possible not only to minimize device size but also to shorten the development period for new systems.


  Frontiers of Research & Development

Image Enhancement Technology for Large Screen Display Era
Interactive Document Classification System Reflecting User's Intentions