-
Language Weaver - machine translation systems
... using "a new, statistical/cryptographic approach to the automatic translation of human languages.
In contrast to current commercial machine translation systems, the statistical translation system uses techniques
that automatically learn how texts can be translated from one language into another."
-
Transparent Language -
AT&T Natural Voices Text-to-Speech Engine
online demo
>> data sheet -
Gist-In-Time
browser-based translation
- Alis Technologies
-
Language Learning & Technology (Web journal)
-
EDICT: Japanese-English Dictionary
-
CEDICT: Chinese-English Dictionary
-
ChineseComputing.com - "your complete guide to using Chinese on
computers"
-
Lexique
informatique officieux de la Commission ministérielle de
terminologie informatique
-
Language
Automation -
Machine Translation or
Mad Translation?
- WebPlexer
- Glossaries (by
language) -
List of Translation
Engines
-
See your name in Hieroglyphics -
See also: Tour Egypt -
Glyphs and Grammars - resource books
-
Travlang Online Translating Dictionaries -
Foreign Language for Travellers
-
ELL - The Encyclopedia of Language and Linguistics - Electronic Index
-
A toolkit for libraries developing multicultural/multilingual collections and services
- resources for libraries to assist in collection development, Web
design, language issues
-
GeoNative
- "Put minority and native languages on the map" (by Luistxo Fernandez)
-
Archimedes Project
>>
Presentation slides
-
The World's Top Twenty Spoken Languages
- Which languages are the most widely spoken?
- Languages with more than 30,000,000 Speakers (as of 1993)
- The World's Most Widely Spoken Languages
- Languages on the Brink
-
The 50 Most Widely Spoken Languages
-
The Elusive
Butterfly - "They noted the curious fact that the
word for "butterfly" was different for every European language,
including those most closely related--such as Spanish and Portuguese."
-
Language and the Scriptures -
a brief look at the question of translating and understanding the Bible.
-
SUMO - Suggested Upper Merged Ontology
- one of the
largest and most comprehensive, free, formal ontologies. This OntologyPortal site is dedicated to distributing open source
formal ontologies and tools, and supporting the developers and users
of those products.
-
A Flexible Open Source Framework for Speech Recognition
-
"Sphinx-4 is a flexible, modular and pluggable framework to help foster new innovations in the core
research of hidden Markov model (HMM) speech recognition systems. The design of Sphinx-4 is based on patterns that have emerged from the design
of past systems as well as new requirements based on areas that researchers currently want to explore.
To exercise this framework, and to provide researchers with a "research-ready"
system, Sphinx-4 also includes several implementations of both simple and state-of-the-art techniques.
The framework and the implementations are all freely available via open source."
- VoiceXML - Voice Extensible Markup Language 2.1 - W3C Working Draft 23 March 2004
- Introduction to VoiceXML 2.0
-
Australian Defence Science and Technology Organisation - DSTO
-
IBM WebSphere Voice Zone - enable voice interaction with
enterprise and Web applications.
Extending and Enterprise with IBM WebSphere Voice Application Access
-
IBM contributes Speech Software to Apache Software and Eclipse Foundations
[ Back to Top ]
- Globalization
(G18N) & Internationalization (I18N) ...
- See also (below):
CULTURES, ETIQUETTE, GLOBAL BUSINESS STRATEGIES
-
Unicode home page
-
How to Build a Multilingual Web Site -
"Web globalization is making a major comeback, driven by a rebounding global economy,
emerging markets and a stronger appreciation for the value of multilingual Web sites."
- How to Write for
a Multilingual Marketplace (Part 1) and
How to Write for a
Multilingual Marketplace (Part 2)
-
Developing Multilingual Web Applications Using JSP (JavaServer Pages) Technology
-
Global Development and Delivery in Practice: Experiences of the IBM Rational India Lab
-
Why You Need a Global Web Site -
five ways you can generate revenue and reduce costs by having localized content.
Local-language content can help deliver a more culturally relevant experience to your site visitors.
It not only optimizes the selling of products and services but also contributes to an organization's bottom line.
-
Which Spelling Should We Use in a Global Neighbourhood? -
"Which is the proper written English: that of the Queen or that of her most loutish of offspring, America?"
-
GeoBytes (content localization technology)
-
Welcome to the Global Village:
Some Considerations for Doing Usability in the Global Markets
- the importance of considering local psychology in developing globalized Internet sites
>> Hofstede’s Dimension of Culture Scales
-
Lost in Translation? Getting Your Message to an International
Audience
-
Is the Internet Suffering from Acute 'Englishitis'? -
"It has become a commonplace to say that the Internet was invented in the United States, and that English language and culture reign over it absolutely
and irreversibly. ... All these indicators show that the initial transitional period on the Internet, marked by the absolute domination of English, has ended.
Of course, English, as the preferred language of exchange in the scientific and commercial world, will maintain its very considerable advantage,
but its advantage will no longer be absolute, since only slightly more than 10% of human beings master English as their first or second language! ...
The advantage among Net surfers will soon no longer belong to English-speaking users but rather to those who master several languages.
Multilingualism is without a doubt the future of the network. ... Cyberspace is open to all languages and all cultures:
but We must produce content in our own language and which reflects our own culture, without feeling inferior or pessimistic!"
-
Some Humorous
Cross-Cultural Advertising Gaffes
-
Are You Cultured? (Part 1)
and
Part 2 -
the components that make up culture have huge implications for site globalization.
-
The Code Project
- Localizing Software Applications -
"When it comes time to make an application language independent is the time all the sales people cheer and developers hide.
To a salesman localized software means more markets and more sales, to a developer it equals a mind-numbing list of problems."
-
Fully Arabised net has some way to go
- Apart from the basic characters the Arabic script also contains a set of optional diacritical marks which can be crucial.
- The Bollywood Technique (for usability testing)
- In Asia it
is impolite to tell someone they have a bad design. It is
embarrassing to admit that you cannot find something. So it is
very hard to get feedback.
-
Your Passport to Proper Internationalization
- MultiLingual Computing & Technology -
"the leading industry magazine for Web site globalization, translation, international software development and language technology."
-
Internationalized Domain Names (IDNs) - "domain names or Web addresses, represented by local language characters.
They have the potential to transform the Internet into a truly global and multilingual tool by enabling Internet users to navigate
and communicate online in their preferred script."
-
IRMA - International Resource Management Application
-
"IRMA's goal is to provide translators with an easy-to-use environment in which the
Linspire OS in addition to Open Source software, can be translated efficiently. It does so by allocating
the responsibility of character encoding and file management to a web system that will do the work for translator."
-
IBM Speech User Interface Guide - To acquire new customers and to retain existing
customers, companies must give credence to customer satisfaction. Call centers with well crafted speech-enabled
applications significantly improve customer satisfaction as well as provide customers with additional services and 24x7
support. These speech-enabled applications rely on the development of pleasant and efficient Speech User Interfaces
(SUI). If the Speech User Interface is not properly design, constructed, and tuned pre and post deployment, unreasonable
and unnecessary expenses will be incurred that can lead to critical situations, increased problem activity, and
decreased customer satisfaction. In order to craft an effective SUI, proper and proven methodology (best
practices) should be followed. This IBM Redpaper will detail an effective methodology
that can be used to create and deliver high quality Speech User Interfaces to meet business needs,
using the IBM WebSphere Voice Toolkit V6.0.1 (a basic knowledge of VoiceXML development
using WebSphere Voice Server and Interactive Voice Response is assumed).
-
Culture: the next big thing in code -
an Intel Developers.net article about Technology and culture:
Despite the apparent universality of technology, it's a mistake to ignore differences,
not only in culture but also in age and gender, when working on UI and design.
[ Back to Top ]
- IBM Globalization
>>
Globalizing your e-business (home page)
Many interesting and useful topics, including:
-
Writing for international audiences
-
Ensure that your information and message is clear for all types of audiences
-
Terminology management -
Managing terminology can help you develop global applications
-
Culture Clashes - Test your knowledge of cultural norms in other countries -
These scenarios highlight common situations where another culture's norms or attitudes
may clash with yours.
-
Introduction to bidirectional languages -
How to handle languages that display text both right to left and left to right
-
The challenges of East Asian languages for software developers and architects -
large sets of characters used by East Asian languages present challenges for software developers
-
Chinese standard GB18030 -
This is the new standard for supporting Chinese characters
-
An introduction to the Thai language -
Thai belongs to the Sino-Tibetan language family. It is a tonal, uninflected and predominantly monosyllabic
language that has been influenced by both Pali and Sanskrit
-
Introduction to Indic languages -
Indic languages provide challenges to developers because of their complexity
-
Developing multilingual Web sites with Lotus Domino Global WorkBench 6,
Domino 6, and Notes 6
-
Machine translation - Fast but potentially error-prone text translations
-
Retrofit Globalization onto a Web Application
-
Developing global Web applications
-
"English is becoming just another dialect of the internet language -- you have to support
all languages to reach the growing number of visitors from different parts of the world."
-
Applying Globalization Architecture Imperatives to your business - "...
some fundamental principles for all development groups to follow, so that different applications
handle different languages consistently, can inter-operate, are translatable, and function well in an on-demand environment.
They will help companies move effectively and efficiently towards e-business across the globe."
-
Web services — Web services can streamline the path to software localization
-
IBM e-business Globalization Solution Design Guide - Getting Started
-
"With the worldwide growth of e-business, globalization is not only an add-on value
but a must for global e-business applications. In fact, globalization has become an architecture
in the realm of e-business."
-
Multilingual Information Management (MIM) solutions enable you to deploy the appropriate mix
of machine translation, localization (l10n), internationalization (i18n), and Professionally Edited Gist (PEG) within your business processes.
-
JIT localization — Just-in-Time localization makes application data easier
to transfer, manage and manipulate, and more accurately reflect user preferences.
-
Fonts and multilingual text - Successful globalized applications can process text
in any language, and accommodate multiple languages and complex text.
-
Implementation of Machine Translation Service for Sametime
-
Mock Translation Input Method Editor - provides programmers and testers
with an easy way to enter pseudo-translated multilingual test data into a Java application.
-
Developing J2EE Global Applications -
“Write Once, Run Anywhere”, and in any language!
-
Introduction to Mozilla globalization features
-
Internationalization with JavaScript
-
Globalizing your e-business RSS feeds - using RSS files,
a data feed can be created that supplies globalized headlines, links, and article summaries from your Web site.
-
Streaming Media development and globalization
-
Writing English for the English-speaking markets
-
Since the English-speaking countries remain the world’s largest economic bloc, having products work well
in the English-speaking markets is very important to many companies in the developing countries.
-
English language writing issues - irregularities, language variations,
ideomatic expressions, terminology, punctuation, political correctness, branding issues
-
References to Taiwan, Hong Kong and Macao - avoid culturally-sensitive issues.
-
Guide to designing international icons - make icons and graphic labels translatable;
avoid text-based graphics; avoid single-letter icons; localize punctuation
and commercial symbols; avoid puns; avoid hand and arm gestures; take care
in selecting animal icons; colors are local; using body parts as metaphors;
facial expressions are crude; sexual images can be effective or deadly;
symbols may be suitable only for local usage; respect reading direction;
avoid excessive detail; avoid racial, gender and religious stereotyping;
use political symbols appropriately; do not display national flags or maps,
especially those depicting national borders; use appropriate monetary symbols;
avoid trivializing violence; take care with mathematical and currency symbols ...
-
American and British English differences -
this Wikipedia
article outlines the differences between
American English, the form of the
English language spoken in the United States, and
Commonwealth English (often called
British English).
- Tandem Writing - Add your contribution to an
ever-growing story!
- IBM
Globalization webcasts on demand ...
-
And more ...
[ Back to Top ]
- Queen Mary University of London
-
Microsoft Terminology Translations -
contains over 9,000 English terms plus the translations of the terms for up to 45 different languages.
Microsoft provides the Microsoft terminology data to allow their customers, ISVs, and partners to have
a more consistent user experience across the products they are using and developing.
-
MSDN TV: The MSDN Global Home Page -
Learn how Microsoft's MSDN home page uses language and location detection to surface geo-aware localised content on the page.
Also, see how the MSDN home page geo-redirects users to any available, fully localised MSDN home page in the user's locale.
-
Voice as a User Interface: Case Study and Lessons Learned -
targeted at speech application developers and architects who want to design and develop voice portlet applications.
Includes a description with sample scenarios describing advanced functions available in voice portlet applications such as
messaging for portlet-to-portlet communications, single sign-on to access secure backend applications and Web Service access.
-
Teaching computers to read no simple task -
Creating algorithms to convert text so machines can learn.
-
Peculiarities in Pronunciation of English Words ...
-
Word2Word
-
Applied Language Solutions
-
The Scamseek Project -
Text Mining for Financial Scams on the Internet
(The project has a $2.2million budget to build a surveillance tool for identifying financial scams on the Internet.
It is funded by the Capital Markets CRC, The Australian Securities and Investment Commission and the participating universities.
This is Australia’s largest research project in language technology.)
-
Automatic Identification of Spoken Languages - Bibliography
-
Leeds Electronic Text Centre Documentation Pages
-
CL Research (develops computational lexicons)
>> Phonetic Databases -
Lists of words coded phonetically
-
The Anthropology Back Door
to the Web >>
Linguistics links-
NTT Intelligent Communication Laboratory >>
Machine Language Research
Group
-
Portable Document Formats - "Translations and other text work shouldn't be delivered in tool-dependent native formats such as Quark or Word but in independent exchange
formats like PostScript, HTML, LaTeX and PDF."
- ALT-J/E
...The Automatic Language Translator Japanese to English
-
Arcnet Online Translation and Dictionaries ... English / Japanese
/ Chinese / Malay
-
Stochastic Language Identifier
- will attempt to identify automatically the language of a sentence, phrase, or word of your choosing
( English, French, Spanish, German, Italian, Dutch, Afrikaans, Norwegian, Danish, Swedish, Portuguese, Icelandic, Latin)
-
Translation Journal - "A Publication for Translators by
Translators about Translators and Translation"
-
Translation Theory
- Traddutore, traditore! by
Rodney J. Decker
-
The British Computer Society
>>
Natural Language Translation Specialist Group
>>
Machine Translation Review
-
Materiales sobre lingüística y lengua españolas en la Web
-
International Journal of Computer Processing of Oriental Languages
-
THE IRN BULLETIN - International Interpretation Research Information Network
-
WIPO - World Intellectual Property Association -
Internationalized Domain Names - Intellectual Property Considerations
- Language Statistics on the Internet
- Multilingual Trademarks
- Multilingual
Domain Names
-
TM
Systems
- "The Digital Solution
for Language Translation, Dubbing and Subtitling
services to the international film
and television industry worldwide"
-
Language identification and IT: Addressing problems of linguistic diversity on a global scale
- CogPrints -
an electronic archive for papers in any area of Psychology, Neuroscience, and
Linguistics,
and many areas of Computer Science, Biology, Medicine, Anthropology, as well as any other portions
of the physical, social and mathematical sciences that are pertinent to the study of cognition
-
Using Computers in Linguistics: A Practical Guide
- The Nature of Linguistic Data and the Requirements of a Computing Environment for Linguistic Research
-
Linguistic Software for Doing Field Linguistics
- Language Understanding and the Emerging Alignment of Linguistics and Natural Language Processing
-
The Digitization of Oriental Languages
-
International Journal of the Chinese and Oriental Language Information Processing Society
-
Awangarda Central and Eastern European Translation Agency
-
Concordia Language Villages
- Arabic,
Chinese,
Danish,
English,
Finnish,
French,
German,
Italian,
Japanese,
Korean,
Norwegian,
Russian,
Spanish,
Swedish
[ Back to Top ]
[ Back to Top ]
|