API (application programming interface). A software interface that enables applications to communicate with each other. An API is the set of programming language constructs or statements that can be coded in an application program to obtain the specific functions and services provided by an underlying operating system or service program.
Bidirectional (writing system). A writing system in which text is generally flush right, and most characters are written from right to left, but some text is written left-to-right as well. Arabic and Hebrew are the only bidirectional writing systems in current use.
Bundle. A project manager may divide the text segments of a job into sections which are called bundles. Bundles can be allocated to different linguists.
Computer aided translation (CAT). Computer technology applications that assist in the act of translating text from one language to another.
Concordance. An alphabetical index of a word or phrase in the source text showing every contextual occurrence of the word or phrase in the text and in the available TM.
Content management system (CMS). A system used to store and subsequently find and retrieve large amounts of data. CMSs were not originally designed to synchronize translation and localization of content, so most have been partnered with globalization management systems (GMS).
DITA (Darwin Information Typing Architecture). An XML-based architecture for authoring, producing and delivering technical information. This architecture consists of a set of design principles for creating ―information typed‖ modules at a topic level and for using that content in delivery modes such as online help and product support portals on the web.
Domain. A knowledge domain that a user is interested in or is communicating about. A group of computers or devices that shares a common directory database and is administered as a unit.
Fuzzy match. Refers to the situation when a sentence or phrase in a translation memory (TM) is similar (but not a 100% match) to the sentence or phrase the translator is currently working on. The TM tool calculates the degree of similarity or ―fuzziness‖ as a percentage figure.
Globalization management system (GMS). Focuses on managing the translation and localization cycles and synchronizing those with source content management. Provides the capability of centralizing linguistic assets in the form of translation databases, leveraging glossaries and branding standards across global content.
GMX-V (Global information management Metrics eXchange – Volume). A word and character count standard for electronic documents. GMX-V is developed and maintained by OSCAR (Open Standards for Container/Content Allowing Re-use), a special interest group of LISA (Localization Industry Standards Association). GMX-V, one of the tripartite series of standards from LISA, deals with electronic document metrics. GMX is made up of the following standards: GMX-V — Volume; GMX-C — Complexity; and GMX-Q — Quality.
HTML. Hyper Text Markup Language (HTML) is a markup language that uses tags to structure text into headings, paragraphs, lists and links, and tells a web browser how to display text and images on a web page.
In context exact match (ICE match). Refers to the situation when a sentence or phrase in a translation memory (TM) is the same phrase in the same context as the sentence or phrase the translator is currently working on.
Internationalization (i18n). The process of generalizing a product so that it can handle multiple languages and cultural conventions without the need for redesign. In i18n, the common abbreviation for internationalization, the 18 refers to the eighteen letters between the i and the n.
Internationalization Tag Set (ITS). A technology to easily create XML which is internationalized and can be localized effectively.
Java. A programming language originally developed by Sun Microsystems and released in 1995 as a core component of Sun‘s Java platform. The language derives much of its syntax from C and C++ but has a simpler object model and fewer low-level facilities. Java applications are typically compiled to byte code that can run on any Java virtual machine regardless of computer architecture.
JavaScript. An open-source scripting language for design of interactive websites. JavaScript can interact with HTML source code, enabling web developers to use dynamic content. For example, JavaScript makes it easy to respond to user-initiated events (such as form input) without having to use common gateway interface.
Job. In the XTM Workflow the processing of one file from the source language to one target language is a job.
JSP. Java Server Pages (JSP) have dynamic scripting capability that works in tandem with HTML code, separating the page logic from the static elements — the actual design and display of the page — to help make the HTML more functional.
Leveraged match. A leveraged match is when a sentence or phrase in a translation memory (TM) is the same phrase in a different context as the sentence or phrase the translator is currently working on.
Localization (l10n). In this context, the process of adapting a product or software to a specific international language or culture so that it seems natural to that particular region. True localization considers language, culture, customs and the characteristics of the target locale. It frequently involves changes to the software‘s writing system and may change keyboard use and fonts as well as date, time and monetary formats. In l10n, the common abbreviation for localization, the 10 refers to the ten letters between the l and the n.
Machine translation (MT). A technology that translates text from one human language to another, using terminology glossaries and advanced grammatical, syntactic and semantic analysis techniques.
Metrics. A measure of the number of segments, words and characters in the source document, together with a summary of the different types of matching that are present after the translation memory has been applied. It also includes matching from within the document or repetitions.
Open Architecture for XML Authoring and Localization (OAXAL). A reference model of how to construct an effective and efficient system for XML authoring and Localization based on Open Standards.
Project. In XTM translating one or more files from one source language to one or more target languages is a project .
Project management (PM). Project management (PM) is the systematic planning, organizing and controlling of allocated resources to accomplish project cost, time and performance objectives. PM is normally reserved for focused, non-repetitive, time-limited activities with some degree of risk.
Project manager. A professional in the field of project management (PM). He or she has the responsibility of the planning, execution and closing of any project. Key PM responsibilities include creating clear and attainable project objectives, building the project requirements, and managing the triple constraint for projects — cost, time and scope.
Quality assurance (QA). The activity of providing evidence needed to establish confidence among all concerned that quality-related activities are being performed effectively. All those planned or systematic actions necessary to provide adequate confidence that a product or service will satisfy given requirements for quality. QA covers all activities from design, development, production and installation to servicing and documentation
Right-to-left languages. Languages such as Hebrew, Arabic, Urdu and Farsi are written primarily right-to-left. This text flow presents significant text and graphic layout implications.
Software as a service (SaaS) is software that is deployed over the internet. With SaaS, a provider licenses an application to customers as a service on demand, through a subscription, in a "pay-as-you-go" mode. The SaaS version of XTM is called XTM Cloud.
Segment. A unit of text that the CAT tool creates in order to match against the translation memory. The unit may be a single word, a phrase, one or more sentences, or even a larger unit.
Segmentation Rules eXchange (SRX). The vendor-neutral standard for describing how translation and other language processing tools segment text for processing. It allows translation memory and other linguistic tools to describe the language-specific processes by which text is broken into segments (usually sentences or paragraphs) for further processing.
Source language. The language of a text which is to be translated into another language
Step. The workflow in XTM is composed of steps; the available steps are: translate, correct and review.
Target language. The language into which a text written in another language is to be translated
Task. In XTM processing a bundle of segments in one step is a task.
Terminology management. Primarily concerned with manipulating terminological resources for specific purposes — for example, establishing repositories of terminological resources for publishing dictionaries, maintaining terminology databases, ad hoc problem solving in finding multilingual equivalences in translation work or creating new terms in technical writing. Terminology management software provides the translator a means of automatically searching a given terminology database for terms appearing in a document, either by automatically displaying terms in the translation memory software interface window or through the use of hotkeys to view the entry in the terminology database.
Terminology manager. A computer technology application tool that assists in the translation of text from one spoken language to another.
Translation. The process of converting all of the text or words from the source language to the target language. An understanding of the context or meaning of the source language must be established in order to convey the same message in the target language.
Translation memory (TM). A special database that stores previously translated sentences which can then be reused, in full or in part, on a sentence-by sentence basis. The database matches source to target language pairs.
Translation Memory eXchange (TMX). An open standard, based on XML, which has been designed for exchanging translation memories (TMs) between CAT tools.
Unicode. The Unicode Worldwide Character Standard (Unicode) is a character encoding standard used to represent text for computer processing. Originally designed to support 65,000 characters, it now has encoding forms to support more than 1,000,000 characters.
UNIX. A multiuser, multitasking operating system. It was one of the first operating systems to be written in a higher level programming language, thus making it hardware-independent.
UTF-8 (Unicode transfer format). An encoding form of Unicode that supports ASCII for backward compatibility and covers the characters for most languages in the world.
Web service. A collection of protocols and standards used for exchanging data between applications or systems.
XLIFF (XML Localization Interchange File Format). An XML-based format for exchanging localization data. Standardized by OASIS in April 2002 and aimed at the localization industry, XLIFF specifies elements and attributes to aid in localization. XLIFF could be used to exchange data between companies, such as a software publisher and a localization vendor, or between localization tools, such as translation memory systems and machine translation systems.
XML (eXtensible Markup Language). Extensible Markup Language (XML) is a simple, flexible text format originally designed to meet the challenges of large-scale electronic publishing. XML also plays an increasingly important role in the exchange of a wide variety of data on the Web and elsewhere.
xml:tm XML-based text memory, a LISA OSCAR standard for author and translation memory.
|