Patents
Literature
Hiro is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Hiro

127 results about "Transliteration" patented technology

Transliteration is a type of conversion of a text from one script to another that involves swapping letters (thus trans- + liter-) in predictable ways (such as α → a, д → d, χ → ch, ն → n or æ → ae). For instance, for the Modern Greek term "Ελληνική Δημοκρατία", which is usually translated as "Hellenic Republic", the usual transliteration to Latin script is "Ellēnikḗ Dēmokratía", and the name for Russia in Cyrillic script, "Россия", is usually transliterated as "Rossiya".

Foreign language abbreviation translation in an instant messaging system

A system for automatically providing foreign language abbreviation translation in an instant messaging system that identifies a foreign language abbreviation translation database based on a user indicated source culture. The foreign abbreviation translation database stores abbreviation translations for foreign language abbreviations frequently used by people from the user indicated source culture. The system locates a candidate term in an instant message and compares the candidate term to the foreign language abbreviations in the foreign language abbreviation translation database. In the event that the candidate term matches one of the foreign language abbreviations in the identified foreign language abbreviation translation database, the corresponding translation is retrieved and displayed. The comparison of the candidate term with the foreign language abbreviations may include automatically obtaining a transliteration of the candidate term. The disclosed system advantageously enables translation of foreign language abbreviations to be performed in real-time.
Owner:IBM CORP

Method for transliterating and suggesting arabic replacement for a given user input

A method for suggesting transliteration for user inputs, comprising: receiving an original user input composed of alpha-numeric characters; identifying the possibility of transliterating the input; determining at least one potential transliteration by performing at least one of the following (1) replacing a sequence of characters in the original input to a possible sequence of Arabic characters (2) determining the probabilities of the potential transliterated alternatives to the user input; and electing the most likely transliteration according to some predetermined criteria (3) verifying the suggested output against a validation repository, the validation repository having a large corpus of Arabic words.
Owner:SHERIKAT LINK LETATWEER ELBARMAGUEYAT E

Determining a compact model to transcribe the arabic language acoustically in a well defined basic phonetic study

In the development of an automatic speech recognition (ASR) system, an extensive study of the basic phonetic alphabet is performed to collect information regarding phonology and phonetics of the language or dialect in question (modern standard Arabic or MSA in this case). In addition, terminological and transcriptional problems are identified with respect to the language or dialect in question. Next, based on feature description (rather than symbol shapes), the symbols in the literature are mapped to a single or more recent phonetic alphabet. Lastly, from a maximal set containing all the phonemes, allophones, and transliteration symbols, a reduced set is created with a compact set of phonetic alphabets. Memory consumption is greatly reduced in a computer system by using this compact set of phonetic alphabets.
Owner:SAKHR SOFTWARE COMPANY

Method, system and computer program product for storing transliteration and/or phonetic spelling information in a text string class

A multi-field text string data structure is employed to encapsulating identification, meaning, and pronunciation information for a text string. A first field contains the Unicode characters for the text string in a language in which the text string is entered, which may be latin characters, characters which sound-map to latin characters, or one or more ideographs. A second field contains either the same characters or an intermediate representation of the text string, such as syllabary characters for a phonetic spelling of the characters within the first field. A third field contains either the same characters as the first field or a latin character phonetic spelling of the characters in the first field. The first field thus contains the text string in the language in which the text string was entered, while the second and third field contains information about the meaning and pronunciation of the text string. When the characters in the first field are unrecognizable to a user, or when the characters in the first field have more than one meaning or more than one pronunciation, the contents of the second and third fields allow the user to recognize the text string and / or perceive the correct meaning and pronunciation of the text string.
Owner:CERENCE OPERATING CO

Computer-implemented methods and systems for entering and searching for non-Roman-alphabet characters and related search systems

A computer-implemented method for selecting a desired Roman or non-Roman-alphabet character or objects from a set of non-Roman characters or objects may include steps of providing an association database that includes, for each non-Roman-alphabet character of the set, a Roman alphabet or other phonetic transliteration associated with each said non-Roman-alphabet character and a plurality of entries that are associated with each said non-Roman-alphabet character; receiving a phonetic transliteration of the desired non-Roman-alphabet character or data object and at least one associated entry that is associated with the desired non-Roman-alphabet character or other similar symbolic input; accessing the association database and identifying as candidate characters those characters of the set that are associated with the received phonetic transliteration and with the at least one received associated entry; if a number of candidate characters is greater than one, receiving additional associated entries and repeating the accessing and identifying step until a number of candidate characters is narrowed down to a single candidate character, and providing the single candidate character as the desired non-Roman-alphabet character. Also, derived from the principles described above, this invention includes a variety of methods for improving the efficiency of search engines through use of associations and other means of providing context for the item(s) being searched.
Owner:ORACLE INT CORP

Computer-implemented methods and systems for entering and searching for non-Roman-alphabet characters and related search systems

A computer-implemented method for selecting a desired Roman or non-Roman-alphabet character or objects from a set of non-Roman characters or objects may include steps of providing an association database that includes, for each non-Roman-alphabet character of the set, a Roman alphabet or other phonetic transliteration associated with each said non-Roman-alphabet character and a plurality of entries that are associated with each said non-Roman-alphabet character; receiving a phonetic transliteration of the desired non-Roman-alphabet character or data object and at least one associated entry that is associated with the desired non-Roman-alphabet character or other similar symbolic input; accessing the association database and identifying as candidate characters those characters of the set that are associated with the received phonetic transliteration and with the at least one received associated entry; if a number of candidate characters is greater than one, receiving additional associated entries and repeating the accessing and identifying step until a number of candidate characters is narrowed down to a single candidate character, and providing the single candidate character as the desired non-Roman-alphabet character. Also, derived from the principles described above, this invention includes a variety of methods for improving the efficiency of search engines through use of associations and other means of providing context for the item(s) being searched.
Owner:ORACLE INT CORP

Cyrillic to Latin script transliteration system and method

Embodiments of the present invention relate to methods, systems and computer-readable media for transliteration between Cyrillic and Latin script in a software product. An embodiment of this transliteration system and method comprises loading a text of characters and words in one of a Cyrillic or Latin script into a character transliteration module. This module converts each character in the one of a Cyrillic or Latin script into a corresponding opposite transliterated Cyrillic or Latin character. Then each word is examined in a word capitalization and exception module that compares each transliterated word against a set of predetermined grammatical rules to determine whether there are exceptions in capitalization. If there are, then appropriate internal capitalization of characters is added. Each word of the text to be transliterated is sequentially examined and converted until all words have been examined.
Owner:MICROSOFT TECH LICENSING LLC
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products