US20210209185A1 - Dynamic language translation of web site content - Google Patents
Dynamic language translation of web site content Download PDFInfo
- Publication number
- US20210209185A1 US20210209185A1 US17/202,405 US202117202405A US2021209185A1 US 20210209185 A1 US20210209185 A1 US 20210209185A1 US 202117202405 A US202117202405 A US 202117202405A US 2021209185 A1 US2021209185 A1 US 2021209185A1
- Authority
- US
- United States
- Prior art keywords
- language
- content
- translated
- translation
- web
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Abandoned
Links
- 238000013519 translation Methods 0.000 title claims abstract description 385
- 238000000034 method Methods 0.000 claims abstract description 93
- 238000003860 storage Methods 0.000 claims abstract description 31
- 230000004044 response Effects 0.000 claims abstract description 29
- 238000004891 communication Methods 0.000 claims description 24
- 230000014616 translation Effects 0.000 description 378
- 230000008569 process Effects 0.000 description 65
- 238000010586 diagram Methods 0.000 description 61
- 239000003795 chemical substances by application Substances 0.000 description 55
- 241000239290 Araneae Species 0.000 description 49
- 239000000047 product Substances 0.000 description 47
- 235000014510 cooky Nutrition 0.000 description 30
- 230000015654 memory Effects 0.000 description 23
- 238000004590 computer program Methods 0.000 description 12
- 230000006870 function Effects 0.000 description 12
- 238000006243 chemical reaction Methods 0.000 description 11
- 230000008859 change Effects 0.000 description 10
- 230000000694 effects Effects 0.000 description 9
- 230000003287 optical effect Effects 0.000 description 8
- 238000005457 optimization Methods 0.000 description 8
- 230000009471 action Effects 0.000 description 7
- 230000006399 behavior Effects 0.000 description 7
- 238000012423 maintenance Methods 0.000 description 7
- 238000013507 mapping Methods 0.000 description 7
- 238000005516 engineering process Methods 0.000 description 6
- 238000004422 calculation algorithm Methods 0.000 description 5
- 230000001419 dependent effect Effects 0.000 description 5
- 230000004807 localization Effects 0.000 description 5
- 230000014509 gene expression Effects 0.000 description 4
- 230000010354 integration Effects 0.000 description 4
- 238000012545 processing Methods 0.000 description 4
- 238000000275 quality assurance Methods 0.000 description 4
- 238000012552 review Methods 0.000 description 4
- 230000003068 static effect Effects 0.000 description 4
- 230000008901 benefit Effects 0.000 description 3
- 230000005540 biological transmission Effects 0.000 description 3
- 230000010365 information processing Effects 0.000 description 3
- 238000004519 manufacturing process Methods 0.000 description 3
- 230000008520 organization Effects 0.000 description 3
- 238000004513 sizing Methods 0.000 description 3
- 101000666896 Homo sapiens V-type immunoglobulin domain-containing suppressor of T-cell activation Proteins 0.000 description 2
- 102100038282 V-type immunoglobulin domain-containing suppressor of T-cell activation Human genes 0.000 description 2
- 230000004913 activation Effects 0.000 description 2
- 238000013459 approach Methods 0.000 description 2
- 230000009286 beneficial effect Effects 0.000 description 2
- 239000000872 buffer Substances 0.000 description 2
- 230000009193 crawling Effects 0.000 description 2
- 239000000835 fiber Substances 0.000 description 2
- IJJVMEJXYNJXOJ-UHFFFAOYSA-N fluquinconazole Chemical compound C=1C=C(Cl)C=C(Cl)C=1N1C(=O)C2=CC(F)=CC=C2N=C1N1C=NC=N1 IJJVMEJXYNJXOJ-UHFFFAOYSA-N 0.000 description 2
- 230000003362 replicative effect Effects 0.000 description 2
- 230000002441 reversible effect Effects 0.000 description 2
- RYGMFSIKBFXOCR-UHFFFAOYSA-N Copper Chemical compound [Cu] RYGMFSIKBFXOCR-UHFFFAOYSA-N 0.000 description 1
- 241001522296 Erithacus rubecula Species 0.000 description 1
- 238000004458 analytical method Methods 0.000 description 1
- 230000015556 catabolic process Effects 0.000 description 1
- 230000001413 cellular effect Effects 0.000 description 1
- 238000001514 detection method Methods 0.000 description 1
- 230000009977 dual effect Effects 0.000 description 1
- 230000007717 exclusion Effects 0.000 description 1
- 238000009434 installation Methods 0.000 description 1
- 230000007774 longterm Effects 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 238000005259 measurement Methods 0.000 description 1
- 230000007246 mechanism Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 230000000737 periodic effect Effects 0.000 description 1
- 230000002688 persistence Effects 0.000 description 1
- 238000011176 pooling Methods 0.000 description 1
- 230000003252 repetitive effect Effects 0.000 description 1
- 239000004065 semiconductor Substances 0.000 description 1
- 108020001568 subdomains Proteins 0.000 description 1
- 239000013589 supplement Substances 0.000 description 1
- 230000001360 synchronised effect Effects 0.000 description 1
- 238000012360 testing method Methods 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/40—Processing or translation of natural language
- G06F40/42—Data-driven translation
- G06F40/45—Example-based machine translation; Alignment
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/953—Querying, e.g. by the use of web search engines
- G06F16/9537—Spatial or temporal dependent retrieval, e.g. spatiotemporal queries
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/24—Querying
- G06F16/248—Presentation of query results
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/955—Retrieval from the web using information identifiers, e.g. uniform resource locators [URL]
- G06F16/9558—Details of hyperlinks; Management of linked annotations
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/955—Retrieval from the web using information identifiers, e.g. uniform resource locators [URL]
- G06F16/9566—URL specific, e.g. using aliases, detecting broken or misspelled links
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/957—Browsing optimisation, e.g. caching or content distillation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/958—Organisation or management of web site content, e.g. publishing, maintaining pages or automatic linking
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/958—Organisation or management of web site content, e.g. publishing, maintaining pages or automatic linking
- G06F16/972—Access to data in other repository systems, e.g. legacy data or dynamic Web page generation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/10—Text processing
- G06F40/103—Formatting, i.e. changing of presentation of documents
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/10—Text processing
- G06F40/103—Formatting, i.e. changing of presentation of documents
- G06F40/117—Tagging; Marking up; Designating a block; Setting of attributes
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/10—Text processing
- G06F40/12—Use of codes for handling textual entities
- G06F40/14—Tree-structured documents
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/10—Text processing
- G06F40/197—Version control
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/40—Processing or translation of natural language
- G06F40/42—Data-driven translation
- G06F40/47—Machine-assisted translation, e.g. using translation memory
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/40—Processing or translation of natural language
- G06F40/55—Rule-based translation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/40—Processing or translation of natural language
- G06F40/58—Use of machine translation, e.g. for multi-lingual retrieval, for server-side translation for client devices or for real-time translation
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L67/00—Network arrangements or protocols for supporting network services or applications
- H04L67/01—Protocols
- H04L67/02—Protocols based on web technology, e.g. hypertext transfer protocol [HTTP]
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/20—Natural language analysis
- G06F40/263—Language identification
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/005—Language recognition
Definitions
- the present teaching generally relates to Internet applications, and more particularly relates to translation of web content.
- Another technique involves managing the translation process by deploying human translators and either maintaining multiple web sites for each language, or re-architecting the existing web site back-end technology to accommodate multiple languages. This requires significant resources in terms of time and cost, including a high level of complexity and duplication of effort.
- dynamic and e-commerce sites present other challenges as well, as the information to be translated resides in multiple places (e.g., a Structured Query Language database, static Hyper Text Markup Language pages and dynamic Hyper Text Markup Language page templates) and each translated site interfacing with the same e-commerce or back-end engine.
- a web site undergoes changes, it is important to handle ongoing maintenance properly.
- this approach may yield superior translations that are suitable for professional web sites of large organizations, it is at a great cost. Most organizations simply do not have, or do not want to invest in, the resources necessary to handle this task internally.
- FIG. 1 is a block diagram illustrating the system architecture of a conventional web site.
- the web site of FIG. 1 is presented in a first language, such as English.
- FIG. 1 shows a web server 112 connected to the Internet 116 via a web connection.
- a public user 118 such as a person using a computer with a web connection, can access the web server 112 via the Internet 116 and download information, such as a web page 114 , from the web server 112 for viewing.
- the web server 112 is operated by programming logic 110 , comprising instructions on how to retrieve, serve, and accept information for processing.
- the web server 112 further has access to a database 102 for storing information, as well as Hyper Text Markup Language (HTML) template files 104 , graphics files 106 and multimedia files 108 , all of which constitute the web site served by web server 112 .
- HTML Hyper Text Markup Language
- FIG. 2 is a block diagram illustrating the system architecture of a conventional web site presented in two languages.
- the web site of FIG. 2 is presented in a first language, such as English (as shown above for FIG. 1 ) and in a second language, such as Spanish.
- FIG. 2 shows the web server 112 and the other English language components described in FIG. 1 , including the database 102 of information, the HTML template files 104 , graphics files 106 , multimedia files 108 and programming logic 110 .
- FIG. 2 further shows the public user 118 accessing the web server 112 via the Internet 116 and downloading information, such as a web page 202 in English or Spanish language.
- FIG. 2 also includes components related to providing web content in Spanish language.
- FIG. 2 (has Spanish language components, including a database 208 of information, HTML template files 214 , graphics files 216 , multimedia files 210 and programming logic 212 .
- These Spanish language components are managed by a multi-lingual content manager 206 , which manages requests for information in the dual languages.
- FIG. 2 further shows that the web server 112 is re-engineered to serve multiple sets of content in different languages.
- the deployment of the Spanish language components 204 and multi-lingual content manager 206 of FIG. 2 requires a significant expenditure of effort and resources. Further, the deployment requires re-engineering the web server 112 , adding to the time and cost associated with the deployment. Additionally, once the Spanish language components 204 have been established, continuous synchronization with changes in the English language components results in a recurring cost.
- a system, method and computer readable medium in association with providing translated web content.
- a method, implemented on a computer having at least one processor, storage, and a communication platform for providing translated web content A request is first received from a user for content in a second language translated from content in a first language from a first Internet source.
- the content in the first language from the first Internet source is obtained.
- the content in the first language is divided into one or more translatable components, wherein a translatable component includes a segment of text. Whether the one or more translatable components have been previously translated, via at least one of machine translation, human translation, and a combination thereof, into the second language and stored as translated components in a storage is determined.
- the content is generated in the second language by modifying the content in the first language so that at least some translatable components are replaced with corresponding translated components, and the content is sent in the second language to the user as a response to the request.
- a method, implemented on a computer having at least one processor, storage, and a communication platform for providing translated web content A request is first received from a user for content in a second language translated from content in a first language accessible from a first Internet source. The content in the first language from the first Internet source is obtained. The content in the first language is divided into one or more translatable components, wherein a translatable component includes a segment of text. Whether any of the translatable components does not have a corresponding translated component stored in the storage and generated previously via at least one of machine translation, human translation, and a combination thereof is determined.
- Translation of the translatable components that do not have corresponding translated components via at least one of machine translation, human translation, and a combination thereof is scheduled to generate the corresponding translated components, wherein each segment of text is translated as a unit.
- the corresponding translated components are then stored in the storage.
- a method, implemented on a computer having at least one processor, storage, and a communication platform for managing language translation.
- Content in a first language is first accessed from an Internet source via a publicly available network path.
- a portion of the content in the first language that is not yet translated into a second language is identified.
- Translation of the portion of the content in the first language that is not yet translated into a second language using at least one of machine translation, human translation, and a combination thereof is then scheduled to produce corresponding content in the second language.
- FIG. 1 (PRIOR ART) is a block diagram illustrating the system architecture of a conventional web site
- FIG. 2 (PRIOR ART) is a block diagram illustrating the system architecture of a conventional web site presented in two languages;
- FIG. 3 is a block diagram illustrating an exemplary system architecture of a web site presented in two languages, in one embodiment of the present teaching
- FIG. 4 is a block diagram illustrating an exemplary system architecture of the present teaching, in one embodiment of the present teaching
- FIG. 5 is an operational flow diagram depicting an exemplary process of the translation server, according to one embodiment of the present teaching
- FIG. 6 is an operational flow diagram depicting an exemplary serving process of the translation server, according to one embodiment of the present teaching
- FIG. 7( a ) is a block diagram depicting an exemplary serving process in an ASP model of the translation server, according to one embodiment of the present teaching
- FIG. 7( b ) is a block diagram depicting an exemplary process in an ASP model of the translation server when the content to be translated is not present on the web site or is not delivered to the user via the web site, according to one embodiment of the present teaching;
- FIG. 8( a ) is a block diagram depicting an exemplary serving process in a web service model of the translation server, according to one embodiment of the present teaching
- FIG. 8( b ) is a block diagram depicting an exemplary serving process in a web service model of the translation server when the content to be translated is not present on the web site or is not delivered to the user via the web site, according to one embodiment of the present teaching;
- FIG. 9 is a screenshot of an exemplary WebCATT interface used for viewing web content for translation, in one embodiment of the present teaching.
- FIG. 10 is a screenshot of an exemplary WebCATT interface used for viewing a translatable image along with a corresponding translation, in one embodiment of the present teaching
- FIG. 11 is a screenshot of an exemplary WebCATT interface used for editing a translatable segment of text, in one embodiment of the present teaching
- FIG. 12 is a screenshot of an exemplary WebCATT interface used for viewing a translation queue, in one embodiment of the present teaching
- FIG. 13 is an operational flow diagram depicting an exemplary process of WebCATT, according to one embodiment of the present teaching
- FIG. 14 is an operational flow diagram depicting an exemplary process of the spider, according to one embodiment of the present teaching.
- FIG. 15 is an operational flow diagram depicting an exemplary synchronization process according to one embodiment of the present teaching.
- FIG. 16 is a block diagram showing a computer system useful for implementing the present teaching
- FIG. 17 is a screenshot of an exemplary Preference Selector pop-up window on the user agent (e.g., a browser), according to one embodiment of the present teaching
- FIG. 18 is an operational flow diagram depicting an exemplary process of loading Preference Selector, according to one embodiment of the present teaching
- FIG. 19 is an operational flow diagram depicting an exemplary process of the Preference Selector server-side application, according to one embodiment of the present teaching
- FIG. 20 is a block diagram depicting an exemplary process of the Preference Selector server-side application request and the response, according to one embodiment of the present teaching
- FIG. 21 is a block diagram illustrating an exemplary system architecture of the Content Localizer, according to one embodiment of the present teaching.
- FIG. 22 is an operational flow diagram depicting an exemplary process of the Content Localizer Server for generating localized content, according to one embodiment of the present teaching
- FIG. 23 is an operational flow diagram depicting an exemplary process of the Content Localizer Server for analyzing the request inputs against the conditions associated with a localized content to determine whether the conditions are met, according to one embodiment of the present teaching
- FIG. 24 is an operational flow diagram depicting an exemplary process of the Translation Server for recognizing the areas of the page to be localized, according to one embodiment of the present teaching.
- FIG. 25 is a block diagram depicting an exemplary process of the Content Localizer Server request and the response, according to one embodiment of the present teaching.
- the methods, systems, and medium, disclosed in accordance with present teaching overcome problems with the prior art by providing an efficient and easy-to-implement system and method for dynamic language translation of a web site.
- FIG. 3 is a block diagram illustrating an exemplary system architecture of a web site presented in two languages, according to one embodiment of the present teaching.
- the web site shown in FIG. 3 may be presented in a first language, such as English, and a second language, such as Spanish.
- FIG. 3 shows the web server 112 may be connected to the Internet 116 via a web connection.
- a public user 118 may access the web server 112 via the Internet 116 and download information, such as a web page, from the web server 112 for viewing.
- the user 118 may utilize a client application, such as a web browser, on a client computer to connect to the web site of via the network 116 .
- the user 118 may browse through the products or services offered by the web site by navigating through its web pages.
- the web server 112 is operated by programming logic 110 , and the web server 112 further has access to a database 102 of information, as well as HTML template files 104 , graphics files 106 and multimedia files 108 , all of which constitute the English components of the web site served by web server 112 .
- FIG. 3 further includes a translation server 300 situated apart from and existing independently from the web server 112 .
- the translation server 300 may embody the main functions of the present teaching, including the provision of a web site in a secondary language, such as Spanish.
- the translation server 300 may provide the secondary language components of a base web site, which is provided by web server 112 , without requiring integration with the base web site or re-configuring or re-engineering of the web server 112 .
- the deployment of the secondary language components FIG. 3 requires a significantly reduced expenditure of time and resources than the deployment of FIG. 2 . Further, in this example, the deployment of FIG. 3 does not require the re-engineering of the web server 112 . Additionally, once the secondary language components have been established by the translation server 300 , they are automatically kept synchronized with the English language components of the base web site. Thus, the system of the present teaching reduces the amount of time, effort and resources that are required to deploy a secondary language web site.
- FIG. 4 is a block diagram illustrating an exemplary system architecture of the present teaching, in one embodiment of the present teaching.
- FIG. 4 presents an alternative point of view of the system architecture of the present teaching.
- FIG. 4 shows a web site 414 representing a web site in a first language such as English that is connected to the Internet 412 via a web connection.
- FIG. 4 further shows a user 416 that utilizes a web connection to the Internet 412 to browse and navigate the web pages served by the web site 414 .
- FIG. 4 further shows a translation server 400 , corresponding to the translation server 300 of FIG. 3 , and a translation database 406 for use by the translation server 400 for storing translated components during the serving of web pages in a secondary language, such as Spanish.
- a secondary language such as Spanish.
- WebCATT Web Computer Aided Translation Tool
- FIG. 4 is also shown in FIG. 4 .
- the Web Computer Aided Translation Tool (WebCATT) which is a tool for aiding a human 418 or an admin 410 in translating the components of a web site in a first language.
- a spider 404 for use in synchronizing, analyzing and sizing a web site 414 .
- the translation server 400 , WebCATT tool 408 and spider 404 may be connected to a web server 402 , which is the conduit through which all web actions of the above tools are channeled.
- the translation server 400 , WebCATT tool 408 are described in greater detail below.
- the computer systems of translation server 400 , WebCATT tool 408 , spider 404 and web server 402 are one or more Personal Computers (PCs) (e.g., IBM or compatible PC workstations running the Microsoft Windows 95/98/2000/ME/CE/NT/XP/VISTA/7 operating system, Unix, Linux, Macintosh computers running the Mac OS operating system, ANDROID, or equivalent), Personal Digital Assistants (PDAs), tablets, smart phones, game consoles or any other information processing devices.
- PCs Personal Computers
- PDAs Personal Digital Assistants
- the computer systems of translation server 400 , WebCATT tool 408 , spider 404 and web server 402 are server systems (e.g., SUN Ultra workstations running the SunOS operating system or IBM RS/6000 workstations and servers running the AIX operating system).
- server systems e.g., SUN Ultra workstations running the SunOS operating system or IBM RS/6000 workstations and servers running the AIX operating system.
- Internet network 412 is a circuit switched network, such as the Public Service Telephone Network (PSTN).
- PSTN Public Service Telephone Network
- the network 412 is a packet switched network.
- the packet switched network includes a wide area network (WAN), such as the global Internet, a private WAN, a local area network (LAN), or any combination of the above-mentioned networks.
- WAN wide area network
- LAN local area network
- network 412 is a wired network, a wireless network, a broadcast network or a point-to-point network.
- network 412 is a communication path among different processes within the same physical hardware or memory space.
- network 412 is a combination of any of the above-mentioned networks.
- the translation server 400 is the application responsible for the conversion of web pages in one language to that in another language.
- the translation server 400 may parse each incoming HTML page into translatable components, substitute each incoming translatable component with an appropriate translated component, and return the translated web page back to the online user 416 .
- Page conversion may be performed on the fly each time an online user 416 requests a page in the second or alternate language.
- the translation server 400 will translate the page if enough translated content is available to meet a customer specified translation threshold. If this is not the case, then the page will be returned in the first or original language.
- a translatable component may include any one of a text segment, an image file with text to be translated, a multimedia file with text or audio to be translated, a file with text to be translated, a file with image with text to be translated, a file with audio to be translated, a file with video and with at least one of text and audio to be translated, or any other suitable file.
- a text segment may be a single word, a short phrase, a sentence, a paragraph or multiple paragraphs, or any other suitable segment.
- the page conversion process follows seven major steps, some of which may be optional.
- a first step for each text segment encountered, if a translation is available, the text segment may be replaced with the translated text segment. If no translation is available, either the text remains in the original language or a machine translation may be performed on the fly, depending on the customer's preference.
- a second step for each linked file (images, PDF files, Flash movies, etc.) encountered if a translated file is available, the HTML, link tag may be rewritten so that it points to the translated file. If a translated file is not available, the original link tag may be left untouched.
- any relative Universal Resource Locator (URL) found in the page may be converted to an absolute URL. This step may be necessary if the resolution of the relative URLs in the user agent (e.g., a browser) requires adjustment.
- URL Universal Resource Locator
- each JavaScript block may be parsed to identify translatable components, such as text or images, requiring translation.
- each link to another web page may be rewritten so that the original URL is redirected to the translation server 400 .
- the request then goes directly to the translation server 400 , and the page is in turn translated.
- This step may be necessary if resolution of relative URLs in the user agent (e.g., a browser) requires adjustment. This feature, which keeps the user in the alternate language as they browse the site, is called “implicit navigation”.
- a sixth step for each directive tag or attribute found, an appropriate instruction may be performed.
- the translation server 400 may automatically schedule the web page for translation by placing it in the WebCATT 408 translation queue, in the event that an available translation cannot be found for one or more text segments or linked files in the page.
- FIG. 5 is an operational flow diagram depicting an exemplary process of the translation server 400 , according to one embodiment of the present teaching.
- the operational flow diagram in FIG. 5 depicts how the translation server 400 responds to a user request for a web page in a secondary language.
- the operational flow diagram of FIG. 5 begins with step 502 and flows directly to step 504 .
- the translation server 400 may receive a request from a user 416 on a web site 414 , the web site 414 having a first web content in a first language, such as English.
- the request such as but not limited to an HTTP request or a Simple Mail Transfer Protocol (SMTP) request, may call for a second web content in a second language, such as Spanish.
- the second web content may be a human translation, machine translation, or human edited machine translation in a second language of the first web content.
- the first language includes any one of English, French, Spanish, German, Portuguese, Italian, Japanese, Chinese, Korean, Arabic, and any other suitable language
- the second language is different than the first language and includes any one of English, French, Spanish, German, Portuguese, Italian, Japanese, Chinese, Korean, Arabic, and any other suitable language.
- the translation server 400 may retrieve the first web content from the web site 414 .
- the translation server 400 may divide the first web content into one or more translatable components.
- the translation server 400 may identify one or more translated components of the second web content corresponding to one or more translatable components of the first web content.
- the translation server 400 may arrange or put the translated components of the second web content to preserve a format that corresponds to the first web content, including, for example, putting tags that are not visible in the first web content.
- the translation server 400 may provide the second web content in response to the request that was received.
- the control flow of FIG. 5 stops.
- FIG. 6 is an operational flow diagram depicting an exemplary serving process of the translation server 400 , according to an embodiment of the present teaching.
- the operational flow diagram of FIG. 6 depicts the process of the translation server 400 of providing a web page in a secondary language in response to a user request and provides more details of steps 508 - 514 of FIG. 5 .
- the operational flow diagram of FIG. 6 begins with step 601 and flows directly to step 602 .
- Step 601 begins with a source HTML, page or first web content of step 506 of FIG. 5 .
- step 602 at least one portion of the first web content may be parsed into translatable components.
- step 603 it may be determined whether the end of the file of the first web content is reached. If it is affirmative, then control flows to step 612 . Otherwise, control flows to step 604 .
- step 604 it may be determined whether the translatable component that was parsed in step 602 is a text segment. If it is affirmative, then control flows to step 606 . Otherwise, control flows to step 614 .
- a matching translated text segment may be looked up in a cache.
- step 607 it may be determined whether the matching translated text segment is found in the cache. If it is affirmative, then control flows to step 609 . Otherwise, control flows to step 618 .
- step 609 it may be determined whether translation of the text segment is suppressed or not yet translated. If it is affirmative, then control flows to step 621 . Otherwise, control flows to step 610 .
- the matching translated text segment may be set as a target segment.
- the current text segment may be set as the target segment.
- the target segment may be added to the output web content, or second web content (i.e., the translated HTML page or the output HTML page).
- the second web content may be output for provision to the user requesting the web page.
- step 612 it may be determined whether there is an incomplete translation of the current web page, i.e., the first web content. If it is affirmative, then control flows to step 613 . Otherwise, control flows to step 611 .
- step 613 the current web page may be scheduled for translation.
- step 611 the translation activity performed by the translation server 400 in servicing the current web page may be recorded in the translation database 406 .
- step 625 it may be determined whether the percentage of the current web page, i.e., the first web content, translated is above a threshold. If it is affirmative, then control flows to step 624 . Otherwise, control flows to step 626 .
- step 624 the second web content or translated HTML, page may be output for provision to the user requesting the web page.
- step 626 the current web page or first web content may be output unchanged for provision to the user requesting the web page.
- step 614 it may be determined whether the translatable component parsed in step 602 is a translatable file, such as a PDF file, an image file, etc. If it is affirmative, then control flows to step 615 . Otherwise, control flows to step 629 . In step 629 , it may be determined whether the translatable component parsed in step 602 is a link to another translatable page. If it is affirmative, then control flows to step 628 . Otherwise, control flows to step 627 . In step 627 , a tag may be added to the translated HTML page to indicate a link (this is described in greater detail below). In step 628 , the link may be modified to redirect the URL (this is described in greater detail below).
- a translatable file such as a PDF file, an image file, etc.
- a translated file corresponding to the translatable file may be looked up in a cache.
- step 616 it may be determined whether the translated file was found. If it is affirmative, then control flows to step 617 . Otherwise, control flows to step 633 .
- step 633 the translated file may be looked up in the translation database 406 .
- step 635 it may be determined whether the translated file was found. If it is affirmative, then control flows to step 634 . Otherwise, control flows to step 632 .
- the translated file that was found may be stored in the cache.
- step 632 an incomplete translation may be recorded in the translation database 406 .
- step 630 the original file may be set as the target file.
- the target file may be added to the translated HTML page.
- step 617 it may be determined whether translation is suppressed for the translatable file. If it is affirmative, then control flows to step 630 . Otherwise, control flows to step 636 .
- step 636 the translated file may be set as the target file.
- step 618 a matching translated text segment may be looked up in the translation database 406 .
- step 622 it may be determined whether the matching translated text segment is found in the database. If it is affirmative, then control flows to step 619 . Otherwise, control flows to step 637 .
- step 619 the translated segment that was found is stored in the cache.
- an incomplete translation may be recorded in the translation database 406 .
- step 638 it may be determined whether a machine translation of the text segment can be performed. If it is affirmative, then control flows to step 639 . Otherwise, control flows to step 621 . In step 639 , the machine translation may be set as the target segment.
- the translation server 400 can be presented in a variety of models. For example, in the Application Service Provider (ASP) model, the translation server 400 may convert full web pages or script files at a time and deliver them directly to the online user 416 . Under this model, all links in a web page may be redirected through the translation server 400 .
- ASP Application Service Provider
- Clicking on a link in a translated page results in the user agent (e.g., a browser) request being sent to the translation server 400 .
- the translation server 400 in turn may request the original language page from the original language web server 414 , convert it to the alternate language, and send it back to the user 416 .
- FIG. 7( a ) is a block diagram depicting an exemplary serving process in an ASP model of the translation server 400 , according to one embodiment of the present teaching.
- the user 416 may click on a link of a web page in a first language on the web site 414 .
- the link points to a page to be translated.
- the translation server 400 may receive the request and process it.
- the translation server 400 may forward the request to the web site 414 , and in a third step 706 , the web site 414 may provide the page to the translation server 400 for translation.
- the translation server 400 may translate the page using the translations in the translation database 406 and send the translated page to the user 416 .
- FIG. 7( b ) is a block diagram depicting an exemplary translation process of the translation server based on an ASP model when the content to be translated is not present on the customer web site 414 or is not delivered to the user via web site 414 , according to an embodiment of the present teaching.
- the content to be translated in this embodiment includes, but is not limited to, electronic mails and/or other types of messages (e.g., messages that use protocols and services such as SMTP, SMS and MMS).
- an application e.g., an electronic mail application or a text message application
- the Translation Server 400 may optionally store the content to be translated and schedule it for translation.
- the Translation Server 400 may send the translated content to the user 416 . In one example, step 3 may take place some time after step 2 .
- the translated content may not be delivered directly to the online user 416 .
- the customer's web site server 414 may issue the request for translation to the translation server 400 , which acts as a web translation service.
- the translation server 400 can convert full pages or just specific text segments and/or files. When directly translating text segments or files, multiple translation requests can be issued, one per segment or file, or multiple segments and files can be translated in a single batched request.
- FIG. 8( a ) is a block diagram depicting an exemplary serving process in a web service model of the translation server 400 , according to an embodiment of the present teaching.
- the user 416 may click on a link of a web page in a first language on the web site 414 .
- the link points to a page to be translated.
- the web site server 414 may receive the request and processes it.
- the web site 414 may provide the page to the translation server 400 for translation.
- the translation server 400 may provide the translated page to the web site 414 .
- the web site 414 may send the translated page to the user 416 .
- FIG. 8( b ) is a block diagram depicting an exemplary serving process in a web service model of the translation server when the content to be translated is not present on the web site 414 or is not delivered to the user via the web site 414 , according to an embodiment of the present teaching.
- the content to be translated in this operational mode includes, but is not limited to, electronic mails and other types of text messages (e.g., ones that use protocols and services such as SMTP, SMS and MMS).
- a customer application e.g., an electronic mail application or a text messaging service application
- a translation if a translation is not found for either all or a part of the content to be translated, the Translation Server 400 may optionally store the content and schedule it for translation.
- the Translation Server 400 may send the translated content back to the customer application running on web site 414 . In one example, step 3 may take place some time after step 2 .
- the customer application may send the translated content back to the user 416 .
- the hosting and management model may define who deploys and manages the hardware and operating system software in which the software components of the present teaching reside.
- the hosted and managed model may be a fully outsourced model in which one entity hosts the service and all translated data. Under this model, one entity may deploy the translation server 400 and WebCATT 408 software on its own hardware. All hardware and software may be provisioned and maintained by this entity, so the customer web site 414 has no responsibility for any hardware or software related to the service.
- the hosting entity may be responsible for: 1) provisioning, installing, configuring and maintaining all hardware, including communication to the Internet 412 , 2) installing, configuring and maintaining all operating system, web server and database server software, 3) installing, configuring and managing on an ongoing basis the translation server 400 and WebCATT 408 software, and 4) maintaining staff and subcontractors that use the WebCATT 408 software to perform the translations that maintain the alternate language site in sync with the original language site.
- the translation server 400 and WebCATT 408 software may be installed on the customer web site's hardware.
- the customer web site 414 maybe responsible for: 1) provisioning, installing, configuring and maintaining all hardware, including communication to the Internet 412 , and 2) installing, configuring and maintaining all operating system, web server and database server software.
- the managing entity may be responsible for: 1) installing, configuring and managing on an ongoing basis the translation server 400 and WebCATT 408 software, and 2) maintaining staff and subcontractors that use the WebCATT 408 software to perform the translations that maintain the alternate language site in sync with the original language site.
- the components of the present teaching can be deployed in dedicated or shared server environments.
- multiple customer web sites may share the same hardware.
- multiple translation servers 400 may be installed in the same web server 402 , which connects to a database server containing the database 406 of translated data.
- a single WebCATT 408 software installation may also be shared by multiple customers. This setup is cost efficient and can be used for small and medium size sites with low-to-moderate web site traffic.
- a dedicated environment all hardware may be dedicated to one customer web site 414 . This may be necessary for large organizations with heavy web site traffic and large amounts of text to be translated.
- a single web server 402 or a cluster of web servers may be dedicated to the customer.
- the database server normally may also be dedicated to the customer. Dedicated servers may be used to assure guaranteed bandwidth for the customer and simplify keeping track of bandwidth usage for management and billing purposes.
- the system of the present teaching may not save or maintain translated pages, except, e.g., in temporary caches for the purpose of improving response performance. Although, this may be useful for sites with static content, it becomes unmanageable for sites whose content is generated dynamically from database information in response to a user's request. Instead, the present teaching may be designed to store only those components within a web page that require translation, i.e., translatable components.
- a translatable component may include any one of a text segment, an image file with text to be translated, a multimedia file with text or audio to be translated, a file with text to be translated, a file with image with to be translated, a file with audio to be translated and a file with video and with at least one of text and audio to be translated.
- a text segment is a chunk of text on a page.
- a text segment can range from a single word to a paragraph or multiple paragraphs.
- a file is any type of external content that resides on a file, is linked from within the page, and may require translation. Typical types of linked files found in web pages include, but are not limited to, images, PDF files, MS Word documents, and Flash movies.
- the above example page may be parsed into the following six text segments: 1) ‘Widget Product Information’, 2) ‘Widget’, 3) ‘Model# 123’, 4) ‘This widget is very useful for many chores around the house’, 5) ‘Product photo’, and 6) ‘Click here to return to the home page’.
- the above example page would further be parsed into the following one file: img/widget_picture.gif.
- the parsing system may break-up text segments taking into consideration the surrounding HTML tags in the page.
- the sentence ‘Widget Model# 123’ was broken-up into two segments because there was an HTML bold tag ( ⁇ b>) in the middle of it.
- the parsing system may be flexible and allow defining, which HTML tags are formatting tags that do not break up text segments.
- the example page would instead be parsed into the following five text segments: 1) ‘Widget Product Information’, 2) ‘Widget ⁇ b>Model# 123 ⁇ /b>’, 3) ‘This widget is very useful for many chores around the house’, 4) ‘Product photo’, and 5) ‘Click here to return to the home page’.
- the translation server 400 may perform several changes to the page. Each text segment may be replaced with a corresponding translation. It is noted that the text of the image description (‘Product photo’) placed in the ‘all’ attribute of the image tag may be recognized as a text segment and translated.
- the translation server 400 can recognize text segments inside attributes of HTML tags, such as the text in buttons of a form.
- the URL of the image tag may be replaced to point to a translated image file.
- the translation server 400 may only execute this action if a translated file has been defined (since many images do not have text and thus do not require translation), otherwise it may not change the URL of the image (except to make the URL absolute if necessary).
- the ‘ES.sub.--24.gif’ image file was defined in WebCATT 408 as the translation for the ‘widget_picture.gif’ file.
- the URL of the home page link may be rewritten from ‘https://www.abcwidgets.com’ to ‘https://espanol.abcwidgets.com’ in order to redirect it to the translation server 400 .
- the online user clicks on the ‘Click here to return to the home page’ link the request may go directly to the translation server 400 , and the home page may also be translated. This process is called “implicit navigation”, and it is explained in more detail below.
- Implicit navigation is a translation server 400 feature that keeps an online user 416 in the alternate language as he/she browses a web site. Implicit navigation can be made automatically because the domain name of a translated site may be different from the domain name of the original language site, or if necessary may be implemented by rewriting the URLs in the applicable links inside a page as the page is being translated, so they are redirected to the translation server 400 . As a result, not only is the page translated, but also all applicable links to other translated pages within the page may be modified when needed if necessary, so that when the consumer clicks on the linked page, the translation is available.
- the translation server 400 may change the domain name in the original URL with the domain name of the translation server 400 .
- the request may go to the translation server 400 , which computes the original URL to be translated based on the path and/or its internal mappings and request the page to be translated from this URL.
- the translation server 400 then may convert the page received to the alternate language and deliver the translated page to the consumer directly.
- the scope of implicit navigation can be pre-defined by domain and/or URL patterns.
- domain and/or URL patterns In a typical scenario, only pages being served from a specific domain(s) may be translated.
- the implicit navigation domains are defined as abcwidgets.com and abcwidgets.net, then only URLs within those two domains will be rewritten.
- URL patterns can be used. For example, if ABC Widgets wishes not to translate the careers and investor relations sections of their site, then the following two example Exclude URL patterns could be used: 1) abcwidgets.com/careers/ and 2) abcwidgets.com/investor/.
- the system according to the present teaching enables translation and optimization of URLs in order to improve the ranking of the translated pages on search engine indexes.
- the original URLs on the customer web site 414 contain words or phrases in the first language that may be relevant or optimized for search engines
- such words and phrases can be translated by the Translation Server 400 into the second language to derive translated URLs on the translated web site. This allows the translated web site in the second language to maintain the search engine URL optimization of the customer web site 414 .
- the original URL representing the web content in the first language
- a translated URL component can be obtained through translation into the second language, so that such a translated URL component can be used to replace the corresponding translatable URL component in the original URL.
- a translated URL may then be derived once the relevant translatable URL component(s) is replaced with corresponding translated URL component(s).
- the translated URL components can be stored for future re-use. This URL translation process can be applied to both search engine optimized URLs or other URLs that have not been search engine optimized
- Search engines such as GOOGLE, place a great emphasis on keywords found on a URL versus keywords found within the content of a page. As a result, it would be beneficial for websites to place search keywords in the actual URL of the page and minimize the use of other, e.g., cryptic parameters.
- search engines such as GOOGLE
- this due to the restrictions of e-commerce engines and/or the great difficulty associated with changing the URL structure of a website, this is rarely done.
- the system according to the present teaching provides a solution to this problem by generating search engine optimized URLs in the second language that map to the customer web site's 414 non-optimized original URLs. For example, on a Spanish site the above ABC Widgets SONY BRAVIA 46′′ LCD HDTV original URL can be translated into:
- the above translated URL is optimized to contain keywords that include the category, manufacturer, brand model number, and short description of the product in Spanish, which makes the URL optimized for search engines.
- the Translation Server 400 may optimize an original URL by identifying (disclosed below) search engine relevant content already present on the page in the first language and placing the corresponding content in the second language on the URL of that translated page. For example, below is the HTML content that the ABC Widgets SONY BRAVIA 46′′ LCD HDTV original URL returns in the first language:
- the Translation Server 400 can automatically pick the most relevant element in the page based on which element better describes the content of the page, or such element can be manually predefined. Alternatively, the most relevant element may be determined in a semi-automated manner. For example, the Translation Server 400 may automatically detect candidates of relevant elements, and a human operator may then interact with the Translation Server 400 to select one or more candidates of relevant elements as the most relevant element to be used to generate optimized URL. The human operator may also manipulate or even edit some candidate relevant elements to make them, e.g., capitalized, boldfaced, highlighted, etc. In addition, any arbitrary content in the page can be flagged as the most relevant for URL optimization via the use of Directive Tags.
- the title of the document is identified as the most search engine relevant element in the page, which is shown again below:
- the Translation Server 400 may then look for a matching translation of the text of that element in the second language. Translation of this text typically occurs within the normal workflow of the translation of the page. For the given example, the Spanish translation of the above title is:
- the translation of the title is then converted into a URL friendly format to derive a translated search engine optimized path. In some embodiments, this is done by performing e.g., the following steps:
- the resulting translated search engine optimized path obtained from the translated title is:
- the Translation Server 400 may then use the translated search engine optimized path to generate a search engine optimized URL in the second language.
- the process to achieve that may start by breaking up the original URL on the customer web site into its origin host, path, and query string elements, as shown below:
- Each parameter may be examined to determine whether the value of the parameter contributes to an identification that uniquely identifies the content, in this case a product.
- all the parameters, except the session parameter are considered to contribute to an identification that can uniquely identify the product.
- a session parameter is specific to a user's session and may change over time.
- Parameters that contribute to an identification that uniquely identifies the content may be included in the search engine optimized URL and parameters that do not may be excluded. For the given example, the following parameters are either included or excluded:
- the origin path can be mapped to the translated search engine optimized path, which can then be used in place of the origin path and included parameters.
- the origin host may be replaced by the host name of the translated site.
- the excluded parameters may be added back to the translated URL after it has been optimized. For example, the resulting translated search engine optimized URL in the second language is shown below:
- the search engine optimized path can also be obtained from another part of the document instead of the title. This can include the H1 header, a meta-description, or any arbitrary content in the page identified by specific Directive Tags.
- the Translation Server 400 may automatically convert the search engine optimized URL into the equivalent non-optimized original URL representing the customer web site based on the above described mappings in order to retrieve the actual content for translation.
- the Translation Server 400 looks up the translated optimized path in the database and finds the corresponding origin path and included parameters. It then replaces the translated optimized path with the origin path and adds the included parameters to the query string.
- an identifier that uniquely identifies the mapping in the database may be added to the translated search engine optimized URL. For example, if the origin path and the included parameters are mapped to the translated search engine optimized path in the database using an identifier, e.g., a numeric identifier, then this identifier can be added to the translated search engine optimized URL. Using such an identifier in the translated search engine optimized URL improves the performance in looking up the mapping, making the lookup operation resilient to changes in the translated text incorporated in the URL.
- an identifier e.g., a numeric identifier
- the use of an identifier (rather than the translated text) to lookup the mapping ensures that the correct origin path and parameters are correctly retrieved from the database.
- the identifier in the URL may also be encoded to reduce the required space needed for the URL.
- the system of the present teaching may enable users to access the same original language e-commerce database in multiple languages. Since the translation server 400 may process web pages after they have left the customer web site 414 , but before they reach the user 416 , it may not affect a web server's e-commerce technology. As a result, the same web site 414 can be accessed in multiple languages, and all users may access the same e-commerce database simultaneously.
- an auction web site can allow users in different countries to bid on the same item. Each user can view the site and bid on the item in his/her native language. Since all bids from the different countries are actually hitting the same web site and the same e-commerce engine through the translation server, all bids occur in real time, and each user can see in real- time what all the other users in all other countries are bidding.
- the meaning of a word or phrase may change depending on the context in which it's being used. It is also possible that the translation itself may vary depending on the context or placement of a text segment, even if the original meaning does not change. As a result, it may be necessary to specify multiple translations for the same word or phrase, one for each usage context.
- the system of the present teaching allows translators to do this by providing the ability to “lock” text segments together. When two or more text segments are locked together they may be used only when the exact translation sequence is followed.
- the translation to Spanish of the text segment “Virtual Brochures” can vary, depending on where it is used. Below is this segment used in an English HTML sentence: ⁇ b>Virtual Brochures ⁇ /b>are great. The corresponding translation to Spanish is: ⁇ b>Los Folletos Virtuales ⁇ /b>son convenientes. Another example of a segment used in an English HTML sentence: There are many great ⁇ b>Virtual Brochures ⁇ /b>. The corresponding translation to Spanish is: Hay muchos permittedes ⁇ b>Folletos Virtuales ⁇ /b>
- the translation server 400 looks up a corresponding translated segment and gets back two potential matches: “Los Folletos Virtuales” and “Folletos Virtuales”. It then proceeds to look up a translated segment for the next segment “are great” and gets back “son substantiales”. Since “son substantiales” is locked to “Los Folletos Virtuales”, the translation server 400 is able to determine that “Los Folletos Virtuales” is the correct translation to the previous segment “Virtual Brochures”.
- the translation server 400 may transparently handle form submissions via GET or POST methods. This means that all form data may be forwarded to the original URL that processes the form and that the response page may be converted to the alternate language.
- the translation server 400 is capable of translating text segments and files located inside JavaScript code, VBScript code, CSS code, XML, AJAX messages, AMF code and many other complex web based technologies and formats by parsing the code or message and recognizing translatable components.
- a script included file may be downloaded by the user agent (e.g., a browser) in a separate HTTP request and included in the web page as if it had appeared within the page.
- Script included files may be handled in the same manner as implicit navigation in standard links within the page.
- the user agent may request the script included file from the translation server 400 , which will compute the URL of the original script included file and request it from its location.
- the translation server 400 then may read the file, perform the appropriate conversions, and deliver the modified file to the user agent for inclusion in the web page.
- Directive tags and directive attributes are special HTML tags and attributes that allow more granular control over the translation, implicit navigation and other translation server behavior within in a web page.
- Directive tags are special HTML comments tags that are ignored by the user agent (e.g., a browser), but provide specific instructions to the translation server 400 .
- Directive attributes are specially named attributes placed within an HTML tag that are also ignored by the user agent (e.g., a browser), but provide specific instructions to the translation server 400 that apply only to the tag in which the attribute is placed.
- Translation control tags and attributes can be used to specify sections on a web page that should not get translated.
- One application of translation control tags is to delimit personal information, such as a person's name, address, credit card numbers, etc. that may show up in a web page, but which may not need to be processed—it may simply pass through the translation server 400 without being translated or stored—for security and privacy issues.
- the directive tag “mp_trans_partial_start & mp_trans_partial_end” signals the start and end of a partial translation section. This tag may be used at the top of a web page in conjunction with section translate tags to selectively translate sections of a page.
- the directive tag “mp_trans_enable_start & mp_trans_enable_end” signals the start and end of a section to be translated within a partial translation section. All text and files within this section may be translated.
- the directive tag “mp_trans_disable_start & mp_trans_disable_end” signals the start and end of a section not to be translated when in normal translation mode.
- the directive tag “mp_trans_machine_start & mp_trans_machine_end” signals that any text segments enclosed within the tags may be machine translated in the event that a human translation is not available.
- the directive attribute “mpdistrans” disables translation of a file or of translatable text in a tag, such as alt, keywords or description meta-tag, or form buttons.
- the directive attribute “mpnav” enables implicit navigation for listed attributes in the tag. This attribute can be used for tags that do not normally contain URLs, but actually do contain URLs.
- the directive attribute “mpdisnav” disables implicit navigation for all attributes or only listed attributes of the tag.
- the directive attribute “mporgnav” forces original navigation for all attributes or only listed attributes of the tag. Original navigation may remove redirection to the translation server if found, otherwise it may leave the link intact. This directive attribute is discussed below with reference to one-link deployment.
- the translation server 400 may process the above page as follows:
- One aspect of the present teaching is to eliminate or minimize the workload of a customer web site's IT department in order to deploy an alternate language web site.
- One-link deployment may allow a customer to deploy the alternate language web site by simply placing one language-switching link in the home page, navigation menu, or any other appropriate area of the original language site.
- the one-link deployment may be a combination of two features: (1) automatic flipping of the language-switching link, and (2) implicit navigation to maintain the user in the alternate language.
- Automatic flipping of the language-switching link is specified by using the exemplary mporgnav directive attribute in the language-switching link.
- the mporgnav directive attribute may instruct the translation server 400 to rewrite the URL to support automatic language switching.
- a mirror Spanish language web site may be deployed by placing one link in the home page that redirects the home page to ABC Widget's translation server 400 .
- a mirror Spanish language web site may be deployed by placing one link in the home page that redirects the home page to ABC Widget's translation server 400 .
- the translation server 400 may return the home page translated, as shown below:
- the translation server 400 may also rewrite the URL in the language-switching link and perform implicit navigation of all other URLs in the page.
- the translation server 400 may rewrite the URL in the language-switching link so that the translation server 400 redirection is removed.
- the exemplary mporgnav directive attribute may be used to instruct the translation server 400 to do this.
- the link text ‘Click here to see this site in Spanish’ may be translated as ‘Haga project aqui para ver este sitio web en Ingles’ (which means ‘Click here to see this site in English’).
- This automatic and simultaneous change of both the URL and the text (or image) in the language-switching link by the translation server 400 is what allows the user to flip back-and-forth between English and Spanish.
- Implicit navigation may be also performed in all the links on the page. In the above example home page, it was performed on the widgets.jsp page. As a result, when a user clicks on this rewritten link, the widgets.jsp page is in turn translated and implicit navigation performed on all of its links within the abcwidgets.com domain. This process may be repeated so that the user is always navigating the site in the alternate language.
- the translation server 400 may allow delivering customized content according to the language and/or location in which a user is viewing the site.
- the translation server 400 when the translation server 400 requests a web page for translation, it sends two cookies to the original web server: one for language and another one for the country.
- the value of the language cookie is a 2 or 3-letter language code in compliance with the ISO 639 standard.
- the value of the country cookie is a 2-letter country code in compliance with the ISO 3166 standard.
- Web site server software can determine if a page is being viewed in an alternate language and/or a different country by checking for these cookies. For example, by checking that the language cookie exists, and that its value is ‘ES’, a web server can determine that a page is being served in Spanish and customize the content being served, such as sselling items that appeal more to Hispanics. In addition, if a company maintains operations in multiple countries, then it can use the country cookie to determine the country and show only products sold or shipped to that country.
- search engine may not be able to find any matching results, or might deliver incorrect results. This occurs because the web server search engine is matching the keyword(s) in the alternate language against a search index of keywords that are in the original language.
- the translation server 400 provides a solution to this problem by performing a real-time reverse machine translation on the search keyword(s) and forwarding the keyword(s) to the web server search engine in the original language.
- Reverse machine translation may be configured so it may be performed only on the specific keyword field(s) of the search form(s) in a web site.
- the system of the present teaching is compatible with all Internet search engines, such as GOOGLE or ALTAVISTA. These search engines utilize content from both the body and head of the HTML document to index a web page. To ensure transparent compatibility with Internet search engines, the system of the present teaching may translate all applicable text in the head of the document. This includes, but is not limited to the page title, the page description meta-tag, and the keywords meta-tag.
- the translation server 400 uses real-time machine translation in the event that a human translation is not (yet) available.
- machine translation can be used as input or starting point for human translation or human post-editing. In that case, a human translator or editor post-edits the translation generated by machine translation to improve the translation.
- frequently used data is cached in memory to minimize repeated access to the database 406 .
- the translation server 400 may make extensive use of memory caches to improve response performance. This includes, but is not limited to a text segment cache, a file cache, and a page cache.
- the translation server 400 may not require IT integration with an existing web site infrastructure.
- the present teaching may convert the outbound HTML stream after it has left the client web server 414 .
- Translated data may be managed and maintained by the WebCATT 408 software outside of the web site's database.
- the translation server 400 may also work with any client web server hardware and software technology infrastructure. Further, it allows for evolution of the existing client's hardware and software technology infrastructure. Moreover, deployment of the present teaching requires minimal effort as a reduced amount of client IT resources are required. One-link deployment allows the client to place one link on the web site 414 to provide access to the alternate language web site. Therefore, deployment is rapid and cost effective.
- the WebCATT (Web Computer Aided Translation Tool) 408 is a web based Graphical User Interface (GUI) application that is used to perform and manage human translations.
- GUI Graphical User Interface
- the tool may be built specifically translation of web content. It can be used by professional translators to translate web site translatable components and by managers to manage the translation process. Since WebCATT 408 is a web-based application that is accessed via the Internet 412 , translators and managers can be located in different geographical areas.
- WebCATT 408 may be similar to other computer aided translation tools used by professional translation service organizations. WebCATT 408 may support localization, text recognition, fuzzy matching, translation memory, internal repetitions, alignment, and a glossary/terminology database. WebCATT 408 may be designed for web site translation and include other features optimized for web translation, such as What You See Is What You Get (WYSIWYG) HTML previewing and support for image/graphic translation.
- WYSIWYG What You See Is What You Get
- WebCATT 408 may organize the translation workload into web pages.
- a web page may be, for example the HTML, XML, JavaScript, CSS or other type of web content generated by a specific URL address, regardless of whether that content is static (i.e., physically resides in the web server in a file), or dynamic (i.e., the content is generated dynamically by combining information from a database and HTML templates). Dynamic pages that are dependent on session information (i.e., a shopping cart checkout page) may be also supported.
- a text segment is a chunk of text on the page.
- a text segment can range from a single word to a paragraph or multiple paragraphs.
- a file is any type of external content that resides on a file, is linked from within the page, and may require translation. Typical types of files found in web pages include, but are not limited to images, PDF files, MS Word documents, and Flash movies.
- a file may be translated by uploading a replacement file that has all text and/or sounds translated.
- FIG. 9 is a screenshot of an exemplary WebCATT interface used for viewing the content of a web page, in one embodiment of the present teaching.
- FIG. 9 shows a display area 902 in which a web page including translatable component in a first language (in this case, English) is displayed.
- a section 904 including information associated with the web page displayed in display area 902 , such as page status, page URL, page ID, etc.
- a section 906 including statistics associated with the web site from which the displayed web page is garnered, such as the number of files translated, the number of segments translated, the number of translations suppressed, etc.
- FIG. 10 is a screenshot of an exemplary WebCATT interface used for viewing a translatable component along with a corresponding translation, in one embodiment of the present teaching.
- FIG. 10 shows a display area 1002 in which an original image file translatable component is displayed in a first language (in this case, English).
- FIG. 10 shows a display area 1004 in which a translated image file is displayed in a second language (in this case, Spanish).
- a section 1006 including information associated with the file displayed in display areas 1002 - 1004 , such as file status, file URL, file ID, etc.
- FIG. 10 shows how WebCATT 408 allows a user to view a translatable component alongside a corresponding translated component for comparison.
- FIG. 11 is a screenshot of an exemplary WebCATT interface used for editing a translatable component, in one embodiment of the present teaching.
- FIG. 11 shows a display area 1102 in which a web page including a translated component in a second language (in this case, Spanish) is displayed.
- the display area 1102 provides a WYSIWYG web page preview feature that allows viewing the translated web page as it is being translated. Translations can often result in a significant amount of word growth (e.g., approx. 20% from English to Spanish) or shrinkage, which can result in carefully formatted web page layouts being knocked out of alignment by the longer text.
- the WYSIWYG page preview feature allows translators to immediately see the translated web pages and quickly make adjustments in word choice in order to maintain the correct alignment and layout of the page when translated.
- a section 1104 including information associated with the web page displayed in display area 1102 , such as page status, page URL, page ID, etc.
- a section 1106 including statistics associated with the web site from which the displayed web page is garnered, such as the number of files translated, the number of segments translated, the number of translations suppressed, etc. In addition to each of those statistics, a breakdown of translated and not translated components is shown in both units and percentages.
- a section 1110 provides a text segment edit form that allows a translator to edit text segments in the order they appear on the page.
- This form features a fuzzy search feature that automatically shows and sorts existing segment matches in the database.
- the translator can copy an existing translation from the search results area to use as a starting translation.
- a section 1108 provides a file list form that allows a translator to preview all linked files on the page.
- the list form allows the translator to select all files that do not require translation (e.g., an image with no text) and quickly tag them as such. It also allows a translator to select individual files for translation via the file edit form.
- File translation may involve uploading a translated file and translating the file text description if present.
- the GUI as shown FIG. 11 enables a user to view the plurality of translated components placed into the format derived from the first, or source, content, thereby enabling a user to review how the translated components are rendered in the first content format.
- the GUI of FIG. 11 further allows a user to highlight any of the plurality of translatable components, which are not yet translated, differently from translated components when previewing the plurality of translated components in the first content format.
- the GUI of FIG. 11 further allows a user to display text when hovering over a translated component so as to view the first content corresponding to the translated component.
- the GUI as shown FIG. 11 further enables a user to select at least one of the translated components when previewing the plurality of translated components in the first content format so as to edit the translated component and store the translated component that has been revised with the corresponding unique identifier.
- the GUI of FIG. 11 further allows previewing in a multi-user environment so that more than one user can simultaneously view translated components rendered in the first content format.
- WebCATT 408 also provides complete management of the translation process.
- Web pages may be scheduled for translation either automatically by the translation server 400 , or manually by a manager via upload of web pages or other type of content to be translated.
- a web page When a web page is scheduled for translation, it may be placed in the translation queue of a specific customer.
- Pages to be translated may be scheduled for translation on a priority basis based on pre-defined priority information or using algorithms, such as ones based on the percentage of the page already translated and how often the page is being accessed on the original web server while it's in the translation queue. This allows the most important pages (e.g., most frequently accessed and those with smaller changes) to be translated first.
- a manager can assign them for translation to a specific translator or translation service subcontractor. If assigned to a subcontractor, a subcontractor manager can then assign them to specific translators within the subcontractor organization or even to freelancers that work with them. Proofers can also be assigned. A subcontractor can assign its own proofers to pages and managers can also assign proofers to check the work of translators or subcontractors.
- a web page may go through a series of status changes before it is available via the Internet.
- the status changes follow a translation workflow that allows translation, editing, proofing, and activation.
- only active pages may be made available via the Internet.
- the text and files within the page may maintain their own translation status.
- the status for text segments and files may be maintained both at the page level (i.e., one single overall status for all segments in the page and another one for all files in the page) and individually.
- the status of text segments and files may change following a translation workflow that allows translation, editing, proofing, and activation Translated segments and files may be available via the Internet only after their status is set to active.
- FIG. 12 is a screenshot of an exemplary WebCATT interface used for viewing a translation queue, in one embodiment of the present teaching.
- FIG. 12 shows a series of columns wherein a unit of information is provided for each page of the web site 414 listed on each row.
- FIG. 12 shows a first column 1202 including unique page identifiers.
- Column 1204 includes a URL for each page.
- Column 1206 includes receipt data for each page.
- Column 1208 includes a percentage statistic indicating the percentage of the page that has been translated.
- Column 1210 indicates a status for each page.
- Column 1212 indicates the contractor assigned to the page.
- FIG. 13 is an operational flow diagram depicting an exemplary process of WebCATT 408 , according to an embodiment of the present teaching.
- the operational flow diagram of FIG. 13 depicts the process by which WebCATT 408 , which provides a web based tool for managing language translations of content, queues, and translates components of a web site 414 .
- the operational flow diagram of FIG. 13 begins with step 1302 and flows directly to step 1304 .
- WebCATT 408 may retrieve a first content, or HTML, source page, in a first language from the web site 414 .
- WebCATT 408 may parse the first content into one or more translatable components.
- WebCATT 408 may queue the translatable components for human translation or human edited machine translation into a second language.
- step 1308 for each of the translatable components it may be determined whether to invoke machine translation. If it is affirmative, then control flows to step 1314 . Otherwise, control flows to step 1312 .
- WebCATT 408 may provide a translatable component for human translation into a second language.
- WebCATT 408 may perform machine translation on a translatable component into a second language.
- WebCATT 408 may provide the machine translated component for human post-editing.
- step 1318 for each of the translatable components, WebCATT 408 may store a translated component corresponding to the translatable component, thereby storing a plurality of translated components.
- step 1320 the control flow of FIG. 13 stops.
- WebCATT 408 allows translators to work directly with live pages off the web site 414 being translated. Thus, the client web site 414 need not send information to the translation server 400 for translation. Furthermore, all web pages in a web site may be automatically entered into the translation work queue by the WebCATT 408 and spider 404 , as described in greater detail below.
- WebCATT 408 WYSIWYG preview allows translators to see translated web pages, as they would appear on the live web site. This allows the translator to compensate for word growth or shrinkage that knocks a web page layout out of alignment.
- a translated preview page may be marked-up with special HTML & JavaScript to allow: 1) color coding of all text in the web page so the translator can see what is already translated, what remains to be translated and where the current text segment is located within the page, 2) clicking in text or a file to take the translator to a form to edit the translation for the text or file, and 3) hovering the mouse over a text or file to pop up a window showing the original wording or file.
- WebCATT 408 may parse pages into translatable components and translators only work with such translatable components, not a complex group of HTML files. All non-translatable content, such as HTML and script code, may be hidden when using WebCATT 408 . WebCATT 408 can be utilized via the ASP model and translators can access it via the web. Translated pages can be delivered via the translation server 400 or saved as static html pages to be sent to client, wherein links among pages are modified so they reference the translated pages.
- WebCATT 408 also allows management of the translation process. Multiple user access levels are supported: managers, proofers, translators & sub-contractors. Mangers can assign work in the translation queue to translators, proofers and/or subcontractors. Subcontractor managers can in turn sub-assign work to subcontractor translators and proofers. Managers can activate web pages before the translation server 400 can deliver them.
- a spider is a program that visits web sites and reads their pages and other information in order to create entries for an index such as a search engine index.
- a search engine index For example, the major search engines on the Internet all have such a program, which is also known as a “crawler” or a “bot.”
- Spiders are typically programmed to visit web sites that have been submitted by their owners as new or updated. Entire web sites or specific pages can be selectively visited and indexed. Spiders are named because they usually visit many web sites in parallel at the same time, their “legs” spanning a large area of the “web.” Spiders can crawl through a web site's pages in several ways.
- spiders for the major search engines on the Internet adhere to the rules of politeness for Web spiders that are specified in a standard for robot exclusion. This standard allows specifying files to be excluded from being indexed. The standard also proscribes a special algorithm for waiting between successive server requests so that the spider doesn't affect web site response time for other users.
- spiders The operations of a spider are in contrast with a normal web browser operated by a human that doesn't automatically follow links other than inline images and URL redirection.
- FIG. 4 shows a spider 404 for use in analyzing and sizing a web site 414 .
- the spider 404 is a tool that crawls specific web sites and performs any of a variety of actions.
- the spider 404 can crawl a web site in order to populate the WebCATT translation queue with new or updated information.
- the spider 404 may also gather content statistics that can be used to provide a monetary quote for deployment of the present teaching.
- FIG. 14 is an operational flow diagram depicting an exemplary process of spider 404 , according to an embodiment of the present teaching.
- the operational flow diagram of FIG. 14 depicts the process by which spider 404 , which provides a web based tool for sizing a web site for language translation, retrieves and indexes translatable components of a web site 414 .
- the operational flow diagram of FIG. 14 begins with step 1402 and flows directly to step 1404 .
- spider 404 may retrieve a first content, such as an HTML source page, in a first language from the web site 414 .
- the first content in a first language may be for translation into a second content in a second language.
- the second web content may be a human translation, or machine translation, or human edited machine translation in a second language of the first web content.
- spider 404 may parse the first content into one or more translatable components.
- a translatable component may include any one of a text segment, an image file with text to be translated, a multimedia file with text or audio to be translated, a file with text to be translated, a file with image with to be translated, a file with audio to be translated, and a file with video and with at least one of text and audio to be translated.
- spider 404 may store the translatable components in the database 406 for human translation, or machine translation, or human edited machine translation into the second language.
- spider 404 may queue the translatable components for human translation, or machine translation, or human edited machine translation into a second language.
- spider 404 may provide the translatable components to WebCATT 408 for human translation or human edited machine translation into a second language.
- spider 404 may generate statistics based on the translatable components retrieved from the web site 414 . The statistics generated may include, but are not limited to a file count, a page count, a translatable segment count, a unique text segment count, a unique text segment word count, and a word count.
- the spider 404 can further generate a web page having a link to each file of the web site 414 .
- the control flow of FIG. 14 stops.
- the spider 404 can be pre-configured for each customer web site so that the use of directive tags and/or attributes is eliminated or minimized. This minimizes the workload of the customer web site's IT personnel. Further, the spider 404 can be separately pre-defined by domain and/or by URL pattern. This allows specifying sections of a web site to be translated without the need for placing directive tags in each web page.
- the spider 404 can be used to update the WebCATT 408 translation work queue. Further, spider 404 can be used to gather statistics about a web site 414 in order to allow estimating the amount of work involved in translating the web site and pricing accordingly. Spider 404 can summarize word counts, segment counts, file counts and page counts of a web site 414 . The spider 404 may supplement the functions of WebCATT 408 by saving all unique text segments and file URLs in the database 406 for later translation into a second language. It can further create an HTML, page containing links to all files of web site 414 , so the files can reviewed for translation at a later time.
- the spider 404 can emulate a user agent (e.g., a browser) by saving and returning cookies when crawling a web site 414 .
- Spider 404 can further fill out and submit forms with pre-defined information and is able to establish a session and normalize session ID parameters for e-commerce sites.
- Spider 404 can further be configured to crawl only specific areas of a web site by defining include/exclude domains and URL patterns.
- Spider 404 can also be configured to send specific HTTP headers, such as the user-agent (i.e., type of browser).
- Spider 404 can be executed in a single computer or in distributed mode. In distributed mode, multiple machines work in conjunction to crawl the same web site simultaneously sharing the same database 406 .
- Automatic maintenance involves automated maintenance of the alternate language web site so as to be maintained in synchronization with the original site with no human intervention or little additional effort.
- Automatic maintenance may be based on the function of the translation server 400 that automatically schedules a web page for translation by placing it in the WebCATT 408 translation queue (described in more detail above) in the event a translation cannot be found for one or more text segments or linked files in the page.
- the act of viewing a never-before translated or a modified page in the alternate language enables the scheduling of the web page for translation.
- One way involves manual quality assurance review. If a new web page or an updated web page goes through a manual quality assurance process that involves a person reviewing the page before it is released to the live web site, then the quality assurance personnel may simply attempt to view the page in the alternate language during the review process. This will place the new web page in the WebCATT 408 translation queue for translation before the page goes into the production (live) web site.
- the spider agent 404 can be used to crawl a web site, or just portions of a web site, in the alternate language on a regular basis. Crawling the web site in the alternate language is equivalent to a user viewing the site in the alternate language, and thus results in any new or modified pages being placed in the WebCATT 408 translation queue.
- This technique can be used for regularly scheduled updates to a web site, which normally happens after hours. For example, if the ABC Widgets web site modifies its sale offerings twice a week, such as on Mondays and Fridays at 12 AM, then the spider agent 404 can be scheduled to crawl the relevant parts of the site shortly after (e.g., at 12:30 AM) on those days. Around-the- clock translators can then translate the new sale banners so that the alternate language web site is up to date sometime later that morning.
- the spider agent 404 can also be used to regularly (e.g., daily) crawl a web site even when changes are not regularly scheduled. This will guarantee that the alternate language site is in sync with the original language site after every crawl and subsequent translation.
- the alternate language web site may be still automatically maintained up to date over the long term. This is because the first online user that attempts to view a new or modified page in the alternate language may trigger the placement of that page into the WebCATT translation queue. In that case, the online user may see the page in the original language or may see a partially translated page. However, subsequent users that access the page may see the web page in the alternate language after it has been translated.
- the present teaching also supports manual maintenance of the alternate language web site so as to be maintained in synchronization with the original site.
- New information that needs translation can also be manually placed in the translation queue using WebCATT 408 . This can be useful to translate large amounts of data that is available in advance of it being on the live web site 414 . For example, if the ABC Widgets web site updates its web site with new product offerings every Thursday morning, and all product information is available by the previous Tuesday, then all new product data can be manually batched into the translation queue using WebCATT 408 as soon as it is available so it is fully translated by the time the new web pages go live. New information that needs translation may also be placed in the translation queue via the web service described in FIG. 8( a ) .
- Population of the WebCATT 408 translation queue can be performed either by URL or by content.
- Population by URL means that translation server 400 stores only the URL of the page in the queue. The content of the URL may be retrieved afterwards when a translator accesses the page to translate it using WebCATT 408 .
- Population by URL can present a problem if the content of the page is dependent on session information, such as a session ID present in a query parameter or stored in a cookie. In that case, the session ID in the query parameter may have expired or the session information stored in the cookie may not be present when viewing the page in WebCATT 408 .
- session dependent pages can be handled in different ways.
- a session dependent page can be handled by replicating the session state via cookies and/or updated session parameters or by populating the page by content.
- Replicating the session state allows the translator to manually re-acquire a session from the original site by entering the session data in WebCATT 408 . Once the session data is entered, it can be used for translating multiple pages.
- Population by content means that translation server 400 stores the full content of the page in the queue. This avoids the session dependence issue, but can result in outdated content. As a result, population by content may be used only for session dependent pages, and population by URL, which guarantees that the content being translated is the latest content, may be used for all other pages.
- Pages to be translated may be scheduled for translation on a priority basis based on pre-defined priority information or using algorithms, such as ones based on the percentage of the page already translated and how often the page is being accessed on the original web server while the page is in the translation queue. This allows the most important pages (e.g., most frequently accessed and those with smaller changes) to be translated first.
- a file change detection feature can be used to deal with files whose names have been changed.
- the translation server 400 and WebCATT 408 can match a file to be translated with its translated file by the URL of the original file. However, it is possible for a file to be changed while its name and location remain the same. In that case, it is possible that an outdated translated file is used for the translation.
- the translation server 400 computes a hash-code or checksum based on the binary content of the file and stores it with the URL.
- the translation server 400 or WebCATT 408 may re-compute the hash-code or checksum and compare it against the stored one. If they match, the file has not changed and the existing translated file can be used as replacement. However, if they do not match, the binary content of the file was changed and the existing file translation cannot be used. In that case, the file may be placed in the WebCATT 408 translation queue so it may be re-translated.
- FIG. 15 is an operational flow diagram depicting an exemplary synchronization process according to an embodiment of the present teaching.
- the operational flow diagram of FIG. 15 depicts the automated maintenance process of the alternate language web site so as to be maintained in synchronization with the original web site 414 .
- the operational flow diagram of FIG. 15 begins with step 1502 and flows directly to step 1504 .
- a first content in a first language such as an HTML, source page, may be retrieved from the web site 414 .
- the first content in a first language may be for translation into a second content in a second language.
- the second web content may be a human translation, or machine translation, or human edited machine translation in a second language of the first web content.
- the first content may be parsed into one or more translatable components.
- a corresponding translated component of the second web content may be identified or matched for each translatable component of the first web content. If a translatable component of the first web content is not matched to a translated component of the second web content, in step 1512 , the translatable component may be designated for translation into the second language. In optional step 1514 , the translatable components that did't matched may be queued for human translation, or machine translation, or human edited machine translation into a second language. In optional step 1516 , the translatable components that did't matched may be provided to WebCATT 408 for translation into a second language. In step 1518 , the control flow of FIG. 15 stops.
- FIG. 17 is an exemplary screenshot of how Preference Selector may be structured on a user agent (e.g., a browser), in one embodiment of the present teaching.
- a user agent e.g., a browser
- Preference Selector may pop-up only when it has been determined that a user 416 likely prefers to view web site 414 in a language other than the site's native language. Otherwise, Preference Selector may not pop-up.
- Preference Selector pops-up and the user 416 selects a preferred preferences, these preferences may be saved in one more cookies on the user's browser. Preference Selector can then automatically redirect the user 416 to the preferred alternate language site when the user visits the site again.
- Preference Selector can also be displayed on-demand to change these preferences at any time.
- Preference Selector can use the following information, which may be available in an HTTP request sent by the user agent (e.g., a browser) or via other means (e.g., cookies), as its inputs to control the subsequent operation(s):
- Preference Selector may be pre-configured with the following information, which may be used to control its operation based on the above inputs:
- Preference Selector may be implemented by inserting a link to a Preference Selector JavaScript file in the web site 414 . This eliminates or minimizes the effort from the IT personnel of a customer's web site. For instance, the code to be inserted to link to the JavaScript file can be provided to a customer as part of the “One-Link” Deployment language switching link.
- the Preference Selector JavaScript file may be provided to work in conjunction with server side logic to provide the pop-up and redirection behavior.
- FIG. 18 is an operational flow diagram depicting an exemplary process of loading Preference Selector, in one embodiment of the present teaching.
- the operational flow diagram of FIG. 18 begins with step 1802 and flows directly to step 1804 .
- the user agent e.g., a browser
- the Preference Selector JavaScript file logic may determine whether the Preference Selector cookie is present for the user 416 . If the Preference Selector cookie is present, then control flows to step 1808 . Otherwise, control flows to step 1814 .
- the value of the cookie may be inspected to determine whether the user 416 prefers an alternate language site. If it is affirmative, then control flows to step 1810 .
- step 1810 a configuration option that specifies immediate redirection may be checked to determine whether a redirection to the preferred translated site is to be performed. If it is affirmative, then control flows to step 1812 . Otherwise, control flows to step 1822 and processing stops.
- step 1812 a JavaScript client side redirection to the translated site may be performed, and the user 416 may be redirected to the preferred translated site. It is understood that other implementations other than a cookie may also be used to achieve the same function.
- the Preference Selector JavaScript file logic may generate the Preference Selector server-side URL to the Preference Selector application and instruct the user agent (e.g., a browser) to request the URL.
- the Preference Selector server-side application may execute its logic.
- the Preference Selector server-side application may analyze the inputs provided.
- the Preference Selector server-side application may generate a response.
- FIG. 20 is a block diagram depicting an exemplary process of the Preference Selector server-side application request and the response, which is also depicted in steps 1814 through 1820 in FIG. 18 , in one embodiment of the present teaching.
- the Preference Selector JavaScript file may generate the Preference Selector server-side URL to the Preference Selector Application and instruct the user agent (e.g., a browser) to request the URL.
- the user agent e.g., a browser
- the user agent may send the request to Preference Selector Application.
- Step 2 shows that the request may include the following additional information: (a) the user's IP address and/or geo-location information, (b) the user' demographic information, such as but not limited to ethnic information, (c) the user's online activity history information, such as but not limited to which language the user has been using to send emails, or what kind of products (e.g., books, CD, etc.) the user has been buying and which language those products are associated with, (d) various HTTP request headers, and (e) specific URL parameters.
- the Preference Selector Application may utilize the information included in the request and its pre-configured information to generate a response. The response may include displaying the Preference Selector pop-up, redirecting the user to a translated site, or performing no action.
- the Preference Selector response may be sent back to the user.
- FIG. 19 is an operational flow diagram depicting an exemplary process of the Preference Selector server-side application for analyzing the inputs against the pre-configured information to control its operation and generate a response, in one embodiment of the present teaching.
- the operational flow diagram of FIG. 19 begins with step 1902 when the application receives the request from the user agent (e.g., a browser) and flows directly to step 1904 .
- the user agent e.g., a browser
- step 1908 the presence of Preference Selector cookie may be checked and, if present, it may be determined based on the value of the cookie as to whether the user 416 prefers an alternate language site. If it is affirmative, then control flows to step 1910 . Otherwise, control flows to step 1912 .
- the Preference Selector application may respond with a server-side redirection to the translated language site.
- step 1912 the value of an Accept-Language user agent request header may be inspected to determine the user's preferred language and locale. If the first (or primary) language listed therein matches with a configured alternate language, then this primary language may be set as the Preference Selector default language and control flows to step 1914 . Otherwise, control flows to step 1916 .
- the Preference Selector default language may be compared against a configured list of affinity languages, and if a match is found, the mapping may be applied and control flows to step 1932 .
- the Preference Selector default language is French, and there is no French website available, but an affinity language has been defined that maps French to Spanish (because a French user 416 may prefer to read Spanish before English), then the Preference Selector default language is set to Spanish.
- the Preference Selector application may respond with a Preference Selector pop-up, e.g., a welcome pop-up, using the Preference Selector default language as the default selection in the user interface.
- step 1916 the value of domain name in the referrer user agent request header may be inspected and compared against a configured list of referrer domains to determine whether the user 416 comes from a website in a configured alternate language. If it is affirmative, then the Preference Selector default language is set to that alternate language and control flows to step 1914 . For example, if the referrer domain is “www.terra.com”, which is a well known Internet portal in Spanish, then the Preference Selector default language is set to Spanish. Otherwise, control flows to step 1918 .
- the value of the top level domain (TLD) of the domain name in the referrer user agent request header may be inspected and compared against a configured list of TLDs to determine whether the user 416 came from a website in a configured TLD. If it is affirmative, then the Preference Selector default language is set according to the language configured for that TLD and control flows to step 1914 . For example, if the referrer domain is “www.google.com.mx”, which is GOOGLE's website in Mexico, and the TLD “.mx” is mapped to Spanish, then the Preference Selector default language is set to Spanish. Otherwise, control flows to step 1920 .
- TLD top level domain
- step 1920 the value of the subdomain in the domain name in the referrer user agent request header may be inspected and compared against a configured list of subdomains to determine whether the user 416 came from a website in a configured subdomain. If it is affirmative, then the Preference Selector default language is set according to the language configured for that subdomain and control flows to step 1914 . For example, if the referrer domain is “espanol.yahoo.com”, which is YAHOO's portal website in Spanish, and the subdomain “espanol” is mapped to Spanish, then the Preference Selector default language is set to Spanish. Otherwise, control flows to step 1922 .
- step 1924 the value of the Accept-Language user agent request header may be re-inspected to determine the user's secondary language and locale. If a secondary language listed is matched against a configured alternate language, then this secondary language is set as the Preference Selector default language, and control flows to step 1914 . Otherwise, control flows to step 1926 .
- step 1926 the value of the user agent language may be inspected to determine the user's user agent language. If the user agent language is matched against a configured alternate language, then the user agent language is set as the Preference Selector default language, and control flows to step 1914 . Otherwise, control flows to step 1928 .
- the IP address of the user 416 may be inspected and a geo-location database used to determine the user's geographic location, such as the country, state/region, city, and zip code. If the user's geographic location is matched against a configured mapping of locations to languages, and the language corresponding to the user's location is matched against a configured alternate language, then the location language is set as the Preference Selector default language, and control flows to step 1914 . Otherwise, control flows to step 1930 .
- the user's demographic information and online activity history information may be inspected to determine the user's preferred language.
- the demographic information such as the ethnic information may be obtained and inspected. For example, if the user belongs to the Hispanic ethnic group, the preferred language of the user is likely to be Spanish.
- the user's online activities may be obtained and inspected.
- the language in which the user has been using to send and receive emails may be used to determine the user's preferred language.
- the user's online shopping history may be analyzed, for example, the language of the books or CDs that the user has been purchasing.
- the demographic information and the online activities history information may be obtained from various sources, such as but not limited to cookies, online commercial activities survey agencies, or any suitable sources where the user may supply his/her personal information or preference information (not shown in figures).
- step 1930 Preference Selector has been unable to find a default alternate language.
- the Preference Selector cookie may be set with a value for the native language of the site 414 . In that case, when the user 416 returns to the site 414 , steps 1804 , 1806 , 1808 and 1822 of FIG. 18 will be executed in succession resulting in the user 416 staying in the native site 414 without receiving the Preference Selector welcome pop-up, or being redirected to an alternate language site.
- the order in which the inputs are checked in the operational flow diagram of FIG. 18 is modified according to configuration information.
- the referrer search keyword checked in step 1922 can be checked before the domain, TLD, and subdomain of the referrer, which may alter the response.
- some of the inputs may not be checked.
- Preference Selector does not actually pop-up in a window in front of the native site 414 , but instead replaces an existing area in the page.
- Preference Selector allows the user 416 to select a preferred currency and geographic location (i.e., country or region where the user 416 is coming from or wants items shipped to). This is useful for websites that offer international service (e.g., global ecommerce, country specific offers or pricing, etc).
- Preference Selector pops-up for all users to a website, regardless of the value of the Preference Selector inputs.
- Preference Selector prompts all users 416 to choose a language, including those users that likely prefer the native language of the site 414 .
- Preference Selector may also prompt all users to choose a preferred currency and/or geographic location.
- Preference Selector shows the user 416 customized content according to one or more of the Preference Selector inputs.
- Preference Selector can display market specific messaging or offers by language or geographic location.
- Preference Selector may also show a customized offer when a user 416 came from a specific site (i.e., the referring site), or when the user 416 used specific search keyword(s) to land on the site.
- Preference Selector can redirect the user 416 to different sites depending on one or more of the Preference Selector inputs. For example, a customer may have two sites that offer the same service (e.g., purchasing train tickets), one for European users and the other for all other users coming from outside Europe. Both of these sites are available in a native language and several other alternate languages. Preference Selector may be configured to redirect the user 416 to the applicable language version of the appropriate site, depending on where the user 416 is coming from and the selected preferred language.
- Preference Selector may be configured to redirect the user 416 to the applicable language version of the appropriate site, depending on where the user 416 is coming from and the selected preferred language.
- Preference Selector collects data about user 416 behavior and learns about circumstances under which it should pop-up in the future. If a user 416 chooses an alternate language site via Preference Selector, Preference Selector records information on that user 416 that may include (1) the user's IP address and/or geo-location information, (2) the referring site URL and IP address, (3) the country/region of origin of the referring site, (4) the user' demographic information, and (5) the user's online activity history information.
- Preference Selector If over time a significant number of users coming from the same referring site select the same alternate language via Preference Selector, even if that referring site is not located in a country where that language is commonly used, it is added to Preference Selector's list of referrer sites for which to pop-up Preference Selector with a default selection of that alternate language. If over time a significant number of users located on the same city or region within a country (based on the user's IP address or geo-location information) select the same alternate language via Preference Selector, even if that city/region is not flagged for that alternate language, that city or region is added to Preference Selector's list of locations for which to pop-up Preference Selector with a default selection of that alternate language.
- Translating a web site to another language is an important first step in expanding an organization's reach to new foreign markets.
- Examples of localization include, for example customizing the format of numbers, dates and times; converting currency in accordance with the custom of the local market; and converting units of measurement in accordance with the custom of the local market.
- Such formatting and conversion capabilities may be performed by the Translation Server 400 at the time of converting pages from one language to another.
- customization can go beyond formatting and conversion in order to provide culturally relevant content to each targeted local market.
- Such localized content may include, but are not limited to marketing content, product variations, descriptions, and legal language specific to each target local market.
- the system of the present teaching includes a technology called Content Localizer that enables a web site operator to easily offer content specific to a local market to a user 416 .
- the Content Localizer may comprise two components: a Content Localizer Manager and a Content Localizer Server.
- the Content Localizer Manager is an application used to define localized content and to manage the process of content localization.
- the Content Localizer Server is an application responsible for serving the localized content to the user 416 .
- the Content Localizer Manager is a web based application with a Graphical User Interface (GUI) interface.
- GUI Graphical User Interface
- FIG. 21 is a block diagram illustrating an exemplary system architecture of the Content Localizer, in one embodiment of the present teaching.
- FIG. 21 shows a web site 2114 representing a web site in a first language such as English, corresponding to the web site 414 of FIG. 4 , which is connected to the Internet 2106 via a web connection.
- FIG. 21 also shows a Translation Server 2102 , corresponding to the Translation Server 400 of FIG. 4 , a Content Localizer Server 2110 , and a Content Localizer Manager 2116 .
- FIG. 21 further shows a localized content database 2100 for storing localized content and the associated conditions for use by the Translation Server 2102 , the Content Localizer Server 2110 , and the Content Localizer Manager 2116 .
- FIG. 21 also shows a user 2108 that utilizes a web connection to the Internet 2106 to browse and navigate the web pages served by the web site 2114 in a first language and by the Translation Server 2102 in a second language. Also shown in FIG. 21 is a content manager user 2120 , who utilizes the Content Localizer Manager 2116 to specify localized content with identifiers and associated conditions.
- the Translation Server 2102 , the Content Localizer Server 2110 and the Content Localizer Manager 2116 are each connected to web servers 2104 , 2112 , and 2118 , respectively, which are the conduits through which all web actions of the above tools are channeled.
- the computer systems for Translation Server 2102 , Content Localizer Server 2110 , Content Localizer Manager 2116 , and web servers 2104 , 2112 and 2118 are one or more Personal Computers (PCs) (e.g., IBM or compatible PC workstations running the Microsoft Windows 95/98/2000/2008/ME/CE/NT/XP/VISTA/7 operating system, Unix, Linux, Macintosh computers running the Mac OS operating system, ANDROID, or equivalent), Personal Digital Assistants (PDAs), tablets, smart phones, game consoles or any other information processing devices.
- PCs Personal Computers
- PDAs Personal Digital Assistants
- the computer systems of Translation Server 2102 , Content Localizer Server 2110 , Content Localizer Manager 2116 , and web servers 2104 , 2112 and 2118 are server systems (e.g., SUN Ultra workstations running the SunOS operating system or IBM RS/6000 workstations and servers running the AIX operating system).
- server systems e.g., SUN Ultra workstations running the SunOS operating system or IBM RS/6000 workstations and servers running the AIX operating system.
- the Content Localizer Manager 2116 may be utilized by users whose role involves managing the content on the web site 2114 to define localized content for some target markets.
- the Content Localizer Manager 2116 allows a user to upload or specify localized content and associate such content with an identifier and a variety of conditions that need to be satisfied before the localized content is to be displayed.
- the localized content together with its identifier and associated conditions are stored in the localized content database 2100 .
- localized content can include text, one or more graphics, flash files, videos, a chunk of HTML, or JavaScript code, etc.
- the identifier may be used to determine where on the site the localized content is to be placed. Different versions of localized content can be associated with the same identifier, but may have different conditions. This allows different versions of the content to be displayed on the same area of the site depending on which conditions are met. A default localized content may also be specified, which may be used when none of the pre-defined conditions are met.
- Examples of conditions to be satisfied in order for the content to be displayed may include:
- Referrer TLD Top Level Domain
- Referrer TLD Top Level Domain
- the Content Localized content can be defined to support testing of two or more different versions of localized content associated with the same identifier and the same conditions.
- the Content Localizer Server 2110 may apply the different versions of the localized content among users that meet the associated conditions.
- round robin approach may be applied and in other embodiments, the selection of a particular version for a particular user may be made randomly or based on some conditions. In other embodiments, the selection may be based on a specified allocation algorithm.
- Special requirements such as session persistence may be factored in so that once a user is assigned a specific version of the localized content, that version is continuously applied to the same user by, e.g., saving the information in a Content Localizer cookie, so that on subsequent visits the user is shown the same localized content.
- the Content Localizer Server 2110 is responsible for serving the appropriate localized content whenever the conditions are met.
- FIG. 22 is an operational flow diagram depicting an exemplary process of the Content Localizer Server 2110 for generating localized content, in one embodiment of the present teaching.
- the operational flow diagram of FIG. 22 begins with step 2202 and flows directly to step 2204 .
- the Content Localizer Server 2110 may receive a request, such as an HTTP request, from a user agent (e.g., a browser) for localized content matching a specific identifier.
- a user agent e.g., a browser
- the request may include the identifier and some or all of the following information, which is used as inputs to determine which localized content satisfies the conditions for display:
- the Content Localizer Server 2110 may retrieve all localized content and associated conditions from the database 2100 that match the identifier sent in the request. In step 2208 , the Content Localizer Server 2110 may inspect each of the retrieved localized contents and conditions. In step 2210 , the Content Localizer Server 2110 may determine whether the conditions specified in the retrieved localized content match those of the inputs included in the request.
- step 2214 the Content Localizer Server 2110 may send the matching localized content as response to the request.
- step 2212 the Content Localizer Server 2110 may check whether a default localized content is defined for the identifier. If it is affirmative, then control flows to step 2216 . Otherwise, control flows to step 2218 .
- step 2216 the Content Localizer Server 2110 may send the default localized content as response to the request.
- step 2218 the Content Localizer Server 2110 may not send localized content as response to the request.
- step 2220 the control flow of FIG. 22 stops.
- FIG. 23 is an operational flow diagram depicting an exemplary process of the Content Localizer Server 2110 for analyzing the request inputs against the conditions associated with a localized content to determine whether the conditions are met, in one embodiment of the present teaching. If the conditions are met, the process returns an affirmative response, such as true or yes, to signal that the localized content is to be displayed to the user. Otherwise it returns a negative response, such as no or false, to signal that the localized content is not to be displayed.
- the operational flow diagram of FIG. 23 specifies in detail the process used to arrive at the determination described in step 2210 of FIG. 22 .
- step 2302 The operational flow diagram of FIG. 23 begins with step 2302 when a retrieved localized content (already matched by its identifier) and its associated conditions is inspected, and flows directly to step 2304 .
- step 2304 the publication date and time, expiration date and time, and local time conditions specified in the localized content may be checked against the date and time of the request, and a determination may be made whether these conditions are applicable to the request. If these conditions are not specified or they are applicable, then control flows to step 2308 . Otherwise control flows to step 2306 . In step 2306 a negative response is returned.
- step 2308 the language condition specified in the localized content may be checked against the language in which the user is viewing the site, and a determination may be made whether this condition is applicable to the request. If this condition is not specified or the user is viewing the site in a language specified in this condition, then control flows to step 2310 . Otherwise control flows to step 2306 .
- step 2310 the user location condition specified in the localized content may be checked against the actual location of the user (which may be determined by the user's IP address, geo-location information or a pre-stored cookie with location information), and a determination may be made whether this condition is applicable to the request. If this condition is not specified or the user is in a location specified in this condition, then control flows to step 2312 .
- step 2306 the stored cookie condition specified in the localized content may be checked against the cookies present in the request, and a determination may be made whether this condition is applicable to the request. If this condition is not specified or the user has a cookie specified in this condition, then control flows to step 2314 . Otherwise control flows to step 2306 .
- step 2314 the referrer domain condition specified in the localized content may be checked against the domain of the referring site in the request, and a determination may be made whether this condition is applicable to the request. If this condition is not specified or the user comes from a referring site domain specified in this condition, then control flows to step 2316 . Otherwise control flows to step 2306 .
- step 2316 the referrer TLD condition specified in the localized content may be checked against the TLD of the referring site in the request, and a determination may be made whether this condition is applicable to the request. If this condition is not specified or the user comes from a referring site TLD specified in this condition, then control flows to step 2318 . Otherwise control flows to step 2306 .
- the referrer sub-domain condition specified in the localized content may be checked against the sub-domain of the referring site in the request, and a determination may be made whether this condition is applicable to the request. If this condition is not specified or the user comes from a referring site sub-domain specified in this condition, then control flows to step 2320 . Otherwise control flows to step 2306 .
- the referrer keyword or parameter condition specified in the localized content may be checked against the URL of the referring site in the request, and a determination may be made whether this condition is applicable to the request. If this condition is not specified or URL of the referring site contains a keyword or parameter specified in this condition, then control flows to step 2322 . Otherwise control flows to step 2306 .
- the Accept-language header condition specified in the localized content may be checked against the Accept-language header sent in the request, and a determination may be made whether this condition is applicable to the request. If this condition is not specified or the Accept-language header contains a value specified in this condition, then control flows to step 2324 . Otherwise control flows to step 2306 .
- the user agent default language condition specified in the localized content may be checked against the user agent default language sent in the request, and a determination may be made whether this condition is applicable to the request. If this condition is not specified or the user agent default language contains a value specified in this condition, then control flows to step 2326 . Otherwise control flows to step 2306 .
- step 2326 the user agent, operating system or device condition specified in the localized content may be checked against the user agent, operating system or device information sent in the request, and a determination may be made whether this condition is applicable to the request. If this condition is not specified or the user agent, operating system or device contain a value specified in this condition, then control flows to step 2328 . Otherwise control flows to step 2306 . In step 2328 an affirmative response is returned.
- the Translation Server 2102 may work in conjunction with the Content Localizer Server 2110 to generate the localized content.
- Each area of a page on the web site 2114 that contains localized content can be identified via the use of the localized content identifier. This identifier is matched with the identifier defined for the content via the Content Localizer Manager 2116 and stored in the localized content database 2100 .
- the above page is on a web site 2114 that is based in the US and the special offer is targeted to users within the US.
- the web site 2114 can also be translated to Spanish to serve Spanish speaking communities in the USA and abroad.
- the special offer may be localized by defining two different versions of the localized offers via the Content Localizer Manager 2116 , as shown in the example below:
- both localized offers share the same identifier (“special-offer- 100 ”), but the actual content of the offer and the conditions differ.
- the Content Localizer Server 2110 replaces the US version of the offer (i.e., “Buy a SONY BRAVIA XBR before the end of the month and get free shipping!”) with the Mexican version of the offer (i.e., “Buy a SONY BRAVIA XBR before the end of the month and get a free mounting bracket!”) when a user from Mexico is viewing the site in Spanish.
- Content Localizer Server replaces the US offer with the Spain offer (i.e., “Buy a SONY BRAVIA XBR before the end of the month and get a free SONY MP3 player!”).
- the Content Localizer Server 2110 may be designed to support different ways to achieve that. In some embodiments, this can be done by wrapping a span or div tag, or another tag, around the content to be localized, which references the identifier assigned to the localized content via the Content Localizer Manager 2116 .
- the example below shows the use of a span tag that wraps the text of the offer to be localized.
- the span tag contains an “id” attribute whose value (“special-offer- 100 ”) is the identifier assigned to the offer via the Content Localizer Manager, allowing it to be matched with the corresponding localized content stored in the localized content database 2100 .
- the area of the page may also be identified via an exemplary Directive Tag called “mp trans localize.”
- mp trans localize an exemplary Directive Tag
- other means instead of span, div or Directive Tags, may be used to identify the content to be localized on a page.
- the localized content can be associated with existing text, or an existing graphic, flash or video file, on a page via the Content Localizer Manager 2116 and the localized version of the content can be replacement text or a replacement graphic, flash or video file, or even a different type of content that fits in the same area.
- the content to be localized on a page may be identified via a Document Object Model (DOM) traversal syntax, such as XPath.
- DOM Document Object Model
- the tags that enclose the content to be localized are defined via their location within the DOM tree, and there is no need to use span, div or Directive Tags.
- XPath syntax can be used to define the location of the area containing the offer to be localized for the above example product page:
- the above XPath can be associated with the identifier “special-offer- 100 ” without the need to insert span, div or directive tags containing the identifier in the product page
- content to be localized on a page can be identified by pattern matching the content in the page against pre-defined patterns of content within the page, using a pattern matching syntax, such as regular expressions.
- a pattern matching syntax such as regular expressions.
- the Content Localizer Manager 2116 may provide a user interface to allow a user to select an area of a page to be customized. Once an area is selected, the Content Localizer Manager 2116 may then identify the actual HTML code that produces the content within the area and generate a DOM traversal path or a pattern match expression that identifies the area within the page.
- the Translation Server 2102 may be made capable of recognizing these areas at the time that it parses the page during the process of page conversion from one language to another.
- the Content Localizer Server 2110 is a separate application whose primary function is to serve localized content.
- the Translation Server 2102 recognizes an area to be localized at page conversion time the Translation Server 2102 replaces the content to be localized with HTML code, and/or JavaScript code, and/or other code that is executed on the user agent and generates an HTTP request to the Content Localizer Server 2110 that includes the identifier of the localized content and other request inputs listed in the description of FIG. 22 .
- the Content Localizer Server 2110 then returns the appropriate localized content which may include additional JavaScript or other code executed on the user agent (e.g., a browser) to dynamically insert the localized content in the page.
- FIG. 24 is an operational flow diagram depicting an exemplary process of the Translation Server 2102 for recognizing the areas of the page to be localized.
- FIG. 24 describes a Translation Server alternate process flow of FIG. 6 for recognizing areas to be localized in a page.
- the operational flow diagram of FIG. 24 begins with step 601 and flows directly to step 602 .
- Steps 601 , 602 , 603 , 604 , 614 , 623 , 627 and 629 may be identical to the same numbered steps described in FIG. 6 .
- Steps 630 and 631 represent the normal process flow described in FIG. 6 when the determination of steps 603 , 604 , 614 and 629 is affirmative.
- step 633 the content following the start area tag may be parsed.
- step 634 it may be determined whether the component being parsed is the localized content area end tag. If it is affirmative, then control flows to step 635 . Otherwise, control flows to back to step 633 for further parsing and the component is ignored (i.e., all content parsed within the start and end tags is ignored and it is not output to the translated page).
- the JavaScript code or other code to be executed on the user agent (e.g., a browser) to generate the request to the Content Localizer Server 2110 may be added to the translated HTML page.
- This code may include sending in the request the identifier and all other information necessary for the Content Localizer Server 2110 to determine which localized content to serve.
- FIG. 25 is a block diagram depicting an exemplary process of the Content Localizer Server 2110 request and the response, in one embodiment of the present teaching.
- the JavaScript code or other code added by the Translation Server 2102 in step 635 of FIG. 24 may be executed on the user agent (e.g., a browser) of the user 2108 to generate a request to the Content Localizer Server 2110 .
- the user agent may send the request to the Content Localizer Server.
- the request may include the localized content identifier, and may also include the following additional information: (a) the user's IP address and/or geo-location information, (b) various HTTP request headers, and (c) specific URL parameters.
- the Content Localizer Server 2110 may utilize the information included in the request to generate localized content, as described in FIG. 22 and FIG. 23 .
- the Content Localizer Server 2110 response may be sent back to the user.
- the Content Localizer Server 2110 may be part of the functionality of the Translation Server 2102 .
- the Translation Server 2102 may perform the process flows described in FIGS. 22 and 23 so that when the conditions are met, the Translation Server 2102 may replace the content to be localized with the localized content in each page at the same time it is converting the page to another language.
- the content to be localized in the above example is a string of text.
- the content to be localized can be anything within a page, including text, one or more graphics, flash files, videos, a chunk of HTML code, JavaScript code, CSS code, XML, etc.
- the Content Localizer Manager 2116 may restrict the output of the localized content to the specified dimensions. It is also possible to specify the dimensions of the area the content occupies on the page using the span, div tag, Directive Tag, or other tag, that wraps the content to be localized. For example:
- the localized content may be uploaded or entered in the native language of the web site 2114 , or in the language of the target audience. If the content is specified in the native language of the web site 2114 , then the content will automatically be entered into the translation workflow of the WebCATT tool 408 , so it can be translated into the language of the target audience. This is useful when the localized content is generated by users in the native country of the web site 2114 , which is the US in this example.
- the localized content may also be specified in the language of the target audience, in which case there is no need for the content to be translated. This is useful when the localized content is generated by users who reside in the country of the target audience. In our example, a user in Mexico whose responsibility includes managing the local content shown to users in Mexico, may directly upload or enter the localized offer for Mexico in Spanish in the Content Localizer Manager 2116 .
- the present teaching is also useful for a web site 2114 that has product assortment requirements for different local markets, such as manufacturer restrictions on products that can only be sold in certain countries.
- the Content Localizer Server 2110 can accept a periodic data feed with product assortment information for the targeted local markets. This feed may include a list of the all products offered, where each product is flagged with any applicable restrictions, such as shipping restrictions.
- the Content Localizer Server 2110 and the Translation Server 2102 can then use the information from the product feed to perform product specific localizations, which may include:
- the Translation Server 2102 and the Content Localizer Server 2110 can store a Content Localizer cookie in the user's user agent (e.g., a browser) that contains information that identify the user for localization purposes and includes information on the referring URL, user geo-location data, the conditions that were satisfied by the user and other user preferences and behavior information.
- a Content Localizer cookie in the user's user agent (e.g., a browser) that contains information that identify the user for localization purposes and includes information on the referring URL, user geo-location data, the conditions that were satisfied by the user and other user preferences and behavior information.
- the present teaching may be realized in hardware, software, firmware, or any combination thereof.
- a system according to one embodiment of the present teaching can be realized in a centralized fashion in one computer system or in a distributed fashion where different elements are spread across several interconnected computer systems. Any kind of computer system—or other apparatus adapted for carrying out the methods described herein—is suited.
- a typical combination of hardware, software, and firmware could be a general-purpose computer system with a computer program that, when being loaded and executed, controls the computer system such that it carries out the methods described herein.
- An embodiment of the present teaching can also be embedded in a computer program product, which comprises all the features enabling the implementation of the methods described herein, and which—when loaded in a computer system—is able to carry out these methods.
- Computer program means or computer program as used in the present teaching indicates any expression, in any language, code or notation, of a set of instructions intended to cause a system having an information processing capability to perform a particular function either directly or after either or both of the following a) conversion to another language, code or, notation; and b) reproduction in a different material form.
- a computer system may include, inter alia, one or more computers and at least a computer readable medium, allowing a computer system, to read data, instructions, messages or message packets, and other computer readable information from the computer readable medium.
- the computer readable medium may include non-volatile memory, such as ROM, Flash memory, Disk drive memory, CD-ROM, and other permanent storage. Additionally, a computer readable medium may include, for example, volatile storage such as RAM, buffers, cache memory, and network circuits. Furthermore, the computer readable medium may comprise computer readable information in a transitory state medium such as a network link and/or a network interface, including a wired network or a wireless network that allow a computer system to read such computer readable information.
- FIG. 16 is a block diagram of an exemplary computer system useful for implementing the different aspects of the present teaching, such as translation server, preference selector, content localizer, URL translation and optimization, E-mail translation server, human machine cooperated translation, WebCATT, TransScope, TransSync, etc.
- the computer system includes one or more processors, such as processor 1604 .
- the processor 1604 is connected to a communication infrastructure 1602 (e.g., a communications bus, cross-over bar, or network).
- a communication infrastructure 1602 e.g., a communications bus, cross-over bar, or network.
- the computer system can include a display interface 1608 that forwards graphics, text, and other data from the communication infrastructure 1602 (or from a frame buffer not shown) for display on the display unit 1610 .
- the computer system also includes a main memory 1606 , preferably random access memory (RAM), and may also include a secondary memory 1612 .
- the secondary memory 1612 may include, for example, a hard disk drive 1614 and/or a removable storage drive 1616 , representing a floppy disk drive, a magnetic tape drive, an optical disk drive, etc.
- the removable storage drive 1616 reads from and/or writes to a removable storage unit 1618 in a manner well known to those having ordinary skill in the art.
- Removable storage unit 1618 represents a floppy disk, magnetic tape, optical disk, etc. which is read by and written to by removable storage drive 1616 .
- the removable storage unit 1618 includes a computer usable storage medium having stored therein computer software and/or data.
- the secondary memory 1612 may include other similar means for allowing computer programs or other instructions to be loaded into the computer system.
- Such means may include, for example, a removable storage unit 1622 and an interface 1620 .
- Examples of such may include a program cartridge and cartridge interface (such as that found in video game devices), a removable memory chip (such as an EPROM, or PROM) and associated socket, and other removable storage units 1622 and interfaces 1620 which allow software and data to be transferred from the removable storage unit 1622 to the computer system.
- the computer system may also include a communications interface 1624 .
- Communications interface 1624 allows software and data to be transferred between the computer system and external devices. Examples of communications interface 1624 may include a modem, a network interface (such as an Ethernet card), a communications port, a PCMCIA slot and card, etc.
- Software and data transferred via communications interface 1624 are in the form of signals which may be, for example, electronic, electromagnetic, optical, or other signals capable of being received by communications interface 1624 . These signals are provided to communications interface 1624 via a communications path (i.e., channel) 1626 .
- This channel 1626 carries signals and may be implemented using wire or cable, fiber optics, a phone line, a cellular phone link, an RF link, and/or other communications channels.
- the terms “computer program medium,” “computer usable medium,” and “computer readable medium” are used to generally refer to media such as main memory 1606 and secondary memory 1612 , removable storage drive 1616 , a hard disk installed in hard disk drive 1614 , and signals. These computer program products are means for providing software to the computer system.
- the computer readable medium allows the computer system to read data, instructions, messages or message packets, and other computer readable information from the computer readable medium.
- the computer readable medium may include non-volatile memory, such as Floppy, ROM, Flash memory, Disk drive memory, CD-ROM, and other permanent storage. It is useful, for example, for transporting information, such as data and computer instructions, between computer systems.
- the computer readable medium may comprise computer readable information in a transitory state medium such as a network link and/or a network interface, including a wired network or a wireless network, that allow a computer to read such computer readable information.
- Computer programs are stored in main memory 1606 and/or secondary memory 1612 . Computer programs may also be received via communications interface 1624 . Such computer programs, when executed, enable the computer system to perform the features of the present teaching as discussed herein. In particular, the computer programs, when executed, enable the processor 1604 to perform the features of the computer system. Accordingly, such computer programs represent controllers of the computer system.
- a software product in accord with this concept, includes at least one machine-readable medium and information carried by the medium.
- the information carried by the medium may be executable program code data regarding web content translation and operational parameters. When such information carried by the medium is read by a machine, it causes the machine to perform programmed functions.
- a translation server located connected with the Internet executes instructions recorded on a medium and is capable of receiving a request for content translation, to obtain content in a first language from a publicly accessible source, analyzing the content in the first language, performing necessary translation based on the analysis, and forwarding, via a network, the translated content in a second language to a party that requesting it.
- Tangible non-transitory “storage” type media include any or all of the memory of the computers, processors or the like, or associated modules thereof, such as various semiconductor memories, tape drives, disk drives and the like, which may provide storage at any time for the software programming.
- All or portions of the software may at times be communicated through the Internet or various other telecommunication networks. Such communications, for example, may enable loading of the software from one computer or processor into another, for example, from a management server or host computer of the network operator or carrier into the platform of the message server or other device implementing a message server or similar functionality.
- another type of media that may bear the software elements includes optical, electrical and electromagnetic waves, such as used across physical interfaces between local devices, through wired and optical landline networks and over various air-links.
- the physical elements that carry such waves, such as wired or wireless links, optical links or the like, also may be considered as media bearing the software.
- terms such as computer or machine “readable medium” refer to any medium that participates in providing instructions to a processor for execution.
- Non-volatile storage media include, for example, optical or magnetic disks, such as any of the storage devices in any computer(s) or the like, such as may be used to implement the data aggregator, the customer communication system, etc. shown in the drawings.
- Volatile storage media include dynamic memory, such as main memory of such a computer platform.
- Tangible transmission media include coaxial cables; copper wire and fiber optics, including the wires that comprise a bus within a computer system.
- Carrier-wave transmission media can take the form of electric or electromagnetic signals, or acoustic or light waves such as those generated during radio frequency (RF) and infrared (IR) data communications.
- RF radio frequency
- IR infrared
- Common forms of computer-readable media therefore include for example: a floppy disk, a flexible disk, hard disk, magnetic tape, any other magnetic medium, a CD-ROM, DVD or DVD-ROM, any other optical medium, punch cards paper tape, any other physical storage medium with patterns of holes, a RAM, a PROM and EPROM, a FLASH-EPROM, any other memory chip or cartridge, a carrier wave transporting data or instructions, cables or links transporting such a carrier wave, or any other medium from which a computer can read programming code and/or data.
- Many of these forms of computer readable media may be involved in carrying one or more sequences of one or more instructions to a processor for execution.
- message server implementation described above is embodied in a hardware device, it can also be implemented as a software only solution—e.g., requiring installation on an existing server.
- a message server or a bind pooling mechanism as disclosed herein can also be implemented as a firmware, firmware/software combination, firmware/hardware combination, or hardware/firmware/software combination.
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Databases & Information Systems (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Physics & Mathematics (AREA)
- Data Mining & Analysis (AREA)
- Computational Linguistics (AREA)
- General Health & Medical Sciences (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Artificial Intelligence (AREA)
- Computer Networks & Wireless Communication (AREA)
- Signal Processing (AREA)
- Information Transfer Between Computers (AREA)
- Machine Translation (AREA)
Abstract
Description
- The present application is a continuation of U.S. patent application Ser. No. 16/459,842 filed Jul. 2, 2019, which is a continuation of U.S. patent application Ser. No. 15/821,607 filed Nov. 22, 2017, which is a continuation of U.S. patent application Ser. No. 14/817,343 filed Aug. 4, 2015, which is a continuation of U.S. patent application Ser. No. 13/182,059, filed Jul. 13, 2011, which claims the benefit of U.S. Provisional Patent Application No. 61/363,804, filed Jul. 13, 2010, the contents of all of which are incorporated herein by reference in their entireties.
- The present teaching generally relates to Internet applications, and more particularly relates to translation of web content.
- The Internet and the world-wide web have allowed consumers to complete business transactions with organizations or individuals located across continents from the comfort of their own desk. In an increasingly global marketplace, it is becoming imperative for businesses/organizations to provide web site content in multiple languages in order to expand their customer base beyond their home countries. In addition, as the demographics of a country change to include foreign language speakers, it is increasingly important to communicate with existing customers and/or potential customers in their native language. For example, several large U.S. retailers have announced that serving the Hispanic segment is now a very high priority. Some U.S. retailers have even hired Hispanic advertisement agencies to start marketing to the Hispanic market in their native language—Spanish.
- Traditionally, an organization that wants to translate its web site to another language can choose from several techniques, each having significant drawbacks. One technique involves purchasing machine translation technology. Machine translation is sometimes useful to get a rough idea as to the meaning of the content on a web site, but it is far from ideal. For most organizations, this type of translation, although convenient, is not practical because the quality of the translation from machines is simply not good enough to be posted on their web sites.
- Another technique involves managing the translation process by deploying human translators and either maintaining multiple web sites for each language, or re-architecting the existing web site back-end technology to accommodate multiple languages. This requires significant resources in terms of time and cost, including a high level of complexity and duplication of effort. In addition, dynamic and e-commerce sites present other challenges as well, as the information to be translated resides in multiple places (e.g., a Structured Query Language database, static Hyper Text Markup Language pages and dynamic Hyper Text Markup Language page templates) and each translated site interfacing with the same e-commerce or back-end engine. Further, as a web site undergoes changes, it is important to handle ongoing maintenance properly. Although, this approach may yield superior translations that are suitable for professional web sites of large organizations, it is at a great cost. Most organizations simply do not have, or do not want to invest in, the resources necessary to handle this task internally.
- For example,
FIG. 1 (PRIOR ART) is a block diagram illustrating the system architecture of a conventional web site. The web site ofFIG. 1 is presented in a first language, such as English.FIG. 1 shows aweb server 112 connected to the Internet 116 via a web connection. Apublic user 118, such as a person using a computer with a web connection, can access theweb server 112 via the Internet 116 and download information, such as a web page 114, from theweb server 112 for viewing. Theweb server 112 is operated byprogramming logic 110, comprising instructions on how to retrieve, serve, and accept information for processing. Theweb server 112 further has access to adatabase 102 for storing information, as well as Hyper Text Markup Language (HTML)template files 104,graphics files 106 andmultimedia files 108, all of which constitute the web site served byweb server 112. -
FIG. 2 (PRIOR ART) is a block diagram illustrating the system architecture of a conventional web site presented in two languages. The web site ofFIG. 2 is presented in a first language, such as English (as shown above forFIG. 1 ) and in a second language, such as Spanish.FIG. 2 shows theweb server 112 and the other English language components described inFIG. 1 , including thedatabase 102 of information, the HTMLtemplate files 104,graphics files 106,multimedia files 108 andprogramming logic 110.FIG. 2 further shows thepublic user 118 accessing theweb server 112 via the Internet 116 and downloading information, such as aweb page 202 in English or Spanish language. -
FIG. 2 also includes components related to providing web content in Spanish language. For example,FIG. 2 (has Spanish language components, including adatabase 208 of information, HTMLtemplate files 214,graphics files 216,multimedia files 210 andprogramming logic 212. These Spanish language components are managed by amulti-lingual content manager 206, which manages requests for information in the dual languages.FIG. 2 further shows that theweb server 112 is re-engineered to serve multiple sets of content in different languages. - As can be seen in the difference between
FIG. 1 andFIG. 2 , the deployment of theSpanish language components 204 andmulti-lingual content manager 206 ofFIG. 2 requires a significant expenditure of effort and resources. Further, the deployment requires re-engineering theweb server 112, adding to the time and cost associated with the deployment. Additionally, once theSpanish language components 204 have been established, continuous synchronization with changes in the English language components results in a recurring cost. - Therefore a need exists to overcome the problems with the prior art as discussed above.
- Briefly, in accordance with the present teaching, disclosed is a system, method and computer readable medium in association with providing translated web content.
- In one example, a method, implemented on a computer having at least one processor, storage, and a communication platform for providing translated web content. A request is first received from a user for content in a second language translated from content in a first language from a first Internet source. The content in the first language from the first Internet source is obtained. The content in the first language is divided into one or more translatable components, wherein a translatable component includes a segment of text. Whether the one or more translatable components have been previously translated, via at least one of machine translation, human translation, and a combination thereof, into the second language and stored as translated components in a storage is determined. If there are one or more translatable components previously translated and stored as translated components, the content is generated in the second language by modifying the content in the first language so that at least some translatable components are replaced with corresponding translated components, and the content is sent in the second language to the user as a response to the request.
- In another example, a method, implemented on a computer having at least one processor, storage, and a communication platform for providing translated web content. A request is first received from a user for content in a second language translated from content in a first language accessible from a first Internet source. The content in the first language from the first Internet source is obtained. The content in the first language is divided into one or more translatable components, wherein a translatable component includes a segment of text. Whether any of the translatable components does not have a corresponding translated component stored in the storage and generated previously via at least one of machine translation, human translation, and a combination thereof is determined. Translation of the translatable components that do not have corresponding translated components via at least one of machine translation, human translation, and a combination thereof is scheduled to generate the corresponding translated components, wherein each segment of text is translated as a unit. The corresponding translated components are then stored in the storage.
- In a different example, a method, implemented on a computer having at least one processor, storage, and a communication platform for managing language translation. Content in a first language is first accessed from an Internet source via a publicly available network path. A portion of the content in the first language that is not yet translated into a second language is identified. Translation of the portion of the content in the first language that is not yet translated into a second language using at least one of machine translation, human translation, and a combination thereof is then scheduled to produce corresponding content in the second language.
- Additional advantages and novel features will be set forth in part in the description which follows, and in part will become apparent to those skilled in the art upon examination of the following and the accompanying drawings or may be learned by production or operation of the examples. The advantages of the present teachings may be realized and attained by practice or use of various aspects of the methodologies, instrumentalities and combinations set forth in the detailed examples discussed below.
-
FIG. 1 (PRIOR ART) is a block diagram illustrating the system architecture of a conventional web site; -
FIG. 2 (PRIOR ART) is a block diagram illustrating the system architecture of a conventional web site presented in two languages; -
FIG. 3 is a block diagram illustrating an exemplary system architecture of a web site presented in two languages, in one embodiment of the present teaching; -
FIG. 4 is a block diagram illustrating an exemplary system architecture of the present teaching, in one embodiment of the present teaching; -
FIG. 5 is an operational flow diagram depicting an exemplary process of the translation server, according to one embodiment of the present teaching; -
FIG. 6 is an operational flow diagram depicting an exemplary serving process of the translation server, according to one embodiment of the present teaching; -
FIG. 7(a) is a block diagram depicting an exemplary serving process in an ASP model of the translation server, according to one embodiment of the present teaching; -
FIG. 7(b) is a block diagram depicting an exemplary process in an ASP model of the translation server when the content to be translated is not present on the web site or is not delivered to the user via the web site, according to one embodiment of the present teaching; -
FIG. 8(a) is a block diagram depicting an exemplary serving process in a web service model of the translation server, according to one embodiment of the present teaching; -
FIG. 8(b) is a block diagram depicting an exemplary serving process in a web service model of the translation server when the content to be translated is not present on the web site or is not delivered to the user via the web site, according to one embodiment of the present teaching; -
FIG. 9 is a screenshot of an exemplary WebCATT interface used for viewing web content for translation, in one embodiment of the present teaching; -
FIG. 10 is a screenshot of an exemplary WebCATT interface used for viewing a translatable image along with a corresponding translation, in one embodiment of the present teaching; -
FIG. 11 is a screenshot of an exemplary WebCATT interface used for editing a translatable segment of text, in one embodiment of the present teaching; -
FIG. 12 is a screenshot of an exemplary WebCATT interface used for viewing a translation queue, in one embodiment of the present teaching; -
FIG. 13 is an operational flow diagram depicting an exemplary process of WebCATT, according to one embodiment of the present teaching; -
FIG. 14 is an operational flow diagram depicting an exemplary process of the spider, according to one embodiment of the present teaching; -
FIG. 15 is an operational flow diagram depicting an exemplary synchronization process according to one embodiment of the present teaching; -
FIG. 16 is a block diagram showing a computer system useful for implementing the present teaching; -
FIG. 17 is a screenshot of an exemplary Preference Selector pop-up window on the user agent (e.g., a browser), according to one embodiment of the present teaching; -
FIG. 18 is an operational flow diagram depicting an exemplary process of loading Preference Selector, according to one embodiment of the present teaching; -
FIG. 19 is an operational flow diagram depicting an exemplary process of the Preference Selector server-side application, according to one embodiment of the present teaching; -
FIG. 20 is a block diagram depicting an exemplary process of the Preference Selector server-side application request and the response, according to one embodiment of the present teaching; -
FIG. 21 is a block diagram illustrating an exemplary system architecture of the Content Localizer, according to one embodiment of the present teaching; -
FIG. 22 is an operational flow diagram depicting an exemplary process of the Content Localizer Server for generating localized content, according to one embodiment of the present teaching; -
FIG. 23 is an operational flow diagram depicting an exemplary process of the Content Localizer Server for analyzing the request inputs against the conditions associated with a localized content to determine whether the conditions are met, according to one embodiment of the present teaching; -
FIG. 24 is an operational flow diagram depicting an exemplary process of the Translation Server for recognizing the areas of the page to be localized, according to one embodiment of the present teaching; and -
FIG. 25 is a block diagram depicting an exemplary process of the Content Localizer Server request and the response, according to one embodiment of the present teaching. - The methods, systems, and medium, disclosed in accordance with present teaching, overcome problems with the prior art by providing an efficient and easy-to-implement system and method for dynamic language translation of a web site.
-
FIG. 3 is a block diagram illustrating an exemplary system architecture of a web site presented in two languages, according to one embodiment of the present teaching. The web site shown inFIG. 3 may be presented in a first language, such as English, and a second language, such as Spanish.FIG. 3 shows theweb server 112 may be connected to theInternet 116 via a web connection. Apublic user 118 may access theweb server 112 via theInternet 116 and download information, such as a web page, from theweb server 112 for viewing. Theuser 118 may utilize a client application, such as a web browser, on a client computer to connect to the web site of via thenetwork 116. Once connected to the web site, theuser 118 may browse through the products or services offered by the web site by navigating through its web pages. - In this example, the
web server 112 is operated byprogramming logic 110, and theweb server 112 further has access to adatabase 102 of information, as well as HTML template files 104, graphics files 106 andmultimedia files 108, all of which constitute the English components of the web site served byweb server 112. -
FIG. 3 further includes atranslation server 300 situated apart from and existing independently from theweb server 112. Thetranslation server 300 may embody the main functions of the present teaching, including the provision of a web site in a secondary language, such as Spanish. Thetranslation server 300 may provide the secondary language components of a base web site, which is provided byweb server 112, without requiring integration with the base web site or re-configuring or re-engineering of theweb server 112. - As can be seen in the difference between
FIG. 2 andFIG. 3 , the deployment of the secondary language componentsFIG. 3 requires a significantly reduced expenditure of time and resources than the deployment ofFIG. 2 . Further, in this example, the deployment ofFIG. 3 does not require the re-engineering of theweb server 112. Additionally, once the secondary language components have been established by thetranslation server 300, they are automatically kept synchronized with the English language components of the base web site. Thus, the system of the present teaching reduces the amount of time, effort and resources that are required to deploy a secondary language web site. -
FIG. 4 is a block diagram illustrating an exemplary system architecture of the present teaching, in one embodiment of the present teaching.FIG. 4 presents an alternative point of view of the system architecture of the present teaching.FIG. 4 shows aweb site 414 representing a web site in a first language such as English that is connected to theInternet 412 via a web connection.FIG. 4 further shows auser 416 that utilizes a web connection to theInternet 412 to browse and navigate the web pages served by theweb site 414. -
FIG. 4 further shows atranslation server 400, corresponding to thetranslation server 300 ofFIG. 3 , and atranslation database 406 for use by thetranslation server 400 for storing translated components during the serving of web pages in a secondary language, such as Spanish. This process is described in greater detail below. Also shown inFIG. 4 is the Web Computer Aided Translation Tool (WebCATT), which is a tool for aiding a human 418 or anadmin 410 in translating the components of a web site in a first language. Further shown is aspider 404 for use in synchronizing, analyzing and sizing aweb site 414. Thetranslation server 400,WebCATT tool 408 andspider 404 may be connected to aweb server 402, which is the conduit through which all web actions of the above tools are channeled. Thetranslation server 400,WebCATT tool 408 are described in greater detail below. - In an embodiment of the present teaching, the computer systems of
translation server 400,WebCATT tool 408,spider 404 andweb server 402 are one or more Personal Computers (PCs) (e.g., IBM or compatible PC workstations running the Microsoft Windows 95/98/2000/ME/CE/NT/XP/VISTA/7 operating system, Unix, Linux, Macintosh computers running the Mac OS operating system, ANDROID, or equivalent), Personal Digital Assistants (PDAs), tablets, smart phones, game consoles or any other information processing devices. In another embodiment of the present teaching, the computer systems oftranslation server 400,WebCATT tool 408,spider 404 andweb server 402 are server systems (e.g., SUN Ultra workstations running the SunOS operating system or IBM RS/6000 workstations and servers running the AIX operating system). - In one embodiment of the present teaching,
Internet network 412 is a circuit switched network, such as the Public Service Telephone Network (PSTN). In another embodiment of the present teaching, thenetwork 412 is a packet switched network. The packet switched network includes a wide area network (WAN), such as the global Internet, a private WAN, a local area network (LAN), or any combination of the above-mentioned networks. In another embodiment of the present teaching,network 412 is a wired network, a wireless network, a broadcast network or a point-to-point network. In another embodiment of the present teaching,network 412 is a communication path among different processes within the same physical hardware or memory space. In another embodiment of the present teaching,network 412 is a combination of any of the above-mentioned networks. - The
translation server 400 is the application responsible for the conversion of web pages in one language to that in another language. Thetranslation server 400 may parse each incoming HTML page into translatable components, substitute each incoming translatable component with an appropriate translated component, and return the translated web page back to theonline user 416. Page conversion may be performed on the fly each time anonline user 416 requests a page in the second or alternate language. In one embodiment, when a web page is received for conversion, thetranslation server 400 will translate the page if enough translated content is available to meet a customer specified translation threshold. If this is not the case, then the page will be returned in the first or original language. - A translatable component may include any one of a text segment, an image file with text to be translated, a multimedia file with text or audio to be translated, a file with text to be translated, a file with image with text to be translated, a file with audio to be translated, a file with video and with at least one of text and audio to be translated, or any other suitable file. A text segment may be a single word, a short phrase, a sentence, a paragraph or multiple paragraphs, or any other suitable segment.
- In this example, the page conversion process follows seven major steps, some of which may be optional. In a first step, for each text segment encountered, if a translation is available, the text segment may be replaced with the translated text segment. If no translation is available, either the text remains in the original language or a machine translation may be performed on the fly, depending on the customer's preference. In a second step, for each linked file (images, PDF files, Flash movies, etc.) encountered if a translated file is available, the HTML, link tag may be rewritten so that it points to the translated file. If a translated file is not available, the original link tag may be left untouched. In a third step, any relative Universal Resource Locator (URL) found in the page may be converted to an absolute URL. This step may be necessary if the resolution of the relative URLs in the user agent (e.g., a browser) requires adjustment.
- In a fourth step, each JavaScript block may be parsed to identify translatable components, such as text or images, requiring translation. In a fifth step, each link to another web page may be rewritten so that the original URL is redirected to the
translation server 400. For example, when an online user clicks on a rewritten link, the request then goes directly to thetranslation server 400, and the page is in turn translated. This step may be necessary if resolution of relative URLs in the user agent (e.g., a browser) requires adjustment. This feature, which keeps the user in the alternate language as they browse the site, is called “implicit navigation”. - In a sixth step, for each directive tag or attribute found, an appropriate instruction may be performed. In a seventh step, the
translation server 400 may automatically schedule the web page for translation by placing it in theWebCATT 408 translation queue, in the event that an available translation cannot be found for one or more text segments or linked files in the page. -
FIG. 5 is an operational flow diagram depicting an exemplary process of thetranslation server 400, according to one embodiment of the present teaching. The operational flow diagram inFIG. 5 depicts how thetranslation server 400 responds to a user request for a web page in a secondary language. The operational flow diagram ofFIG. 5 begins withstep 502 and flows directly to step 504. - In
step 504, thetranslation server 400 may receive a request from auser 416 on aweb site 414, theweb site 414 having a first web content in a first language, such as English. The request, such as but not limited to an HTTP request or a Simple Mail Transfer Protocol (SMTP) request, may call for a second web content in a second language, such as Spanish. The second web content may be a human translation, machine translation, or human edited machine translation in a second language of the first web content. The first language includes any one of English, French, Spanish, German, Portuguese, Italian, Japanese, Chinese, Korean, Arabic, and any other suitable language, and the second language is different than the first language and includes any one of English, French, Spanish, German, Portuguese, Italian, Japanese, Chinese, Korean, Arabic, and any other suitable language. - In
step 506, thetranslation server 400 may retrieve the first web content from theweb site 414. Instep 508, thetranslation server 400 may divide the first web content into one or more translatable components. - In
step 512, thetranslation server 400 may identify one or more translated components of the second web content corresponding to one or more translatable components of the first web content. Instep 514, thetranslation server 400 may arrange or put the translated components of the second web content to preserve a format that corresponds to the first web content, including, for example, putting tags that are not visible in the first web content. Instep 516, thetranslation server 400 may provide the second web content in response to the request that was received. Instep 518, the control flow ofFIG. 5 stops. -
FIG. 6 is an operational flow diagram depicting an exemplary serving process of thetranslation server 400, according to an embodiment of the present teaching. The operational flow diagram ofFIG. 6 depicts the process of thetranslation server 400 of providing a web page in a secondary language in response to a user request and provides more details of steps 508-514 ofFIG. 5 . The operational flow diagram ofFIG. 6 begins withstep 601 and flows directly to step 602. - Step 601 begins with a source HTML, page or first web content of
step 506 ofFIG. 5 . Instep 602, at least one portion of the first web content may be parsed into translatable components. Instep 603, it may be determined whether the end of the file of the first web content is reached. If it is affirmative, then control flows to step 612. Otherwise, control flows to step 604. Instep 604, it may be determined whether the translatable component that was parsed instep 602 is a text segment. If it is affirmative, then control flows to step 606. Otherwise, control flows to step 614. - In
step 606, a matching translated text segment may be looked up in a cache. Instep 607, it may be determined whether the matching translated text segment is found in the cache. If it is affirmative, then control flows to step 609. Otherwise, control flows to step 618. Instep 609, it may be determined whether translation of the text segment is suppressed or not yet translated. If it is affirmative, then control flows to step 621. Otherwise, control flows to step 610. - In
step 610, the matching translated text segment may be set as a target segment. Instep 621, the current text segment may be set as the target segment. Instep 640, the target segment may be added to the output web content, or second web content (i.e., the translated HTML page or the output HTML page). Instep 623, the second web content may be output for provision to the user requesting the web page. - In
step 612, it may be determined whether there is an incomplete translation of the current web page, i.e., the first web content. If it is affirmative, then control flows to step 613. Otherwise, control flows to step 611. Instep 613, the current web page may be scheduled for translation. Instep 611, the translation activity performed by thetranslation server 400 in servicing the current web page may be recorded in thetranslation database 406. Instep 625, it may be determined whether the percentage of the current web page, i.e., the first web content, translated is above a threshold. If it is affirmative, then control flows to step 624. Otherwise, control flows to step 626. Instep 624, the second web content or translated HTML, page may be output for provision to the user requesting the web page. Instep 626, the current web page or first web content may be output unchanged for provision to the user requesting the web page. - In
step 614, it may be determined whether the translatable component parsed instep 602 is a translatable file, such as a PDF file, an image file, etc. If it is affirmative, then control flows to step 615. Otherwise, control flows to step 629. Instep 629, it may be determined whether the translatable component parsed instep 602 is a link to another translatable page. If it is affirmative, then control flows to step 628. Otherwise, control flows to step 627. Instep 627, a tag may be added to the translated HTML page to indicate a link (this is described in greater detail below). Instep 628, the link may be modified to redirect the URL (this is described in greater detail below). - In
step 615, a translated file corresponding to the translatable file may be looked up in a cache. Instep 616, it may be determined whether the translated file was found. If it is affirmative, then control flows to step 617. Otherwise, control flows to step 633. Instep 633, the translated file may be looked up in thetranslation database 406. Instep 635, it may be determined whether the translated file was found. If it is affirmative, then control flows to step 634. Otherwise, control flows to step 632. Instep 634, the translated file that was found may be stored in the cache. Instep 632, an incomplete translation may be recorded in thetranslation database 406. Instep 630, the original file may be set as the target file. Instep 631, the target file may be added to the translated HTML page. - In
step 617, it may be determined whether translation is suppressed for the translatable file. If it is affirmative, then control flows to step 630. Otherwise, control flows to step 636. Instep 636, the translated file may be set as the target file. Instep 618, a matching translated text segment may be looked up in thetranslation database 406. Instep 622, it may be determined whether the matching translated text segment is found in the database. If it is affirmative, then control flows to step 619. Otherwise, control flows to step 637. Instep 619, the translated segment that was found is stored in the cache. Instep 637, an incomplete translation may be recorded in thetranslation database 406. - In
step 638, it may be determined whether a machine translation of the text segment can be performed. If it is affirmative, then control flows to step 639. Otherwise, control flows to step 621. Instep 639, the machine translation may be set as the target segment. - The
translation server 400 can be presented in a variety of models. For example, in the Application Service Provider (ASP) model, thetranslation server 400 may convert full web pages or script files at a time and deliver them directly to theonline user 416. Under this model, all links in a web page may be redirected through thetranslation server 400. - Clicking on a link in a translated page results in the user agent (e.g., a browser) request being sent to the
translation server 400. Thetranslation server 400 in turn may request the original language page from the originallanguage web server 414, convert it to the alternate language, and send it back to theuser 416. -
FIG. 7(a) is a block diagram depicting an exemplary serving process in an ASP model of thetranslation server 400, according to one embodiment of the present teaching. In afirst step 702, theuser 416 may click on a link of a web page in a first language on theweb site 414. The link points to a page to be translated. Thetranslation server 400 may receive the request and process it. In asecond step 704, thetranslation server 400 may forward the request to theweb site 414, and in athird step 706, theweb site 414 may provide the page to thetranslation server 400 for translation. In afourth step 708, thetranslation server 400 may translate the page using the translations in thetranslation database 406 and send the translated page to theuser 416. -
FIG. 7(b) is a block diagram depicting an exemplary translation process of the translation server based on an ASP model when the content to be translated is not present on thecustomer web site 414 or is not delivered to the user viaweb site 414, according to an embodiment of the present teaching. The content to be translated in this embodiment includes, but is not limited to, electronic mails and/or other types of messages (e.g., messages that use protocols and services such as SMTP, SMS and MMS). In afirst step 704, an application (e.g., an electronic mail application or a text message application) running onweb site 414, may send a request to theTranslation Server 400 for translation of text content generated or delivered by the application. In asecond step 706, if a translation is not found for all or a part of the message, theTranslation Server 400 may optionally store the content to be translated and schedule it for translation. In athird step 708, theTranslation Server 400 may send the translated content to theuser 416. In one example,step 3 may take place some time afterstep 2. - In the web service model, the translated content may not be delivered directly to the
online user 416. Instead, the customer'sweb site server 414 may issue the request for translation to thetranslation server 400, which acts as a web translation service. Under this model, thetranslation server 400 can convert full pages or just specific text segments and/or files. When directly translating text segments or files, multiple translation requests can be issued, one per segment or file, or multiple segments and files can be translated in a single batched request. -
FIG. 8(a) is a block diagram depicting an exemplary serving process in a web service model of thetranslation server 400, according to an embodiment of the present teaching. In afirst step 802, theuser 416 may click on a link of a web page in a first language on theweb site 414. The link points to a page to be translated. Theweb site server 414 may receive the request and processes it. In asecond step 804, theweb site 414 may provide the page to thetranslation server 400 for translation. In athird step 806, thetranslation server 400 may provide the translated page to theweb site 414. In afourth step 808, theweb site 414 may send the translated page to theuser 416. -
FIG. 8(b) is a block diagram depicting an exemplary serving process in a web service model of the translation server when the content to be translated is not present on theweb site 414 or is not delivered to the user via theweb site 414, according to an embodiment of the present teaching. The content to be translated in this operational mode includes, but is not limited to, electronic mails and other types of text messages (e.g., ones that use protocols and services such as SMTP, SMS and MMS). In afirst step 804, a customer application (e.g., an electronic mail application or a text messaging service application) running onweb site 414, may send content to be translated (e.g., an email or a message) to theTranslation Server 400 for translation. In asecond step 806, if a translation is not found for either all or a part of the content to be translated, theTranslation Server 400 may optionally store the content and schedule it for translation. In athird step 808, theTranslation Server 400 may send the translated content back to the customer application running onweb site 414. In one example,step 3 may take place some time afterstep 2. In afourth step 810, the customer application may send the translated content back to theuser 416. - The hosting and management model may define who deploys and manages the hardware and operating system software in which the software components of the present teaching reside. There are two hosting and management models: hosted & managed, and managed only. Alternately, the software can be directly to the customer, and the customer is responsible for both the hosting and management.
- The hosted and managed model may be a fully outsourced model in which one entity hosts the service and all translated data. Under this model, one entity may deploy the
translation server 400 andWebCATT 408 software on its own hardware. All hardware and software may be provisioned and maintained by this entity, so thecustomer web site 414 has no responsibility for any hardware or software related to the service. In this model, the hosting entity may be responsible for: 1) provisioning, installing, configuring and maintaining all hardware, including communication to theInternet 412, 2) installing, configuring and maintaining all operating system, web server and database server software, 3) installing, configuring and managing on an ongoing basis thetranslation server 400 andWebCATT 408 software, and 4) maintaining staff and subcontractors that use theWebCATT 408 software to perform the translations that maintain the alternate language site in sync with the original language site. - In the managed only model, the
translation server 400 andWebCATT 408 software may be installed on the customer web site's hardware. In this model thecustomer web site 414 maybe responsible for: 1) provisioning, installing, configuring and maintaining all hardware, including communication to theInternet 412, and 2) installing, configuring and maintaining all operating system, web server and database server software. The managing entity may be responsible for: 1) installing, configuring and managing on an ongoing basis thetranslation server 400 andWebCATT 408 software, and 2) maintaining staff and subcontractors that use theWebCATT 408 software to perform the translations that maintain the alternate language site in sync with the original language site. - Dedicated Vs. Shared Servers
- The components of the present teaching can be deployed in dedicated or shared server environments. In a shared environment multiple customer web sites may share the same hardware. In a typical scenario,
multiple translation servers 400 may be installed in thesame web server 402, which connects to a database server containing thedatabase 406 of translated data. Asingle WebCATT 408 software installation may also be shared by multiple customers. This setup is cost efficient and can be used for small and medium size sites with low-to-moderate web site traffic. - In a dedicated environment all hardware may be dedicated to one
customer web site 414. This may be necessary for large organizations with heavy web site traffic and large amounts of text to be translated. In this case, either asingle web server 402 or a cluster of web servers may be dedicated to the customer. The database server normally may also be dedicated to the customer. Dedicated servers may be used to assure guaranteed bandwidth for the customer and simplify keeping track of bandwidth usage for management and billing purposes. - The system of the present teaching may not save or maintain translated pages, except, e.g., in temporary caches for the purpose of improving response performance. Although, this may be useful for sites with static content, it becomes unmanageable for sites whose content is generated dynamically from database information in response to a user's request. Instead, the present teaching may be designed to store only those components within a web page that require translation, i.e., translatable components.
- Parsing is the process of breaking-up an HTML, page submitted for translation into its translatable and non-translatable components. Non-translatable components simply pass through the system unchanged (except for URLs that need rewriting). Translatable components are processed and replaced by their translated counterparts if available. There are generally two types of translatable components in a web page: text segments and files. A translatable component may include any one of a text segment, an image file with text to be translated, a multimedia file with text or audio to be translated, a file with text to be translated, a file with image with to be translated, a file with audio to be translated and a file with video and with at least one of text and audio to be translated.
- A text segment is a chunk of text on a page. A text segment can range from a single word to a paragraph or multiple paragraphs. A file is any type of external content that resides on a file, is linked from within the page, and may require translation. Typical types of linked files found in web pages include, but are not limited to, images, PDF files, MS Word documents, and Flash movies.
- Below is an example of a very simple HTML page:
-
<html><head><title>Widget Product Information</title></head> <body>Widget<b>Model# 123</b> <p>This widget is very useful for many chores around the house. <p><img src=“img/widget_picture.gif” alt=“Product photo”> <p><a href=“https://www.abcwidgets.com”>Click here to return to the home page</a></body></html> - The above example page may be parsed into the following six text segments: 1) ‘Widget Product Information’, 2) ‘Widget’, 3) ‘Model# 123’, 4) ‘This widget is very useful for many chores around the house’, 5) ‘Product photo’, and 6) ‘Click here to return to the home page’. The above example page would further be parsed into the following one file: img/widget_picture.gif.
- By default the parsing system may break-up text segments taking into consideration the surrounding HTML tags in the page. In the above example, the sentence ‘Widget Model# 123’ was broken-up into two segments because there was an HTML bold tag (<b>) in the middle of it. However, the parsing system may be flexible and allow defining, which HTML tags are formatting tags that do not break up text segments. So, if we define the bold tag as a formatting tag, then the example page would instead be parsed into the following five text segments: 1) ‘Widget Product Information’, 2) ‘Widget <b>Model# 123</b>’, 3) ‘This widget is very useful for many chores around the house’, 4) ‘Product photo’, and 5) ‘Click here to return to the home page’.
- The bold tags now became part of the second text segment, allowing the translator to place them in the correct location in the alternate language. For example, translating the text segment ‘Widget <b>Model# 123</b>’ to Spanish will result in flipping the order of the ‘Widget’ and ‘Model’ words within the sentence. Since the bold tag is part of the text segment, it can be moved to still bold the word ‘Model’, as shown: <b>Modelo No. 123</b> de Artefacto
- Below is an example of how the example page is converted to Spanish by the translation server 400:
-
<html><head><title>Informacion del Artefacto</title></head> <body> <b>Modelo No. 123</b>del Artefacto <p>Este artefacto es muy útil para todo tipo de trabajos en la casa. <p><img src=“https://espanol.abcwidgets.com/img/ ES_24.gif” alt= “Foto del Producto”> <p><a href=“https://espanol.abcwidgets.com”>Haga clic aqui para regresar a la pagina principal</a></body></html> - In order to convert the page, the
translation server 400 may perform several changes to the page. Each text segment may be replaced with a corresponding translation. It is noted that the text of the image description (‘Product photo’) placed in the ‘all’ attribute of the image tag may be recognized as a text segment and translated. Thetranslation server 400 can recognize text segments inside attributes of HTML tags, such as the text in buttons of a form. - Further, the URL of the image tag may be replaced to point to a translated image file. The
translation server 400 may only execute this action if a translated file has been defined (since many images do not have text and thus do not require translation), otherwise it may not change the URL of the image (except to make the URL absolute if necessary). In this example, it is assumed that the ‘ES.sub.--24.gif’ image file was defined inWebCATT 408 as the translation for the ‘widget_picture.gif’ file. - The URL of the home page link may be rewritten from ‘https://www.abcwidgets.com’ to ‘https://espanol.abcwidgets.com’ in order to redirect it to the
translation server 400. When the online user clicks on the ‘Click here to return to the home page’ link, the request may go directly to thetranslation server 400, and the home page may also be translated. This process is called “implicit navigation”, and it is explained in more detail below. - Implicit navigation is a
translation server 400 feature that keeps anonline user 416 in the alternate language as he/she browses a web site. Implicit navigation can be made automatically because the domain name of a translated site may be different from the domain name of the original language site, or if necessary may be implemented by rewriting the URLs in the applicable links inside a page as the page is being translated, so they are redirected to thetranslation server 400. As a result, not only is the page translated, but also all applicable links to other translated pages within the page may be modified when needed if necessary, so that when the consumer clicks on the linked page, the translation is available. - To rewrite a link, the
translation server 400 may change the domain name in the original URL with the domain name of thetranslation server 400. When a rewritten link is clicked, the request may go to thetranslation server 400, which computes the original URL to be translated based on the path and/or its internal mappings and request the page to be translated from this URL. Thetranslation server 400 then may convert the page received to the alternate language and deliver the translated page to the consumer directly. - The scope of implicit navigation can be pre-defined by domain and/or URL patterns. In a typical scenario, only pages being served from a specific domain(s) may be translated. In the ABC Widgets example, if the implicit navigation domains are defined as abcwidgets.com and abcwidgets.net, then only URLs within those two domains will be rewritten. If a more granular translation is required, such as when translating only part of a web site, then URL patterns can be used. For example, if ABC Widgets wishes not to translate the careers and investor relations sections of their site, then the following two example Exclude URL patterns could be used: 1) abcwidgets.com/careers/ and 2) abcwidgets.com/investor/.
- Any URLs for pages residing within the above two paths may not be rewritten and thus never translated. On the other hand, if ABC Widgets wishes only to translate its online product catalog, then the following example Include URL pattern could be used: abcwidgets.com/catalog/.
- In that case, only pages residing within the abcwidgets.com/catalog/path are rewritten and thus translated. Include and Exclude URL patterns may be combined to better define the scope of the translation. Implicit navigation can also be controlled from within the HTML to be translated through the use of directive tags or directive attributes. These are explained in detail in below.
- The system according to the present teaching enables translation and optimization of URLs in order to improve the ranking of the translated pages on search engine indexes. In the case that the original URLs on the
customer web site 414 contain words or phrases in the first language that may be relevant or optimized for search engines, such words and phrases can be translated by theTranslation Server 400 into the second language to derive translated URLs on the translated web site. This allows the translated web site in the second language to maintain the search engine URL optimization of thecustomer web site 414. - In some embodiments, to achieve such URL translation, the original URL, representing the web content in the first language, is processed to identify translatable URL component(s) and for each such translatable URL component, a translated URL component can be obtained through translation into the second language, so that such a translated URL component can be used to replace the corresponding translatable URL component in the original URL. A translated URL may then be derived once the relevant translatable URL component(s) is replaced with corresponding translated URL component(s). In some embodiments, the translated URL components can be stored for future re-use. This URL translation process can be applied to both search engine optimized URLs or other URLs that have not been search engine optimized
- However, quite often, dynamic and e-commerce websites use URLs that are not search engine optimized. It is common for e-commerce sites to use cryptic, generic or repetitive page names combined with parameters. Below is an example of such a URL which displays information about SONY'S product BRAVIA 46″ LCD HDTV:
- https://www. abcwidgets.com/site/olspage.jsp?skuId=9276286&productCategoryId=abcat0 101001&type=product&id=1218073534751&session=12345
- Search engines, such as GOOGLE, place a great emphasis on keywords found on a URL versus keywords found within the content of a page. As a result, it would be beneficial for websites to place search keywords in the actual URL of the page and minimize the use of other, e.g., cryptic parameters. However, due to the restrictions of e-commerce engines and/or the great difficulty associated with changing the URL structure of a website, this is rarely done.
- The system according to the present teaching provides a solution to this problem by generating search engine optimized URLs in the second language that map to the customer web site's 414 non-optimized original URLs. For example, on a Spanish site the above ABC Widgets SONY BRAVIA 46″ LCD HDTV original URL can be translated into:
- https://espanol.abcwidgets.com/televisiones-sony-bravia-xbr-clase-46-1080p-240hz-1cd-hdtv-kd1-46xbr9/?session=12345
- The above translated URL is optimized to contain keywords that include the category, manufacturer, brand model number, and short description of the product in Spanish, which makes the URL optimized for search engines. The
Translation Server 400 may optimize an original URL by identifying (disclosed below) search engine relevant content already present on the page in the first language and placing the corresponding content in the second language on the URL of that translated page. For example, below is the HTML content that the ABC Widgets SONY BRAVIA 46″ LCD HDTV original URL returns in the first language: -
<html> <head> <title>Televisions - Sony - BRAVIA XBR 46″ Class / 1080p / 240Hz / LCD HDTV - KDL- 46XBR9</title> <meta name=“keywords” content=“SONY, BRAVIA XBR 46″ Class / 1080p / 240Hz / LCD HDTV, KDL-46XBR9, LCD Televisions, Televisions”> <meta name=“description” content=“ SONY BRAVIA XBR 46″ Class / 1080p / 240Hz / LCD HDTV: 4 HDMI inputs; Ethernet port; black cabinet; 16:9 aspect ratio”> </head> <body> <h1>Sony - BRAVIA XBR 46″ Class / 1080p / 240Hz / LCD HDTV</h1> <p>Product Description <p>With 4 HDMI inputs, a USB port and Ethernet connectivity, the Sony BRAVIA XBR 46″ flat-panel LCD HDTV provides an ideal centerpiece for your multimedia home theater system. <p>Get access to great Instant Content on this LCD HDTV. Connect to the Internet and you will have instant access to stream movies, listen to music and a wide variety of content through your HDTV. </body> </html> - In the above page, there are several elements that are search engine relevant and which can be placed in the URL for optimization. These elements include the document title, the text within the H1 tags, the meta-description, the meta-keywords and other text within the body. The
Translation Server 400 can automatically pick the most relevant element in the page based on which element better describes the content of the page, or such element can be manually predefined. Alternatively, the most relevant element may be determined in a semi-automated manner. For example, theTranslation Server 400 may automatically detect candidates of relevant elements, and a human operator may then interact with theTranslation Server 400 to select one or more candidates of relevant elements as the most relevant element to be used to generate optimized URL. The human operator may also manipulate or even edit some candidate relevant elements to make them, e.g., capitalized, boldfaced, highlighted, etc. In addition, any arbitrary content in the page can be flagged as the most relevant for URL optimization via the use of Directive Tags. - For instance, in this example, the title of the document is identified as the most search engine relevant element in the page, which is shown again below:
- <Title>Televisions-Sony-BRAVIA XBR 46″ Class/1080p/240 Hz/LCD HDTV-KDL-46XBR9</title>
- Once the most relevant element is identified, the
Translation Server 400 may then look for a matching translation of the text of that element in the second language. Translation of this text typically occurs within the normal workflow of the translation of the page. For the given example, the Spanish translation of the above title is: - <title>Televisiones-Sony-BRAVIA XBR Clase 46″/1080p/240 Hz/LCD HDTV-KDL-46XBR9</title>
- The translation of the title is then converted into a URL friendly format to derive a translated search engine optimized path. In some embodiments, this is done by performing e.g., the following steps:
- 1. Replacing spaces, underscores and any other characters separating words with dashes or another character that search engines consider as a separator of words in a URL.
- 2. Removing all characters that are not search engine friendly, such as numbers and symbols.
- 3. Optionally lowercasing all letters.
- 4. Optionally shrinking the size to a maximum size at the closest word boundary.
- 5. Optionally removing or adding a string of text.
- For this example, the resulting translated search engine optimized path obtained from the translated title is:
- /televisiones-sony-bravia-xbr-clase-46-1080p-240 hz-lcd-hdtv-kdl-46xbr9
- The
Translation Server 400 may then use the translated search engine optimized path to generate a search engine optimized URL in the second language. In some embodiments, the process to achieve that may start by breaking up the original URL on the customer web site into its origin host, path, and query string elements, as shown below: - 1. Origin Host: https://www.abcwidgets.com
- 2. Origin Path: /site/olspage.jsp
- 3. Origin-Query-String:
- skuId=9276286&productCategoryId=abcat0101001&type=product&id=121807353475-1&session=12345
- The query string is then further split into a number of parameters, each of which may be a name=value pair, e.g.,:
- skuId=9276286
- productCategoryId=abcat0101001
- type=product
- id=1218073534751
- session=12345
- Each parameter may be examined to determine whether the value of the parameter contributes to an identification that uniquely identifies the content, in this case a product. In the given example, all the parameters, except the session parameter, are considered to contribute to an identification that can uniquely identify the product. A session parameter is specific to a user's session and may change over time. Parameters that contribute to an identification that uniquely identifies the content may be included in the search engine optimized URL and parameters that do not may be excluded. For the given example, the following parameters are either included or excluded:
- Included Parameters:
- skuId=9276286
- productCategoryId=abcat0101001
- type=product
- id=1218073534751
- Excluded Parameters:
- session=12345
- The origin path, together with the included parameters, can be mapped to the translated search engine optimized path, which can then be used in place of the origin path and included parameters. The origin host may be replaced by the host name of the translated site. Finally the excluded parameters may be added back to the translated URL after it has been optimized. For example, the resulting translated search engine optimized URL in the second language is shown below:
- https://espanol.abcwidgets.com/televisiones-sony-bravia-xbr-clase-46-1080p-240 hz-lcd-hdtv-kdl-46xbr9?session=12345
- As mentioned before, the search engine optimized path can also be obtained from another part of the document instead of the title. This can include the H1 header, a meta-description, or any arbitrary content in the page identified by specific Directive Tags.
- In this example, if the search engine optimized path is obtained from the H1 header, then the resulting translated SEO optimized URL would be:
- https://espanol.abcwidgets.com/sony-bravia-xbr-clase-46-1080p-240 hz-lcd-hdtv?session=12345
- Below is an example where arbitrary content in the page is defined as the most search engine relevant for URL optimization via exemplary “mp_trans_seo_url_title” Directive Tags. In this example, the tags are used around the product description:
- <!--mp_trans_seo_url_title_start-->
- <p>With 4 HDMI inputs, a USB port and Ethernet connectivity, the Sony BRAVIA XBR 46″ flat-panel LCD HDTV provides an ideal centerpiece for your multimedia home theater system.
- <!-- mp_trans_seo_url_title_end-->
- With the above tags, the resulting translated search engine optimized URL becomes:
- http: //espanol.abcwidgets.com/con-4-entradas-hdmi-un-puerto-usb-y-conectividad-ethernet-la-television-sony-bravia-xbr-46-flat-panel-lcd-hdtv-le-brinda-un-perfecto-centro-de-atencion-para-su-sistema-de-teatro?session=12345
- When the
Translation Server 400 receives a request containing a search engine optimized URL, theTranslation Server 400 may automatically convert the search engine optimized URL into the equivalent non-optimized original URL representing the customer web site based on the above described mappings in order to retrieve the actual content for translation. To convert the translated URL back to the original URL, in some embodiments, theTranslation Server 400 looks up the translated optimized path in the database and finds the corresponding origin path and included parameters. It then replaces the translated optimized path with the origin path and adds the included parameters to the query string. - To aid this process, in some embodiments, an identifier that uniquely identifies the mapping in the database may be added to the translated search engine optimized URL. For example, if the origin path and the included parameters are mapped to the translated search engine optimized path in the database using an identifier, e.g., a numeric identifier, then this identifier can be added to the translated search engine optimized URL. Using such an identifier in the translated search engine optimized URL improves the performance in looking up the mapping, making the lookup operation resilient to changes in the translated text incorporated in the URL. Below is an example that shows a numeric identifier of 100 added to the end of the translated search engine optimized URL in the second language:
- https://espanol.abcwidgets.com/televisiones-sony-bravia-xbr-clase-46-1080p-240 hz-lcd-hdtv-kdl-46xbr9/100/?session=12345
- In the above example, even if the translation for “televisions” is changed from “televisiones” to “tvs” in the database, the use of an identifier (rather than the translated text) to lookup the mapping ensures that the correct origin path and parameters are correctly retrieved from the database. The identifier in the URL may also be encoded to reduce the required space needed for the URL.
- The system of the present teaching may enable users to access the same original language e-commerce database in multiple languages. Since the
translation server 400 may process web pages after they have left thecustomer web site 414, but before they reach theuser 416, it may not affect a web server's e-commerce technology. As a result, thesame web site 414 can be accessed in multiple languages, and all users may access the same e-commerce database simultaneously. - For example, an auction web site can allow users in different countries to bid on the same item. Each user can view the site and bid on the item in his/her native language. Since all bids from the different countries are actually hitting the same web site and the same e-commerce engine through the translation server, all bids occur in real time, and each user can see in real- time what all the other users in all other countries are bidding.
- Occasionally, the meaning of a word or phrase may change depending on the context in which it's being used. It is also possible that the translation itself may vary depending on the context or placement of a text segment, even if the original meaning does not change. As a result, it may be necessary to specify multiple translations for the same word or phrase, one for each usage context. The system of the present teaching allows translators to do this by providing the ability to “lock” text segments together. When two or more text segments are locked together they may be used only when the exact translation sequence is followed.
- For example, the translation to Spanish of the text segment “Virtual Brochures” can vary, depending on where it is used. Below is this segment used in an English HTML sentence: <b>Virtual Brochures</b>are great. The corresponding translation to Spanish is: <b>Los Folletos Virtuales</b>son excelentes. Another example of a segment used in an English HTML sentence: There are many great <b>Virtual Brochures</b>. The corresponding translation to Spanish is: Hay muchos excelentes <b>Folletos Virtuales</b>
- For this example, it is assumed that the HTML bold (<b>) tag is not defined as a formatting tag and, therefore, forces each sentence above to be broken up into two text segments each. As a result, the phrase “Virtual Brochures” becomes a separate text segment that requires a different translation for each case. Using the text segment locking feature in
WebCATT 408, the translator locks the “Los Folletos Virtuales” translated segment with the “son excelentes” translated segment in the first sentence, and the “Hay muchos excelentes” translated segment with the “Folletos Virtuales” translated segment in the second sentence. - At conversion time, when the
translation server 400 encounters the “Virtual Brochures” segment in the first sentence, it looks up a corresponding translated segment and gets back two potential matches: “Los Folletos Virtuales” and “Folletos Virtuales”. It then proceeds to look up a translated segment for the next segment “are great” and gets back “son excelentes”. Since “son excelentes” is locked to “Los Folletos Virtuales”, thetranslation server 400 is able to determine that “Los Folletos Virtuales” is the correct translation to the previous segment “Virtual Brochures”. - The
translation server 400 may transparently handle form submissions via GET or POST methods. This means that all form data may be forwarded to the original URL that processes the form and that the response page may be converted to the alternate language. - The
translation server 400 is capable of translating text segments and files located inside JavaScript code, VBScript code, CSS code, XML, AJAX messages, AMF code and many other complex web based technologies and formats by parsing the code or message and recognizing translatable components. - Translation of content inside files, such as JavaScript, CSS and VBScript, is also supported. A script included file may be downloaded by the user agent (e.g., a browser) in a separate HTTP request and included in the web page as if it had appeared within the page. Script included files may be handled in the same manner as implicit navigation in standard links within the page. The user agent may request the script included file from the
translation server 400, which will compute the URL of the original script included file and request it from its location. Thetranslation server 400 then may read the file, perform the appropriate conversions, and deliver the modified file to the user agent for inclusion in the web page. - JavaScript included files may be specified using the source (src) attribute in the <SCRIPT>tag, as shown: <script language=“javascript” src=“menthjs”></script>
- Shown is an example of how the above script tag is rewritten so the content inside the JavaScript include file is translated: <script language=“javascript” src=“https://espanol.abcwidgets.com/menu.js”></script>
- Directive tags and directive attributes are special HTML tags and attributes that allow more granular control over the translation, implicit navigation and other translation server behavior within in a web page. Directive tags are special HTML comments tags that are ignored by the user agent (e.g., a browser), but provide specific instructions to the
translation server 400. Directive attributes are specially named attributes placed within an HTML tag that are also ignored by the user agent (e.g., a browser), but provide specific instructions to thetranslation server 400 that apply only to the tag in which the attribute is placed. - Translation control tags and attributes can be used to specify sections on a web page that should not get translated. One application of translation control tags is to delimit personal information, such as a person's name, address, credit card numbers, etc. that may show up in a web page, but which may not need to be processed—it may simply pass through the
translation server 400 without being translated or stored—for security and privacy issues. - Following is an exemplary list of directive tags. The directive tag “mp_trans_partial_start & mp_trans_partial_end” signals the start and end of a partial translation section. This tag may be used at the top of a web page in conjunction with section translate tags to selectively translate sections of a page. The directive tag “mp_trans_enable_start & mp_trans_enable_end” signals the start and end of a section to be translated within a partial translation section. All text and files within this section may be translated. The directive tag “mp_trans_disable_start & mp_trans_disable_end” signals the start and end of a section not to be translated when in normal translation mode. The directive tag “mp_trans_machine_start & mp_trans_machine_end” signals that any text segments enclosed within the tags may be machine translated in the event that a human translation is not available.
- Following is an exemplary list of directive attributes. The directive attribute “mpdistrans” disables translation of a file or of translatable text in a tag, such as alt, keywords or description meta-tag, or form buttons.
- Below is an example of usage of translation control directive tags and attributes:
- <html><head>
- <meta name=“description” content=“This page description is translated”><meta mpdistrans name=“keywords” content=“These keywords are not translated, keyword1, keyword2, keyword3, keyword4, keyword5”>
- <title>This title is translated</title></head><body>
- This text and the image widget1.gif below are translated.
- <img src=“img/widget1.gif”alt=“This image description is translated”>
- <p><img mpdistrans src=“img/widget2.gif” alt=“This image and this description are NOT translated because of the mpdistrans attribute”>
- <!--mp_trans_disable_start-->
- This text and the image widget3.gif below are NOT translated because they are inside a translation disabled section.
- <img src=“img/widget3.gif.gif” alt=“This image description is NOT translated”>
- <!--mp_trans_disable_end-->This text is translated.
- <!--mp_trans_partial_start-->This text is NOT translated because it is inside a partially translated section and not specifically designated as translatable content.
- <!--mp_trans_enable_start-->This text is translated because it is inside a partially translated section and it is specifically designated as translatable content.
- <!--mp_trans_enable_end-->This text is NOT translated because it is inside a partially translated section and not specifically designated as translatable content.
- <!--mp_trans_partial_end-->This text is translated.</body></html>
- Following is an exemplary list of directive attributes for implicit navigation control. The directive attribute “mpnav” enables implicit navigation for listed attributes in the tag. This attribute can be used for tags that do not normally contain URLs, but actually do contain URLs. The directive attribute “mpdisnav” disables implicit navigation for all attributes or only listed attributes of the tag. The directive attribute “mporgnav” forces original navigation for all attributes or only listed attributes of the tag. Original navigation may remove redirection to the translation server if found, otherwise it may leave the link intact. This directive attribute is discussed below with reference to one-link deployment.
- Below is an example of usage of implicit navigation control directive attributes.
-
<html><body>ABC Widgets Home Page <p><a href=“widgetsjsp”>See all useful widgets</a> <p><a mpdisnav href=“uselesswidgetsjsp>See useless widgets</a> <p><form action=“showwidget.jsp” method=“post”> <select name=“WidgetSer”> <option value SELECTED>Select a widget to view:</option> <option mpnav=“value” value=“widget1.jsp”> Widget 1</option><option mpnav=“value” value=“widget2.jsp”> Widget 2</option></select></form></body></html> - The
translation server 400 may process the above page as follows: -
<html><body>Pagina Principal de ABC Widgets <a href=“https://espanol.abcwidgets.com/widgets.jsp”>Ver artefactos útiles</a> <p><a mpdisnav href=“https://www.abcwidgets.com/uselesswidgets.jsp>Ver artefactos inútiles</a> <p><form action=“https://espanol.abcwidgets.com/showwidget.jsp” method=“pose”> <select name=“WidgetSel”> <option value SELECTED>Escoga un artefacto para verlo:</option> <option mpnav=“value” value=“https://espanol.abcwidgets.com/ widget1.jsp”> Artefacto 1</option> <option mpnav=“value” value=“https://espanol.abcwidgets.com/ widget2.jsp”> Artefacto 2</option> </select></form></body></html> - It can be seen above that implicit navigation was not performed for the anchor (<A>) tag with the mpdisnav attribute. As a result, when the user clicks on the ‘Ver artefactos intútiles’ link, the uselesswidgets.jsp web page is not redirected to the
translation server 400 and therefore, it is not translated. Furthermore, the mpnav attribute placed in the two <OPTION> tags instructed thetranslation server 400 to perform implicit navigation on the URL specified in the value attribute of each tag. - One aspect of the present teaching is to eliminate or minimize the workload of a customer web site's IT department in order to deploy an alternate language web site. One-link deployment may allow a customer to deploy the alternate language web site by simply placing one language-switching link in the home page, navigation menu, or any other appropriate area of the original language site.
- In some embodiments, the one-link deployment may be a combination of two features: (1) automatic flipping of the language-switching link, and (2) implicit navigation to maintain the user in the alternate language. Automatic flipping of the language-switching link is specified by using the exemplary mporgnav directive attribute in the language-switching link. The mporgnav directive attribute may instruct the
translation server 400 to rewrite the URL to support automatic language switching. - Below is an example of a very simple home page:
-
<html><body>Welcome to the ABC Widgets Home Page <p><a href=“widgetsjsp”>Click here to see all widgets we sell<a> </body></html> - In some embodiments, a mirror Spanish language web site may be deployed by placing one link in the home page that redirects the home page to ABC Widget's
translation server 400. Below is an example of the above home page with the new language-switching link added: -
<html><body>Welcome to the ABC Widgets Home Page<p> <a mporgnav href=“https://espanol.abcwidgets.com”> Click here to see this site in Spanish</a> <p><a href=“widgets.jsp”>Click here to see all widgets we sell</a> </body></html> - When a user clicks the ‘Click here to see this site in Spanish’ language-switching link, the
translation server 400 may return the home page translated, as shown below: -
<html><body>Bienvenidos a la Pagina Principal de ABC Widgets<p> <a mporgnav href=“https://www.abcwidgets.com”>Haga clic aqui para ver este sitio web en Ingles</a><p> <a href=“https://espanol.abcwidgets.com/widgets.jsp”>Haga clic aqui para ver todos los artefactos que vendemos</a></body></html> - As shown above, in addition to translating the page, the
translation server 400 may also rewrite the URL in the language-switching link and perform implicit navigation of all other URLs in the page. Thetranslation server 400 may rewrite the URL in the language-switching link so that thetranslation server 400 redirection is removed. The exemplary mporgnav directive attribute may be used to instruct thetranslation server 400 to do this. In addition, the link text ‘Click here to see this site in Spanish’ may be translated as ‘Haga clic aqui para ver este sitio web en Ingles’ (which means ‘Click here to see this site in English’). This automatic and simultaneous change of both the URL and the text (or image) in the language-switching link by thetranslation server 400 is what allows the user to flip back-and-forth between English and Spanish. - Implicit navigation may be also performed in all the links on the page. In the above example home page, it was performed on the widgets.jsp page. As a result, when a user clicks on this rewritten link, the widgets.jsp page is in turn translated and implicit navigation performed on all of its links within the abcwidgets.com domain. This process may be repeated so that the user is always navigating the site in the alternate language.
- The
translation server 400 may allow delivering customized content according to the language and/or location in which a user is viewing the site. In some embodiments, when thetranslation server 400 requests a web page for translation, it sends two cookies to the original web server: one for language and another one for the country. The value of the language cookie is a 2 or 3-letter language code in compliance with theISO 639 standard. The value of the country cookie is a 2-letter country code in compliance with the ISO 3166 standard. - Web site server software can determine if a page is being viewed in an alternate language and/or a different country by checking for these cookies. For example, by checking that the language cookie exists, and that its value is ‘ES’, a web server can determine that a page is being served in Spanish and customize the content being served, such as showcasing items that appeal more to Hispanics. In addition, if a company maintains operations in multiple countries, then it can use the country cookie to determine the country and show only products sold or shipped to that country.
- When an
online user 416 who is viewing aweb site 414 in an alternate language performs an internal site search, it is natural for the user to enter the search keyword(s) in the alternate language. When thetranslation server 400 forwards the search keyword(s) to the original web site, the search engine may not be able to find any matching results, or might deliver incorrect results. This occurs because the web server search engine is matching the keyword(s) in the alternate language against a search index of keywords that are in the original language. - The
translation server 400 provides a solution to this problem by performing a real-time reverse machine translation on the search keyword(s) and forwarding the keyword(s) to the web server search engine in the original language. Reverse machine translation may be configured so it may be performed only on the specific keyword field(s) of the search form(s) in a web site. - The system of the present teaching is compatible with all Internet search engines, such as GOOGLE or ALTAVISTA. These search engines utilize content from both the body and head of the HTML document to index a web page. To ensure transparent compatibility with Internet search engines, the system of the present teaching may translate all applicable text in the head of the document. This includes, but is not limited to the page title, the page description meta-tag, and the keywords meta-tag.
- Integration with Machine Translation
- In some embodiments, the
translation server 400 uses real-time machine translation in the event that a human translation is not (yet) available. In addition, machine translation can be used as input or starting point for human translation or human post-editing. In that case, a human translator or editor post-edits the translation generated by machine translation to improve the translation. - In some embodiments, frequently used data is cached in memory to minimize repeated access to the
database 406. Thetranslation server 400 may make extensive use of memory caches to improve response performance. This includes, but is not limited to a text segment cache, a file cache, and a page cache. - As discussed herein, the
translation server 400 may not require IT integration with an existing web site infrastructure. The present teaching may convert the outbound HTML stream after it has left theclient web server 414. Thus, there is no need to re-architect an existing web site or build a separate web site for alternate language. Further, there is no client storage or management of translated data required. Translated data may be managed and maintained by theWebCATT 408 software outside of the web site's database. - The
translation server 400 may also work with any client web server hardware and software technology infrastructure. Further, it allows for evolution of the existing client's hardware and software technology infrastructure. Moreover, deployment of the present teaching requires minimal effort as a reduced amount of client IT resources are required. One-link deployment allows the client to place one link on theweb site 414 to provide access to the alternate language web site. Therefore, deployment is rapid and cost effective. - The WebCATT (Web Computer Aided Translation Tool) 408 is a web based Graphical User Interface (GUI) application that is used to perform and manage human translations. The tool may be built specifically translation of web content. It can be used by professional translators to translate web site translatable components and by managers to manage the translation process. Since
WebCATT 408 is a web-based application that is accessed via theInternet 412, translators and managers can be located in different geographical areas. -
WebCATT 408 may be similar to other computer aided translation tools used by professional translation service organizations.WebCATT 408 may support localization, text recognition, fuzzy matching, translation memory, internal repetitions, alignment, and a glossary/terminology database.WebCATT 408 may be designed for web site translation and include other features optimized for web translation, such as What You See Is What You Get (WYSIWYG) HTML previewing and support for image/graphic translation. -
WebCATT 408 may organize the translation workload into web pages. A web page may be, for example the HTML, XML, JavaScript, CSS or other type of web content generated by a specific URL address, regardless of whether that content is static (i.e., physically resides in the web server in a file), or dynamic (i.e., the content is generated dynamically by combining information from a database and HTML templates). Dynamic pages that are dependent on session information (i.e., a shopping cart checkout page) may be also supported. - Within a web page there are two types of units of translation that translators work with: text segments and files. A text segment is a chunk of text on the page. A text segment can range from a single word to a paragraph or multiple paragraphs. A file is any type of external content that resides on a file, is linked from within the page, and may require translation. Typical types of files found in web pages include, but are not limited to images, PDF files, MS Word documents, and Flash movies. A file may be translated by uploading a replacement file that has all text and/or sounds translated.
-
FIG. 9 is a screenshot of an exemplary WebCATT interface used for viewing the content of a web page, in one embodiment of the present teaching.FIG. 9 shows adisplay area 902 in which a web page including translatable component in a first language (in this case, English) is displayed. Also shown inFIG. 9 is asection 904 including information associated with the web page displayed indisplay area 902, such as page status, page URL, page ID, etc. Further shown inFIG. 9 is asection 906 including statistics associated with the web site from which the displayed web page is garnered, such as the number of files translated, the number of segments translated, the number of translations suppressed, etc. -
FIG. 10 is a screenshot of an exemplary WebCATT interface used for viewing a translatable component along with a corresponding translation, in one embodiment of the present teaching.FIG. 10 shows adisplay area 1002 in which an original image file translatable component is displayed in a first language (in this case, English).FIG. 10 shows adisplay area 1004 in which a translated image file is displayed in a second language (in this case, Spanish). Also shown inFIG. 10 is asection 1006 including information associated with the file displayed in display areas 1002-1004, such as file status, file URL, file ID, etc.FIG. 10 shows howWebCATT 408 allows a user to view a translatable component alongside a corresponding translated component for comparison. -
FIG. 11 is a screenshot of an exemplary WebCATT interface used for editing a translatable component, in one embodiment of the present teaching.FIG. 11 shows adisplay area 1102 in which a web page including a translated component in a second language (in this case, Spanish) is displayed. Thedisplay area 1102 provides a WYSIWYG web page preview feature that allows viewing the translated web page as it is being translated. Translations can often result in a significant amount of word growth (e.g., approx. 20% from English to Spanish) or shrinkage, which can result in carefully formatted web page layouts being knocked out of alignment by the longer text. The WYSIWYG page preview feature allows translators to immediately see the translated web pages and quickly make adjustments in word choice in order to maintain the correct alignment and layout of the page when translated. - Also shown in
FIG. 11 is asection 1104 including information associated with the web page displayed indisplay area 1102, such as page status, page URL, page ID, etc. Further shown inFIG. 11 is asection 1106 including statistics associated with the web site from which the displayed web page is garnered, such as the number of files translated, the number of segments translated, the number of translations suppressed, etc. In addition to each of those statistics, a breakdown of translated and not translated components is shown in both units and percentages. - A
section 1110 provides a text segment edit form that allows a translator to edit text segments in the order they appear on the page. This form features a fuzzy search feature that automatically shows and sorts existing segment matches in the database. The translator can copy an existing translation from the search results area to use as a starting translation. - A
section 1108 provides a file list form that allows a translator to preview all linked files on the page. The list form allows the translator to select all files that do not require translation (e.g., an image with no text) and quickly tag them as such. It also allows a translator to select individual files for translation via the file edit form. File translation may involve uploading a translated file and translating the file text description if present. - The GUI as shown
FIG. 11 enables a user to view the plurality of translated components placed into the format derived from the first, or source, content, thereby enabling a user to review how the translated components are rendered in the first content format. The GUI of FIG. 11 further allows a user to highlight any of the plurality of translatable components, which are not yet translated, differently from translated components when previewing the plurality of translated components in the first content format. The GUI ofFIG. 11 further allows a user to display text when hovering over a translated component so as to view the first content corresponding to the translated component. - The GUI as shown
FIG. 11 further enables a user to select at least one of the translated components when previewing the plurality of translated components in the first content format so as to edit the translated component and store the translated component that has been revised with the corresponding unique identifier. The GUI ofFIG. 11 further allows previewing in a multi-user environment so that more than one user can simultaneously view translated components rendered in the first content format. -
WebCATT 408 also provides complete management of the translation process. Web pages may be scheduled for translation either automatically by thetranslation server 400, or manually by a manager via upload of web pages or other type of content to be translated. When a web page is scheduled for translation, it may be placed in the translation queue of a specific customer. Pages to be translated may be scheduled for translation on a priority basis based on pre-defined priority information or using algorithms, such as ones based on the percentage of the page already translated and how often the page is being accessed on the original web server while it's in the translation queue. This allows the most important pages (e.g., most frequently accessed and those with smaller changes) to be translated first. - Once pages are in the queue, a manager can assign them for translation to a specific translator or translation service subcontractor. If assigned to a subcontractor, a subcontractor manager can then assign them to specific translators within the subcontractor organization or even to freelancers that work with them. Proofers can also be assigned. A subcontractor can assign its own proofers to pages and managers can also assign proofers to check the work of translators or subcontractors.
- A web page may go through a series of status changes before it is available via the Internet. The status changes follow a translation workflow that allows translation, editing, proofing, and activation. In some embodiments, only active pages may be made available via the Internet.
- In addition to the page statuses, the text and files within the page may maintain their own translation status. The status for text segments and files may be maintained both at the page level (i.e., one single overall status for all segments in the page and another one for all files in the page) and individually. The status of text segments and files may change following a translation workflow that allows translation, editing, proofing, and activation Translated segments and files may be available via the Internet only after their status is set to active.
-
FIG. 12 is a screenshot of an exemplary WebCATT interface used for viewing a translation queue, in one embodiment of the present teaching.FIG. 12 shows a series of columns wherein a unit of information is provided for each page of theweb site 414 listed on each row.FIG. 12 shows afirst column 1202 including unique page identifiers.Column 1204 includes a URL for each page.Column 1206 includes receipt data for each page.Column 1208 includes a percentage statistic indicating the percentage of the page that has been translated.Column 1210 indicates a status for each page.Column 1212 indicates the contractor assigned to the page. -
FIG. 13 is an operational flow diagram depicting an exemplary process ofWebCATT 408, according to an embodiment of the present teaching. The operational flow diagram ofFIG. 13 depicts the process by whichWebCATT 408, which provides a web based tool for managing language translations of content, queues, and translates components of aweb site 414. The operational flow diagram ofFIG. 13 begins withstep 1302 and flows directly to step 1304. - In
step 1304,WebCATT 408 may retrieve a first content, or HTML, source page, in a first language from theweb site 414. Instep 1306,WebCATT 408 may parse the first content into one or more translatable components. Instep 1308,WebCATT 408 may queue the translatable components for human translation or human edited machine translation into a second language. - In
step 1308, for each of the translatable components it may be determined whether to invoke machine translation. If it is affirmative, then control flows to step 1314. Otherwise, control flows to step 1312. Instep 1312,WebCATT 408 may provide a translatable component for human translation into a second language. Instep 1314,WebCATT 408 may perform machine translation on a translatable component into a second language. Instep 1316,WebCATT 408 may provide the machine translated component for human post-editing. Instep 1318, for each of the translatable components,WebCATT 408 may store a translated component corresponding to the translatable component, thereby storing a plurality of translated components Instep 1320, the control flow ofFIG. 13 stops. -
WebCATT 408 allows translators to work directly with live pages off theweb site 414 being translated. Thus, theclient web site 414 need not send information to thetranslation server 400 for translation. Furthermore, all web pages in a web site may be automatically entered into the translation work queue by theWebCATT 408 andspider 404, as described in greater detail below. -
WebCATT 408 WYSIWYG preview allows translators to see translated web pages, as they would appear on the live web site. This allows the translator to compensate for word growth or shrinkage that knocks a web page layout out of alignment. Furthermore, in some embodiments a translated preview page may be marked-up with special HTML & JavaScript to allow: 1) color coding of all text in the web page so the translator can see what is already translated, what remains to be translated and where the current text segment is located within the page, 2) clicking in text or a file to take the translator to a form to edit the translation for the text or file, and 3) hovering the mouse over a text or file to pop up a window showing the original wording or file. -
WebCATT 408 may parse pages into translatable components and translators only work with such translatable components, not a complex group of HTML files. All non-translatable content, such as HTML and script code, may be hidden when usingWebCATT 408.WebCATT 408 can be utilized via the ASP model and translators can access it via the web. Translated pages can be delivered via thetranslation server 400 or saved as static html pages to be sent to client, wherein links among pages are modified so they reference the translated pages. -
WebCATT 408 also allows management of the translation process. Multiple user access levels are supported: managers, proofers, translators & sub-contractors. Mangers can assign work in the translation queue to translators, proofers and/or subcontractors. Subcontractor managers can in turn sub-assign work to subcontractor translators and proofers. Managers can activate web pages before thetranslation server 400 can deliver them. - A spider is a program that visits web sites and reads their pages and other information in order to create entries for an index such as a search engine index. For example, the major search engines on the Internet all have such a program, which is also known as a “crawler” or a “bot.” Spiders are typically programmed to visit web sites that have been submitted by their owners as new or updated. Entire web sites or specific pages can be selectively visited and indexed. Spiders are named because they usually visit many web sites in parallel at the same time, their “legs” spanning a large area of the “web.” Spiders can crawl through a web site's pages in several ways.
- One way a spider can crawl through a web site is to follow all the hypertext links in each page until all the pages have been read. The spiders for the major search engines on the Internet adhere to the rules of politeness for Web spiders that are specified in a standard for robot exclusion. This standard allows specifying files to be excluded from being indexed. The standard also proscribes a special algorithm for waiting between successive server requests so that the spider doesn't affect web site response time for other users.
- The operations of a spider are in contrast with a normal web browser operated by a human that doesn't automatically follow links other than inline images and URL redirection. The algorithm used by spiders to pick which references to follow strongly depends on the spider's purpose. Index-building spiders usually retrieve a significant proportion of the references. The other extreme is spiders that try to validate the references in a set of documents. These spiders usually do not retrieve any of the links apart from redirections.
-
FIG. 4 shows aspider 404 for use in analyzing and sizing aweb site 414. Thespider 404 is a tool that crawls specific web sites and performs any of a variety of actions. Thespider 404 can crawl a web site in order to populate the WebCATT translation queue with new or updated information. Thespider 404 may also gather content statistics that can be used to provide a monetary quote for deployment of the present teaching. -
FIG. 14 is an operational flow diagram depicting an exemplary process ofspider 404, according to an embodiment of the present teaching. The operational flow diagram ofFIG. 14 depicts the process by whichspider 404, which provides a web based tool for sizing a web site for language translation, retrieves and indexes translatable components of aweb site 414. The operational flow diagram ofFIG. 14 begins withstep 1402 and flows directly to step 1404. - In
step 1404,spider 404 may retrieve a first content, such as an HTML source page, in a first language from theweb site 414. The first content in a first language may be for translation into a second content in a second language. The second web content may be a human translation, or machine translation, or human edited machine translation in a second language of the first web content. Instep 1406,spider 404 may parse the first content into one or more translatable components. A translatable component may include any one of a text segment, an image file with text to be translated, a multimedia file with text or audio to be translated, a file with text to be translated, a file with image with to be translated, a file with audio to be translated, and a file with video and with at least one of text and audio to be translated. - In
step 1410,spider 404 may store the translatable components in thedatabase 406 for human translation, or machine translation, or human edited machine translation into the second language. - In
optional step 1412,spider 404 may queue the translatable components for human translation, or machine translation, or human edited machine translation into a second language. In optional step 1414,spider 404 may provide the translatable components toWebCATT 408 for human translation or human edited machine translation into a second language. Instep 1416,spider 404 may generate statistics based on the translatable components retrieved from theweb site 414. The statistics generated may include, but are not limited to a file count, a page count, a translatable segment count, a unique text segment count, a unique text segment word count, and a word count. Thespider 404 can further generate a web page having a link to each file of theweb site 414. Instep 1418, the control flow ofFIG. 14 stops. - The
spider 404 can be pre-configured for each customer web site so that the use of directive tags and/or attributes is eliminated or minimized. This minimizes the workload of the customer web site's IT personnel. Further, thespider 404 can be separately pre-defined by domain and/or by URL pattern. This allows specifying sections of a web site to be translated without the need for placing directive tags in each web page. - The
spider 404 can be used to update theWebCATT 408 translation work queue. Further,spider 404 can be used to gather statistics about aweb site 414 in order to allow estimating the amount of work involved in translating the web site and pricing accordingly.Spider 404 can summarize word counts, segment counts, file counts and page counts of aweb site 414. Thespider 404 may supplement the functions ofWebCATT 408 by saving all unique text segments and file URLs in thedatabase 406 for later translation into a second language. It can further create an HTML, page containing links to all files ofweb site 414, so the files can reviewed for translation at a later time. - The
spider 404 can emulate a user agent (e.g., a browser) by saving and returning cookies when crawling aweb site 414.Spider 404 can further fill out and submit forms with pre-defined information and is able to establish a session and normalize session ID parameters for e-commerce sites.Spider 404 can further be configured to crawl only specific areas of a web site by defining include/exclude domains and URL patterns.Spider 404 can also be configured to send specific HTTP headers, such as the user-agent (i.e., type of browser).Spider 404 can be executed in a single computer or in distributed mode. In distributed mode, multiple machines work in conjunction to crawl the same web site simultaneously sharing thesame database 406. - Most web sites are continuously updated with new information, but maintaining an alternate language web site up to date presents a challenge when using traditional methods. The system of the present teaching provides various methods to maintain an alternate language web site up to date.
- Automatic maintenance involves automated maintenance of the alternate language web site so as to be maintained in synchronization with the original site with no human intervention or little additional effort. Automatic maintenance may be based on the function of the
translation server 400 that automatically schedules a web page for translation by placing it in theWebCATT 408 translation queue (described in more detail above) in the event a translation cannot be found for one or more text segments or linked files in the page. Thus, the act of viewing a never-before translated or a modified page in the alternate language enables the scheduling of the web page for translation. - There are several ways to take leverage the auto-scheduling function of the
translation server 400. One way involves manual quality assurance review. If a new web page or an updated web page goes through a manual quality assurance process that involves a person reviewing the page before it is released to the live web site, then the quality assurance personnel may simply attempt to view the page in the alternate language during the review process. This will place the new web page in theWebCATT 408 translation queue for translation before the page goes into the production (live) web site. - Another way to take leverage the auto-scheduling function of the
translation server 400 involves thespider agent 404. Thespider agent 404 can be used to crawl a web site, or just portions of a web site, in the alternate language on a regular basis. Crawling the web site in the alternate language is equivalent to a user viewing the site in the alternate language, and thus results in any new or modified pages being placed in theWebCATT 408 translation queue. - This technique can be used for regularly scheduled updates to a web site, which normally happens after hours. For example, if the ABC Widgets web site modifies its sale offerings twice a week, such as on Mondays and Fridays at 12 AM, then the
spider agent 404 can be scheduled to crawl the relevant parts of the site shortly after (e.g., at 12:30 AM) on those days. Around-the- clock translators can then translate the new sale banners so that the alternate language web site is up to date sometime later that morning. - The
spider agent 404 can also be used to regularly (e.g., daily) crawl a web site even when changes are not regularly scheduled. This will guarantee that the alternate language site is in sync with the original language site after every crawl and subsequent translation. - Another way to take leverage the auto-scheduling function of the
translation server 400 involves user access. Even if no manual quality assurance reviews or scheduledspider agent 404 crawls are performed, the alternate language web site may be still automatically maintained up to date over the long term. This is because the first online user that attempts to view a new or modified page in the alternate language may trigger the placement of that page into the WebCATT translation queue. In that case, the online user may see the page in the original language or may see a partially translated page. However, subsequent users that access the page may see the web page in the alternate language after it has been translated. - In addition to automatic maintenance, the present teaching also supports manual maintenance of the alternate language web site so as to be maintained in synchronization with the original site. New information that needs translation can also be manually placed in the translation
queue using WebCATT 408. This can be useful to translate large amounts of data that is available in advance of it being on thelive web site 414. For example, if the ABC Widgets web site updates its web site with new product offerings every Thursday morning, and all product information is available by the previous Tuesday, then all new product data can be manually batched into the translationqueue using WebCATT 408 as soon as it is available so it is fully translated by the time the new web pages go live. New information that needs translation may also be placed in the translation queue via the web service described inFIG. 8(a) . - Population of the
WebCATT 408 translation queue can be performed either by URL or by content. Population by URL means thattranslation server 400 stores only the URL of the page in the queue. The content of the URL may be retrieved afterwards when a translator accesses the page to translate it usingWebCATT 408. Population by URL can present a problem if the content of the page is dependent on session information, such as a session ID present in a query parameter or stored in a cookie. In that case, the session ID in the query parameter may have expired or the session information stored in the cookie may not be present when viewing the page inWebCATT 408. - In some embodiments, session dependent pages can be handled in different ways. For example, a session dependent page can be handled by replicating the session state via cookies and/or updated session parameters or by populating the page by content. Replicating the session state allows the translator to manually re-acquire a session from the original site by entering the session data in
WebCATT 408. Once the session data is entered, it can be used for translating multiple pages. Population by content means thattranslation server 400 stores the full content of the page in the queue. This avoids the session dependence issue, but can result in outdated content. As a result, population by content may be used only for session dependent pages, and population by URL, which guarantees that the content being translated is the latest content, may be used for all other pages. - Access to the
WebCATT 408 translation queue is segmented by customer and prioritized. Pages to be translated may be scheduled for translation on a priority basis based on pre-defined priority information or using algorithms, such as ones based on the percentage of the page already translated and how often the page is being accessed on the original web server while the page is in the translation queue. This allows the most important pages (e.g., most frequently accessed and those with smaller changes) to be translated first. - A file change detection feature can be used to deal with files whose names have been changed. The
translation server 400 andWebCATT 408 can match a file to be translated with its translated file by the URL of the original file. However, it is possible for a file to be changed while its name and location remain the same. In that case, it is possible that an outdated translated file is used for the translation. - To overcome this issue, in some embodiments the
translation server 400 computes a hash-code or checksum based on the binary content of the file and stores it with the URL. Each time a file is presented for translation or at certain intervals, thetranslation server 400 orWebCATT 408 may re-compute the hash-code or checksum and compare it against the stored one. If they match, the file has not changed and the existing translated file can be used as replacement. However, if they do not match, the binary content of the file was changed and the existing file translation cannot be used. In that case, the file may be placed in theWebCATT 408 translation queue so it may be re-translated. -
FIG. 15 is an operational flow diagram depicting an exemplary synchronization process according to an embodiment of the present teaching. The operational flow diagram ofFIG. 15 depicts the automated maintenance process of the alternate language web site so as to be maintained in synchronization with theoriginal web site 414. The operational flow diagram ofFIG. 15 begins withstep 1502 and flows directly to step 1504. - In
step 1504, a first content in a first language, such as an HTML, source page, may be retrieved from theweb site 414. The first content in a first language may be for translation into a second content in a second language. The second web content may be a human translation, or machine translation, or human edited machine translation in a second language of the first web content. Instep 1506, the first content may be parsed into one or more translatable components. - In
step 1510, a corresponding translated component of the second web content may be identified or matched for each translatable component of the first web content. If a translatable component of the first web content is not matched to a translated component of the second web content, instep 1512, the translatable component may be designated for translation into the second language. Inoptional step 1514, the translatable components that weren't matched may be queued for human translation, or machine translation, or human edited machine translation into a second language. Inoptional step 1516, the translatable components that weren't matched may be provided toWebCATT 408 for translation into a second language. Instep 1518, the control flow ofFIG. 15 stops. - A translated website creates value only when a potential customer visits it. Unfortunately, users sometimes fail to notice the alternate language links on the
web site 414. Even when users do see these links, they may be reluctant to click because they believe the experience will be inconsistent or inferior to theorigin web site 414. The system of the present teaching provides a solution to this problem called Preference Selector, which provides different ways to prompt auser 416, whose likely language, country, or currency preference may not be consistent with the web site's native language, country, or currency, to confirm this likely preference when entering a web site.FIG. 17 is an exemplary screenshot of how Preference Selector may be structured on a user agent (e.g., a browser), in one embodiment of the present teaching. Through Preference Selector, auser 416 onweb site 414 can be routed to his/her preferred online experience. As a result, user trust-levels increase and the probability of the user carrying out a transaction on the web site also increases. - In one embodiment of the present teaching, Preference Selector may pop-up only when it has been determined that a
user 416 likely prefers to viewweb site 414 in a language other than the site's native language. Otherwise, Preference Selector may not pop-up. When Preference Selector pops-up and theuser 416 selects a preferred preferences, these preferences may be saved in one more cookies on the user's browser. Preference Selector can then automatically redirect theuser 416 to the preferred alternate language site when the user visits the site again. In addition to facilitating the initial language selection process, Preference Selector can also be displayed on-demand to change these preferences at any time. - In some embodiments, Preference Selector can use the following information, which may be available in an HTTP request sent by the user agent (e.g., a browser) or via other means (e.g., cookies), as its inputs to control the subsequent operation(s):
-
- Request URL
- Referrer URL
- User Agent “Accept-language” header
- User Agent language
- User's IP address
- User's geo-location information
- User's demographic information
- User's online activities history information
- User Agent Language cookie, if previously visited the site
- Preference Selector may be pre-configured with the following information, which may be used to control its operation based on the above inputs:
-
- List of customer domain names to enable Preference Selector for
- List of languages, countries and/or currencies to enable Preference Selector for
- List of referrer domains for each alternate language (e.g., www.terra.com for Spanish)
- List of referrer TLDs (Top Level Domains) for each alternate language (e.g., “.mx” in “google.com.mx”—GOOGLE Mexico—for Spanish)
- List of referrer subdomains for each alternate language (e.g., “espanol” in “espanol.yahoo.com” for Spanish)
- List of referrer keywords or parameters for each alternate language (e.g., the search term “lavadora”—Spanish for “washer”—in the GOOGLE URL www.google.com/search/?h1=en&q=lavadora for Spanish)
-
- List of languages by country, region, or city
- List of affinity languages. (e.g., a French user may prefer to read Spanish before English)
- Preference Selector may be implemented by inserting a link to a Preference Selector JavaScript file in the
web site 414. This eliminates or minimizes the effort from the IT personnel of a customer's web site. For instance, the code to be inserted to link to the JavaScript file can be provided to a customer as part of the “One-Link” Deployment language switching link. The Preference Selector JavaScript file may be provided to work in conjunction with server side logic to provide the pop-up and redirection behavior. -
FIG. 18 is an operational flow diagram depicting an exemplary process of loading Preference Selector, in one embodiment of the present teaching. The operational flow diagram ofFIG. 18 begins withstep 1802 and flows directly to step 1804. Instep 1804, the user agent (e.g., a browser) may load the Preference Selector JavaScript file and execute its logic. Instep 1806, the Preference Selector JavaScript file logic may determine whether the Preference Selector cookie is present for theuser 416. If the Preference Selector cookie is present, then control flows to step 1808. Otherwise, control flows to step 1814. Instep 1808, the value of the cookie may be inspected to determine whether theuser 416 prefers an alternate language site. If it is affirmative, then control flows to step 1810. Otherwise, control flows to step 1822 and the processing stops. Instep 1810, a configuration option that specifies immediate redirection may be checked to determine whether a redirection to the preferred translated site is to be performed. If it is affirmative, then control flows to step 1812. Otherwise, control flows to step 1822 and processing stops. Instep 1812, a JavaScript client side redirection to the translated site may be performed, and theuser 416 may be redirected to the preferred translated site. It is understood that other implementations other than a cookie may also be used to achieve the same function. - In
step 1814, the Preference Selector JavaScript file logic may generate the Preference Selector server-side URL to the Preference Selector application and instruct the user agent (e.g., a browser) to request the URL. Instep 1816, the Preference Selector server-side application may execute its logic. Instep 1818, the Preference Selector server-side application may analyze the inputs provided. Instep 1820, the Preference Selector server-side application may generate a response. -
FIG. 20 is a block diagram depicting an exemplary process of the Preference Selector server-side application request and the response, which is also depicted insteps 1814 through 1820 inFIG. 18 , in one embodiment of the present teaching. Instep 1, the Preference Selector JavaScript file may generate the Preference Selector server-side URL to the Preference Selector Application and instruct the user agent (e.g., a browser) to request the URL. Instep 2, the user agent may send the request to Preference Selector Application.Step 2 shows that the request may include the following additional information: (a) the user's IP address and/or geo-location information, (b) the user' demographic information, such as but not limited to ethnic information, (c) the user's online activity history information, such as but not limited to which language the user has been using to send emails, or what kind of products (e.g., books, CD, etc.) the user has been buying and which language those products are associated with, (d) various HTTP request headers, and (e) specific URL parameters. Instep 3, the Preference Selector Application may utilize the information included in the request and its pre-configured information to generate a response. The response may include displaying the Preference Selector pop-up, redirecting the user to a translated site, or performing no action. Instep 4, the Preference Selector response may be sent back to the user. -
FIG. 19 is an operational flow diagram depicting an exemplary process of the Preference Selector server-side application for analyzing the inputs against the pre-configured information to control its operation and generate a response, in one embodiment of the present teaching. The operational flow diagram ofFIG. 19 begins withstep 1902 when the application receives the request from the user agent (e.g., a browser) and flows directly to step 1904. Instep 1904, it may be determined whether the request comes from a valid Preference Selector domain. If it is affirmative, then control flows to step 1908. Otherwise, control flows to step 1906 and the Preference Selector application may not return any content in this case. - In
step 1908, the presence of Preference Selector cookie may be checked and, if present, it may be determined based on the value of the cookie as to whether theuser 416 prefers an alternate language site. If it is affirmative, then control flows to step 1910. Otherwise, control flows to step 1912. Instep 1910, the Preference Selector application may respond with a server-side redirection to the translated language site. - In
step 1912, the value of an Accept-Language user agent request header may be inspected to determine the user's preferred language and locale. If the first (or primary) language listed therein matches with a configured alternate language, then this primary language may be set as the Preference Selector default language and control flows to step 1914. Otherwise, control flows to step 1916. - In
step 1914, the Preference Selector default language may be compared against a configured list of affinity languages, and if a match is found, the mapping may be applied and control flows to step 1932. For example, if the Preference Selector default language is French, and there is no French website available, but an affinity language has been defined that maps French to Spanish (because aFrench user 416 may prefer to read Spanish before English), then the Preference Selector default language is set to Spanish. Instep 1932, the Preference Selector application may respond with a Preference Selector pop-up, e.g., a welcome pop-up, using the Preference Selector default language as the default selection in the user interface. - In
step 1916, the value of domain name in the referrer user agent request header may be inspected and compared against a configured list of referrer domains to determine whether theuser 416 comes from a website in a configured alternate language. If it is affirmative, then the Preference Selector default language is set to that alternate language and control flows to step 1914. For example, if the referrer domain is “www.terra.com”, which is a well known Internet portal in Spanish, then the Preference Selector default language is set to Spanish. Otherwise, control flows to step 1918. - In
step 1918, the value of the top level domain (TLD) of the domain name in the referrer user agent request header may be inspected and compared against a configured list of TLDs to determine whether theuser 416 came from a website in a configured TLD. If it is affirmative, then the Preference Selector default language is set according to the language configured for that TLD and control flows to step 1914. For example, if the referrer domain is “www.google.com.mx”, which is GOOGLE's website in Mexico, and the TLD “.mx” is mapped to Spanish, then the Preference Selector default language is set to Spanish. Otherwise, control flows to step 1920. - In
step 1920, the value of the subdomain in the domain name in the referrer user agent request header may be inspected and compared against a configured list of subdomains to determine whether theuser 416 came from a website in a configured subdomain. If it is affirmative, then the Preference Selector default language is set according to the language configured for that subdomain and control flows to step 1914. For example, if the referrer domain is “espanol.yahoo.com”, which is YAHOO's portal website in Spanish, and the subdomain “espanol” is mapped to Spanish, then the Preference Selector default language is set to Spanish. Otherwise, control flows to step 1922. - In
step 1922, the value of a keyword or parameter in the in the referrer user agent request header may be inspected and compared against a configured list of keywords or parameters to determine whether theuser 416 is using keywords or parameters associated with an alternate language. If it is affirmative, then the Preference Selector default language is set according to the language configured for that keyword or parameter and control flows to step 1914. For example, if the referrer URL is “https://www.google.com/search?h1=en&q=lavadora” and the search term “lavadora” (which is Spanish for “washer”) in the URL is recognized as a Spanish keyword, then the Preference Selector default language is set to Spanish. Otherwise, control flows to step 1924. - In
step 1924, the value of the Accept-Language user agent request header may be re-inspected to determine the user's secondary language and locale. If a secondary language listed is matched against a configured alternate language, then this secondary language is set as the Preference Selector default language, and control flows to step 1914. Otherwise, control flows to step 1926. - In
step 1926, the value of the user agent language may be inspected to determine the user's user agent language. If the user agent language is matched against a configured alternate language, then the user agent language is set as the Preference Selector default language, and control flows to step 1914. Otherwise, control flows to step 1928. - In
step 1928, the IP address of theuser 416 may be inspected and a geo-location database used to determine the user's geographic location, such as the country, state/region, city, and zip code. If the user's geographic location is matched against a configured mapping of locations to languages, and the language corresponding to the user's location is matched against a configured alternate language, then the location language is set as the Preference Selector default language, and control flows to step 1914. Otherwise, control flows to step 1930. - In addition to the IP address and the geo-location, the user's demographic information and online activity history information may be inspected to determine the user's preferred language. In one embodiment, the demographic information such as the ethnic information may be obtained and inspected. For example, if the user belongs to the Hispanic ethnic group, the preferred language of the user is likely to be Spanish. In another embodiment, the user's online activities may be obtained and inspected. In one example, the language in which the user has been using to send and receive emails may be used to determine the user's preferred language. In another example, the user's online shopping history may be analyzed, for example, the language of the books or CDs that the user has been purchasing. The demographic information and the online activities history information may be obtained from various sources, such as but not limited to cookies, online commercial activities survey agencies, or any suitable sources where the user may supply his/her personal information or preference information (not shown in figures).
- When control flows to step 1930, Preference Selector has been unable to find a default alternate language. In
step 1930, the Preference Selector cookie may be set with a value for the native language of thesite 414. In that case, when theuser 416 returns to thesite 414,steps FIG. 18 will be executed in succession resulting in theuser 416 staying in thenative site 414 without receiving the Preference Selector welcome pop-up, or being redirected to an alternate language site. - In another embodiment of the present teaching, the order in which the inputs are checked in the operational flow diagram of
FIG. 18 is modified according to configuration information. For example, the referrer search keyword checked instep 1922 can be checked before the domain, TLD, and subdomain of the referrer, which may alter the response. In yet another embodiment of the present teaching, some of the inputs may not be checked. - In another embodiment of the present teaching, Preference Selector does not actually pop-up in a window in front of the
native site 414, but instead replaces an existing area in the page. - In another embodiment of the present teaching, Preference Selector allows the
user 416 to select a preferred currency and geographic location (i.e., country or region where theuser 416 is coming from or wants items shipped to). This is useful for websites that offer international service (e.g., global ecommerce, country specific offers or pricing, etc). - In another embodiment of the present teaching, Preference Selector pops-up for all users to a website, regardless of the value of the Preference Selector inputs. In this case, Preference Selector prompts all
users 416 to choose a language, including those users that likely prefer the native language of thesite 414. Preference Selector may also prompt all users to choose a preferred currency and/or geographic location. - In another embodiment of the present teaching, Preference Selector shows the
user 416 customized content according to one or more of the Preference Selector inputs. For example, Preference Selector can display market specific messaging or offers by language or geographic location. Preference Selector may also show a customized offer when auser 416 came from a specific site (i.e., the referring site), or when theuser 416 used specific search keyword(s) to land on the site. - In another embodiment of the present teaching, Preference Selector can redirect the
user 416 to different sites depending on one or more of the Preference Selector inputs. For example, a customer may have two sites that offer the same service (e.g., purchasing train tickets), one for European users and the other for all other users coming from outside Europe. Both of these sites are available in a native language and several other alternate languages. Preference Selector may be configured to redirect theuser 416 to the applicable language version of the appropriate site, depending on where theuser 416 is coming from and the selected preferred language. - In another embodiment of the present teaching, Preference Selector collects data about
user 416 behavior and learns about circumstances under which it should pop-up in the future. If auser 416 chooses an alternate language site via Preference Selector, Preference Selector records information on thatuser 416 that may include (1) the user's IP address and/or geo-location information, (2) the referring site URL and IP address, (3) the country/region of origin of the referring site, (4) the user' demographic information, and (5) the user's online activity history information. If over time a significant number of users coming from the same referring site select the same alternate language via Preference Selector, even if that referring site is not located in a country where that language is commonly used, it is added to Preference Selector's list of referrer sites for which to pop-up Preference Selector with a default selection of that alternate language. If over time a significant number of users located on the same city or region within a country (based on the user's IP address or geo-location information) select the same alternate language via Preference Selector, even if that city/region is not flagged for that alternate language, that city or region is added to Preference Selector's list of locations for which to pop-up Preference Selector with a default selection of that alternate language. - Translating a web site to another language is an important first step in expanding an organization's reach to new foreign markets. However, in order to make a web site culturally suitable to a desired target audience, it is essential that the web site's content is customized, or localized, according to the culture and requirements associated with the targeted audience. Examples of localization include, for example customizing the format of numbers, dates and times; converting currency in accordance with the custom of the local market; and converting units of measurement in accordance with the custom of the local market. Such formatting and conversion capabilities may be performed by the
Translation Server 400 at the time of converting pages from one language to another. In addition, customization can go beyond formatting and conversion in order to provide culturally relevant content to each targeted local market. Such localized content may include, but are not limited to marketing content, product variations, descriptions, and legal language specific to each target local market. The system of the present teaching includes a technology called Content Localizer that enables a web site operator to easily offer content specific to a local market to auser 416. - Content Localizer may comprise two components: a Content Localizer Manager and a Content Localizer Server. The Content Localizer Manager is an application used to define localized content and to manage the process of content localization. The Content Localizer Server is an application responsible for serving the localized content to the
user 416. In some embodiments of the present teaching, the Content Localizer Manager is a web based application with a Graphical User Interface (GUI) interface. -
FIG. 21 is a block diagram illustrating an exemplary system architecture of the Content Localizer, in one embodiment of the present teaching.FIG. 21 shows aweb site 2114 representing a web site in a first language such as English, corresponding to theweb site 414 ofFIG. 4 , which is connected to theInternet 2106 via a web connection.FIG. 21 also shows aTranslation Server 2102, corresponding to theTranslation Server 400 ofFIG. 4 , aContent Localizer Server 2110, and aContent Localizer Manager 2116.FIG. 21 further shows alocalized content database 2100 for storing localized content and the associated conditions for use by theTranslation Server 2102, theContent Localizer Server 2110, and theContent Localizer Manager 2116. -
FIG. 21 also shows auser 2108 that utilizes a web connection to theInternet 2106 to browse and navigate the web pages served by theweb site 2114 in a first language and by theTranslation Server 2102 in a second language. Also shown inFIG. 21 is acontent manager user 2120, who utilizes theContent Localizer Manager 2116 to specify localized content with identifiers and associated conditions. TheTranslation Server 2102, theContent Localizer Server 2110 and theContent Localizer Manager 2116 are each connected toweb servers - In some embodiments, the computer systems for
Translation Server 2102,Content Localizer Server 2110,Content Localizer Manager 2116, andweb servers Translation Server 2102,Content Localizer Server 2110,Content Localizer Manager 2116, andweb servers - The
Content Localizer Manager 2116 may be utilized by users whose role involves managing the content on theweb site 2114 to define localized content for some target markets. TheContent Localizer Manager 2116 allows a user to upload or specify localized content and associate such content with an identifier and a variety of conditions that need to be satisfied before the localized content is to be displayed. The localized content together with its identifier and associated conditions are stored in thelocalized content database 2100. In some embodiments, localized content can include text, one or more graphics, flash files, videos, a chunk of HTML, or JavaScript code, etc. - The identifier may be used to determine where on the site the localized content is to be placed. Different versions of localized content can be associated with the same identifier, but may have different conditions. This allows different versions of the content to be displayed on the same area of the site depending on which conditions are met. A default localized content may also be specified, which may be used when none of the pre-defined conditions are met.
- Examples of conditions to be satisfied in order for the content to be displayed may include:
-
- Publication date and time, which restricts display of content to only on or after the publication date and time
- Expiration date and time, which restricts display of content to only before the expiration date and time
- Local Time, which restricts display of content to only at some specific time of the day, such as in the evening
- Browser, Operating System or Device, which restricts display of content to users in an environment involving specified user agents (e.g., a browser), operating systems (e.g., Windows) or devices (e.g., a smart phone)
- Language, which restricts display of content to users viewing the site in a specific language, such as French
- User's Location, which restricts display of content to users being recognized as coming from a specific location, e.g., a specified country, region, city or postal code, detected based on, e.g., the user's IP address or geo-location information
- Referrer domain, which restricts display of content to users coming from a specific set referring site domains, such as “www.terra.com”, a well-known portal in Spanish
- Referrer TLD (Top Level Domain), which restricts display of content to users coming from a specific set of referring site TLDs, such as “mx” in “google.com.mx” for GOOGLE Mexico
-
- Referrer sub-domain, which restricts display of content to users coming from a specific set of referring site sub-domains, such as “espanol” in “espanol.yahoo.com”
- Referrer keyword or parameter, which restricts display of content to users who used specific keywords or parameters in the referring site URL, such as the search term “lavadora” (Spanish for “washer”) in the Google referring URL www.google.com/search?h1=en&q=lavadora
- URL or content viewed, which restricts display of content to users that view specific URLs within the site, or that view a page that has specific content, such as a specific page title
- User behavior, which restricts display of content to users that exhibit a specific behavior while visiting the site, such as a user browsing for video cameras on a retailer's site
- Search keywords, which restricts display of content to users who perform on-site searches using specific search keywords
- Stored cookie, which restricts display of content to users who have a cookie that specify that they have previously visited the site, or other related sites, and may have specific preferences or have exhibited specific behavior in the past while on the site
- Accept-language header, which restricts display of content to users with a specific set of values in the user agent “Accept-language” header, such as “Accept-Language: es-ve” for a user whose default user agent language and country is Spanish-Venezuela
- Browser default language, which restricts display of content to users with a specific set of values in the user agent default language
- User's demographic information
- User's online activities history information.
- Localized content can be defined to support testing of two or more different versions of localized content associated with the same identifier and the same conditions. In this case, the
Content Localizer Server 2110 may apply the different versions of the localized content among users that meet the associated conditions. In some embodiments, round robin approach may be applied and in other embodiments, the selection of a particular version for a particular user may be made randomly or based on some conditions. In other embodiments, the selection may be based on a specified allocation algorithm. Special requirements such as session persistence may be factored in so that once a user is assigned a specific version of the localized content, that version is continuously applied to the same user by, e.g., saving the information in a Content Localizer cookie, so that on subsequent visits the user is shown the same localized content. - Once localized content is defined via the
Content Localizer Manager 2116 and stored in thelocalized content database 2100, theContent Localizer Server 2110 is responsible for serving the appropriate localized content whenever the conditions are met. -
FIG. 22 is an operational flow diagram depicting an exemplary process of theContent Localizer Server 2110 for generating localized content, in one embodiment of the present teaching. The operational flow diagram ofFIG. 22 begins withstep 2202 and flows directly to step 2204. Instep 2204, theContent Localizer Server 2110 may receive a request, such as an HTTP request, from a user agent (e.g., a browser) for localized content matching a specific identifier. As described later inFIG. 25 , the request may include the identifier and some or all of the following information, which is used as inputs to determine which localized content satisfies the conditions for display: -
- Request URL
- Referrer URL
- User Agent “Accept-language” header
- User Agent default language
- User's IP address
- User's geo-location information
- Language the user is viewing the site in
- One or more cookies associated with the user
- In
step 2206, theContent Localizer Server 2110 may retrieve all localized content and associated conditions from thedatabase 2100 that match the identifier sent in the request. Instep 2208, theContent Localizer Server 2110 may inspect each of the retrieved localized contents and conditions. Instep 2210, theContent Localizer Server 2110 may determine whether the conditions specified in the retrieved localized content match those of the inputs included in the request. - If it is affirmative, then control flows to step 2214. Otherwise, control flows to step 2212. In
step 2214, theContent Localizer Server 2110 may send the matching localized content as response to the request. Instep 2212, theContent Localizer Server 2110 may check whether a default localized content is defined for the identifier. If it is affirmative, then control flows to step 2216. Otherwise, control flows to step 2218. Instep 2216, theContent Localizer Server 2110 may send the default localized content as response to the request. Instep 2218, theContent Localizer Server 2110 may not send localized content as response to the request. Instep 2220, the control flow ofFIG. 22 stops. -
FIG. 23 is an operational flow diagram depicting an exemplary process of theContent Localizer Server 2110 for analyzing the request inputs against the conditions associated with a localized content to determine whether the conditions are met, in one embodiment of the present teaching. If the conditions are met, the process returns an affirmative response, such as true or yes, to signal that the localized content is to be displayed to the user. Otherwise it returns a negative response, such as no or false, to signal that the localized content is not to be displayed. The operational flow diagram ofFIG. 23 specifies in detail the process used to arrive at the determination described instep 2210 ofFIG. 22 . - The operational flow diagram of
FIG. 23 begins withstep 2302 when a retrieved localized content (already matched by its identifier) and its associated conditions is inspected, and flows directly to step 2304. Instep 2304, the publication date and time, expiration date and time, and local time conditions specified in the localized content may be checked against the date and time of the request, and a determination may be made whether these conditions are applicable to the request. If these conditions are not specified or they are applicable, then control flows to step 2308. Otherwise control flows to step 2306. In step 2306 a negative response is returned. - In
step 2308, the language condition specified in the localized content may be checked against the language in which the user is viewing the site, and a determination may be made whether this condition is applicable to the request. If this condition is not specified or the user is viewing the site in a language specified in this condition, then control flows to step 2310. Otherwise control flows to step 2306. Instep 2310, the user location condition specified in the localized content may be checked against the actual location of the user (which may be determined by the user's IP address, geo-location information or a pre-stored cookie with location information), and a determination may be made whether this condition is applicable to the request. If this condition is not specified or the user is in a location specified in this condition, then control flows to step 2312. Otherwise control flows to step 2306. Instep 2312, the stored cookie condition specified in the localized content may be checked against the cookies present in the request, and a determination may be made whether this condition is applicable to the request. If this condition is not specified or the user has a cookie specified in this condition, then control flows to step 2314. Otherwise control flows to step 2306. - In
step 2314, the referrer domain condition specified in the localized content may be checked against the domain of the referring site in the request, and a determination may be made whether this condition is applicable to the request. If this condition is not specified or the user comes from a referring site domain specified in this condition, then control flows to step 2316. Otherwise control flows to step 2306. Instep 2316, the referrer TLD condition specified in the localized content may be checked against the TLD of the referring site in the request, and a determination may be made whether this condition is applicable to the request. If this condition is not specified or the user comes from a referring site TLD specified in this condition, then control flows to step 2318. Otherwise control flows to step 2306. Instep 2318, the referrer sub-domain condition specified in the localized content may be checked against the sub-domain of the referring site in the request, and a determination may be made whether this condition is applicable to the request. If this condition is not specified or the user comes from a referring site sub-domain specified in this condition, then control flows to step 2320. Otherwise control flows to step 2306. Instep 2320, the referrer keyword or parameter condition specified in the localized content may be checked against the URL of the referring site in the request, and a determination may be made whether this condition is applicable to the request. If this condition is not specified or URL of the referring site contains a keyword or parameter specified in this condition, then control flows to step 2322. Otherwise control flows to step 2306. - In
step 2322, the Accept-language header condition specified in the localized content may be checked against the Accept-language header sent in the request, and a determination may be made whether this condition is applicable to the request. If this condition is not specified or the Accept-language header contains a value specified in this condition, then control flows to step 2324. Otherwise control flows to step 2306. Instep 2324, the user agent default language condition specified in the localized content may be checked against the user agent default language sent in the request, and a determination may be made whether this condition is applicable to the request. If this condition is not specified or the user agent default language contains a value specified in this condition, then control flows to step 2326. Otherwise control flows to step 2306. Instep 2326, the user agent, operating system or device condition specified in the localized content may be checked against the user agent, operating system or device information sent in the request, and a determination may be made whether this condition is applicable to the request. If this condition is not specified or the user agent, operating system or device contain a value specified in this condition, then control flows to step 2328. Otherwise control flows to step 2306. Instep 2328 an affirmative response is returned. - The
Translation Server 2102 may work in conjunction with theContent Localizer Server 2110 to generate the localized content. Each area of a page on theweb site 2114 that contains localized content can be identified via the use of the localized content identifier. This identifier is matched with the identifier defined for the content via theContent Localizer Manager 2116 and stored in thelocalized content database 2100. - For example, below is the HTML of a page containing information about a SONY 46″ television that contains a special offer:
-
<html> <body> <h1>Sony - BRAVIA XBR 46″ Class / 1080p / 240Hz / LCD HDTV</h1> <p>The Sony BRAVIA XBR 46″ flat-panel LCD HDTV provides an ideal centerpiece for yourmultimedia home theater system. </p> <p>Special Offer</p> <p id=“offer”>Buy a Sony BRAVIA XBR before the end of the month and get free shipping! </p> </body> </html> - The above page is on a
web site 2114 that is based in the US and the special offer is targeted to users within the US. However, theweb site 2114 can also be translated to Spanish to serve Spanish speaking communities in the USA and abroad. To be effective in marketing, it is beneficial to localize the product offer to users, e.g., coming from Mexico or Spain who are viewing the site in Spanish. To do so, in some embodiments, the special offer may be localized by defining two different versions of the localized offers via theContent Localizer Manager 2116, as shown in the example below: -
-
- Identifier: special-offer-100
- Content: Buy a SONY BRAVIA XBR before the end of the month and get a free mounting bracket!
- Conditions: Show only to users viewing site in Spanish and coming from Mexico
-
-
- Identifier: special-offer-100
- Content: Buy a SONY BRAVIA XBR before the end of the month and get a free Sony MP3 player!
- Conditions: Show only to users viewing site in Spanish and coming from Spain
- In this illustrated example, both localized offers share the same identifier (“special-offer-100”), but the actual content of the offer and the conditions differ. In this example, the
Content Localizer Server 2110 replaces the US version of the offer (i.e., “Buy a SONY BRAVIA XBR before the end of the month and get free shipping!”) with the Mexican version of the offer (i.e., “Buy a SONY BRAVIA XBR before the end of the month and get a free mounting bracket!”) when a user from Mexico is viewing the site in Spanish. Or if a user from Spain is viewing the site in Spanish, then Content Localizer Server replaces the US offer with the Spain offer (i.e., “Buy a SONY BRAVIA XBR before the end of the month and get a free SONY MP3 player!”). - In order to be able to place localized content on a page, the area of a page that contains the content to be localized may need to be identified. The
Content Localizer Server 2110 may be designed to support different ways to achieve that. In some embodiments, this can be done by wrapping a span or div tag, or another tag, around the content to be localized, which references the identifier assigned to the localized content via theContent Localizer Manager 2116. - The example below shows the use of a span tag that wraps the text of the offer to be localized. The span tag contains an “id” attribute whose value (“special-offer-100”) is the identifier assigned to the offer via the Content Localizer Manager, allowing it to be matched with the corresponding localized content stored in the
localized content database 2100. -
<html> <body> <h1>SONY - BRAVIA XBR 46″ Class / 1080p / 240Hz / LCD HDTV</h1> <p>The SONY BRAVIA XBR 46″ flat-panel LCD HDTV provides an ideal centerpiece for your multimedia home theater system.</p> <p>Special Offer</p> <span id=“localize:special-offer-100”> <p id=“offer”>Buy a Sony BRAVIA XBR before the end of the month and get free shipping! </p> </span> </body> </html> - In other embodiments, the area of the page may also be identified via an exemplary Directive Tag called “mp trans localize.” Below is an example of its use for the above special offer:
-
<!-- mp_trans_localize_start id=“special-offer-100” --> <p id=“offer”>Buy a SONY BRAVIA XBR before the end of the month and get a free mounting bracket! </p> <!-- mp_trans_localize_end --> - In some embodiments, other means, instead of span, div or Directive Tags, may be used to identify the content to be localized on a page. In one embodiment, the localized content can be associated with existing text, or an existing graphic, flash or video file, on a page via the
Content Localizer Manager 2116 and the localized version of the content can be replacement text or a replacement graphic, flash or video file, or even a different type of content that fits in the same area. - In another embodiment, the content to be localized on a page may be identified via a Document Object Model (DOM) traversal syntax, such as XPath. In this case, the tags that enclose the content to be localized are defined via their location within the DOM tree, and there is no need to use span, div or Directive Tags. Below is an example of how the XPath syntax can be used to define the location of the area containing the offer to be localized for the above example product page:
- /html/body/p[id=“offer”]
- The above XPath can be associated with the identifier “special-offer-100” without the need to insert span, div or directive tags containing the identifier in the product page
- And in another embodiment of the present teaching, content to be localized on a page can be identified by pattern matching the content in the page against pre-defined patterns of content within the page, using a pattern matching syntax, such as regular expressions. Below is an example of how a regular expression can be used to define the location of the area containing the offer to be localized for the above example product page:
- <p id=“offer”>(.+)</p>
- To facilitate these different embodiments disclosed herein, the
Content Localizer Manager 2116 may provide a user interface to allow a user to select an area of a page to be customized. Once an area is selected, theContent Localizer Manager 2116 may then identify the actual HTML code that produces the content within the area and generate a DOM traversal path or a pattern match expression that identifies the area within the page. - When areas of the page to be localized are identified, the
Translation Server 2102 may be made capable of recognizing these areas at the time that it parses the page during the process of page conversion from one language to another. - In one embodiment of the present teaching, the
Content Localizer Server 2110 is a separate application whose primary function is to serve localized content. In this case, when theTranslation Server 2102 recognizes an area to be localized at page conversion time theTranslation Server 2102 replaces the content to be localized with HTML code, and/or JavaScript code, and/or other code that is executed on the user agent and generates an HTTP request to theContent Localizer Server 2110 that includes the identifier of the localized content and other request inputs listed in the description ofFIG. 22 . TheContent Localizer Server 2110 then returns the appropriate localized content which may include additional JavaScript or other code executed on the user agent (e.g., a browser) to dynamically insert the localized content in the page. -
FIG. 24 is an operational flow diagram depicting an exemplary process of theTranslation Server 2102 for recognizing the areas of the page to be localized.FIG. 24 describes a Translation Server alternate process flow ofFIG. 6 for recognizing areas to be localized in a page. The operational flow diagram ofFIG. 24 begins withstep 601 and flows directly to step 602.Steps FIG. 6 .Steps FIG. 6 when the determination ofsteps step 632 after the determination ofsteps step 632, it may be determined whether the current component being parsed is a tag or another element that defines the start of an area to be localized, such as the <span id=“localize:special-offer-100”>tag of the above example. - If it is affirmative, then control flows to step 633. Otherwise, control flows to step 627. Step 627 is identical to the same numbered step described in
FIG. 6 . Instep 633, the content following the start area tag may be parsed. Instep 634, it may be determined whether the component being parsed is the localized content area end tag. If it is affirmative, then control flows to step 635. Otherwise, control flows to back to step 633 for further parsing and the component is ignored (i.e., all content parsed within the start and end tags is ignored and it is not output to the translated page). Instep 635, the JavaScript code or other code to be executed on the user agent (e.g., a browser) to generate the request to theContent Localizer Server 2110 may be added to the translated HTML page. This code may include sending in the request the identifier and all other information necessary for theContent Localizer Server 2110 to determine which localized content to serve. -
FIG. 25 is a block diagram depicting an exemplary process of theContent Localizer Server 2110 request and the response, in one embodiment of the present teaching. Instep 1, the JavaScript code or other code added by theTranslation Server 2102 instep 635 ofFIG. 24 may be executed on the user agent (e.g., a browser) of theuser 2108 to generate a request to theContent Localizer Server 2110. Instep 2, the user agent may send the request to the Content Localizer Server.Step 2 shows that the request may include the localized content identifier, and may also include the following additional information: (a) the user's IP address and/or geo-location information, (b) various HTTP request headers, and (c) specific URL parameters. Instep 3, theContent Localizer Server 2110 may utilize the information included in the request to generate localized content, as described inFIG. 22 andFIG. 23 . Instep 4, theContent Localizer Server 2110 response may be sent back to the user. - In another embodiment of the present teaching, the
Content Localizer Server 2110 may be part of the functionality of theTranslation Server 2102. In this case, theTranslation Server 2102 may perform the process flows described inFIGS. 22 and 23 so that when the conditions are met, theTranslation Server 2102 may replace the content to be localized with the localized content in each page at the same time it is converting the page to another language. - For simplicity purposes, the content to be localized in the above example is a string of text. However, as previously mentioned, the content to be localized can be anything within a page, including text, one or more graphics, flash files, videos, a chunk of HTML code, JavaScript code, CSS code, XML, etc. When uploading or entering localized content via the
Content Localizer Manager 2116, it is possible to specify the dimensions of the area the content occupies on the page, which is typically done in pixels. In that case, theContent Localizer Server 2110 may restrict the output of the localized content to the specified dimensions. It is also possible to specify the dimensions of the area the content occupies on the page using the span, div tag, Directive Tag, or other tag, that wraps the content to be localized. For example: - <!--mp_trans_localize_start id=“special-offer-100” width=“900” height=“200”-->
- <p id=“offer”>Buy a SONY BRAVIA XBR before the end of the month and get a free mounting bracket! </p>
- <!--mp_trans_localize_end-->
- The localized content may be uploaded or entered in the native language of the
web site 2114, or in the language of the target audience. If the content is specified in the native language of theweb site 2114, then the content will automatically be entered into the translation workflow of theWebCATT tool 408, so it can be translated into the language of the target audience. This is useful when the localized content is generated by users in the native country of theweb site 2114, which is the US in this example. The localized content may also be specified in the language of the target audience, in which case there is no need for the content to be translated. This is useful when the localized content is generated by users who reside in the country of the target audience. In our example, a user in Mexico whose responsibility includes managing the local content shown to users in Mexico, may directly upload or enter the localized offer for Mexico in Spanish in theContent Localizer Manager 2116. - The present teaching is also useful for a
web site 2114 that has product assortment requirements for different local markets, such as manufacturer restrictions on products that can only be sold in certain countries. In this case, theContent Localizer Server 2110 can accept a periodic data feed with product assortment information for the targeted local markets. This feed may include a list of the all products offered, where each product is flagged with any applicable restrictions, such as shipping restrictions. TheContent Localizer Server 2110 and theTranslation Server 2102 can then use the information from the product feed to perform product specific localizations, which may include: -
- Place a product specific message to inform the
user 416 of the restrictions, such as the message: “This product cannot be shipped to Mexico” - Disable a specific function on the page, such as graying out or removing an “Add to Cart” button when displaying product information for a product that cannot be shipped to a particular region
- Remove all information for a product that cannot be offered for sale in a particular region from a product listing, a product category landing page, or a product search results page for a
user 416 in that particular region
- Place a product specific message to inform the
- The
Translation Server 2102 and theContent Localizer Server 2110 can store a Content Localizer cookie in the user's user agent (e.g., a browser) that contains information that identify the user for localization purposes and includes information on the referring URL, user geo-location data, the conditions that were satisfied by the user and other user preferences and behavior information. - The present teaching may be realized in hardware, software, firmware, or any combination thereof. A system according to one embodiment of the present teaching can be realized in a centralized fashion in one computer system or in a distributed fashion where different elements are spread across several interconnected computer systems. Any kind of computer system—or other apparatus adapted for carrying out the methods described herein—is suited. A typical combination of hardware, software, and firmware could be a general-purpose computer system with a computer program that, when being loaded and executed, controls the computer system such that it carries out the methods described herein.
- An embodiment of the present teaching can also be embedded in a computer program product, which comprises all the features enabling the implementation of the methods described herein, and which—when loaded in a computer system—is able to carry out these methods. Computer program means or computer program as used in the present teaching indicates any expression, in any language, code or notation, of a set of instructions intended to cause a system having an information processing capability to perform a particular function either directly or after either or both of the following a) conversion to another language, code or, notation; and b) reproduction in a different material form.
- A computer system may include, inter alia, one or more computers and at least a computer readable medium, allowing a computer system, to read data, instructions, messages or message packets, and other computer readable information from the computer readable medium. The computer readable medium may include non-volatile memory, such as ROM, Flash memory, Disk drive memory, CD-ROM, and other permanent storage. Additionally, a computer readable medium may include, for example, volatile storage such as RAM, buffers, cache memory, and network circuits. Furthermore, the computer readable medium may comprise computer readable information in a transitory state medium such as a network link and/or a network interface, including a wired network or a wireless network that allow a computer system to read such computer readable information.
-
FIG. 16 is a block diagram of an exemplary computer system useful for implementing the different aspects of the present teaching, such as translation server, preference selector, content localizer, URL translation and optimization, E-mail translation server, human machine cooperated translation, WebCATT, TransScope, TransSync, etc. The computer system includes one or more processors, such asprocessor 1604. Theprocessor 1604 is connected to a communication infrastructure 1602 (e.g., a communications bus, cross-over bar, or network). Various software embodiments are described in terms of this exemplary computer system. After reading this description, it will become apparent to a person of ordinary skill in the relevant art(s) how to implement the teaching using other computer systems and/or computer architectures. - The computer system can include a
display interface 1608 that forwards graphics, text, and other data from the communication infrastructure 1602 (or from a frame buffer not shown) for display on thedisplay unit 1610. The computer system also includes amain memory 1606, preferably random access memory (RAM), and may also include asecondary memory 1612. Thesecondary memory 1612 may include, for example, a hard disk drive 1614 and/or aremovable storage drive 1616, representing a floppy disk drive, a magnetic tape drive, an optical disk drive, etc. Theremovable storage drive 1616 reads from and/or writes to aremovable storage unit 1618 in a manner well known to those having ordinary skill in the art.Removable storage unit 1618, represents a floppy disk, magnetic tape, optical disk, etc. which is read by and written to byremovable storage drive 1616. As will be appreciated, theremovable storage unit 1618 includes a computer usable storage medium having stored therein computer software and/or data. - In alternative embodiments, the
secondary memory 1612 may include other similar means for allowing computer programs or other instructions to be loaded into the computer system. Such means may include, for example, aremovable storage unit 1622 and aninterface 1620. Examples of such may include a program cartridge and cartridge interface (such as that found in video game devices), a removable memory chip (such as an EPROM, or PROM) and associated socket, and otherremovable storage units 1622 andinterfaces 1620 which allow software and data to be transferred from theremovable storage unit 1622 to the computer system. - The computer system may also include a
communications interface 1624.Communications interface 1624 allows software and data to be transferred between the computer system and external devices. Examples ofcommunications interface 1624 may include a modem, a network interface (such as an Ethernet card), a communications port, a PCMCIA slot and card, etc. Software and data transferred viacommunications interface 1624 are in the form of signals which may be, for example, electronic, electromagnetic, optical, or other signals capable of being received bycommunications interface 1624. These signals are provided tocommunications interface 1624 via a communications path (i.e., channel) 1626. This channel 1626 carries signals and may be implemented using wire or cable, fiber optics, a phone line, a cellular phone link, an RF link, and/or other communications channels. - In this document, the terms “computer program medium,” “computer usable medium,” and “computer readable medium” are used to generally refer to media such as
main memory 1606 andsecondary memory 1612,removable storage drive 1616, a hard disk installed in hard disk drive 1614, and signals. These computer program products are means for providing software to the computer system. The computer readable medium allows the computer system to read data, instructions, messages or message packets, and other computer readable information from the computer readable medium. The computer readable medium, for example, may include non-volatile memory, such as Floppy, ROM, Flash memory, Disk drive memory, CD-ROM, and other permanent storage. It is useful, for example, for transporting information, such as data and computer instructions, between computer systems. Furthermore, the computer readable medium may comprise computer readable information in a transitory state medium such as a network link and/or a network interface, including a wired network or a wireless network, that allow a computer to read such computer readable information. - Computer programs (also called computer control logic) are stored in
main memory 1606 and/orsecondary memory 1612. Computer programs may also be received viacommunications interface 1624. Such computer programs, when executed, enable the computer system to perform the features of the present teaching as discussed herein. In particular, the computer programs, when executed, enable theprocessor 1604 to perform the features of the computer system. Accordingly, such computer programs represent controllers of the computer system. - Although specific embodiments of the teaching have been disclosed, those having ordinary skill in the art will understand that changes can be made to the specific embodiments without departing from the spirit and scope of the teaching. The scope of the teaching is not to be restricted, therefore, to the specific embodiments.
- Other concepts relate to unique software for implementing the different aspects of the present teaching, such as translation server, preference selector, content localizer, URL translation and optimization, E-mail translation server, human machine cooperated translation, WebCATT, TransScope, TransSync, etc. A software product, in accord with this concept, includes at least one machine-readable medium and information carried by the medium. The information carried by the medium may be executable program code data regarding web content translation and operational parameters. When such information carried by the medium is read by a machine, it causes the machine to perform programmed functions. In one example, a translation server located connected with the Internet executes instructions recorded on a medium and is capable of receiving a request for content translation, to obtain content in a first language from a publicly accessible source, analyzing the content in the first language, performing necessary translation based on the analysis, and forwarding, via a network, the translated content in a second language to a party that requesting it.
- The hardware elements, operating systems and programming languages of such translation servers are conventional in nature, and it is presumed that those skilled in the art are adequately familiar therewith. Of course, the server functions may be implemented in a distributed fashion on a number of similar or even different platforms, to distribute the processing load. Hence, aspects of the methods of receiving web content translation requests through a common communication port in a server or network device from a variety of client applications, as outlined above, may be embodied in programming.
- Program aspects of the technology may be thought of as “products” or “articles of manufacture” typically in the form of executable code and/or associated data that is carried on or embodied in a type of machine readable medium. Tangible non-transitory “storage” type media include any or all of the memory of the computers, processors or the like, or associated modules thereof, such as various semiconductor memories, tape drives, disk drives and the like, which may provide storage at any time for the software programming.
- All or portions of the software may at times be communicated through the Internet or various other telecommunication networks. Such communications, for example, may enable loading of the software from one computer or processor into another, for example, from a management server or host computer of the network operator or carrier into the platform of the message server or other device implementing a message server or similar functionality. Thus, another type of media that may bear the software elements includes optical, electrical and electromagnetic waves, such as used across physical interfaces between local devices, through wired and optical landline networks and over various air-links. The physical elements that carry such waves, such as wired or wireless links, optical links or the like, also may be considered as media bearing the software. As used herein, unless restricted to tangible “storage” media, terms such as computer or machine “readable medium” refer to any medium that participates in providing instructions to a processor for execution.
- Hence, a machine readable medium may take many forms, including but not limited to, a tangible storage medium, a carrier wave medium or physical transmission medium. Non-volatile storage media include, for example, optical or magnetic disks, such as any of the storage devices in any computer(s) or the like, such as may be used to implement the data aggregator, the customer communication system, etc. shown in the drawings. Volatile storage media include dynamic memory, such as main memory of such a computer platform. Tangible transmission media include coaxial cables; copper wire and fiber optics, including the wires that comprise a bus within a computer system. Carrier-wave transmission media can take the form of electric or electromagnetic signals, or acoustic or light waves such as those generated during radio frequency (RF) and infrared (IR) data communications. Common forms of computer-readable media therefore include for example: a floppy disk, a flexible disk, hard disk, magnetic tape, any other magnetic medium, a CD-ROM, DVD or DVD-ROM, any other optical medium, punch cards paper tape, any other physical storage medium with patterns of holes, a RAM, a PROM and EPROM, a FLASH-EPROM, any other memory chip or cartridge, a carrier wave transporting data or instructions, cables or links transporting such a carrier wave, or any other medium from which a computer can read programming code and/or data. Many of these forms of computer readable media may be involved in carrying one or more sequences of one or more instructions to a processor for execution.
- Those skilled in the art will recognize that the present teachings are amenable to a variety of modifications and/or enhancements. For example, although the message server implementation described above is embodied in a hardware device, it can also be implemented as a software only solution—e.g., requiring installation on an existing server. In addition, a message server or a bind pooling mechanism as disclosed herein can also be implemented as a firmware, firmware/software combination, firmware/hardware combination, or hardware/firmware/software combination.
Claims (12)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US17/202,405 US20210209185A1 (en) | 2010-07-13 | 2021-03-16 | Dynamic language translation of web site content |
Applications Claiming Priority (6)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US36380410P | 2010-07-13 | 2010-07-13 | |
US13/182,059 US9128918B2 (en) | 2010-07-13 | 2011-07-13 | Dynamic language translation of web site content |
US14/817,343 US9858347B2 (en) | 2010-07-13 | 2015-08-04 | Dynamic language translation of web site content |
US15/821,607 US10387517B2 (en) | 2010-07-13 | 2017-11-22 | Dynamic language translation of web site content |
US16/459,842 US10977329B2 (en) | 2010-07-13 | 2019-07-02 | Dynamic language translation of web site content |
US17/202,405 US20210209185A1 (en) | 2010-07-13 | 2021-03-16 | Dynamic language translation of web site content |
Related Parent Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US16/459,842 Continuation US10977329B2 (en) | 2010-07-13 | 2019-07-02 | Dynamic language translation of web site content |
Publications (1)
Publication Number | Publication Date |
---|---|
US20210209185A1 true US20210209185A1 (en) | 2021-07-08 |
Family
ID=44628956
Family Applications (24)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US13/182,120 Active 2033-07-22 US9213685B2 (en) | 2010-07-13 | 2011-07-13 | Dynamic language translation of web site content |
US13/182,059 Active 2033-06-08 US9128918B2 (en) | 2010-07-13 | 2011-07-13 | Dynamic language translation of web site content |
US13/182,139 Active 2033-08-04 US9465782B2 (en) | 2010-07-13 | 2011-07-13 | Dynamic language translation of web site content |
US13/182,118 Active 2031-08-24 US9411793B2 (en) | 2010-07-13 | 2011-07-13 | Dynamic language translation of web site content |
US13/182,080 Active US9864809B2 (en) | 2010-07-13 | 2011-07-13 | Dynamic language translation of web site content |
US13/944,356 Active US9311287B2 (en) | 2010-07-13 | 2013-07-17 | Dynamic language translation of web site content |
US14/322,016 Active 2033-07-09 US10089400B2 (en) | 2010-07-13 | 2014-07-02 | Dynamic language translation of web site content |
US14/817,343 Active 2031-07-22 US9858347B2 (en) | 2010-07-13 | 2015-08-04 | Dynamic language translation of web site content |
US14/932,025 Active US10210271B2 (en) | 2010-07-13 | 2015-11-04 | Dynamic language translation of web site content |
US15/058,257 Active 2032-01-27 US10146884B2 (en) | 2010-07-13 | 2016-03-02 | Dynamic language translation of web site content |
US15/189,081 Active 2032-05-27 US10296651B2 (en) | 2010-07-13 | 2016-06-22 | Dynamic language translation of web site content |
US15/252,810 Active US10073917B2 (en) | 2010-07-13 | 2016-08-31 | Dynamic language translation of web site content |
US15/821,607 Active US10387517B2 (en) | 2010-07-13 | 2017-11-22 | Dynamic language translation of web site content |
US15/827,018 Abandoned US20180081890A1 (en) | 2010-07-13 | 2017-11-30 | Dynamic Language Translation of Web Site Content |
US16/047,111 Active US10936690B2 (en) | 2010-07-13 | 2018-07-27 | Dynamic language translation of web site content |
US16/051,944 Active US10922373B2 (en) | 2010-07-13 | 2018-08-01 | Dynamic language translation of web site content |
US16/164,994 Active 2031-12-30 US11030267B2 (en) | 2010-07-13 | 2018-10-19 | Dynamic language translation of web site content |
US16/220,300 Abandoned US20190121831A1 (en) | 2010-07-13 | 2018-12-14 | Dynamic language translation of web site content |
US16/373,821 Active 2032-03-23 US11157581B2 (en) | 2010-07-13 | 2019-04-03 | Dynamic language translation of web site content |
US16/459,842 Active US10977329B2 (en) | 2010-07-13 | 2019-07-02 | Dynamic language translation of web site content |
US17/141,455 Active 2031-08-12 US11409828B2 (en) | 2010-07-13 | 2021-01-05 | Dynamic language translation of web site content |
US17/141,401 Active US11481463B2 (en) | 2010-07-13 | 2021-01-05 | Dynamic language translation of web site content |
US17/202,405 Abandoned US20210209185A1 (en) | 2010-07-13 | 2021-03-16 | Dynamic language translation of web site content |
US17/313,171 Abandoned US20210256082A1 (en) | 2010-07-13 | 2021-05-06 | Dynamic language translation of web site content |
Family Applications Before (22)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US13/182,120 Active 2033-07-22 US9213685B2 (en) | 2010-07-13 | 2011-07-13 | Dynamic language translation of web site content |
US13/182,059 Active 2033-06-08 US9128918B2 (en) | 2010-07-13 | 2011-07-13 | Dynamic language translation of web site content |
US13/182,139 Active 2033-08-04 US9465782B2 (en) | 2010-07-13 | 2011-07-13 | Dynamic language translation of web site content |
US13/182,118 Active 2031-08-24 US9411793B2 (en) | 2010-07-13 | 2011-07-13 | Dynamic language translation of web site content |
US13/182,080 Active US9864809B2 (en) | 2010-07-13 | 2011-07-13 | Dynamic language translation of web site content |
US13/944,356 Active US9311287B2 (en) | 2010-07-13 | 2013-07-17 | Dynamic language translation of web site content |
US14/322,016 Active 2033-07-09 US10089400B2 (en) | 2010-07-13 | 2014-07-02 | Dynamic language translation of web site content |
US14/817,343 Active 2031-07-22 US9858347B2 (en) | 2010-07-13 | 2015-08-04 | Dynamic language translation of web site content |
US14/932,025 Active US10210271B2 (en) | 2010-07-13 | 2015-11-04 | Dynamic language translation of web site content |
US15/058,257 Active 2032-01-27 US10146884B2 (en) | 2010-07-13 | 2016-03-02 | Dynamic language translation of web site content |
US15/189,081 Active 2032-05-27 US10296651B2 (en) | 2010-07-13 | 2016-06-22 | Dynamic language translation of web site content |
US15/252,810 Active US10073917B2 (en) | 2010-07-13 | 2016-08-31 | Dynamic language translation of web site content |
US15/821,607 Active US10387517B2 (en) | 2010-07-13 | 2017-11-22 | Dynamic language translation of web site content |
US15/827,018 Abandoned US20180081890A1 (en) | 2010-07-13 | 2017-11-30 | Dynamic Language Translation of Web Site Content |
US16/047,111 Active US10936690B2 (en) | 2010-07-13 | 2018-07-27 | Dynamic language translation of web site content |
US16/051,944 Active US10922373B2 (en) | 2010-07-13 | 2018-08-01 | Dynamic language translation of web site content |
US16/164,994 Active 2031-12-30 US11030267B2 (en) | 2010-07-13 | 2018-10-19 | Dynamic language translation of web site content |
US16/220,300 Abandoned US20190121831A1 (en) | 2010-07-13 | 2018-12-14 | Dynamic language translation of web site content |
US16/373,821 Active 2032-03-23 US11157581B2 (en) | 2010-07-13 | 2019-04-03 | Dynamic language translation of web site content |
US16/459,842 Active US10977329B2 (en) | 2010-07-13 | 2019-07-02 | Dynamic language translation of web site content |
US17/141,455 Active 2031-08-12 US11409828B2 (en) | 2010-07-13 | 2021-01-05 | Dynamic language translation of web site content |
US17/141,401 Active US11481463B2 (en) | 2010-07-13 | 2021-01-05 | Dynamic language translation of web site content |
Family Applications After (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US17/313,171 Abandoned US20210256082A1 (en) | 2010-07-13 | 2021-05-06 | Dynamic language translation of web site content |
Country Status (3)
Country | Link |
---|---|
US (24) | US9213685B2 (en) |
EP (6) | EP2593884A2 (en) |
WO (1) | WO2012009441A2 (en) |
Families Citing this family (202)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US8335300B2 (en) | 2003-06-26 | 2012-12-18 | International Business Machines Corporation | Personalizing computerized customer service |
US10319252B2 (en) * | 2005-11-09 | 2019-06-11 | Sdl Inc. | Language capability assessment and training apparatus and techniques |
US9122674B1 (en) | 2006-12-15 | 2015-09-01 | Language Weaver, Inc. | Use of annotations in statistical machine translation |
US8831928B2 (en) * | 2007-04-04 | 2014-09-09 | Language Weaver, Inc. | Customizable machine translation service |
US8825466B1 (en) | 2007-06-08 | 2014-09-02 | Language Weaver, Inc. | Modification of annotated bilingual segment pairs in syntax-based machine translation |
US20120284015A1 (en) * | 2008-01-28 | 2012-11-08 | William Drewes | Method for Increasing the Accuracy of Subject-Specific Statistical Machine Translation (SMT) |
US8793614B2 (en) * | 2008-05-23 | 2014-07-29 | Aol Inc. | History-based tracking of user preference settings |
US9798720B2 (en) | 2008-10-24 | 2017-10-24 | Ebay Inc. | Hybrid machine translation |
US8990064B2 (en) | 2009-07-28 | 2015-03-24 | Language Weaver, Inc. | Translating documents based on content |
US10417646B2 (en) | 2010-03-09 | 2019-09-17 | Sdl Inc. | Predicting the cost associated with translating textual content |
US9213685B2 (en) * | 2010-07-13 | 2015-12-15 | Motionpoint Corporation | Dynamic language translation of web site content |
US9081864B2 (en) * | 2010-08-04 | 2015-07-14 | Microsoft Technology Licensing, Llc | Late resource localization binding for web services |
US8924883B2 (en) * | 2010-09-28 | 2014-12-30 | Lenovo Enterprise Solutions (Singapore) Pte. Ltd. | Content presentation utilizing moveable fly-over on-demand user interfaces |
US8533051B2 (en) | 2010-10-27 | 2013-09-10 | Nir Platek | Multi-language multi-platform E-commerce management system |
US9164988B2 (en) * | 2011-01-14 | 2015-10-20 | Lionbridge Technologies, Inc. | Methods and systems for the dynamic creation of a translated website |
US9547626B2 (en) | 2011-01-29 | 2017-01-17 | Sdl Plc | Systems, methods, and media for managing ambient adaptability of web applications and web services |
US10657540B2 (en) | 2011-01-29 | 2020-05-19 | Sdl Netherlands B.V. | Systems, methods, and media for web content management |
NL2006294C2 (en) * | 2011-02-24 | 2012-08-27 | Exvo Com Group B V | Website translator, system, and method. |
US10580015B2 (en) | 2011-02-25 | 2020-03-03 | Sdl Netherlands B.V. | Systems, methods, and media for executing and optimizing online marketing initiatives |
US9015030B2 (en) * | 2011-04-15 | 2015-04-21 | International Business Machines Corporation | Translating prompt and user input |
US11003838B2 (en) | 2011-04-18 | 2021-05-11 | Sdl Inc. | Systems and methods for monitoring post translation editing |
EP2704025B1 (en) * | 2011-04-28 | 2017-12-27 | Rakuten, Inc. | Browsing system, terminal, image server, program, computer-readable recording medium recording said program, and method |
US8732569B2 (en) | 2011-05-04 | 2014-05-20 | Google Inc. | Predicting user navigation events |
US8600733B1 (en) | 2011-05-31 | 2013-12-03 | Google Inc. | Language selection using language indicators |
US9769285B2 (en) | 2011-06-14 | 2017-09-19 | Google Inc. | Access to network content |
US8788711B2 (en) | 2011-06-14 | 2014-07-22 | Google Inc. | Redacting content and inserting hypertext transfer protocol (HTTP) error codes in place thereof |
US9104744B2 (en) | 2011-06-30 | 2015-08-11 | Google Inc. | Cluster-based language detection |
US8928591B2 (en) | 2011-06-30 | 2015-01-06 | Google Inc. | Techniques for providing a user interface having bi-directional writing tools |
US8788259B1 (en) | 2011-06-30 | 2014-07-22 | Google Inc. | Rules-based language detection |
US9298698B2 (en) * | 2011-06-30 | 2016-03-29 | Google Inc. | Language detection based upon a social graph |
US8838437B1 (en) * | 2011-06-30 | 2014-09-16 | Google Inc. | Language classifiers for language detection |
US8745212B2 (en) | 2011-07-01 | 2014-06-03 | Google Inc. | Access to network content |
US8650139B2 (en) | 2011-07-01 | 2014-02-11 | Google Inc. | Predicting user navigation events |
US8566696B1 (en) * | 2011-07-14 | 2013-10-22 | Google Inc. | Predicting user navigation events |
US8744988B1 (en) | 2011-07-15 | 2014-06-03 | Google Inc. | Predicting user navigation events in an internet browser |
US9800657B2 (en) * | 2011-08-16 | 2017-10-24 | Empire Technology Development Llc | Allocating data to plurality storage devices |
US9223891B2 (en) * | 2011-09-15 | 2015-12-29 | Citicorp Credit Services, Inc. (Usa) | Methods and systems for dynamically generating and reusing dynamic web content |
US8655819B1 (en) | 2011-09-15 | 2014-02-18 | Google Inc. | Predicting user navigation events based on chronological history data |
US9525900B2 (en) * | 2011-09-15 | 2016-12-20 | Google Inc. | Video management system |
US8600921B2 (en) | 2011-09-15 | 2013-12-03 | Google Inc. | Predicting user navigation events in a browser using directed graphs |
US9213686B2 (en) * | 2011-10-04 | 2015-12-15 | Wfh Properties Llc | System and method for managing a form completion process |
WO2013052601A1 (en) * | 2011-10-04 | 2013-04-11 | Chegg, Inc. | Electronic content management and delivery platform |
US9104664B1 (en) | 2011-10-07 | 2015-08-11 | Google Inc. | Access to search results |
US9465799B2 (en) * | 2011-10-10 | 2016-10-11 | Red Hat, Inc. | Server-side internationalization and localization of web applications using a scripting language |
US8886515B2 (en) | 2011-10-19 | 2014-11-11 | Language Weaver, Inc. | Systems and methods for enhancing machine translation post edit review processes |
US8781811B1 (en) * | 2011-10-21 | 2014-07-15 | Google Inc. | Cross-application centralized language preferences |
US8805987B1 (en) * | 2011-11-29 | 2014-08-12 | Google Inc. | Ensuring a cookie-less namespace |
US9584579B2 (en) | 2011-12-01 | 2017-02-28 | Google Inc. | Method and system for providing page visibility information |
US8700691B2 (en) * | 2011-12-05 | 2014-04-15 | Microsoft Corporation | Minimal download and simulated page navigation features |
US9342615B2 (en) | 2011-12-07 | 2016-05-17 | Google Inc. | Reducing redirects |
US9069732B2 (en) | 2011-12-29 | 2015-06-30 | Chegg, Inc. | Automated document conversion testing |
US8769404B2 (en) * | 2012-01-03 | 2014-07-01 | International Business Machines Corporation | Rule-based locale definition generation for a new or customized locale support |
US10289743B2 (en) * | 2012-01-19 | 2019-05-14 | Microsoft Technology Licensing, Llc | Client-side minimal download and simulated page navigation features |
US8793235B2 (en) | 2012-01-19 | 2014-07-29 | Google Inc. | System and method for improving access to search results |
US9846605B2 (en) | 2012-01-19 | 2017-12-19 | Microsoft Technology Licensing, Llc | Server-side minimal download and error failover |
US9658998B2 (en) * | 2012-02-24 | 2017-05-23 | American Express Travel Related Services Company, Inc. | Systems and methods for internationalization and localization |
US9251223B2 (en) * | 2012-02-29 | 2016-02-02 | Google Inc. | Alternative web pages suggestion based on language |
US8942973B2 (en) * | 2012-03-09 | 2015-01-27 | Language Weaver, Inc. | Content page URL translation |
US20150161112A1 (en) * | 2012-04-13 | 2015-06-11 | Google Inc. | One click localization techniques |
WO2013163964A1 (en) * | 2012-05-02 | 2013-11-07 | Xcremeno Ltd | Website translation delivery and manipulation |
US9773270B2 (en) | 2012-05-11 | 2017-09-26 | Fredhopper B.V. | Method and system for recommending products based on a ranking cocktail |
US9946792B2 (en) | 2012-05-15 | 2018-04-17 | Google Llc | Access to network content |
US10261994B2 (en) | 2012-05-25 | 2019-04-16 | Sdl Inc. | Method and system for automatic management of reputation of translators |
US20130326347A1 (en) * | 2012-05-31 | 2013-12-05 | Microsoft Corporation | Application language libraries for managing computing environment languages |
US9639676B2 (en) | 2012-05-31 | 2017-05-02 | Microsoft Technology Licensing, Llc | Login interface selection for computing environment user login |
CN102693322B (en) | 2012-06-01 | 2014-10-22 | 杭州海康威视数字技术股份有限公司 | Multi-language supporting webpage processing method, webpage loading method and systems |
US9672209B2 (en) | 2012-06-21 | 2017-06-06 | International Business Machines Corporation | Dynamic translation substitution |
CN103532995B (en) * | 2012-07-03 | 2017-07-25 | 百度在线网络技术(北京)有限公司 | Renewal of the page based reminding method, system and device |
US20140039871A1 (en) * | 2012-08-02 | 2014-02-06 | Richard Henry Dana Crawford | Synchronous Texts |
US8887239B1 (en) | 2012-08-08 | 2014-11-11 | Google Inc. | Access to network content |
US9569410B2 (en) | 2012-08-13 | 2017-02-14 | Chegg, Inc. | Multilayered document distribution in multiscreen systems |
US9304990B2 (en) * | 2012-08-20 | 2016-04-05 | International Business Machines Corporation | Translation of text into multiple languages |
US9021536B2 (en) * | 2012-09-06 | 2015-04-28 | Stream Translations, Ltd. | Process for subtitling streaming video content |
US10452740B2 (en) | 2012-09-14 | 2019-10-22 | Sdl Netherlands B.V. | External content libraries |
US11308528B2 (en) * | 2012-09-14 | 2022-04-19 | Sdl Netherlands B.V. | Blueprinting of multimedia assets |
US11386186B2 (en) | 2012-09-14 | 2022-07-12 | Sdl Netherlands B.V. | External content library connector systems and methods |
US20140081618A1 (en) * | 2012-09-17 | 2014-03-20 | Salesforce.Com, Inc. | Designing a website to be displayed in multiple languages |
US9141722B2 (en) | 2012-10-02 | 2015-09-22 | Google Inc. | Access to network content |
JP2014089637A (en) * | 2012-10-31 | 2014-05-15 | International Business Maschines Corporation | Method, computer, and computer program for determining translations corresponding to words or phrases in image data to be translated differently |
US9330402B2 (en) | 2012-11-02 | 2016-05-03 | Intuit Inc. | Method and system for providing a payroll preparation platform with user contribution-based plug-ins |
CA2851585C (en) | 2012-11-06 | 2020-09-01 | Lance Saleme | Stack-based adaptive localization and internationalization of applications |
US20140136948A1 (en) * | 2012-11-09 | 2014-05-15 | Microsoft Corporation | Taxonomy Driven Page Model |
US20140142917A1 (en) * | 2012-11-19 | 2014-05-22 | Lindsay D'Penha | Routing of machine language translation to human language translator |
US9152622B2 (en) | 2012-11-26 | 2015-10-06 | Language Weaver, Inc. | Personalized machine translation via online adaptation |
WO2014087704A1 (en) * | 2012-12-06 | 2014-06-12 | 楽天株式会社 | Input support device, input support method, and input support program |
US10296968B2 (en) | 2012-12-07 | 2019-05-21 | United Parcel Service Of America, Inc. | Website augmentation including conversion of regional content |
US10410257B1 (en) * | 2012-12-18 | 2019-09-10 | Nativo, Inc. | Native online ad creation |
US11222362B2 (en) * | 2013-01-15 | 2022-01-11 | Motionpoint Corporation | Dynamic determination of localization source for web site content |
TW201430593A (en) * | 2013-01-25 | 2014-08-01 | Hon Hai Prec Ind Co Ltd | System and method for converting multi-language webpage |
US9916295B1 (en) * | 2013-03-15 | 2018-03-13 | Richard Henry Dana Crawford | Synchronous context alignments |
US10356461B2 (en) | 2013-03-15 | 2019-07-16 | adRise, Inc. | Adaptive multi-device content generation based on associated internet protocol addressing |
US10887421B2 (en) * | 2013-03-15 | 2021-01-05 | Tubi, Inc. | Relevant secondary-device content generation based on associated internet protocol addressing |
US9069759B2 (en) * | 2013-03-15 | 2015-06-30 | One Hour Translation, Ltd. | System and method for website tranlsations |
US10594763B2 (en) | 2013-03-15 | 2020-03-17 | adRise, Inc. | Platform-independent content generation for thin client applications |
US20150039599A1 (en) * | 2013-08-01 | 2015-02-05 | Go Daddy Operating Company, LLC | Methods and systems for recommending top level and second level domains |
US9922351B2 (en) | 2013-08-29 | 2018-03-20 | Intuit Inc. | Location-based adaptation of financial management system |
US9372672B1 (en) * | 2013-09-04 | 2016-06-21 | Tg, Llc | Translation in visual context |
US9547641B2 (en) | 2013-09-26 | 2017-01-17 | International Business Machines Corporation | Domain specific salient point translation |
US10496709B2 (en) * | 2013-10-01 | 2019-12-03 | AsterionDB, Inc. | Systems, methods and program instructions for calling a database function with a URL |
US9213694B2 (en) | 2013-10-10 | 2015-12-15 | Language Weaver, Inc. | Efficient online domain adaptation |
WO2015077894A1 (en) * | 2013-11-29 | 2015-06-04 | 1033759 Alberta Ltd. | System and method for generating and publishing electronic content from predetermined templates |
US9639526B2 (en) * | 2014-01-10 | 2017-05-02 | Microsoft Technology Licensing, Llc | Mobile language translation of web content |
US9530161B2 (en) * | 2014-02-28 | 2016-12-27 | Ebay Inc. | Automatic extraction of multilingual dictionary items from non-parallel, multilingual, semi-structured data |
US20150261880A1 (en) * | 2014-03-15 | 2015-09-17 | Google Inc. | Techniques for translating user interfaces of web-based applications |
US10140627B2 (en) | 2014-03-26 | 2018-11-27 | Excalibur Ip, Llc | Xpath related and other techniques for use in native advertisement placement |
US10269048B2 (en) * | 2014-03-26 | 2019-04-23 | Excalibur Ip, Llc | Xpath related and other techniques for use in native advertisement placement |
US9361635B2 (en) | 2014-04-14 | 2016-06-07 | Yahoo! Inc. | Frequent markup techniques for use in native advertisement placement |
US9130882B1 (en) * | 2014-05-05 | 2015-09-08 | Priceline.Com Llc | Dynamic assignment of a target web page based on request context |
US9690780B2 (en) | 2014-05-23 | 2017-06-27 | International Business Machines Corporation | Document translation based on predictive use |
US9906621B2 (en) | 2014-06-03 | 2018-02-27 | Google Llc | Providing language recommendations |
US10127244B2 (en) * | 2014-06-04 | 2018-11-13 | Harris Corporation | Systems and methods for dynamic data storage |
US20160004783A1 (en) * | 2014-07-01 | 2016-01-07 | EveryMundo, LLC | Automated generation of web site entry pages |
US9965466B2 (en) * | 2014-07-16 | 2018-05-08 | United Parcel Service Of America, Inc. | Language content translation |
KR102365160B1 (en) * | 2014-07-31 | 2022-02-21 | 삼성전자주식회사 | Method, apparatus and system for providing translated contents |
WO2016018004A1 (en) * | 2014-07-31 | 2016-02-04 | Samsung Electronics Co., Ltd. | Method, apparatus, and system for providing translated content |
US10949904B2 (en) * | 2014-10-04 | 2021-03-16 | Proz.Com | Knowledgebase with work products of service providers and processing thereof |
GB2532763A (en) | 2014-11-27 | 2016-06-01 | Ibm | Displaying an application in the graphical user interface of a computer display |
US9798716B2 (en) * | 2014-12-10 | 2017-10-24 | James E. Niles | Internet of things language setting system |
US10261996B2 (en) * | 2014-12-19 | 2019-04-16 | Dropbox, Inc. | Content localization using fallback translations |
US10452786B2 (en) * | 2014-12-29 | 2019-10-22 | Paypal, Inc. | Use of statistical flow data for machine translations between different languages |
US9448776B1 (en) * | 2015-01-08 | 2016-09-20 | AppNotch LLC | Method and apparatus for converting a website into a native mobile application |
CN107646191B (en) * | 2015-01-30 | 2020-08-25 | Idac控股公司 | Method and system for anchoring hypertext transfer protocol (HTTP) -level services in information-centric networking (ICN) |
EP3281100A4 (en) * | 2015-04-08 | 2018-10-17 | Lisuto KK | Data transformation system and method |
CN104820680B (en) * | 2015-04-17 | 2018-04-06 | 南京大学 | A kind of universal distributed reptile scheduling system |
US10409810B2 (en) | 2015-05-08 | 2019-09-10 | International Business Machines Corporation | Generating multilingual queries |
US20160366234A1 (en) * | 2015-06-10 | 2016-12-15 | Ricoh Company, Ltd. | Data process system, data process apparatus, and data process method |
DE112015006710T5 (en) * | 2015-07-15 | 2018-04-12 | Mitsubishi Electric Corporation | Display control device and display control method |
US10929857B2 (en) * | 2015-08-28 | 2021-02-23 | Zig-Zag, Inc. | Assistance method for assisting in provision of EC abroad, and program or assistance server for assistance method |
US9922028B2 (en) * | 2015-09-17 | 2018-03-20 | Oslabs Pte. Ltd. | System and method for translation and localization of content in digital applications |
US10075482B2 (en) | 2015-09-25 | 2018-09-11 | International Business Machines Corporation | Multiplexed, multimodal conferencing |
KR20180069813A (en) * | 2015-10-16 | 2018-06-25 | 알리바바 그룹 홀딩 리미티드 | Title display method and apparatus |
US10614167B2 (en) * | 2015-10-30 | 2020-04-07 | Sdl Plc | Translation review workflow systems and methods |
US9851871B2 (en) * | 2015-11-25 | 2017-12-26 | International Business Machines Corporation | Browser bookmarking for multiple environments |
US9767011B2 (en) | 2015-12-01 | 2017-09-19 | International Business Machines Corporation | Globalization testing management using a set of globalization testing operations |
US9740601B2 (en) * | 2015-12-01 | 2017-08-22 | International Business Machines Corporation | Globalization testing management service configuration |
JP2017120563A (en) * | 2015-12-28 | 2017-07-06 | 青島海爾洗衣机有限公司QingDao Haier Washing Machine Co.,Ltd. | Laundry system |
US9659010B1 (en) * | 2015-12-28 | 2017-05-23 | International Business Machines Corporation | Multiple language screen capture |
US9824332B1 (en) * | 2017-04-12 | 2017-11-21 | eTorch Inc. | Email data collection compliance enforcement |
WO2017158538A1 (en) * | 2016-03-16 | 2017-09-21 | Patil Mrityunjay | A method and a system for enabling an user to consume a video or audio content understandable with respect to a preferred language |
US10412162B2 (en) | 2016-04-15 | 2019-09-10 | Ebay Inc. | Adopting data across different sites |
CN105825851B (en) * | 2016-05-17 | 2020-07-21 | Tcl科技集团股份有限公司 | Voice control method and system based on Android system |
US10409623B2 (en) * | 2016-05-27 | 2019-09-10 | Microsoft Technology Licensing, Llc | Graphical user interface for localizing a computer program using context data captured from the computer program |
US10762099B2 (en) * | 2016-06-07 | 2020-09-01 | International Business Machines Corporation | Syntactical transformation of database interaction statements |
CN107547671A (en) * | 2016-06-29 | 2018-01-05 | 中兴通讯股份有限公司 | A kind of URL matching process and device |
US10303777B2 (en) * | 2016-08-08 | 2019-05-28 | Netflix, Inc. | Localization platform that leverages previously translated content |
WO2018052906A1 (en) * | 2016-09-13 | 2018-03-22 | Sophistio, Inc. | Automatic wearable item classification systems and methods based upon normalized depictions |
US10275459B1 (en) | 2016-09-28 | 2019-04-30 | Amazon Technologies, Inc. | Source language content scoring for localizability |
US10229113B1 (en) * | 2016-09-28 | 2019-03-12 | Amazon Technologies, Inc. | Leveraging content dimensions during the translation of human-readable languages |
US10223356B1 (en) | 2016-09-28 | 2019-03-05 | Amazon Technologies, Inc. | Abstraction of syntax in localization through pre-rendering |
US10235362B1 (en) | 2016-09-28 | 2019-03-19 | Amazon Technologies, Inc. | Continuous translation refinement with automated delivery of re-translated content |
US10261995B1 (en) | 2016-09-28 | 2019-04-16 | Amazon Technologies, Inc. | Semantic and natural language processing for content categorization and routing |
US10217453B2 (en) * | 2016-10-14 | 2019-02-26 | Soundhound, Inc. | Virtual assistant configured by selection of wake-up phrase |
CN106484687A (en) * | 2016-11-01 | 2017-03-08 | 深圳市歪果仁科技有限公司 | A kind of translation on line system and method based on mobile Internet |
GB2558062A (en) * | 2016-11-18 | 2018-07-04 | Lionbridge Tech Inc | Collection strategies that facilitate arranging portions of documents into content collections |
US10489267B2 (en) * | 2016-11-21 | 2019-11-26 | Vmware, Inc. | Taking an action in response to detecting an unsupported language in a log |
US10325027B2 (en) * | 2017-02-07 | 2019-06-18 | International Business Machines Corporation | Changing a language for a user session replay |
WO2018167960A1 (en) * | 2017-03-17 | 2018-09-20 | ヤマハ株式会社 | Speech processing device, speech processing system, speech processing method, and speech processing program |
US10268674B2 (en) * | 2017-04-10 | 2019-04-23 | Dell Products L.P. | Linguistic intelligence using language validator |
US10795799B2 (en) | 2017-04-18 | 2020-10-06 | Salesforce.Com, Inc. | Website debugger for natural language translation and localization |
US10489513B2 (en) | 2017-04-19 | 2019-11-26 | Salesforce.Com, Inc. | Web application localization |
US10437935B2 (en) * | 2017-04-18 | 2019-10-08 | Salesforce.Com, Inc. | Natural language translation and localization |
US10652622B2 (en) * | 2017-06-27 | 2020-05-12 | At&T Intellectual Property I, L.P. | Method and apparatus for providing content based upon a selected language |
US11605114B2 (en) * | 2017-08-16 | 2023-03-14 | Zig-Zag, Inc. | Method, medium, and system for supporting provision of EC to overseas and device using same |
JP7027757B2 (en) * | 2017-09-21 | 2022-03-02 | 富士フイルムビジネスイノベーション株式会社 | Information processing equipment and information processing programs |
US12093261B2 (en) * | 2017-09-29 | 2024-09-17 | Oracle International Corporation | Storage formats for in-memory caches |
US11328130B2 (en) * | 2017-11-06 | 2022-05-10 | Orion Labs, Inc. | Translational bot for group communication |
US10409583B2 (en) | 2017-11-27 | 2019-09-10 | Salesforce.Com, Inc. | Content deployment system having a content publishing engine with a filter module for selectively extracting content items provided from content sources for integration into a specific release and methods for implementing the same |
US10684847B2 (en) * | 2017-11-27 | 2020-06-16 | Salesforce.Com, Inc. | Content deployment system having a proxy for continuously providing selected content items to a content publishing engine for integration into a specific release and methods for implementing the same |
US10803257B2 (en) | 2018-03-22 | 2020-10-13 | Microsoft Technology Licensing, Llc | Machine translation locking using sequence-based lock/unlock classification |
US10430512B1 (en) * | 2018-05-24 | 2019-10-01 | Slack Technologies, Inc. | Methods, apparatuses and computer program products for formatting messages in a messaging user interface within a group-based communication system |
CN109299405B (en) * | 2018-09-28 | 2021-01-29 | 北京小米移动软件有限公司 | Information pushing method and device and storage medium |
CN113302642A (en) * | 2018-11-22 | 2021-08-24 | Y·尹 | Evaluation system based on multi-language label |
KR102214990B1 (en) * | 2018-11-26 | 2021-02-15 | 김준 | System for providing bookmark management and information searching service and method for providing bookmark management and information searching service using it |
US11361169B2 (en) | 2019-02-28 | 2022-06-14 | Yandex Europe Ag | Method and server for training a machine learning algorithm for translation |
US11347381B2 (en) * | 2019-06-13 | 2022-05-31 | International Business Machines Corporation | Dynamic synchronized image text localization |
US11227101B2 (en) * | 2019-07-05 | 2022-01-18 | Open Text Sa Ulc | System and method for document translation in a format agnostic document viewer |
CN110381371B (en) * | 2019-07-30 | 2021-08-31 | 维沃移动通信有限公司 | Video editing method and electronic equipment |
US11461559B2 (en) * | 2020-01-28 | 2022-10-04 | Salesforce.Com, Inc. | Mechanism to facilitate image translation |
US11861313B2 (en) * | 2020-02-02 | 2024-01-02 | International Business Machines Corporation | Multi-level linguistic alignment in specific user targeted messaging |
US11494567B2 (en) * | 2020-03-03 | 2022-11-08 | Dell Products L.P. | Content adaptation techniques for localization of content presentation |
US11443122B2 (en) * | 2020-03-03 | 2022-09-13 | Dell Products L.P. | Image analysis-based adaptation techniques for localization of content presentation |
US11455456B2 (en) | 2020-03-03 | 2022-09-27 | Dell Products L.P. | Content design structure adaptation techniques for localization of content presentation |
US11449688B2 (en) | 2020-03-13 | 2022-09-20 | Sap Se | Processing of translation-resistant content in web application |
WO2021184249A1 (en) * | 2020-03-18 | 2021-09-23 | Citrix Systems, Inc. | Machine translation of digital content |
US11687732B2 (en) * | 2020-04-06 | 2023-06-27 | Open Text Holdings, Inc. | Content management systems for providing automated translation of content items |
US11373657B2 (en) * | 2020-05-01 | 2022-06-28 | Raytheon Applied Signal Technology, Inc. | System and method for speaker identification in audio data |
CN111563223B (en) * | 2020-05-12 | 2023-09-19 | 北京飞漫软件技术有限公司 | Webpage localization method and device |
US11586834B2 (en) | 2020-06-30 | 2023-02-21 | Roblox Corporation | Automatic localization of dynamic content |
US11315545B2 (en) * | 2020-07-09 | 2022-04-26 | Raytheon Applied Signal Technology, Inc. | System and method for language identification in audio data |
US12020697B2 (en) | 2020-07-15 | 2024-06-25 | Raytheon Applied Signal Technology, Inc. | Systems and methods for fast filtering of audio keyword search |
JP7561537B2 (en) * | 2020-08-04 | 2024-10-04 | キヤノン株式会社 | Information processing system, control method, and program |
US11418622B2 (en) | 2020-08-18 | 2022-08-16 | Baton Simulations | System and methods for web-based software application translation |
US11321412B1 (en) | 2020-11-04 | 2022-05-03 | Capital One Services, Llc | Customized navigation flow |
CN114625434B (en) * | 2020-12-10 | 2024-04-23 | 华为技术有限公司 | Address acquisition method and equipment |
US20220253882A1 (en) * | 2021-01-29 | 2022-08-11 | KwikClick, LLC | Hyperlinks incorporating products in international-scale multi-level marketing system |
US11328113B1 (en) | 2021-03-03 | 2022-05-10 | Micro Focus Llc | Dynamic localization using color |
KR102382316B1 (en) * | 2021-04-14 | 2022-04-04 | 주식회사 브링코 | Method and system for providing electronic commerce service using partnership service cart realized by api in shopping mall |
US11962817B2 (en) | 2021-06-21 | 2024-04-16 | Tubi, Inc. | Machine learning techniques for advanced frequency management |
CN113238756B (en) * | 2021-07-08 | 2021-10-22 | 北京达佳互联信息技术有限公司 | Live broadcast service processing method and device, electronic equipment and storage medium |
TWI827984B (en) * | 2021-10-05 | 2024-01-01 | 台灣大哥大股份有限公司 | System and method for website classification |
EP4430515A1 (en) * | 2021-11-08 | 2024-09-18 | AIRBNB, Inc. | Selective pre-translation of web content |
US20230297764A1 (en) * | 2022-03-15 | 2023-09-21 | Salesforce.Com, Inc. | Non-obtrusive markup augmentation for website localization |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20080162114A1 (en) * | 2007-01-03 | 2008-07-03 | Vistaprint Technologies Limited | Translation processing using a translation memory |
US8145472B2 (en) * | 2005-12-12 | 2012-03-27 | John Shore | Language translation using a hybrid network of human and machine translators |
US8244519B2 (en) * | 2008-12-03 | 2012-08-14 | Xerox Corporation | Dynamic translation memory using statistical machine translation |
Family Cites Families (223)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPS58101365A (en) | 1981-12-14 | 1983-06-16 | Hitachi Ltd | Text display calibration system in machine translation system |
JP2831647B2 (en) | 1988-03-31 | 1998-12-02 | 株式会社東芝 | Machine translation system |
EP0366142B1 (en) | 1988-10-28 | 1997-08-06 | Kabushiki Kaisha Toshiba | Method and apparatus of machine translation |
NZ299101A (en) | 1992-09-04 | 1997-06-24 | Caterpillar Inc | Computer-based document development system: includes text editor and language editor enforcing lexical and grammatical constraints |
US5608622A (en) | 1992-09-11 | 1997-03-04 | Lucent Technologies Inc. | System for analyzing translations |
USH2098H1 (en) | 1994-02-22 | 2004-03-02 | The United States Of America As Represented By The Secretary Of The Navy | Multilingual communications device |
US5987402A (en) * | 1995-01-31 | 1999-11-16 | Oki Electric Industry Co., Ltd. | System and method for efficiently retrieving and translating source documents in different languages, and other displaying the translated documents at a client device |
US5715466A (en) | 1995-02-14 | 1998-02-03 | Compuserve Incorporated | System for parallel foreign language communication over a computer network |
US5802539A (en) | 1995-05-05 | 1998-09-01 | Apple Computer, Inc. | Method and apparatus for managing text objects for providing text to be interpreted across computer operating systems using different human languages |
AU5969896A (en) | 1995-06-07 | 1996-12-30 | International Language Engineering Corporation | Machine assisted translation tools |
GB9513379D0 (en) | 1995-06-30 | 1995-09-06 | Jonhig Ltd | Electronic purse system |
US6073143A (en) | 1995-10-20 | 2000-06-06 | Sanyo Electric Co., Ltd. | Document conversion system including data monitoring means that adds tag information to hyperlink information and translates a document when such tag information is included in a document retrieval request |
US6993471B1 (en) | 1995-11-13 | 2006-01-31 | America Online, Inc. | Integrated multilingual browser |
US5835192A (en) | 1995-12-21 | 1998-11-10 | Johnson & Johnson Vision Products, Inc. | Contact lenses and method of fitting contact lenses |
US5974372A (en) | 1996-02-12 | 1999-10-26 | Dst Systems, Inc. | Graphical user interface (GUI) language translator |
US5855020A (en) | 1996-02-21 | 1998-12-29 | Infoseek Corporation | Web scan process |
US5864852A (en) | 1996-04-26 | 1999-01-26 | Netscape Communications Corporation | Proxy server caching mechanism that provides a file directory structure and a mapping mechanism within the file directory structure |
US5944790A (en) * | 1996-07-19 | 1999-08-31 | Lucent Technologies Inc. | Method and apparatus for providing a web site having a home page that automatically adapts to user language and customs |
JP3121548B2 (en) * | 1996-10-15 | 2001-01-09 | インターナショナル・ビジネス・マシーンズ・コーポレ−ション | Machine translation method and apparatus |
US5956740A (en) | 1996-10-23 | 1999-09-21 | Iti, Inc. | Document searching system for multilingual documents |
KR19980055170A (en) | 1996-12-28 | 1998-09-25 | 김영귀 | Vehicle Hazard Warning Device |
KR19980055170U (en) | 1996-12-31 | 1998-10-07 | 박병재 | Input shaft oil ring tear prevention structure of automobile transmission |
US6065026A (en) | 1997-01-09 | 2000-05-16 | Document.Com, Inc. | Multi-user electronic document authoring system with prompted updating of shared language |
US5898836A (en) | 1997-01-14 | 1999-04-27 | Netmind Services, Inc. | Change-detection tool indicating degree and location of change of internet documents by comparison of cyclic-redundancy-check(CRC) signatures |
US6336033B1 (en) | 1997-02-06 | 2002-01-01 | Ntt Mobile Communication Network Inc. | Adaptive array antenna |
EP0867815A3 (en) | 1997-03-26 | 2000-05-31 | Kabushiki Kaisha Toshiba | Translation service providing method and translation service system |
IL121071A0 (en) | 1997-03-27 | 1997-11-20 | El Mar Software Ltd | Automatic conversion server |
EP0952532A1 (en) | 1997-03-31 | 1999-10-27 | Sanyo Electric Co., Ltd. | Document preparation method and machine translation device |
US5991710A (en) | 1997-05-20 | 1999-11-23 | International Business Machines Corporation | Statistical translation system with features based on phrases or groups of words |
US6112240A (en) | 1997-09-03 | 2000-08-29 | International Business Machines Corporation | Web site client information tracker |
US6161082A (en) | 1997-11-18 | 2000-12-12 | At&T Corp | Network based language translation system |
US6349275B1 (en) | 1997-11-24 | 2002-02-19 | International Business Machines Corporation | Multiple concurrent language support system for electronic catalogue using a concept based knowledge representation |
IL123129A (en) | 1998-01-30 | 2010-12-30 | Aviv Refuah | Www addressing |
US6122666A (en) | 1998-02-23 | 2000-09-19 | International Business Machines Corporation | Method for collaborative transformation and caching of web objects in a proxy network |
US6526426B1 (en) | 1998-02-23 | 2003-02-25 | David Lakritz | Translation management system |
US8489980B2 (en) * | 1998-02-23 | 2013-07-16 | Transperfect Global, Inc. | Translation management system |
US6623529B1 (en) * | 1998-02-23 | 2003-09-23 | David Lakritz | Multilingual electronic document translation, management, and delivery system |
US10541974B2 (en) * | 1998-02-23 | 2020-01-21 | Transperfect Global, Inc. | Intercepting web server requests and localizing content |
US6076108A (en) | 1998-03-06 | 2000-06-13 | I2 Technologies, Inc. | System and method for maintaining a state for a user session using a web system having a global session server |
US6959318B1 (en) | 1998-03-06 | 2005-10-25 | Intel Corporation | Method of proxy-assisted predictive pre-fetching with transcoding |
US6163765A (en) | 1998-03-30 | 2000-12-19 | Motorola, Inc. | Subband normalization, transformation, and voiceness to recognize phonemes for text messaging in a radio communication system |
US7020601B1 (en) | 1998-05-04 | 2006-03-28 | Trados Incorporated | Method and apparatus for processing source information based on source placeable elements |
US6345243B1 (en) | 1998-05-27 | 2002-02-05 | Lionbridge Technologies, Inc. | System, method, and product for dynamically propagating translations in a translation-memory system |
US6526416B1 (en) | 1998-06-30 | 2003-02-25 | Microsoft Corporation | Compensating resource managers |
JP2002521751A (en) | 1998-07-23 | 2002-07-16 | ロゴヴィスタ株式会社 | Modular language translation system |
US6826593B1 (en) | 1998-09-01 | 2004-11-30 | Lucent Technologies Inc. | Computer implemented method and apparatus for fulfilling a request for information content with a user-selectable version of a file containing that information content |
US6738827B1 (en) | 1998-09-29 | 2004-05-18 | Eli Abir | Method and system for alternate internet resource identifiers and addresses |
US6349276B1 (en) | 1998-10-29 | 2002-02-19 | International Business Machines Corporation | Multilingual information retrieval with a transfer corpus |
US6347316B1 (en) | 1998-12-14 | 2002-02-12 | International Business Machines Corporation | National language proxy file save and incremental cache translation option for world wide web documents |
KR20000039748A (en) | 1998-12-15 | 2000-07-05 | 정선종 | Apparatus for translating web documents written in multi-languages and method for translating service using the apparatus |
US6901367B1 (en) | 1999-01-28 | 2005-05-31 | International Business Machines Corporation | Front end translation mechanism for received communication |
US6446036B1 (en) | 1999-04-20 | 2002-09-03 | Alis Technologies, Inc. | System and method for enhancing document translatability |
US6338033B1 (en) | 1999-04-20 | 2002-01-08 | Alis Technologies, Inc. | System and method for network-based teletranslation from one natural language to another |
US6286006B1 (en) | 1999-05-07 | 2001-09-04 | Alta Vista Company | Method and apparatus for finding mirrored hosts by analyzing urls |
US7607085B1 (en) | 1999-05-11 | 2009-10-20 | Microsoft Corporation | Client side localizations on the world wide web |
AUPQ141999A0 (en) | 1999-07-05 | 1999-07-29 | Worldlingo.Com Pty Ltd | Communication processing system |
US6418402B1 (en) | 1999-07-27 | 2002-07-09 | International Business Machines Corporation | Method and system for utilizing machine translation as input correction |
CN1176432C (en) | 1999-07-28 | 2004-11-17 | 国际商业机器公司 | Method and system for providing national language inquiry service |
US7110938B1 (en) | 1999-09-17 | 2006-09-19 | Trados, Inc. | E-services translation portal system |
CN1173282C (en) | 1999-09-20 | 2004-10-27 | 国际商业机器公司 | Method and system for dynamically increasiing new functions for www. page |
US6662233B1 (en) | 1999-09-23 | 2003-12-09 | Intel Corporation | System dynamically translates translation information corresponding to a version of a content element having a bandwidth corresponding to bandwidth capability of a recipient |
US6393389B1 (en) | 1999-09-23 | 2002-05-21 | Xerox Corporation | Using ranked translation choices to obtain sequences indicating meaning of multi-token expressions |
US7016977B1 (en) | 1999-11-05 | 2006-03-21 | International Business Machines Corporation | Method and system for multilingual web server |
US7383320B1 (en) | 1999-11-05 | 2008-06-03 | Idom Technologies, Incorporated | Method and apparatus for automatically updating website content |
US7039722B1 (en) | 1999-11-12 | 2006-05-02 | Fuisz Richard C | Method and apparatus for translating web addresses and using numerically entered web addresses |
JP2001167092A (en) | 1999-12-13 | 2001-06-22 | Nec Corp | Translation server system |
JP2001175683A (en) | 1999-12-21 | 2001-06-29 | Nec Corp | Translation server system |
AUPQ539700A0 (en) | 2000-02-02 | 2000-02-24 | Worldlingo.Com Pty Ltd | Translation ordering system |
US7149964B1 (en) | 2000-02-09 | 2006-12-12 | Microsoft Corporation | Creation and delivery of customized content |
US7216072B2 (en) | 2000-02-29 | 2007-05-08 | Fujitsu Limited | Relay device, server device, terminal device, and translation server system utilizing these devices |
WO2001080036A1 (en) | 2000-03-10 | 2001-10-25 | The One.Com | System and method for providing interactive translation of information in a communication network |
GB0006153D0 (en) | 2000-03-14 | 2000-05-03 | Inpharmatica Ltd | Database |
US20020002452A1 (en) | 2000-03-28 | 2002-01-03 | Christy Samuel T. | Network-based text composition, translation, and document searching |
AU2001249777A1 (en) | 2000-03-31 | 2001-10-15 | Amikai, Inc. | Method and apparatus for providing multilingual translation over a network |
EP1139231A1 (en) | 2000-03-31 | 2001-10-04 | Fujitsu Limited | Document processing apparatus and method |
JP2001282732A (en) | 2000-04-03 | 2001-10-12 | Komatsu Ltd | Method and system for providing service to distant user through inter-computer communication |
US7509397B1 (en) | 2000-04-06 | 2009-03-24 | Yahoo! Inc. | Web portholes: using web proxies to capture and enhance display real estate |
US6604101B1 (en) | 2000-06-28 | 2003-08-05 | Qnaturally Systems, Inc. | Method and system for translingual translation of query and search and retrieval of multilingual information on a computer network |
US6865716B1 (en) | 2000-05-05 | 2005-03-08 | Aspect Communication Corporation | Method and apparatus for dynamic localization of documents |
FR2809509B1 (en) | 2000-05-26 | 2003-09-12 | Bull Sa | SYSTEM AND METHOD FOR INTERNATIONALIZING THE CONTENT OF TAGGED DOCUMENTS IN A COMPUTER SYSTEM |
WO2001093089A1 (en) | 2000-05-26 | 2001-12-06 | Theone.Com | System and method for providing interactive translation of information in a communication network |
JP2001344169A (en) * | 2000-06-01 | 2001-12-14 | Internatl Business Mach Corp <Ibm> | Network system, server, web server, web page, data processing method, storage medium, and program transmitting device |
AU2001268642A1 (en) | 2000-06-23 | 2002-01-08 | Advisortech Corporation | Apparatus and method of providing multilingual content in an online environment |
JP4011268B2 (en) | 2000-07-05 | 2007-11-21 | 株式会社アイアイエス | Multilingual translation system |
US6667751B1 (en) | 2000-07-13 | 2003-12-23 | International Business Machines Corporation | Linear web browser history viewer |
US7389221B1 (en) | 2000-07-17 | 2008-06-17 | Globalenglish Corporation | System and method for interactive translation |
US7571217B1 (en) | 2000-08-16 | 2009-08-04 | Parallel Networks, Llc | Method and system for uniform resource locator transformation |
US20020111787A1 (en) | 2000-10-13 | 2002-08-15 | Iko Knyphausen | Client-driven workload environment |
EP1327214A1 (en) | 2000-10-16 | 2003-07-16 | IIS Inc | Method for offering multilingual information translated in many languages through a communication network |
US20020065946A1 (en) | 2000-10-17 | 2002-05-30 | Shankar Narayan | Synchronized computing with internet widgets |
US20020083029A1 (en) * | 2000-10-23 | 2002-06-27 | Chun Won Ho | Virtual domain name system using the user's preferred language for the internet |
US20020083068A1 (en) | 2000-10-30 | 2002-06-27 | Quass Dallan W. | Method and apparatus for filling out electronic forms |
US6980953B1 (en) | 2000-10-31 | 2005-12-27 | International Business Machines Corp. | Real-time remote transcription or translation service |
US6859820B1 (en) | 2000-11-01 | 2005-02-22 | Microsoft Corporation | System and method for providing language localization for server-based applications |
US7139898B1 (en) | 2000-11-03 | 2006-11-21 | Mips Technologies, Inc. | Fetch and dispatch disassociation apparatus for multistreaming processors |
US7185044B2 (en) * | 2000-11-06 | 2007-02-27 | The Weather Channel | Weather information delivery systems and methods providing planning functionality and navigational tools |
US8677505B2 (en) * | 2000-11-13 | 2014-03-18 | Digital Doors, Inc. | Security system with extraction, reconstruction and secure recovery and storage of data |
US7418390B1 (en) | 2000-11-20 | 2008-08-26 | Yahoo! Inc. | Multi-language system for online communications |
US6665642B2 (en) | 2000-11-29 | 2003-12-16 | Ibm Corporation | Transcoding system and method for improved access by users with special needs |
US8452850B2 (en) | 2000-12-14 | 2013-05-28 | International Business Machines Corporation | Method, apparatus and computer program product to crawl a web site |
US7657640B2 (en) | 2000-12-21 | 2010-02-02 | Hewlett-Packard Development Company, L.P. | Method and system for efficient routing of customer and contact e-mail messages |
US20020091509A1 (en) | 2001-01-02 | 2002-07-11 | Yacov Zoarez | Method and system for translating text |
JP2002215621A (en) | 2001-01-19 | 2002-08-02 | Nec Corp | Translation server, translation method and program |
US20030115040A1 (en) * | 2001-02-09 | 2003-06-19 | Yue Xing | International (multiple language/non-english) domain name and email user account ID services system |
US6964014B1 (en) | 2001-02-15 | 2005-11-08 | Networks Associates Technology, Inc. | Method and system for localizing Web pages |
AUPR329501A0 (en) | 2001-02-22 | 2001-03-22 | Worldlingo, Inc | Translation information segment |
US20020123879A1 (en) | 2001-03-01 | 2002-09-05 | Donald Spector | Translation system & method |
AUPR360701A0 (en) | 2001-03-06 | 2001-04-05 | Worldlingo, Inc | Seamless translation system |
WO2002073464A1 (en) | 2001-03-09 | 2002-09-19 | The One.Com | System and method for providing efficient and accurate translation of information in a communication network |
US20020133523A1 (en) | 2001-03-16 | 2002-09-19 | Anthony Ambler | Multilingual graphic user interface system and method |
US20020165885A1 (en) | 2001-05-03 | 2002-11-07 | International Business Machines Corporation | Method and system for verifying translation of localized messages for an internationalized application |
US20020188670A1 (en) | 2001-06-08 | 2002-12-12 | Stringham Gary G. | Method and apparatus that enables language translation of an electronic mail message |
WO2002103997A2 (en) * | 2001-06-14 | 2002-12-27 | Dizpersion Group, L.L.C. | Method and system for providing network based target advertising |
US8538803B2 (en) | 2001-06-14 | 2013-09-17 | Frank C. Nicholas | Method and system for providing network based target advertising and encapsulation |
US20030004703A1 (en) | 2001-06-28 | 2003-01-02 | Arvind Prabhakar | Method and system for localizing a markup language document |
EP1421044B1 (en) | 2001-07-02 | 2007-03-07 | Exxonmobil Chemical Patents Inc. | Inhibiting catalyst coke formation in the manufacture of an olefin |
US7793326B2 (en) | 2001-08-03 | 2010-09-07 | Comcast Ip Holdings I, Llc | Video and digital multimedia aggregator |
EP1288793A1 (en) | 2001-08-27 | 2003-03-05 | Sony NetServices GmbH | Translation text management system |
US6993473B2 (en) | 2001-08-31 | 2006-01-31 | Equality Translation Services | Productivity tool for language translators |
US20030084401A1 (en) | 2001-10-16 | 2003-05-01 | Abel Todd J. | Efficient web page localization |
EP1306775A1 (en) * | 2001-10-29 | 2003-05-02 | BRITISH TELECOMMUNICATIONS public limited company | Machine translation |
US7447624B2 (en) | 2001-11-27 | 2008-11-04 | Sun Microsystems, Inc. | Generation of localized software applications |
US20030105621A1 (en) | 2001-12-04 | 2003-06-05 | Philippe Mercier | Method for computer-assisted translation |
US7007026B2 (en) * | 2001-12-14 | 2006-02-28 | Sun Microsystems, Inc. | System for controlling access to and generation of localized application values |
US20030120478A1 (en) | 2001-12-21 | 2003-06-26 | Robert Palmquist | Network-based translation system |
US6869820B2 (en) | 2002-01-30 | 2005-03-22 | United Epitaxy Co., Ltd. | High efficiency light emitting diode and method of making the same |
US7412374B1 (en) | 2002-01-30 | 2008-08-12 | Novell, Inc. | Method to dynamically determine a user's language for a network |
US20030154071A1 (en) | 2002-02-11 | 2003-08-14 | Shreve Gregory M. | Process for the document management and computer-assisted translation of documents utilizing document corpora constructed by intelligent agents |
JP3809863B2 (en) | 2002-02-28 | 2006-08-16 | インターナショナル・ビジネス・マシーンズ・コーポレーション | server |
JP2003296223A (en) | 2002-03-29 | 2003-10-17 | Fuji Xerox Co Ltd | Method and device, and program for providing web page information |
US20030204573A1 (en) | 2002-04-30 | 2003-10-30 | Andre Beck | Method of providing a web user with additional context-specific information |
US7110937B1 (en) | 2002-06-20 | 2006-09-19 | Siebel Systems, Inc. | Translation leveraging |
US7308399B2 (en) | 2002-06-20 | 2007-12-11 | Siebel Systems, Inc. | Searching for and updating translations in a terminology database |
US7313511B2 (en) | 2002-08-21 | 2007-12-25 | California Institute Of Technology | Method and apparatus for computer simulation of flight test beds |
US7113960B2 (en) | 2002-08-22 | 2006-09-26 | International Business Machines Corporation | Search on and search for functions in applications with varying data types |
US20040049374A1 (en) | 2002-09-05 | 2004-03-11 | International Business Machines Corporation | Translation aid for multilingual Web sites |
US6996520B2 (en) | 2002-11-22 | 2006-02-07 | Transclick, Inc. | Language translation system and method using specialized dictionaries |
US7634728B2 (en) | 2002-12-28 | 2009-12-15 | International Business Machines Corporation | System and method for providing a runtime environment for active web based document resources |
US7627817B2 (en) | 2003-02-21 | 2009-12-01 | Motionpoint Corporation | Analyzing web site for translation |
US8170863B2 (en) | 2003-04-01 | 2012-05-01 | International Business Machines Corporation | System, method and program product for portlet-based translation of web content |
US7444590B2 (en) * | 2003-06-25 | 2008-10-28 | Microsoft Corporation | Systems and methods for declarative localization of web services |
CA2433512C (en) | 2003-06-26 | 2008-01-15 | Ibm Canada Limited - Ibm Canada Limitee | File translation |
US7313587B1 (en) * | 2003-07-14 | 2007-12-25 | Microsoft Corporation | Method and apparatus for localizing Web applications |
JP2005301817A (en) | 2004-04-14 | 2005-10-27 | Ricoh Co Ltd | Translation support system |
EP1749096B1 (en) | 2004-05-28 | 2013-07-17 | Mologen AG | Method for the production of suitable dna constructs for specific inhibition of gene expression by rna interference |
JP4384939B2 (en) | 2004-05-31 | 2009-12-16 | 株式会社インパルスジャパン | Language discrimination device, translation device, translation server, language discrimination method, and translation processing method |
JP4048188B2 (en) | 2004-06-07 | 2008-02-13 | 株式会社インパルスジャパン | WEB page translation apparatus and WEB page translation method |
US20050283473A1 (en) | 2004-06-17 | 2005-12-22 | Armand Rousso | Apparatus, method and system of artificial intelligence for data searching applications |
CN100568230C (en) | 2004-07-30 | 2009-12-09 | 国际商业机器公司 | Multilingual network information search method and system based on hypertext |
US8230328B2 (en) * | 2004-10-08 | 2012-07-24 | Sharp Laboratories Of America, Inc. | Methods and systems for distributing localized display elements to an imaging device |
US7509318B2 (en) | 2005-01-28 | 2009-03-24 | Microsoft Corporation | Automatic resource translation |
US7536640B2 (en) | 2005-01-28 | 2009-05-19 | Oracle International Corporation | Advanced translation context via web pages embedded with resource information |
US7958446B2 (en) | 2005-05-17 | 2011-06-07 | Yahoo! Inc. | Systems and methods for language translation in network browsing applications |
US8249854B2 (en) | 2005-05-26 | 2012-08-21 | Microsoft Corporation | Integrated native language translation |
US20060294199A1 (en) | 2005-06-24 | 2006-12-28 | The Zeppo Network, Inc. | Systems and Methods for Providing A Foundational Web Platform |
US7543189B2 (en) | 2005-06-29 | 2009-06-02 | International Business Machines Corporation | Automated multilingual software testing method and apparatus |
US7636656B1 (en) * | 2005-07-29 | 2009-12-22 | Sun Microsystems, Inc. | Method and apparatus for synthesizing multiple localizable formats into a canonical format |
US20070033520A1 (en) | 2005-08-08 | 2007-02-08 | Kimzey Ann M | System and method for web page localization |
CA2619773C (en) * | 2005-08-19 | 2016-01-26 | Biap Systems, Inc. | System and method for recommending items of interest to a user |
US20080214152A1 (en) * | 2005-09-14 | 2008-09-04 | Jorey Ramer | Methods and systems of mobile dynamic content presentation |
EP2527990B1 (en) | 2006-02-17 | 2020-01-15 | Google LLC | Using distributed models for machine translation |
US8069182B2 (en) * | 2006-04-24 | 2011-11-29 | Working Research, Inc. | Relevancy-based domain classification |
US7612712B2 (en) | 2006-04-25 | 2009-11-03 | Rx Networks Inc. | Distributed orbit modeling and propagation method for a predicted and real-time assisted GPS system |
US20070255554A1 (en) | 2006-04-26 | 2007-11-01 | Lucent Technologies Inc. | Language translation service for text message communications |
WO2007133625A2 (en) | 2006-05-12 | 2007-11-22 | Eij Group Llc | Multi-lingual information retrieval |
WO2007139910A2 (en) | 2006-05-26 | 2007-12-06 | Laden Sondrah S | System and method of language translation |
US20100287049A1 (en) * | 2006-06-07 | 2010-11-11 | Armand Rousso | Apparatuses, Methods and Systems for Language Neutral Search |
WO2008021863A2 (en) | 2006-08-08 | 2008-02-21 | Wayport, Inc. | Automated acquisition and maintenance of web-servable content via enhanced '404: not found' handler |
US8181157B2 (en) * | 2006-09-29 | 2012-05-15 | Rockwell Automation Technologies, Inc. | Custom language support for project documentation and editing |
US9852430B2 (en) | 2006-10-03 | 2017-12-26 | Microsoft Technology Licensing, Llc | Dynamic generation of advertisement text |
US20080133342A1 (en) | 2006-12-01 | 2008-06-05 | Nathalie Criou | Determining Advertising Effectiveness |
US8347205B2 (en) | 2006-12-04 | 2013-01-01 | Integrated Software, Llc | Automated generation of multiple versions of a publication |
US8606606B2 (en) | 2007-01-03 | 2013-12-10 | Vistaprint Schweiz Gmbh | System and method for translation processing |
US20080168049A1 (en) | 2007-01-08 | 2008-07-10 | Microsoft Corporation | Automatic acquisition of a parallel corpus from a network |
US8831928B2 (en) | 2007-04-04 | 2014-09-09 | Language Weaver, Inc. | Customizable machine translation service |
US7877251B2 (en) | 2007-05-07 | 2011-01-25 | Microsoft Corporation | Document translation system |
US8799307B2 (en) | 2007-05-16 | 2014-08-05 | Google Inc. | Cross-language information retrieval |
JP4877831B2 (en) | 2007-06-27 | 2012-02-15 | 久美子 石井 | Confirmation system, information provision system, and program |
US7890525B2 (en) | 2007-11-14 | 2011-02-15 | International Business Machines Corporation | Foreign language abbreviation translation in an instant messaging system |
US20090138379A1 (en) * | 2007-11-15 | 2009-05-28 | Ronald Scheman | System and method for international internet shopping |
US20090132230A1 (en) | 2007-11-15 | 2009-05-21 | Dimitri Kanevsky | Multi-hop natural language translation |
JP5102593B2 (en) * | 2007-11-30 | 2012-12-19 | インターナショナル・ビジネス・マシーンズ・コーポレーション | Apparatus and method for controlling display of document data |
US7974832B2 (en) | 2007-12-12 | 2011-07-05 | Microsoft Corporation | Web translation provider |
US9418061B2 (en) | 2007-12-14 | 2016-08-16 | International Business Machines Corporation | Prioritized incremental asynchronous machine translation of structured documents |
US8090885B2 (en) | 2008-01-14 | 2012-01-03 | Microsoft Corporation | Automatically configuring computer devices wherein customization parameters of the computer devices are adjusted based on detected removable key-pad input devices |
US8370234B2 (en) | 2008-01-23 | 2013-02-05 | Super Derivatives, Inc. | Device, system, and method of generating a customized trade article |
US8612469B2 (en) * | 2008-02-21 | 2013-12-17 | Globalenglish Corporation | Network-accessible collaborative annotation tool |
US8010628B2 (en) | 2008-03-03 | 2011-08-30 | Bookit.Com, Inc. | Software to provide geographically relevant website content |
US20090234633A1 (en) | 2008-03-17 | 2009-09-17 | Virginia Chao-Suren | Systems and methods for enabling inter-language communications |
US20090231361A1 (en) * | 2008-03-17 | 2009-09-17 | Sensormatic Electronics Corporation | Rapid localized language development for video matrix switching system |
WO2009119251A1 (en) | 2008-03-26 | 2009-10-01 | 日本碍子株式会社 | Device and method for producing sealed honeycomb structure |
US8515729B2 (en) | 2008-03-31 | 2013-08-20 | Microsoft Corporation | User translated sites after provisioning |
US8249858B2 (en) | 2008-04-24 | 2012-08-21 | International Business Machines Corporation | Multilingual administration of enterprise data with default target languages |
US8265990B2 (en) | 2008-05-15 | 2012-09-11 | Utrom Processing Co. L.L.C. | Method and system for selecting and delivering media content via the internet |
US8250083B2 (en) | 2008-05-16 | 2012-08-21 | Enpulz, Llc | Support for international search terms—translate as you crawl |
US8464150B2 (en) | 2008-06-07 | 2013-06-11 | Apple Inc. | Automatic language identification for dynamic text processing |
US9246708B2 (en) * | 2008-08-06 | 2016-01-26 | Bindu Rama Rao | Social networking website system with automatic registration based on location information |
US9262409B2 (en) | 2008-08-06 | 2016-02-16 | Abbyy Infopoisk Llc | Translation of a selected text fragment of a screen |
US8458589B2 (en) | 2008-09-18 | 2013-06-04 | Apple Inc. | Localized label user interface control |
US8438310B2 (en) * | 2008-10-01 | 2013-05-07 | Adp Dealer Services, Inc. | Systems and methods for configuring a website having a plurality of operational modes |
DE102008061480A1 (en) | 2008-10-06 | 2010-04-08 | Siemens Aktiengesellschaft | Method and apparatus for exchanging a component of a computer system |
US20100107114A1 (en) | 2008-10-28 | 2010-04-29 | Zachcial Slawomir | In context web page localization |
JP5305083B2 (en) * | 2008-11-20 | 2013-10-02 | 富士通株式会社 | Call control system, communication control method, and communication control program |
US20100235762A1 (en) | 2009-03-10 | 2010-09-16 | Nokia Corporation | Method and apparatus of providing a widget service for content sharing |
JP5897456B2 (en) | 2009-03-18 | 2016-03-30 | グーグル インコーポレイテッド | Web translation using display replacement |
US9342508B2 (en) * | 2009-03-19 | 2016-05-17 | Microsoft Technology Licensing, Llc | Data localization templates and parsing |
WO2010120929A2 (en) * | 2009-04-15 | 2010-10-21 | Evri Inc. | Generating user-customized search results and building a semantics-enhanced search engine |
US20100274553A1 (en) * | 2009-04-27 | 2010-10-28 | Netanel Raisch | Multi-Languages IDN System |
US8478579B2 (en) | 2009-05-05 | 2013-07-02 | Google Inc. | Conditional translation header for translation of web documents |
US20100293448A1 (en) | 2009-05-15 | 2010-11-18 | Infonow Corporation | Centralized website local content customization |
US20100313255A1 (en) | 2009-06-03 | 2010-12-09 | Exling, Llc | Web Browser and Web Page Plug-In Language Translation Method and System |
US8990064B2 (en) * | 2009-07-28 | 2015-03-24 | Language Weaver, Inc. | Translating documents based on content |
AU2010328326B2 (en) | 2009-12-07 | 2016-12-01 | Robert Buffone | System and method for website performance optimization and internet traffic processing |
US9336320B2 (en) | 2010-02-19 | 2016-05-10 | Nokia Technologies Oy | Method and apparatus for navigating services |
US8386233B2 (en) | 2010-05-13 | 2013-02-26 | Exling, Llc | Electronic multi-language-to-multi-language translation method and system |
US20110307243A1 (en) * | 2010-06-10 | 2011-12-15 | Microsoft Corporation | Multilingual runtime rendering of metadata |
US20110307240A1 (en) | 2010-06-10 | 2011-12-15 | Microsoft Corporation | Data modeling of multilingual taxonomical hierarchies |
US8635205B1 (en) | 2010-06-18 | 2014-01-21 | Google Inc. | Displaying local site name information with search results |
EP2587388A4 (en) * | 2010-06-25 | 2018-01-03 | Rakuten, Inc. | Machine translation system and method of machine translation |
US9213685B2 (en) | 2010-07-13 | 2015-12-15 | Motionpoint Corporation | Dynamic language translation of web site content |
US8977624B2 (en) | 2010-08-30 | 2015-03-10 | Microsoft Technology Licensing, Llc | Enhancing search-result relevance ranking using uniform resource locators for queries containing non-encoding characters |
US9164988B2 (en) | 2011-01-14 | 2015-10-20 | Lionbridge Technologies, Inc. | Methods and systems for the dynamic creation of a translated website |
US8600733B1 (en) * | 2011-05-31 | 2013-12-03 | Google Inc. | Language selection using language indicators |
-
2011
- 2011-07-13 US US13/182,120 patent/US9213685B2/en active Active
- 2011-07-13 EP EP11735940.6A patent/EP2593884A2/en not_active Ceased
- 2011-07-13 EP EP13004263.3A patent/EP2680159B1/en active Active
- 2011-07-13 US US13/182,059 patent/US9128918B2/en active Active
- 2011-07-13 EP EP20130004291 patent/EP2680162A1/en not_active Ceased
- 2011-07-13 EP EP20130004292 patent/EP2682875A1/en not_active Ceased
- 2011-07-13 EP EP13004288.0A patent/EP2680161B1/en active Active
- 2011-07-13 WO PCT/US2011/043865 patent/WO2012009441A2/en active Application Filing
- 2011-07-13 EP EP13004264.1A patent/EP2680160B1/en active Active
- 2011-07-13 US US13/182,139 patent/US9465782B2/en active Active
- 2011-07-13 US US13/182,118 patent/US9411793B2/en active Active
- 2011-07-13 US US13/182,080 patent/US9864809B2/en active Active
-
2013
- 2013-07-17 US US13/944,356 patent/US9311287B2/en active Active
-
2014
- 2014-07-02 US US14/322,016 patent/US10089400B2/en active Active
-
2015
- 2015-08-04 US US14/817,343 patent/US9858347B2/en active Active
- 2015-11-04 US US14/932,025 patent/US10210271B2/en active Active
-
2016
- 2016-03-02 US US15/058,257 patent/US10146884B2/en active Active
- 2016-06-22 US US15/189,081 patent/US10296651B2/en active Active
- 2016-08-31 US US15/252,810 patent/US10073917B2/en active Active
-
2017
- 2017-11-22 US US15/821,607 patent/US10387517B2/en active Active
- 2017-11-30 US US15/827,018 patent/US20180081890A1/en not_active Abandoned
-
2018
- 2018-07-27 US US16/047,111 patent/US10936690B2/en active Active
- 2018-08-01 US US16/051,944 patent/US10922373B2/en active Active
- 2018-10-19 US US16/164,994 patent/US11030267B2/en active Active
- 2018-12-14 US US16/220,300 patent/US20190121831A1/en not_active Abandoned
-
2019
- 2019-04-03 US US16/373,821 patent/US11157581B2/en active Active
- 2019-07-02 US US16/459,842 patent/US10977329B2/en active Active
-
2021
- 2021-01-05 US US17/141,455 patent/US11409828B2/en active Active
- 2021-01-05 US US17/141,401 patent/US11481463B2/en active Active
- 2021-03-16 US US17/202,405 patent/US20210209185A1/en not_active Abandoned
- 2021-05-06 US US17/313,171 patent/US20210256082A1/en not_active Abandoned
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US8145472B2 (en) * | 2005-12-12 | 2012-03-27 | John Shore | Language translation using a hybrid network of human and machine translators |
US20080162114A1 (en) * | 2007-01-03 | 2008-07-03 | Vistaprint Technologies Limited | Translation processing using a translation memory |
US8244519B2 (en) * | 2008-12-03 | 2012-08-14 | Xerox Corporation | Dynamic translation memory using statistical machine translation |
Also Published As
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US11409828B2 (en) | Dynamic language translation of web site content |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AS | Assignment |
Owner name: MOTIONPOINT CORPORATION, FLORIDA Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:TRAVIESO, ENRIQUE;RUBENSTEIN, ADAM;FLEMING, WILLIAM;AND OTHERS;REEL/FRAME:055599/0819 Effective date: 20110916 |
|
STPP | Information on status: patent application and granting procedure in general |
Free format text: APPLICATION DISPATCHED FROM PREEXAM, NOT YET DOCKETED |
|
STPP | Information on status: patent application and granting procedure in general |
Free format text: DOCKETED NEW CASE - READY FOR EXAMINATION |
|
STPP | Information on status: patent application and granting procedure in general |
Free format text: NON FINAL ACTION MAILED |
|
STCB | Information on status: application discontinuation |
Free format text: ABANDONED -- FAILURE TO RESPOND TO AN OFFICE ACTION |