JP7393361B2

JP7393361B2 - Digital supplementary association and search for visual search

Info

Publication number: JP7393361B2
Application number: JP2020570146A
Authority: JP
Inventors: ジョイス，アラン; チュン，エドガー; ヤン，ヂョー; メサ，イアン; オルソン，ジョセフ
Original assignee: Google LLC
Current assignee: Google LLC
Priority date: 2018-06-21
Filing date: 2019-06-21
Publication date: 2023-12-06
Anticipated expiration: 2039-06-21
Also published as: JP2024112912A; CN112020712A; JP2021522614A; CN112020712B; JP2022110057A; KR20230003388A; KR20200136030A; WO2019245801A1; EP3811238A1

Description

関連出願の参照
本出願は、２０１８年６月２１日に出願された「DIGITAL SUPPLEMENT ASSOCIATION AND RETRIEVAL FOR VISUAL SEARCH」と題する米国非仮特許出願第１６／０１４，５２０号の継続出願であり、その優先権を主張するものであり、その開示は参照によりその全体が本明細書に組み込まれる。 REFERENCES TO RELATED APPLICATIONS This application is a continuation of U.S. Nonprovisional Patent Application No. 16/014,520, filed on June 21, 2018, entitled “DIGITAL SUPPLEMENT ASSOCIATION AND RETRIEVAL FOR VISUAL SEARCH,” and has priority thereto. , the disclosure of which is incorporated herein by reference in its entirety.

背景
スマートフォンのようなモバイルコンピューティングデバイスは、多くの場合、カメラを含む。これらのカメラは、当該コンピューティングデバイスの周囲の環境におけるエンティティの画像を取得するために使用され得る。これらのエンティティに関連する様々なタイプのコンテンツまたは経験は、当該モバイルコンピューティングデバイスを介してユーザに利用可能であり得る。 Background Mobile computing devices, such as smartphones, often include a camera. These cameras may be used to capture images of entities in the environment surrounding the computing device. Various types of content or experiences related to these entities may be available to the user via the mobile computing device.

概要
本開示は、ビジュアルサーチのためのデジタル補足関連付けおよび検索のためのシステムおよび方法を説明する。たとえば、本明細書で説明されるシステムおよび技術は、ビジュアルサーチに応答する、拡張現実（ＡＲ）コンテンツまたは経験などのデジタル補足を提供するために使用され得る。ビジュアルサーチは、例えば、画像または画像内で識別されたエンティティに基づき得る。デジタル補足は、例えば、画像に関連付けられた情報または機能を提供することを含み得る。 Overview This disclosure describes systems and methods for digital complementary association and retrieval for visual search. For example, the systems and techniques described herein may be used to provide digital supplements, such as augmented reality (AR) content or experiences, in response to visual searches. Visual searches may be based on images or entities identified within images, for example. Digital supplementation may include, for example, providing information or functionality associated with an image.

一局面は、デジタル補足を指定するデータを受信することを含むコンピュータで実行される方法であり、当該データは、デジタル補足と、当該デジタル補足を画像コンテンツに関連付けるための補足アンカーとを識別する。当該方法はまた、デジタル補足および補足アンカーを指定するデータ構造インスタンスを生成することを含む。当該方法は、データ構造インスタンスを生成した後に、少なくともデータ構造インスタンスを複数の他のデータ構造インスタンスを含むデータベースに格納することに基づいて、画像によるデジタル補足のトリガを有効化することをさらに含む。他のデータ構造インスタンスの各々は、デジタル補足と、１つまたは複数の補足アンカーとを指定する。 One aspect is a computer-implemented method that includes receiving data specifying a digital supplement, the data identifying a digital supplement and a supplement anchor for associating the digital supplement with image content. The method also includes generating a data structure instance that specifies the digital supplement and the supplement anchor. The method further includes, after generating the data structure instance, enabling triggering of digital supplementation with the image based on storing at least the data structure instance in a database that includes a plurality of other data structure instances. Each of the other data structure instances specifies a digital supplement and one or more supplement anchors.

別の局面は、少なくとも１つのプロセッサと、命令を記憶するメモリとを含むコンピューティングデバイスである。当該命令は、当該少なくとも１つのプロセッサによって実行されると、当該コンピューティングデバイスに、デジタル補足を指定するデータを受信させ、当該データは、デジタル補足と、当該デジタル補足を画像コンテンツに関連付けるための補足アンカーと、コンテキスト情報とを受信させる。当該命令はまた、当該コンピューティングデバイスに、デジタル補足と補足アンカーとコンテキスト情報とを指定するデータ構造インスタンスを生成させる。当該命令はさらに、当該コンピューティングデバイスに、データ構造インスタンスを生成した後に、当該データ構造インスタンスを複数の他のデータ構造インスタンスを含むデータベースに格納することに少なくとも基づいて、画像によるデジタル補足のトリガを有効化する。当該他のデータ構造インスタンスの各々は、デジタル補足と、１つまたは複数の補足アンカーとを指定する。 Another aspect is a computing device that includes at least one processor and memory that stores instructions. The instructions, when executed by the at least one processor, cause the computing device to receive data specifying a digital supplement, the data including a digital supplement and a supplement for associating the digital supplement with image content. An anchor and context information are received. The instructions also cause the computing device to generate a data structure instance that specifies the digital supplement, the supplement anchor, and the context information. The instructions further cause the computing device to trigger digital supplementation with an image based at least on: generating the data structure instance and then storing the data structure instance in a database that includes a plurality of other data structure instances. Activate. Each of the other data structure instances specifies a digital supplement and one or more supplement anchors.

さらに別の局面は、コンピューティングデバイスから画像コンテンツクエリを受信することと、当該画像コンテンツクエリに基づいて補足アンカーを識別することとを含む、コンピュータで実行される方法である。当該方法はまた、識別された補足アンカーに基づいてデジタル補足の順序付けられたリストを生成することと、当該順序付けられたリストをクライアントコンピューティングデバイスに送信することとを含む。 Yet another aspect is a computer-implemented method that includes receiving an image content query from a computing device and identifying a supplemental anchor based on the image content query. The method also includes generating an ordered list of digital supplements based on the identified supplement anchors and transmitting the ordered list to the client computing device.

１つまたは複数の実施形態の詳細は、添付の図面および以下の説明に記載される。他の特徴は、当該説明および図面ならびに特許請求の範囲から明らかとなるであろう。 The details of one or more embodiments are set forth in the accompanying drawings and the description below. Other features will be apparent from the description and drawings, and from the claims.

例示的な実施形態に従うシステムを示すブロック図である。1 is a block diagram illustrating a system according to an example embodiment. FIG. 図１のクライアントコンピューティングデバイスの実施形態がデジタル補足にアクセスしている例示的な物理的空間の三人称視点である。2 is a third-person view of an exemplary physical space in which an embodiment of the client computing device of FIG. 1 is accessing a digital supplement; FIG. 本明細書で説明される実施形態に従う、デジタル補足のトリガを有効化する例示的な方法の図である。FIG. 3 is an illustration of an example method for enabling a trigger for digital supplementation, according to embodiments described herein. 本明細書で説明される実施形態に従う、デジタル補足のトリガを有効化する例示的な方法の図である。FIG. 3 is an illustration of an example method for enabling a trigger for digital supplementation, according to embodiments described herein. 本明細書で説明される実施形態に従う、デジタル補足を検索して提示する例示的な方法の図である。FIG. 2 is an illustration of an example method for searching and presenting digital supplements in accordance with embodiments described herein. 本明細書で説明される実施形態に従う、画像に基づいてデジタル補足を識別して提示する例示的な方法の図である。FIG. 2 is an illustration of an example method for identifying and presenting digital supplements based on images, according to embodiments described herein. 画像コンテンツ検索を行なってデジタル補足を表示するために、図１のクライアントコンピューティングデバイスの実施形態によって表示されるユーザインターフェイス画面の概略図である。2 is a schematic diagram of a user interface screen displayed by the embodiment of the client computing device of FIG. 1 for performing image content searches and displaying digital supplements; FIG. 画像コンテンツ検索を行なってデジタル補足を表示するために、図１のクライアントコンピューティングデバイスの実施形態によって表示されるユーザインターフェイス画面の概略図である。2 is a schematic diagram of a user interface screen displayed by the embodiment of the client computing device of FIG. 1 for performing image content searches and displaying digital supplements; FIG. 画像コンテンツ検索を行なってデジタル補足を表示するために、図１のクライアントコンピューティングデバイスの実施形態によって表示されるユーザインターフェイス画面の概略図である。2 is a schematic diagram of a user interface screen displayed by the embodiment of the client computing device of FIG. 1 for performing image content searches and displaying digital supplements; FIG. 画像コンテンツ検索を行なってデジタル補足を表示するために、図１のクライアントコンピューティングデバイスの実施形態によって表示されるユーザインターフェイス画面の概略図である。2 is a schematic diagram of a user interface screen displayed by the embodiment of the client computing device of FIG. 1 for performing image content searches and displaying digital supplements; FIG. 画像コンテンツ検索を行なってデジタル補足を表示するために、図１のクライアントコンピューティングデバイスの実施形態によって表示されるユーザインターフェイス画面の概略図である。2 is a schematic diagram of a user interface screen displayed by the embodiment of the client computing device of FIG. 1 for performing image content searches and displaying digital supplements; FIG. 画像コンテンツ検索を行なってデジタル補足を表示するために、図１のクライアントコンピューティングデバイスの実施形態によって表示されるユーザインターフェイス画面の概略図である。2 is a schematic diagram of a user interface screen displayed by the embodiment of the client computing device of FIG. 1 for performing image content searches and displaying digital supplements; FIG. 画像コンテンツ検索を行なってデジタル補足を表示するために、図１のクライアントコンピューティングデバイスの実施形態によって表示されるユーザインターフェイス画面の概略図である。2 is a schematic diagram of a user interface screen displayed by the embodiment of the client computing device of FIG. 1 for performing image content searches and displaying digital supplements; FIG. 画像コンテンツ検索を行なってデジタル補足を表示するために、図１のクライアントコンピューティングデバイスの実施形態によって表示されるユーザインターフェイス画面の概略図である。2 is a schematic diagram of a user interface screen displayed by the embodiment of the client computing device of FIG. 1 for performing image content searches and displaying digital supplements; FIG. 画像コンテンツ検索を行なってデジタル補足を表示するために、図１のクライアントコンピューティングデバイスの実施形態によって表示されるユーザインターフェイス画面の概略図である。2 is a schematic diagram of a user interface screen displayed by the embodiment of the client computing device of FIG. 1 for performing image content searches and displaying digital supplements; FIG. 画像コンテンツ検索を行なってデジタル補足を表示するために、図１のクライアントコンピューティングデバイスの実施形態によって表示されるユーザインターフェイス画面の概略図である。2 is a schematic diagram of a user interface screen displayed by the embodiment of the client computing device of FIG. 1 for performing image content searches and displaying digital supplements; FIG. 画像コンテンツ検索を行なってデジタル補足を表示するために、図１のクライアントコンピューティングデバイスの実施形態によって表示されるユーザインターフェイス画面の概略図である。2 is a schematic diagram of a user interface screen displayed by the embodiment of the client computing device of FIG. 1 for performing image content searches and displaying digital supplements; FIG. 店内で様々な画像コンテンツ検索を行うために、図１のクライアントコンピューティングデバイスの実施形態によって表示されるユーザインターフェイス画面の概略図である。2 is a schematic diagram of a user interface screen displayed by the embodiment of the client computing device of FIG. 1 to perform various image content searches within a store; FIG. 店内で様々な画像コンテンツ検索を行うために、図１のクライアントコンピューティングデバイスの実施形態によって表示されるユーザインターフェイス画面の概略図である。2 is a schematic diagram of a user interface screen displayed by the embodiment of the client computing device of FIG. 1 to perform various image content searches within a store; FIG. 店内で様々な画像コンテンツ検索を行うために、図１のクライアントコンピューティングデバイスの実施形態によって表示されるユーザインターフェイス画面の概略図である。2 is a schematic diagram of a user interface screen displayed by the embodiment of the client computing device of FIG. 1 to perform various image content searches within a store; FIG. 様々な画像コンテンツ検索中に図１のクライアントコンピューティングデバイスの実施形態によって表示されるユーザインターフェイス画面の概略図である。2 is a schematic diagram of a user interface screen displayed by an embodiment of the client computing device of FIG. 1 during various image content searches; FIG. 様々な画像コンテンツ検索中に図１のクライアントコンピューティングデバイスの実施形態によって表示されるユーザインターフェイス画面の概略図である。2 is a schematic diagram of a user interface screen displayed by an embodiment of the client computing device of FIG. 1 during various image content searches; FIG. 様々な画像コンテンツ検索中に図１のクライアントコンピューティングデバイスの実施形態によって表示されるユーザインターフェイス画面の概略図である。2 is a schematic diagram of a user interface screen displayed by an embodiment of the client computing device of FIG. 1 during various image content searches; FIG. 本明細書で説明される技術を実施するために使用され得るコンピュータデバイスおよびモバイルコンピュータデバイスの一例の概略図である。1 is a schematic illustration of an example of a computing device and a mobile computing device that may be used to implement the techniques described herein.

詳細な説明
本開示は、画像コンテンツに基づくデジタル補足の識別および提示を単純化する技術的改善を記載する。本明細書で説明される技術のある実施形態は、特定のタイプの画像コンテンツに関連のあるデジタル補足のインデックスを生成し、クライアントコンピューティングデバイスから受信された画像コンテンツクエリに応答してそれらのデジタル補足を提供する。このインデックスにより、ユーザは、世界中に配置されたネットワークアクセス可能なリソース（例えば、ウェブページ）によって提供される、関連のあるデジタル補足にアクセスできる。このことは、情報のより効率的な検索を可能にする機能的なデータ構造を提供し得る。 DETAILED DESCRIPTION The present disclosure describes technical improvements that simplify the identification and presentation of digital supplements based on image content. Certain embodiments of the techniques described herein generate an index of digital supplements related to particular types of image content and index those digital supplements in response to an image content query received from a client computing device. Provide supplements. This index allows users to access relevant digital supplements provided by network-accessible resources (eg, web pages) located around the world. This may provide a functional data structure that allows more efficient retrieval of information.

例えば、スマートフォンのようなクライアントコンピューティングデバイスは、エンティティのような、補足アンカーの画像を取得し得る。次いで、クライアントコンピューティングデバイスは、識別された補足アンカーに関連付けられるデジタル補足を検索するために、画像に基づく画像コンテンツクエリをサーバコンピューティングデバイスに送信し得る。ある実施形態では、補足アンカーは、クライアントコンピューティングデバイス周囲の物理的環境に基づき、デジタル補足は、物理的環境におけるユーザの経験を補足し得る仮想コンテンツである。 For example, a client computing device, such as a smartphone, may obtain images of supplementary anchors, such as entities. The client computing device may then send an image content query based on the image to the server computing device to retrieve digital supplements associated with the identified supplement anchors. In some embodiments, the supplemental anchor is based on the physical environment surrounding the client computing device, and the digital supplement is virtual content that may supplement the user's experience in the physical environment.

画像コンテンツクエリは、（例えば、識別された補足アンカーのインジケータのような）画像から決定される画像またはデータを含み得る。画像から決定されるデータの例は、例えば、光学的文字認識を使用して画像から抽出されるテキストである。画像から抽出されるデータの他の例は、当該画像、エンティティの識別子または説明、製品、または、当該画像で識別されるエンティティのタイプにある、バーコード、ＱＲ（登録商標）コード等を含む。 The image content query may include images or data determined from the image (eg, indicators of identified supplemental anchors). An example of data determined from an image is text extracted from the image using, for example, optical character recognition. Other examples of data extracted from images include barcodes, QR codes, etc. on the image, an identifier or description of the entity, the product, or the type of entity identified in the image.

エンティティ、製品、またはエンティティタイプは、たとえば、畳み込みニューラルネットワークシステムのようなニューラルネットワークシステムを使用して画像内で識別され得る。エンティティ、製品、またはエンティティタイプの識別子または説明は、エンティティ、製品、またはエンティティタイプに関連するデータベース内のレコードへのメタデータまたは参照を含み得る。エンティティの非限定的な例は、建物、技術の作品、製品、書籍、ポスター、写真、カタログ、サイン、文書（例えば、名刺、レシート、クーポン、カタログ）、人々、および身体部分を含む。 An entity, product, or entity type may be identified within an image using a neural network system, such as a convolutional neural network system, for example. An identifier or description of an entity, product, or entity type may include metadata or a reference to a record in a database that is associated with the entity, product, or entity type. Non-limiting examples of entities include buildings, works of technology, products, books, posters, photographs, catalogs, signs, documents (eg, business cards, receipts, coupons, catalogs), people, and body parts.

補足アンカーに関連付けられる様々なタイプのデジタル補足が利用可能であり得る。デジタル補足は、インターネット上で利用可能なウェブページのような、ネットワークアクセス可能なリソースによって提供され得る。画像コンテンツクエリに応答して、これらのデジタル補足の位置を特定して提供する方法が必要である。ある実施形態は、画像コンテンツクエリに応答する際に使用するエンティティに関連付けられたデジタル補足のインデックスを生成して維持する。インデックスは、例えば、ネットワークアクセス可能なリソースをクローリングして当該ネットワークアクセス可能なリソースがデジタル補足を含むかまたは提供するか否かを判断し、それらのデジタル補足に関連付けられる補足アンカーを決定することによって、追加され得る。 Various types of digital supplements associated with supplemental anchors may be available. Digital supplements may be provided by network accessible resources, such as web pages available on the Internet. A method is needed to locate and provide these digital supplements in response to image content queries. Certain embodiments generate and maintain an index of digital supplements associated with entities for use in responding to image content queries. The index may be configured, for example, by crawling network-accessible resources to determine whether the network-accessible resources include or provide digital supplements and determining supplementary anchors associated with those digital supplements. , may be added.

例えば、ネットワークアクセス可能なリソースは、デジタル補足が関連付けられる補足アンカー（例えば、テキスト、コード、エンティティ、またはエンティティの種類）を識別するメタデータを含み得る。メタデータは、ハイパーテキスト転送プロトコル（ＨＴＴＰ）要求に応答してネットワークアクセス可能なリソースによって含められ得る。メタデータは、ＸＭＬ（extensible markup language）、JavaScript Object Notation（ＪＳＯＮ）、または他のフォーマットのような、様々なフォーマットで提供され得る。 For example, a network-accessible resource may include metadata that identifies a supplement anchor (eg, text, code, entity, or type of entity) with which a digital supplement is associated. Metadata may be included by a network-accessible resource in response to a Hypertext Transfer Protocol (HTTP) request. Metadata may be provided in various formats, such as extensible markup language (XML), JavaScript Object Notation (JSON), or other formats.

デジタル補足のためのメタデータは、以下のうちの１つまたは複数、すなわち、タイプインジケータ、アンカーインジケータ、名前、説明、コンテンツのスニペット（すなわち、コンテンツの一部分の抜粋またはプレビュー）、関連画像、デジタル補足へのＵＲＬのようなリンク、および、当該デジタル補足に関連付けられるアプリケーションの識別子、を含み得る。メタデータはまた、デジタル補足の発行者に関する情報を含み得る。たとえば、メタデータは、発行者名、発行者の説明、および、当該発行者に関連付けられる画像またはアイコンのうちの１つまたは複数を含み得る。ある実施形態では、メタデータは、デジタル補足を提供することに関連付けられるコンテキスト情報を含む。例えば、メタデータはまた、デジタル補足を提供すること又はデジタル捕捉にアクセスすることに関連付けられる条件（例えば、地理的条件、必要なアプリケーション）を含み得る。 Metadata for a digital supplement may include one or more of the following: type indicator, anchor indicator, name, description, snippet of content (i.e., an excerpt or preview of a portion of the content), associated images, digital supplement and an identifier of the application associated with the digital supplement. The metadata may also include information about the digital supplement's publisher. For example, the metadata may include one or more of a publisher name, a publisher description, and an image or icon associated with the publisher. In some embodiments, the metadata includes contextual information associated with providing digital supplements. For example, metadata may also include conditions associated with providing a digital supplement or accessing a digital capture (eg, geographic conditions, required applications).

識別されたデジタル補足は、メモリに記憶されるインデックスに追加され得る。少なくともある実施形態では、デジタル補足のための関連付けられた補足アンカーは、インデックスに対するキーとして使用される。デジタル補足はまた、様々なスコアに関連付けられ得る。例えば、デジタル補足は、当該デジタル補足または当該デジタル補足に関連付けられるネットワークアクセス可能なリソースを参照する他のリンクが（例えば、ネットワークアクセス可能なリソースをクローリングしている間に）どれだけ見つかったかと、それらのリンクを提供するネットワークアクセス可能なリソースの名声とに基づく名声スコアに関連付けられ得る。別の例として、デジタル補足は、特定のアンカーに対するデジタル補足（または関連付けられるネットワークアクセス可能なリソース）の関連性に対応する１つまたは複数の関連性スコアに関連付けられ得る。関連性スコアはまた、キーワードまたは主題に関連付けられ得る。関連性スコアは、デジタル補足の内容、ネットワークアクセス可能なリソースの内容、ネットワークアクセス可能なリソースにリンクするサイトの内容、およびネットワークアクセス可能なリソースへのリンクの内容（たとえばテキスト）のうちの１つまたは複数に基づいて決定され得る。 The identified digital supplements may be added to an index stored in memory. In at least some embodiments, the associated supplementary anchor for the digital supplement is used as a key to the index. Digital supplements may also be associated with various scores. For example, a digital supplement can determine how many other links that refer to the digital supplement or network-accessible resources associated with the digital supplement are found (e.g., while crawling network-accessible resources); These links may be associated with a prestige score based on the prestige of the network-accessible resource providing the link. As another example, a digital supplement may be associated with one or more relevance scores that correspond to the relevance of the digital supplement (or associated network-accessible resource) to a particular anchor. Relevance scores may also be associated with keywords or subjects. The relevance score is one of the following: the content of a digital supplement, the content of a network-accessible resource, the content of a site that links to a network-accessible resource, and the content (e.g., text) of a link to a network-accessible resource. or more than one.

図１は、例示的な実施形態に従うシステム１００を示すブロック図である。システム１００は、デジタル補足をエンティティまたはエンティティタイプに関連付けることができ、ビジュアルサーチに応答してデジタル補足を取り出すことができる。ビジュアルサーチは、画像コンテンツに基づく検索である。例えば、ビジュアルサーチは、画像コンテンツクエリに基づいて実行され得る。画像コンテンツクエリは、画像または他の画像コンテンツに基づくクエリである。例えば、画像コンテンツクエリは、画像を含み得る。ある実施形態では、画像コンテンツクエリは、画像に基づくテキストまたはデータを含み得る。例えば、テキストまたはデータは、画像内の１つまたは複数のエンティティを認識することによって生成され得る。一部の画像コンテンツクエリは、画像を含まない（例えば、画像コンテンツクエリは、画像から生成されたデータまたはテキストのみを含み得る）。ある実施形態では、システム１００は、クライアントコンピューティングデバイス１０２と、検索サーバ１５２と、デジタル補足サーバ１７２とを含む。また、クライアントコンピューティングデバイス１０２、検索サーバ１５２、およびデジタル補足サーバ１７２が通信し得るネットワーク１９０も示されている。 FIG. 1 is a block diagram illustrating a system 100 according to an example embodiment. System 100 can associate digital supplements with entities or entity types and can retrieve digital supplements in response to visual searches. Visual search is a search based on image content. For example, a visual search may be performed based on an image content query. Image content queries are queries based on images or other image content. For example, an image content query may include images. In some embodiments, the image content query may include text or data based on the image. For example, text or data may be generated by recognizing one or more entities within an image. Some image content queries do not include images (eg, an image content query may include only data or text generated from images). In some embodiments, system 100 includes client computing device 102, search server 152, and digital supplementation server 172. Also shown is a network 190 with which client computing device 102, search server 152, and digital supplementary server 172 may communicate.

クライアントコンピューティングデバイス１０２は、プロセッサアセンブリ１０４、通信モジュール１０６、センサシステム１１０、およびメモリ１２０を含み得る。センサシステム１１０は、カメラアセンブリ１１２、慣性運動ユニット（ＩＭＵ）１１４、および全地球測位システム（ＧＰＳ）受信機１１６のような、様々なセンサを含み得る。センサシステム１１０の実施形態はまた、例えば、光センサ、音声センサ、画像センサ、距離および／または近接センサ、容量センサなどの接触センサ、タイマ、および／または、他のセンサ、および／または、センサの異なる組み合せを含む他のセンサを含み得る。ある実施形態では、クライアントコンピューティングデバイス１０２は、モバイルデバイス（たとえば、スマートフォン）である。 Client computing device 102 may include a processor assembly 104, a communications module 106, a sensor system 110, and memory 120. Sensor system 110 may include various sensors, such as a camera assembly 112, an inertial motion unit (IMU) 114, and a global positioning system (GPS) receiver 116. Embodiments of sensor system 110 may also include, for example, light sensors, audio sensors, image sensors, distance and/or proximity sensors, contact sensors such as capacitive sensors, timers, and/or other sensors; Other sensors may be included including different combinations. In some embodiments, client computing device 102 is a mobile device (eg, a smartphone).

カメラアセンブリ１１２は、クライアントコンピューティングデバイス１０２の周りの物理的空間の画像または映像を取得する。カメラアセンブリ１１２は、１つまたは複数のカメラを含み得る。カメラアセンブリ１１２はまた、赤外線カメラを含み得る。カメラアセンブリ１１２で取得された画像は、アンカーを補足するために、および画像コンテンツクエリを形成するために識別するために、使用され得る。 Camera assembly 112 captures images or video of the physical space around client computing device 102 . Camera assembly 112 may include one or more cameras. Camera assembly 112 may also include an infrared camera. Images captured with camera assembly 112 may be used to supplement anchors and identify them to form image content queries.

ある実施形態では、カメラアセンブリ１１２によって取得された画像はまた、メモリ１２０または外部コンピューティングデバイスから受信された物理的空間の表現に基づいて、内部空間のような物理的空間内のクライアントコンピューティングデバイス１０２の位置および向きを決定するために使用され得る。ある実施形態では、物理的空間の表現は、当該物理的空間の視覚的特徴（例えば、現実空間の画像から抽出される特徴）を含み得る。当該表現はまた、それらの特徴に関連付けられて、物理的空間の１つまたは複数の画像に基づいて物理的空間内の位置および／または場所を決定するために視覚的位置決めシステムによって使用され得る位置決定データを含み得る。当該表現はまた、物理的空間内の少なくともいくつかの構造の三次元モデルを含み得る。ある実施形態では、当該表現は、物理的空間の３次元モデルを含まない。 In some embodiments, the images captured by camera assembly 112 also capture images of a client computing device within a physical space, such as an interior space, based on a representation of the physical space received from memory 120 or an external computing device. 102 can be used to determine the position and orientation of 102. In some embodiments, a representation of a physical space may include visual features of the physical space (eg, features extracted from an image of the real space). The representation may also be associated with a position that can be used by a visual positioning system to determine a position and/or location within a physical space based on one or more images of the physical space. May include decision data. The representation may also include a three-dimensional model of at least some structure in physical space. In some embodiments, the representation does not include a three-dimensional model of the physical space.

ＩＭＵ１１４は、クライアントコンピューティングデバイスの動作、動き、および／または加速度を検出し得る。ＩＭＵ１１４は、たとえば、加速度計、ジャイロスコープ、磁力計、および他のそのようなセンサのような様々な異なるタイプのセンサを含み得る。クライアントコンピューティングデバイス１０２の向きは、ＩＭＵ１１４またはＧＰＳ受信機１１６によって提供されるデータに基づいて検出および追跡され得る。 IMU 114 may detect motion, motion, and/or acceleration of the client computing device. IMU 114 may include a variety of different types of sensors, such as, for example, accelerometers, gyroscopes, magnetometers, and other such sensors. The orientation of client computing device 102 may be detected and tracked based on data provided by IMU 114 or GPS receiver 116.

ＧＰＳ受信機１１６は、ＧＰＳ衛星によって発信される信号を受信し得る。信号は、衛星の時間および位置を含む。いくつかの（例えば、少なくとも４つの）衛星から信号を受信することに基づいて、ＧＰＳ受信機１１６は、クライアントコンピューティングデバイス１０２のグローバル位置を決定し得る。 GPS receiver 116 may receive signals transmitted by GPS satellites. The signal includes the time and location of the satellite. Based on receiving signals from a number (eg, at least four) of satellites, GPS receiver 116 may determine the global location of client computing device 102.

メモリ１２０は、アプリケーション１２２、他のアプリケーション１４０、およびデバイス測位システム１４２を含み得る。他のアプリケーション１４０は、クライアントコンピューティングデバイス１０２上で実行するためにインストールされるかまたはそうでなければ利用可能である任意の他のアプリケーションを含む。ある実施形態では、アプリケーション１２２は、他のアプリケーション１４０のうちの１つに、デジタル補足を提供させ得る。ある実施形態では、いくつかのデジタル補足は、他のアプリケーション１４０が、デジタル補足に関連付けられ又は当該デジタル補足を提供するために必要とされる特定のアプリケーションを含む場合にのみ、利用可能であり得る。 Memory 120 may include applications 122, other applications 140, and device positioning system 142. Other applications 140 include any other applications installed or otherwise available for execution on client computing device 102. In some embodiments, application 122 may cause one of the other applications 140 to provide digital supplements. In some embodiments, some digital supplements may be available only if other applications 140 include the particular applications associated with or required to provide the digital supplements. .

デバイス測位システム１４２は、クライアントコンピューティングデバイス１０２の位置を決定する。デバイス測位システム１４２は、クライアントコンピューティングデバイス１０２の位置および向きをグローバルにまたは物理的空間内で決定するために、センサシステム１１０を使用し得る。ある実施形態では、デバイス測位システム１４２は、たとえば、セルラー三角測量に基づいて、クライアントコンピューティングデバイス１０２の位置を決定する。 Device positioning system 142 determines the location of client computing device 102 . Device positioning system 142 may use sensor system 110 to determine the location and orientation of client computing device 102 globally or within physical space. In some embodiments, device positioning system 142 determines the location of client computing device 102 based on, for example, cellular triangulation.

ある実施形態では、クライアントコンピューティングデバイス１０２は、カメラアセンブリ１１２によって取得された画像（またはそれらの画像から抽出される特徴）を、物理的空間の表現内の特徴の既知の配置と比較し、物理的空間内のクライアントコンピューティングデバイス１０２の６自由度の姿勢（たとえば、位置および向き）を決定する視覚的測位システムを含み得る。 In some embodiments, client computing device 102 compares images captured by camera assembly 112 (or features extracted from those images) to known placements of features within a representation of physical space, and The system may include a visual positioning system that determines a six-degree-of-freedom pose (eg, position and orientation) of client computing device 102 in virtual space.

アプリケーション１２２は、補足アンカー識別エンジン１２４、デジタル補足検索エンジン１２６、デジタル補足提示エンジン１２８、および、ユーザインターフェイスエンジン１３０を含み得る。アプリケーション１２２のある実施形態は、より少ない、追加の、または他の構成要素を含み得る。 Application 122 may include a supplemental anchor identification engine 124, a digital supplementary search engine 126, a digital supplementary presentation engine 128, and a user interface engine 130. Certain embodiments of application 122 may include fewer, additional, or other components.

補足アンカー識別エンジン１２４は、例えば、カメラアセンブリ１１２で取得された画像に基づいて補足アンカーを識別する。ある実施形態では、補足アンカー識別エンジン１２４は、テキストを識別するために画像を分析する。当該テキストは、次いで、アンカーを識別するために使用され得る。例えば、テキストは、知識グラフ内のノードにマッピングされ得る。たとえば、テキストは、人、場所、製品、建物、アートワーク、映画、または他のタイプのエンティティのような、エンティティの名前として認識され得る。ある実施形態では、テキストは、特定のエンティティに一般に関連付けられるフレーズとして、または、特定のエンティティを記述するフレーズとして認識され得る。例えば、テキストは、次いで、当該特定のエンティティに関連付けられたアンカーとして認識され得る。 Supplemental anchor identification engine 124 identifies supplemental anchors based on images captured with camera assembly 112, for example. In some embodiments, supplemental anchor identification engine 124 analyzes images to identify text. That text can then be used to identify the anchor. For example, text may be mapped to nodes in a knowledge graph. For example, the text may be recognized as the name of an entity, such as a person, place, product, building, artwork, movie, or other type of entity. In some embodiments, the text may be recognized as phrases commonly associated with or describing a particular entity. For example, the text may then be recognized as an anchor associated with that particular entity.

ある実施形態では、補足アンカー識別エンジン１２４は、画像内でバーコード、ＱＲコード（登録商標）、または別のタイプのコードのような、１つまたは複数のコードを識別する。次いで、コードは、補足アンカーにマッピングされ得る。 In some embodiments, supplemental anchor identification engine 124 identifies one or more codes within the image, such as a barcode, QR code, or another type of code. The codes may then be mapped to supplementary anchors.

補足アンカー識別エンジン１２４は、画像内の少なくともいくつかのタイプのエンティティを認識できる機械学習モジュールを含み得る。例えば、機械学習モジュールは、ニューラルネットワークシステムを含み得る。ニューラルネットワークは、機械学習に使用される計算モデルであり、重み付き接続を有する層に編成されたノードから構成される。ニューラルネットワークをトレーニングすることは、複数のトレーニング例を使用し、各例は入力および所望の出力であり、一連の反復ラウンドにわたって、ニューラルネットワークが所与の入力に対して所望の出力を提供する可能性を増加させる層間の接続の重み値を決定する。各トレーニングラウンドの間、当該重みは、誤った出力値に対処するように調整される。トレーニングされると、ニューラルネットワークは、提供された入力に基づいて出力を予測するために使用され得る。 Supplemental anchor identification engine 124 may include a machine learning module that can recognize at least some types of entities within an image. For example, a machine learning module may include a neural network system. A neural network is a computational model used for machine learning, consisting of nodes organized into layers with weighted connections. Training a neural network uses multiple training examples, each example being an input and a desired output, and over a series of iterative rounds, it is possible for the neural network to provide the desired output for a given input. Determine the weight value of the connection between the layers to increase the During each training round, the weights are adjusted to account for erroneous output values. Once trained, a neural network can be used to predict output based on the input provided.

ある実施形態では、ニューラルネットワークシステムは、畳み込みニューラルネットワーク（ＣＮＮ）を含む。畳み込みニューラルネットワーク（ＣＮＮ）は、ニューラルネットワークの層のうちの少なくとも１つが畳み込み層であるニューラルネットワークである。畳み込み層は、前の層の値のサブセットにカーネル関数を適用することに基づいて層の値が計算される層である。ニューラルネットワークをトレーニングすることは、トレーニング例に基づいてカーネル関数の重みを調整することを含み得る。典型的には、同じカーネル関数が、畳み込み層における各値を計算するために使用され得る。したがって、畳み込み層をトレーニングしながら学習しなければならない重みは、ニューラルネットワークにおいて完全に接続された層（例えば、ある層内の各値が、前の層内の各値の独立して調整されて重み付けされた組み合わせとして計算される層）よりもはるかに少ない。畳み込み層の重みは通常より少ないため、畳み込み層をトレーニングすることおよび使用することは、同等の完全に接続された層よりも少ないメモリ、プロセッササイクル、および時間を必要とし得る。 In some embodiments, the neural network system includes a convolutional neural network (CNN). A convolutional neural network (CNN) is a neural network in which at least one of the layers of the neural network is a convolutional layer. A convolutional layer is a layer in which the values of a layer are calculated based on applying a kernel function to a subset of the values of previous layers. Training a neural network may include adjusting weights of a kernel function based on training examples. Typically, the same kernel function may be used to compute each value in the convolutional layer. Therefore, the weights that have to be learned while training a convolutional layer can be applied to fully connected layers in a neural network (e.g., each value in one layer is adjusted independently of each value in the previous layer). (layers computed as a weighted combination). Because convolutional layers typically have fewer weights, training and using a convolutional layer may require less memory, processor cycles, and time than an equivalent fully connected layer.

補足アンカー識別エンジン１２４が画像内のエンティティまたはエンティティタイプを認識した後、エンティティまたはエンティティタイプのテキスト記述が生成され得る。加えて、エンティティまたはエンティティタイプは、補足アンカーにマッピングされ得る。ある実施形態では、補足アンカーは、１つまたは複数のデジタル補足に関連付けられる。 After supplemental anchor identification engine 124 recognizes an entity or entity type in the image, a textual description of the entity or entity type may be generated. Additionally, entities or entity types may be mapped to supplemental anchors. In some embodiments, a supplemental anchor is associated with one or more digital supplements.

ある実施形態では、補足アンカー識別エンジン１２４は、認識されたアンカーの信頼スコアを決定する。より高い信頼スコアは、より低い信頼スコアが決定された場合よりも、画像からのコンテンツ（例えば、画像、抽出されたテキスト、バーコード、ＱＲコード（登録商標））が、決定されたアンカーに関連付けられる可能性が高いことを示し得る。 In some embodiments, supplemental anchor identification engine 124 determines confidence scores for recognized anchors. A higher confidence score indicates that content from the image (e.g., image, extracted text, barcode, QR code) is more associated with the determined anchor than if a lower confidence score were determined. This can indicate that there is a high possibility that the

図１の例は、クライアントコンピューティングデバイス１０２上のアプリケーション１２２の構成要素として補足アンカー識別エンジン１２４を示すが、ある実施形態は、検索サーバ１５２上の補足アンカー識別エンジンを含む。例えば、クライアントコンピューティングデバイス１０２は、カメラアセンブリ１１２によって取得された画像を検索サーバ１５２に送信することができ、検索サーバ１５２は、画像内の補足アンカーを識別し得る。 Although the example of FIG. 1 shows supplemental anchor identification engine 124 as a component of application 122 on client computing device 102, certain embodiments include a supplemental anchor identification engine on search server 152. For example, client computing device 102 may send an image captured by camera assembly 112 to search server 152, and search server 152 may identify supplemental anchors within the image.

ある実施形態では、補足アンカー識別エンジン１２４は、潜在的な補足アンカーを識別する。例えば、補足アンカー識別エンジン１２４は、画像内の様々なエンティティを識別（認識）し得る。次いで、認識されたエンティティの識別子は、検索サーバ１５２に送信されることができ、検索サーバ１５２は、エンティティのいずれかが任意の補足アンカーに関連付けられているかどうかを決定し得る。ある実施形態では、検索サーバ１５２は、識別されたエンティティが補足アンカーでない場合であっても、識別されたエンティティをコンテキスト情報として使用し得る。 In some embodiments, supplemental anchor identification engine 124 identifies potential supplemental anchors. For example, supplemental anchor identification engine 124 may identify (recognize) various entities within an image. Identifiers of the recognized entities may then be sent to search server 152, which may determine whether any of the entities are associated with any supplemental anchors. In some embodiments, search server 152 may use the identified entity as context information even if the identified entity is not a supplemental anchor.

デジタル補足検索エンジン１２６は、デジタル補足を検索する。例えば、デジタル補足検索エンジン１２６は、補足アンカー識別エンジン１２４によって識別された補足アンカーに関連付けられるデジタル補足を検索し得る。ある実施形態では、デジタル補足検索エンジン１２６は、検索サーバ１５２またはデジタル補足サーバ１７２からデジタル補足を検索する。 Digital supplement search engine 126 searches digital supplements. For example, digital supplement search engine 126 may search for digital supplements associated with supplemental anchors identified by supplemental anchor identification engine 124. In some embodiments, digital supplement search engine 126 searches for digital supplements from search server 152 or digital supplement server 172.

例えば、補足アンカーが識別された後、デジタル補足検索エンジン１２６は、識別された補足アンカーに関連付けられる１つまたは複数のデジタル補足を検索し得る。デジタル補足検索エンジン１２６は、画像（または画像内の補足アンカーまたはエンティティの識別子）を含む画像コンテンツクエリを生成し得、その画像コンテンツクエリを検索サーバ１５２に送信し得る。画像コンテンツクエリはまた、クライアントコンピューティングデバイス１０２の位置のようなコンテキスト情報を含み得る。ある実施形態では、名前、画像、または説明のような、デジタル補足に関連するデータが取り出されて、（例えば、ユーザインターフェイスエンジン１３０によって）ユーザに提示される。複数のデジタル補足が提示される場合、ユーザは、ユーザインターフェイスエンジン１３０によって生成されたユーザインターフェイスを介して、デジタル補足のうちの１つを選択し得る。 For example, after a supplemental anchor is identified, digital supplement search engine 126 may search for one or more digital supplements associated with the identified supplemental anchor. Digital supplementary search engine 126 may generate an image content query that includes an image (or an identifier of a supplementary anchor or entity within the image) and send the image content query to search server 152. The image content query may also include contextual information such as the location of client computing device 102. In some embodiments, data associated with the digital supplement, such as a name, image, or description, is retrieved and presented to the user (eg, by user interface engine 130). If multiple digital supplements are presented, the user may select one of the digital supplements via a user interface generated by user interface engine 130.

デジタル補足提示エンジン１２８は、デジタル補足をクライアントコンピューティングデバイス１０２上に提示するか、または提示させる。ある実施形態では、デジタル補足提示エンジン１２８は、クライアントコンピューティングデバイスに他のアプリケーション１４０のうちの１つを開始させる。ある実施形態では、デジタル補足提示エンジン１２８は、情報またはコンテンツを表示させる。例えば、デジタル補足提示エンジン１２８は、ユーザインターフェイスエンジン１３０に、クライアントコンピューティングデバイス１０２によって表示されるデジタル補足からの情報またはコンテンツを含むユーザインターフェイスを生成させ得る。ある実施形態では、デジタル補足提示エンジン１２８は、デジタル補足検索エンジン１２６がデジタル補足を検索することによってトリガされる。次いで、デジタル補足提示エンジン１２８は、デジタル補足に関連付けられるコンテンツを表示するために、ディスプレイデバイス１０８をトリガし得る。ある実施形態では、デジタル補足提示エンジン１２８は、デジタル補足検索エンジン１２６がデジタル補足を検索する時とは異なる時にデジタル補足を表示させる。例えば、デジタル補足は、第１の時点で画像コンテンツクエリに応答して検索されてもよく、また、デジタル補足は、第２の時点で提示され得る。例えば、デジタル補足は、第１の時間（例えば、ユーザがカタログを見ている間、または店にいる間である）にカタログまたは店からの家財道具または家具の画像に基づく画像コンテンツクエリに応答して検索され得る。家財道具または家具のＡＲコンテンツを含むデジタル補足は、第２の時間（例えば、ユーザが家財道具または家具が設置され得る部屋にいる間）に提示され得る。 Digital supplement presentation engine 128 presents or causes digital supplements to be presented on client computing device 102. In some embodiments, the digital supplement presentation engine 128 causes the client computing device to start one of the other applications 140. In some embodiments, digital supplemental presentation engine 128 causes information or content to be displayed. For example, digital supplement presentation engine 128 may cause user interface engine 130 to generate a user interface that includes information or content from the digital supplement displayed by client computing device 102. In some embodiments, digital supplement presentation engine 128 is triggered by digital supplement search engine 126 searching for digital supplements. Digital supplement presentation engine 128 may then trigger display device 108 to display content associated with the digital supplement. In some embodiments, digital supplement presentation engine 128 causes digital supplements to be displayed at different times than when digital supplement search engine 126 searches for digital supplements. For example, the digital supplement may be retrieved in response to an image content query at a first point in time, and the digital supplement may be presented at a second point in time. For example, the digital supplement responds to an image content query based on images of household goods or furniture from a catalog or store at a first time (e.g., while the user is viewing a catalog or while in a store). can be searched. A digital supplement that includes AR content of household items or furniture may be presented at a second time (e.g., while the user is in a room where the household items or furniture may be installed).

ユーザインターフェイスエンジン１３０は、ユーザインターフェイスを生成する。ユーザインターフェイスエンジン１３０はまた、クライアントコンピューティングデバイス１０２に、生成されたユーザインターフェイスを表示させることもできる。生成されたユーザインターフェイスは、例えば、デジタル補足からの情報またはコンテンツを表示し得る。ある実施形態では、ユーザインターフェイスエンジン１３０は、各々がデジタル補足に関連付けられる複数のユーザ作動可能な制御を含むユーザインターフェイスを生成する。例えば、ユーザは、（例えば、タッチスクリーン上のコントロールに触れること、マウスまたは別の入力装置を使用してコントロールをクリックすること、またはそうでなければコントロールを作動させることによって）、ユーザ作動可能な制御のうちの１つを作動させ得る。 User interface engine 130 generates a user interface. User interface engine 130 may also cause client computing device 102 to display the generated user interface. The generated user interface may display information or content from the digital supplement, for example. In some embodiments, user interface engine 130 generates a user interface that includes a plurality of user actuatable controls, each associated with a digital supplement. For example, the user can perform a user-actuatable One of the controls may be activated.

検索サーバ１５２は、コンピューティングデバイスである。検索サーバ１５２は、画像コンテンツクエリのような検索要求に応答し得る。当該応答は、画像コンテンツクエリに潜在的に関連する１つまたは複数のデジタル補足を含み得る。ある実施形態では、検索サーバ１５２は、メモリ１６０と、プロセッサアセンブリ１５４と、通信モジュール１５６とを含む。メモリ１６０は、コンテンツクローラ１６２、デジタル補足検索エンジン１６４、および、デジタル補足データ記憶部１６６を含み得る。 Search server 152 is a computing device. Search server 152 may respond to search requests, such as image content queries. The response may include one or more digital supplements potentially related to the image content query. In some embodiments, search server 152 includes memory 160, processor assembly 154, and communication module 156. Memory 160 may include a content crawler 162, a digital supplementary search engine 164, and a digital supplementary data storage 166.

コンテンツクローラ１６２は、デジタル補足を識別するためにネットワークアクセス可能なリソースをクローリングし得る。例えば、コンテンツクローラ１６２は、デジタル補足サーバ１７２によって提供されるウェブページのような、インターネットを介してアクセス可能なウェブページにアクセスし得る。ネットワークアクセス可能なリソースをクローリングすることは、ウェブサーバからリソースを要求することと、当該リソースの少なくとも一部分を分析することとを含み得る。デジタル補足は、デジタル補足に関する情報を提供するＸＭＬまたはＪＳＯＮデータのような、ネットワークアクセス可能なリソースによって提供されるメタデータに基づいて識別され得る。ある実施形態では、クローラは、以前にクロールされたネットワークアクセス可能なリソースからリンクを抽出することに基づいて、ネットワークアクセス可能なリソースを識別する。コンテンツクローラ１６２はまた、ユーザによって提出された入力の受信に基づいて、クロールすべきネットワークアクセス可能なリソースを識別し得る。例えば、ユーザは、ウェブフォームまたはアプリケーションプログラミングインターフェイス（ＡＰＩ）を介して、デジタル補足を含むネットワークアクセス可能なリソースにＵＲＬ（または他の情報）を提出し得る。ある実施形態では、コンテンツクローラ１６２は、識別されたデジタル補足のインデックスを生成する。コンテンツクローラ１６２はまた、関連性スコアまたは人気（名声）スコアのような、デジタル補足に関連付けられるスコアを生成し得る。 Content crawler 162 may crawl network accessible resources to identify digital supplements. For example, content crawler 162 may access web pages accessible via the Internet, such as web pages provided by digital supplement server 172. Crawling a network-accessible resource may include requesting the resource from a web server and analyzing at least a portion of the resource. Digital supplements may be identified based on metadata provided by network-accessible resources, such as XML or JSON data that provides information about the digital supplement. In some embodiments, the crawler identifies network-accessible resources based on extracting links from previously crawled network-accessible resources. Content crawler 162 may also identify network-accessible resources to crawl based on receiving input submitted by users. For example, a user may submit a URL (or other information) to a network-accessible resource containing a digital supplement via a web form or application programming interface (API). In some embodiments, content crawler 162 generates an index of the identified digital supplements. Content crawler 162 may also generate a score associated with the digital supplement, such as a relevance score or popularity (fame) score.

デジタル補足検索エンジン１６４は、検索クエリを受信し、１つまたは複数の潜在的に関連するデジタル補足を含み得る応答を生成する。例えば、デジタル補足検索エンジン１６４は、クライアントコンピューティングデバイス１０２から画像コンテンツクエリを受信し得る。画像コンテンツクエリは、画像を含み得る。デジタル補足検索エンジン１６４は、画像内の補足アンカーを識別し得、識別された補足アンカーに基づいて、関連付けられた、または潜在的に関連するデジタル補足を識別し得る。デジタル補足検索エンジン１６４は、デジタル補足または当該デジタル補足にアクセスするために使用され得る情報を含む応答をクライアントコンピューティングデバイス１０２に送信し得る。ある実施形態では、デジタル補足検索エンジン１６４は、複数のデジタル補足に関連付けられる情報を返し得る。例えば、デジタル補足のリストは、クエリに対する応答に含まれ得る。当該リストは、補足アンカーへの関連性、人気、またはデジタル補足の他の特性に基づいて、順序付けされ得る。 Digital supplement search engine 164 receives a search query and generates a response that may include one or more potentially relevant digital supplements. For example, digital supplementary search engine 164 may receive image content queries from client computing device 102. Image content queries may include images. Digital supplement search engine 164 may identify supplement anchors within the image and may identify associated or potentially related digital supplements based on the identified supplement anchors. Digital supplement search engine 164 may send a response to client computing device 102 that includes the digital supplement or information that may be used to access the digital supplement. In some embodiments, digital supplement search engine 164 may return information associated with multiple digital supplements. For example, a list of digital supplements may be included in a response to a query. The list may be ordered based on relevance to the supplement anchor, popularity, or other characteristics of the digital supplement.

画像コンテンツクエリは、例えば、カメラアセンブリ１１２によって取得された画像、または、カメラアセンブリ１１２によって取得された画像に関連付けられるテキストもしくは他のデータを含み得る。画像コンテンツクエリはまた、クライアントコンピューティングデバイス１０２の位置またはクライアントコンピューティングデバイス１０２のユーザの識別子のような他の情報を含み得る。ある実施形態では、検索サーバ１５２は、（例えば、もし、ユーザが、当該ユーザの位置に関する情報をユーザのアカウントに関連付ける位置サービスをクライアントコンピューティングデバイス１０２上で有効にした場合）、ユーザ識別子からクライアントコンピューティングデバイス１０２の予想される位置を決定し得る。 Image content queries may include, for example, images captured by camera assembly 112 or text or other data associated with images captured by camera assembly 112. The image content query may also include other information, such as the location of client computing device 102 or an identifier of the user of client computing device 102. In some embodiments, the search server 152 retrieves the client information from the user identifier (e.g., if the user has enabled location services on the client computing device 102 that associate information about the user's location with the user's account). An expected location of computing device 102 may be determined.

デジタル補足データ記憶部１６６は、デジタル補足に関する情報を記憶する。ある実施形態では、デジタル補足データ記憶部１６６は、デジタル補足のインデックスを含む。例えば、インデックスは、コンテンツクローラ１６２によって生成され得る。デジタル補足検索エンジン１６４は、検索クエリに応答するためにインデックスを使用し得る。 Digital supplement data storage unit 166 stores information regarding digital supplements. In some embodiments, digital supplement data storage 166 includes an index of digital supplements. For example, the index may be generated by content crawler 162. Digital supplementary search engine 164 may use the index to respond to search queries.

デジタル補足サーバ１７２は、コンピューティングデバイスである。デジタル補足サーバ１７２は、デジタル補足を提供する。ある実施形態では、デジタル補足サーバ１７２は、メモリ１８０と、プロセッサアセンブリ１７４と、通信モジュール１７６とを含む。メモリ１８０は、デジタル補足１８２およびメタデータ１８４を含み得る。ある実施形態では、メモリ１８０はまた、必ずしもデジタル補足ではないウェブページのような他のネットワークアクセス可能なリソースを含み得る。たとえば、メモリ１８０は、１つまたは複数のデジタル補足に関する詳細およびそれらのデジタル補足にアクセスする方法を提供するためのメタデータを含むウェブページを記憶し得る。さらに、メモリ１８０は、例えば、ウェブページおよびデジタル補足のようなネットワークアクセス可能なリソースを用いてＨＴＴＰ要求のような要求に応答する、ウェブサーバのようなリソースサービス提供エンジンを含み得る。 Digital supplementary server 172 is a computing device. Digital supplement server 172 provides digital supplements. In some embodiments, digital supplementary server 172 includes memory 180, processor assembly 174, and communication module 176. Memory 180 may include digital supplement 182 and metadata 184. In some embodiments, memory 180 may also include other network-accessible resources, such as web pages, which are not necessarily digital supplements. For example, memory 180 may store web pages that include metadata to provide details about one or more digital supplements and how to access those digital supplements. Additionally, memory 180 may include a resource serving engine, such as a web server, that responds to requests, such as HTTP requests, with network-accessible resources such as web pages and digital supplements.

デジタル補足１８２は、ユーザの周囲の物理的環境において何かに対する補足として提供され得る任意の種類のコンテンツである。デジタル補足１８２はまた、（例えば、ユーザ周辺の以前の物理的環境の）格納された画像を補足できる任意のタイプのコンテンツを含み得る。例えば、デジタル補足は、画像、当該画像内で識別されるオブジェクトもしくは製品、または、場所のような補足アンカーに関連付けられ得る。デジタル補足１８２は、１つまたは複数の画像、オーディオコンテンツ、テキストデータ、映像、ゲーム、データファイル、アプリケーション、または構造化されたテキスト文書を含み得る。構造化されたテキスト文書の例は、ハイパーテキストマークアップ言語（ＨＴＭＬ）文書、ＸＭＬ文書、および他の種類の構造化されたテキスト文書を含む。 Digital supplement 182 is any type of content that can be provided as a supplement to something in the user's physical environment. Digital supplement 182 may also include any type of content that can supplement stored images (eg, of a previous physical environment around the user). For example, a digital supplement may be associated with a supplemental anchor such as an image, an object or product identified within the image, or a location. Digital supplement 182 may include one or more images, audio content, textual data, video, games, data files, applications, or structured text documents. Examples of structured text documents include Hypertext Markup Language (HTML) documents, XML documents, and other types of structured text documents.

デジタル補足１８２は、アプリケーションを起動させることができ、そのアプリケーションに対するパラメータを定義し得る。デジタル補足１８２はまた、要求をサーバに送信させ（例えば、ＨＴＴＰ要求）、その要求に対するパラメータを定義し得る。ある実施形態では、デジタル補足１８２は、購入を完了するためのワークフローのような、活動を完了するためのワークフローとして開始する。例えば、デジタル補足１８２は、特定の商品をユーザのショッピングカートに追加し、クーポンコードを追加し、そして、購入確認ページを検索するサーバに、ＨＴＴＰ要求を送信し得る。 Digital supplement 182 may launch an application and may define parameters for the application. Digital supplement 182 may also cause requests to be sent to the server (eg, HTTP requests) and define parameters for the requests. In some embodiments, digital supplement 182 begins as a workflow for completing an activity, such as a workflow for completing a purchase. For example, digital supplement 182 may send an HTTP request to a server that adds a particular item to the user's shopping cart, adds a coupon code, and retrieves a purchase confirmation page.

メタデータ１８４は、デジタル補足を記述するデータである。メタデータ１８４は、デジタル補足サーバ１７２によって提供される又は他の場所に提供される、１つまたは複数のデジタル補足を記述し得る。デジタル補足のためのメタデータ１８４は、以下のうちの１つ以上を含み得る：タイプインジケータ、アンカーインジケータ、名前、説明、プレビュースニペットまたは抜粋、関連する画像、デジタル補足に対するＵＲＬのようなリンク、およびデジタル補足に関連付けられるアプリケーションの識別子。メタデータはまた、発行者名、発行者の説明、および発行者に関連付けられる画像またはアイコンのような、デジタル補足の発行者に関する情報を含み得る。ある実施形態では、メタデータはまた、デジタル補足に関するコンテキスト情報、またはデジタル補足を提供するために満足しなければならないコンテキスト情報を含む。例えば、メタデータは、デジタル補足にアクセスするために満たされなければならない条件（例えば、地理的条件、クライアントコンピューティングデバイス要件、必要なアプリケーション）を含み得る。例示的なコンテキスト情報は、位置、画像内で識別されるエンティティ、または画像内で識別される複数のエンティティ（例えば、いくつかのデジタル補足は、画像内で認識されるエンティティの組み合わせを必要とする場合がある）を含む。認識されたエンティティは、補足アンカーであり得る。ある実施形態では、認識されたエンティティは、アンカーを補足せず、代わりにコンテキスト情報を提供する。メタデータ１８４はまた、デジタル補足に関連付けられる補足アンカー（例えば、テキスト、コード、エンティティ、またはエンティティのタイプ）を含み得る。 Metadata 184 is data that describes the digital supplement. Metadata 184 may describe one or more digital supplements provided by digital supplement server 172 or provided elsewhere. Metadata 184 for a digital supplement may include one or more of the following: a type indicator, an anchor indicator, a name, a description, a preview snippet or excerpt, an associated image, a URL-like link to the digital supplement, and Identifier of the application associated with the digital supplement. The metadata may also include information about the digital supplement's publisher, such as a publisher name, a publisher description, and an image or icon associated with the publisher. In some embodiments, the metadata also includes contextual information about the digital supplement or that must be satisfied to provide the digital supplement. For example, the metadata may include conditions that must be met to access the digital supplement (eg, geographic conditions, client computing device requirements, required applications). Exemplary context information includes a location, an entity identified within an image, or multiple entities identified within an image (e.g., some digital supplements require a combination of entities recognized within an image). (in some cases). A recognized entity may be a supplementary anchor. In some embodiments, the recognized entity does not supplement the anchor, but instead provides context information. Metadata 184 may also include supplemental anchors (eg, text, code, entities, or types of entities) associated with the digital supplement.

メタデータ１８４は、様々な形式で格納され得る。ある実施形態では、メタデータ１８４はデータベースに格納される。メタデータ１８４は、ＸＭＬファイル、ＪＳＯＮファイル、または他のフォーマットファイルとして格納され得る。ある実施形態では、デジタル補足サーバ１７２は、データベースからメタデータ１８４を取り出し、ＸＭＬ、ＪＳＯＮとしてメタデータ１８４をフォーマットし、またはそうでなければ、クライアントまたは検索サーバ１５２からの要求に対する応答を提供する。例えば、検索サーバ１５２は、デジタル補足データ記憶部１６６に記憶され、かつ、クライアントコンピューティングデバイス１０２からの検索要求に応答するために使用されるデータを生成するために、メタデータ１８４にアクセスし得る。 Metadata 184 may be stored in a variety of formats. In some embodiments, metadata 184 is stored in a database. Metadata 184 may be stored as an XML file, JSON file, or other format file. In some embodiments, digital capture server 172 retrieves metadata 184 from a database, formats metadata 184 as XML, JSON, or otherwise provides a response to a request from a client or search server 152. For example, search server 152 may access metadata 184 to generate data that is stored in digital supplemental data store 166 and used to respond to search requests from client computing device 102. .

通信モジュール１０６は、検索サーバ１５２またはデジタル補足サーバ１７２のような、他のコンピューティングデバイスと通信するための１つまたは複数のデバイスを含む。通信モジュール１０６は、ネットワーク１９０のような無線または有線ネットワークを介して通信し得る。検索サーバ１５２の通信モジュール１５６およびデジタル補足サーバ１７２の通信モジュール１７６は、通信モジュール１０６と同様であり得る。 Communication module 106 includes one or more devices for communicating with other computing devices, such as search server 152 or digital supplement server 172. Communication module 106 may communicate via a wireless or wired network, such as network 190. Communication module 156 of search server 152 and communication module 176 of digital supplement server 172 may be similar to communication module 106.

ディスプレイデバイス１０８は、例えば、ＬＣＤ（液晶ディスプレイ）画面、ＬＥＤ（発光ダイオード）画面、ＯＬＥＤ（有機発光ダイオード）画面、タッチ画面、または画像もしくは情報をユーザに表示するための任意の他の画面もしくはディスプレイを含み得る。ある実施形態では、ディスプレイデバイス１０８は、ユーザの目の一部に光を投影するように構成された光プロジェクタを含む。 Display device 108 may be, for example, an LCD (Liquid Crystal Display) screen, an LED (Light Emitting Diode) screen, an OLED (Organic Light Emitting Diode) screen, a touch screen, or any other screen or display for displaying images or information to a user. may include. In some embodiments, display device 108 includes a light projector configured to project light onto a portion of the user's eye.

メモリ１２０は、１つまたは複数の非一時的コンピュータ読み取り可能な記憶媒体を含み得る。メモリ１２０は、取得された画像に基づいて画像コンテンツクエリを生成し、画像コンテンツクエリを送信し、当該画像コンテンツクエリに対する応答を受信し、画像コンテンツクエリに対する応答において識別されるデジタル補足を提示するために、のように、本明細書で説明される技術を実施するためにクライアントコンピューティングデバイス１０２によって使用可能な命令およびデータを記憶し得る。検索サーバ１５２のメモリ１６０およびデジタル補足サーバ１７２のメモリ１８０は、メモリ１２０と同様であり得、それぞれ、検索サーバ１５２およびデジタル補足サーバ１７２の技術を実施するために使用可能なデータ命令を記憶し得る。 Memory 120 may include one or more non-transitory computer-readable storage media. Memory 120 is configured to generate an image content query based on the acquired images, transmit the image content query, receive a response to the image content query, and present digital supplements identified in the response to the image content query. may store instructions and data that can be used by client computing device 102 to implement the techniques described herein, such as. Memory 160 of search server 152 and memory 180 of digital supplementation server 172 may be similar to memory 120 and may store data instructions that can be used to implement the techniques of search server 152 and digital supplementation server 172, respectively. .

プロセッサアセンブリ１０４は、ビジュアルサーチのためのデジタル補足関連付けおよび検索に関連付けられる様々なタスクを実行するために、メモリ１２０によって格納された命令のような、命令を実行可能な１つまたは複数のデバイスを含む。たとえば、プロセッサアセンブリ１０４は、中央処理ユニット（ＣＰＵ）および／またはグラフィックプロセッサユニット（ＧＰＵ）を含み得る。たとえば、ＧＰＵが存在する場合、ユーザインターフェイスの生成および表示、またはデジタル補足の部分の表示のような、いくつかの画像／ビデオレンダリングタスクは、ＣＰＵからＧＰＵにオフロードされ得る。ある実施形態では、いくつかの画像認識タスクはまた、ＣＰＵからＧＰＵにオフロードされ得る。 Processor assembly 104 executes one or more devices capable of executing instructions, such as instructions stored by memory 120, to perform various tasks associated with digital supplementary association and retrieval for visual search. include. For example, processor assembly 104 may include a central processing unit (CPU) and/or a graphics processor unit (GPU). For example, if a GPU is present, some image/video rendering tasks may be offloaded from the CPU to the GPU, such as generating and displaying a user interface or displaying portions of a digital supplement. In some embodiments, some image recognition tasks may also be offloaded from the CPU to the GPU.

図１は示さないが、ある実施形態は、ヘッドマウンテッドディスプレイデバイス（ＨＭＤ）を含む。ＨＭＤは、クライアントコンピューティングデバイス１０２とは別個のデバイスであってもよく、または、クライアントコンピューティングデバイス１０２は、ＨＭＤを含み得る。ある実施形態では、クライアントコンピューティングデバイス１０２は、ケーブルを介してＨＭＤと通信する。例えば、クライアントコンピューティングデバイス１０２は、映像信号および／またはオーディオ信号をＨＭＤに送信して、ユーザに表示することができ、ＨＭＤは、動き、位置、および／または、向き情報をクライアントコンピューティングデバイス１０２に送信し得る。 Although not shown in FIG. 1, certain embodiments include a head-mounted display device (HMD). The HMD may be a separate device from client computing device 102, or client computing device 102 may include an HMD. In some embodiments, client computing device 102 communicates with the HMD via a cable. For example, the client computing device 102 may send video and/or audio signals to the HMD for display to the user, and the HMD may transmit motion, position, and/or orientation information to the client computing device 102. can be sent to.

クライアントコンピューティングデバイス１０２はまた、ワイヤレス通信プロトコルを使用してクライアントコンピューティングデバイス１０２と通信するコントローラのような、様々なユーザ入力構成要素（図示しない）を含み得る。ある実施形態では、クライアントコンピューティングデバイス１０２は、有線接続（例えば、ユニバーサルシリアルバス（ＵＳＢ）ケーブル）を介して、または無線通信プロトコル（例えば、任意のＷｉＦｉプロトコル、任意のBlueToothプロトコル、Zigbee（登録商標）など）を介して、ＨＭＤ（図示しない）と通信し得る。ある実施形態では、クライアントコンピューティングデバイス１０２は、ＨＭＤの構成要素であり、ＨＭＤのハウジング内に収容され得る。 Client computing device 102 may also include various user input components (not shown), such as a controller that communicates with client computing device 102 using a wireless communication protocol. In some embodiments, the client computing device 102 connects via a wired connection (e.g., Universal Serial Bus (USB) cable) or via a wireless communication protocol (e.g., any WiFi protocol, any BlueTooth protocol, Zigbee® ), etc.) may communicate with an HMD (not shown). In some embodiments, client computing device 102 is a component of an HMD and may be housed within a housing of the HMD.

ネットワーク１９０は、インターネット、ローカルエリアネットワーク（ＬＡＮ）、ワイヤレスローカルエリアネットワーク（ＷＬＡＮ）、および／または任意の他のネットワークであり得る。クライアントコンピューティングデバイス１０２は、例えば、オーディオ／映像信号を受信でき、これは、一例の例示的な実施形態では、デジタル補足の一部として、ネットワークを介して提供され得る。 Network 190 may be the Internet, a local area network (LAN), a wireless local area network (WLAN), and/or any other network. Client computing device 102 may, for example, receive audio/video signals, which in one example embodiment may be provided over a network as part of a digital supplement.

図２は、クライアントコンピューティングデバイス１０２の実施形態がデジタル補足にアクセスしている例示的な物理的空間２００の三人称視点である。この例では、物理的空間２００はオブジェクト２２２を含む。ここで、オブジェクト２２２は、物理的空間２００の壁上のアートワークである。オブジェクト２２２は、クライアントコンピューティングデバイス１０２のカメラアセンブリ１１２の視野２０４内に含まれる。 FIG. 2 is a third-person view of an exemplary physical space 200 in which an embodiment of client computing device 102 is accessing digital supplements. In this example, physical space 200 includes object 222 . Here, object 222 is artwork on the wall of physical space 200. Object 222 is included within field of view 204 of camera assembly 112 of client computing device 102.

例示的なユーザインターフェイス画面２０６も示されている。ユーザインターフェイス画面２０６は、例えば、クライアントコンピューティングデバイス１０２のユーザインターフェイスエンジン１３０によって生成され得る。ユーザインターフェイス画面２０６は、画像表示パネル２０８およびデジタル補足選択パネル２１０を含む。画像表示パネル２０８は、画像を表示する。例えば、画像表示パネル２０８は、クライアントコンピューティングデバイス１０２のカメラアセンブリ１１２からのリアルタイムフィードに対応する画像を示し得る。ある実施形態では、画像表示パネル２０８は、以前に取得された画像またはクライアントコンピューティングデバイス１０２のメモリ１２０から取り出された画像を示す。 An example user interface screen 206 is also shown. User interface screen 206 may be generated by user interface engine 130 of client computing device 102, for example. User interface screen 206 includes an image display panel 208 and a digital supplement selection panel 210. Image display panel 208 displays images. For example, image display panel 208 may show an image corresponding to a real-time feed from camera assembly 112 of client computing device 102. In some embodiments, image display panel 208 shows previously captured images or images retrieved from memory 120 of client computing device 102 .

ある実施形態では、ユーザインターフェイス画面２０６は、クライアントコンピューティングデバイス１０２のディスプレイデバイス上でユーザに提示される。ある実施形態では、ユーザインターフェイス画面２０６は、物理的空間の画像（または、コンピューティングデバイスのカメラによって取得されている映像フィード）上にそのようにオーバーレイされ得る。加えて、ユーザインターフェイス画面２０６は、ユーザによって装着されたＨＭＤを使用して、ユーザの視野にわたってＡＲコンテンツとして表示され得る。 In some embodiments, user interface screen 206 is presented to a user on a display device of client computing device 102. In some embodiments, the user interface screen 206 may be so overlaid onto an image of the physical space (or a video feed being captured by a camera of the computing device). Additionally, user interface screen 206 may be displayed as AR content across the user's field of view using an HMD worn by the user.

画像表示パネル２０８はまた、画像に関連する注釈またはユーザインターフェイス要素を含み得る。例えば、画像表示パネル２０８は、画像内のオブジェクト（例えば、オブジェクト２２２）が補足アンカーとして認識されたというインジケータを含み得る。当該インジケータは、識別された補足アンカーに関連付けられるデジタル補足に関する情報にアクセスまたは閲覧するための、ユーザ作動可能コントロールを含み得る。ある状況では、画像表示パネル２０８に表示される画像は、補足アンカーとして認識される複数のオブジェクトを含んでもよく、画像表示パネル２０８は、それらの補足アンカーを識別するために画像を重ねる複数の注釈を含み得る。 Image display panel 208 may also include annotations or user interface elements associated with the image. For example, image display panel 208 may include an indicator that an object in the image (eg, object 222) has been recognized as a supplemental anchor. The indicator may include user actuatable controls to access or view information regarding the digital supplement associated with the identified supplemental anchor. In some situations, images displayed on image display panel 208 may include multiple objects that are recognized as supplemental anchors, and image display panel 208 may include multiple annotations overlaying the images to identify those supplemental anchors. may include.

補足アンカーは、クライアントコンピューティングデバイス１０２の補足アンカー識別エンジンによって認識され得る。ある実施形態では、補足アンカーは、画像を検索サーバ１５２に送信することによって識別される。次いで、検索サーバ１５２は、画像を分析し、当該画像内の補足アンカーを識別し得る。ある実施形態では、検索サーバ１５２は、補足アンカーに関連付けられる位置（たとえば、画像座標）または任意の識別されたオブジェクトのディメンジョンのうちの１つまたは複数を、クライアントコンピューティングデバイス１０２に送信し得る。次いで、クライアントコンピューティングデバイス１０２は、画像内の補足アンカー（または関連付けられたオブジェクト）を識別する注釈を示すためにユーザインターフェイス画面を更新し得る。ある実施形態では、クライアントコンピューティングデバイス１０２は、カメラアセンブリ１１２によって取得された映像ストリーム（たとえば、順次に取得された画像シーケンス）内の補足アンカー（または関連付けられたオブジェクト）の位置を追跡し得る（例えば、補足アンカー識別エンジン１２４は、検索サーバ１５２によって識別された補足アンカーを追跡し得る）。 The supplemental anchor may be recognized by the supplemental anchor identification engine of client computing device 102. In some embodiments, supplemental anchors are identified by sending images to search server 152. Search server 152 may then analyze the image and identify supplemental anchors within the image. In some embodiments, search server 152 may send one or more of the location (eg, image coordinates) or dimensions of any identified object associated with the supplemental anchor to client computing device 102. Client computing device 102 may then update the user interface screen to show annotations that identify supplementary anchors (or associated objects) within the image. In some embodiments, client computing device 102 may track the position of a supplemental anchor (or associated object) within a video stream (e.g., a sequentially acquired image sequence) acquired by camera assembly 112 ( For example, supplemental anchor identification engine 124 may track supplemental anchors identified by search server 152).

デジタル補足選択パネル２１０は、提示のためのデジタル補足をユーザが選択することを可能にする。例えば、デジタル補足選択パネル２１０は、各々がデジタル補足に関連付けられるユーザ作動可能コントロールを含むメニューを含み得る。この例では、デジタル補足選択パネル２１０は、各々が関連付けられたデジタル補足に関する情報を含む、ユーザ作動可能コントロール２１２およびユーザ作動可能コントロール２１４を含む。例えば、当該ユーザ作動可能コントロールは、検索サーバ１５２から受信され得る、名前（またはタイトル）、簡単な説明、および当該デジタル補足に関連付けられた画像のうちの１つまたは複数を表示し得る。ユーザ作動可能コントロール２１２またはユーザ作動可能コントロール２１４が作動すると、関連付けられたデジタル補足のコンテンツがユーザに提示され得る。デジタル補足をユーザに提示することは、クライアントコンピューティングデバイス１０２に、当該デジタル補足からの画像、映像、テキスト、他のコンテンツ、またはそれらの組み合わせを含むユーザインターフェイス画面を表示させることを含み得る。ある実施形態では、デジタル補足コンテンツは、画像またはカメラフィード上で画像表示パネル２０８上にオーバーレイとして表示される。デジタル補足コンテンツは、三次元の拡張現実コンテンツであり得る。 Digital supplement selection panel 210 allows the user to select digital supplements for presentation. For example, digital supplement selection panel 210 may include a menu that each includes user actuatable controls associated with a digital supplement. In this example, digital supplement selection panel 210 includes user-actuatable controls 212 and user-actuatable controls 214, each containing information about an associated digital supplement. For example, the user-actuatable control may display one or more of a name (or title), a short description, and an image associated with the digital supplement that may be received from search server 152. Upon actuation of user-actuatable control 212 or user-actuatable control 214, associated digital supplemental content may be presented to the user. Presenting a digital supplement to a user may include causing client computing device 102 to display a user interface screen that includes images, video, text, other content, or a combination thereof from the digital supplement. In some embodiments, the digital supplemental content is displayed as an overlay on the image display panel 208 over the image or camera feed. The digital supplemental content may be three-dimensional augmented reality content.

ある実施形態では、デジタル補足を提示することは、クライアントコンピューティングデバイス１０２にインストールされているアプリケーション（例えば、他のアプリケーション１４０の１つ）を起動することを含む。デジタル補足を提示することはまた、当該デジタル補足に関連付けられるＵＲＬに要求を送信することを含み得る。当該要求は、画像内で識別される製品またはオブジェクトの識別子のような、デジタル補足に関連付けられたパラメータを含み得る。ある実施形態では、画像コンテンツクエリからの画像（または他のコンテンツ）は、当該要求とともにパラメータを渡される。画像はまた、デジタル補足サーバ１７２に関連付けられたＡＰＩを介して提供され得る。ある実施形態では、クライアントコンピューティングデバイス１０２は、画像をデジタル補足サーバ１７２に送信する。ある実施形態では、検索サーバ１５２は、画像をデジタル補足サーバ１７２に送信し得る。例えば、ユーザがデジタル補足を選択することに応答して、クライアントコンピューティングデバイス１０２は、選択のインジケータを検索サーバ１５２に送信でき、次いで、検索サーバ１５２は、対応するデジタル補足サーバに画像を送信し得る。クライアントコンピューティングデバイス１０２はまた、デジタル補足サーバ１７２が画像にアクセスするために使用できる検索サーバ１５２上の場所にＵＲＬを送信し得る。有益なことに、これらの実施は、当該クライアントコンピューティングデバイスが送信する必要があるデータの量を削減し得る。 In some embodiments, presenting the digital supplement includes launching an application (eg, one of the other applications 140) installed on the client computing device 102. Presenting the digital supplement may also include sending a request to a URL associated with the digital supplement. The request may include parameters associated with the digital supplement, such as an identifier for the product or object identified in the image. In some embodiments, images (or other content) from an image content query are passed parameters with the request. Images may also be provided via an API associated with digital supplementation server 172. In some embodiments, client computing device 102 transmits the image to digital capture server 172. In some embodiments, search server 152 may send the images to digital supplementation server 172. For example, in response to a user selecting a digital supplement, client computing device 102 can send an indicator of the selection to search server 152, which in turn sends the image to the corresponding digital supplement server. obtain. Client computing device 102 may also send a URL to a location on search server 152 that digital supplementation server 172 can use to access the images. Beneficially, these implementations may reduce the amount of data that the client computing device needs to transmit.

ユーザ作動可能コントロール２１２に関連付けられたデジタル補足は、美術館からの情報のようなオブジェクト２２２についての情報を表示させ得る。ユーザ作動可能コントロール２１４に関連付けられたデジタル補足は、美術館に関連する情報を表示させ得る。例えば、デジタル補足の提示は、美術館ツアー上の停止場所を完了したとして印をつけさせ、次の停止場所の情報を表示させ得る。 Digital supplements associated with user actuatable controls 212 may cause information about object 222 to be displayed, such as information from a museum. Digital supplements associated with user actuatable controls 214 may cause information related to the museum to be displayed. For example, the presentation of a digital supplement may cause a stop on a museum tour to be marked as complete and information about the next stop may be displayed.

図３は、本明細書で説明される実施形態に従う、デジタル補足のトリガを有効化する例示的な方法３００の図である。この方法３００は、例えば、検索サーバ１５２のコンテンツクローラ１６２によって実行されて、ユーザが画像コンテンツクエリに基づいてデジタル補足にアクセスすることを可能にし得る。 FIG. 3 is an illustration of an example method 300 of enabling a trigger for digital supplementation, according to embodiments described herein. The method 300 may be performed, for example, by the content crawler 162 of the search server 152 to allow users to access digital supplements based on image content queries.

動作３０２において、デジタル補足を指定するデータが受信される。当該データは、デジタル補足およびデジタル補足が提供されるべき状況を識別し得る。デジタル補足を指定するデータは、様々な方法で受信され得る。例えば、デジタル補足を指定するデータは、デジタル補足に関するメタデータを含むウェブページのようなネットワークアクセス可能なリソースから受信され得る。デジタル補足を指定するデータはまた、例えば、検索サーバ１５２によって提供されるＡＰＩまたはフォームを介して受信され得る。デジタル補足を指定するデータはまた、メモリロケーションまたはデータストアから受信され得る。 At operation 302, data specifying digital supplementation is received. The data may identify digital supplements and situations in which digital supplements should be provided. Data specifying digital supplements may be received in a variety of ways. For example, data specifying a digital supplement may be received from a network-accessible resource such as a web page that includes metadata about the digital supplement. Data specifying digital supplements may also be received via an API or form provided by search server 152, for example. Data specifying the digital supplement may also be received from a memory location or data store.

デジタル補足に関するデータは、デジタル補足にアクセスするためにクライアントコンピューティングデバイスによって使用可能なアクセスデータを含み得る。例えば、アクセスデータは、デジタル補足のＵＲＬと、そのＵＲＬに渡されるパラメータとを含み得る。アクセスデータはまた、アプリケーションの識別子および当該アプリケーションのためのパラメータを含み得る。デジタル補足に関するデータはまた、デジタル補足に関する記述データを含み得る。当該記述データは、デジタル補足に関する情報をユーザに（例えば、ユーザがデジタル補足を選択し得るメニュー上で）提示するためにクライアントコンピューティングデバイスによって使用され得る。当該記述データは、たとえば、名前（またはタイトル）、説明、出版社の名前、および画像を含み得る。デジタル補足に関するデータはまた、補足アンカーの識別子を含み得る。 Data regarding the digital supplement may include access data usable by a client computing device to access the digital supplement. For example, the access data may include a URL of the digital supplement and parameters passed to the URL. The access data may also include an identifier of the application and parameters for the application. Data regarding the digital supplement may also include descriptive data regarding the digital supplement. The descriptive data may be used by the client computing device to present information about the digital supplement to the user (eg, on a menu from which the user may select the digital supplement). Such descriptive data may include, for example, a name (or title), a description, a publisher's name, and an image. Data regarding the digital supplement may also include an identifier of the supplement anchor.

動作３０４において、受信データに基づくデータ構造インスタンスが生成される。データ構造は、例えば、データベース内のレコードであり得る。データベースはリレーショナルデータベースであってもよく、データ構造インスタンスは、補足アンカーに関連付けられた１つまたは複数のレコードに（たとえば、外のキーを介して）リンクされ得る。 At act 304, a data structure instance is generated based on the received data. A data structure may be, for example, a record in a database. The database may be a relational database, and a data structure instance may be linked (eg, via an external key) to one or more records associated with a supplementary anchor.

動作３０６において、データ構造インスタンスを生成した後、画像コンテンツクエリによるデジタル補足の検索が可能になる。例えば、データ構造インスタンスに関連付けられたデータベースフィールドは、デジタル補足検索エンジン１６４が、当該関連付けられたデジタル補足にアクセスして返すことができるように、アクティブに設定され得る。ある実施形態では、デジタル補足のトリガは、データベースのレコードを保存またはコミットすることを含み得る。ある実施形態では、デジタル補足の検索を可能にすることは、クライアントコンピューティングデバイスによるデジタル補足のトリガを有効化することを含む。例えば、インスタンスが生成された後、デジタル補足は、検索に応答してクライアントコンピューティングデバイスに戻され、クライアントコンピューティングデバイスによってアクティブにされ又は提示され得る。 At operation 306, after generating the data structure instance, searching for the digital supplement with an image content query is enabled. For example, a database field associated with a data structure instance may be set to active so that digital supplement search engine 164 can access and return the associated digital supplement. In some embodiments, triggering a digital supplement may include saving or committing a record to a database. In some embodiments, enabling retrieval of the digital supplement includes enabling triggering of the digital supplement by the client computing device. For example, after the instance is generated, the digital supplement may be returned to, activated or presented by the client computing device in response to the search.

図４は、本明細書で説明される実施形態に従う、デジタル補足のトリガを有効化する例示的な方法４００の図である。この方法４００は、例えば、検索サーバ１５２のコンテンツクローラ１６２によって実行されて、ユーザが画像コンテンツクエリに基づいてデジタル補足にアクセスすることを可能にし得る。 FIG. 4 is an illustration of an example method 400 of enabling a trigger for digital supplementation, according to embodiments described herein. The method 400 may be performed, for example, by the content crawler 162 of the search server 152 to allow users to access digital supplements based on image content queries.

動作４０２において、ネットワークアクセス可能なリソースが分析される。ある実施形態では、ネットワークアクセス可能なリソースは、たとえば、デジタル補足サーバ１７２によって提示されるウェブページである。ある実施形態では、一組のネットワークアクセス可能なリソースが分析される。当該一組のネットワークアクセス可能なリソースは、フォームまたはＡＰＩを介した提出に基づいて生成され得る。ある実施形態では、当該一組のネットワークアクセス可能なリソースは、ＵＲＬを識別するために他のネットワークアクセス可能なリソースをクローリングすることによって生成され得る。このクローリングプロセスは、再帰的に実行され得る。 At operation 402, network accessible resources are analyzed. In some embodiments, the network-accessible resource is, for example, a web page presented by digital supplementary server 172. In some embodiments, a set of network accessible resources is analyzed. The set of network-accessible resources may be generated based on submission via a form or API. In some embodiments, the set of network-accessible resources may be generated by crawling other network-accessible resources to identify URLs. This crawling process may be performed recursively.

動作４０４において、ネットワークアクセス可能なリソース内のデジタル補足に関連付けられるメタデータが識別される。ある実施形態では、ネットワークアクセス可能なリソースは、デジタル補足に関連付けられるメタデータのインジケータを含み得る。例えば、ネットワークアクセス可能なリソースは、メタデータを含むネットワークアクセス可能なリソースの一部分を識別するタグを含み得る。当該タグは、特定のタイプまたは属性を有するＸＭＬタグであり得る。当該タグは、メタデータを含むＪＳＯＮデータ構造を含むスクリプトタグのようなＨＴＭＬタグであり得る。 At operation 404, metadata associated with the digital supplement within the network-accessible resource is identified. In some embodiments, the network-accessible resource may include an indicator of metadata associated with the digital supplement. For example, a network-accessible resource may include a tag that identifies a portion of the network-accessible resource that includes metadata. The tag may be an XML tag with specific types or attributes. The tag can be an HTML tag, such as a script tag that includes a JSON data structure that includes metadata.

動作４０６において、メタデータに基づくデジタル補足データ構造インスタンスが生成される。動作４０６は、動作３０４と同様であり得る。 At operation 406, a metadata-based digital supplementary data structure instance is generated. Act 406 may be similar to act 304.

動作４０８において、画像コンテンツクエリが受信される。画像コンテンツクエリは、例えば、クライアントコンピューティングデバイス１０２のようなクライアントコンピューティングデバイスによって送信され得る。ある実施形態では、画像コンテンツクエリは画像を含む。画像コンテンツクエリはまた、画像を記述するテキストデータを含み得る。たとえば、テキストデータは、当該クライアントコンピューティングデバイスのカメラアセンブリによって取得された画像内の補足アンカーの識別子を含み得る。ある実施形態では、画像コンテンツクエリはまた、当該クライアントコンピューティングデバイスの位置またはクライアントコンピューティングデバイスに関連付けられたユーザアカウントの識別子のような、他の情報を含む。 At operation 408, an image content query is received. Image content queries may be sent by a client computing device, such as client computing device 102, for example. In some embodiments, the image content query includes images. The image content query may also include textual data that describes the image. For example, the text data may include an identifier of a supplemental anchor within an image captured by a camera assembly of the client computing device. In some embodiments, the image content query also includes other information, such as the location of the client computing device or an identifier of a user account associated with the client computing device.

動作４１０において、複数のデジタル補足データ構造インスタンスは、画像コンテンツクエリに基づいて識別される。ある実施形態では、補足アンカーは、画像コンテンツクエリで提供される画像内で識別される。次いで、補足アンカーは、関連付けられたデジタル補足についてインデックスまたはデータベースに問い合わせるために使用され得る。ある実施形態では、クエリとともに提供される他のデータは、クライアントコンピューティングデバイスの位置またはユーザアカウントに関連付けられた情報のような、デジタル補足も識別するために使用され得る。ある実施形態では、複数の補足アンカーは、関連付けられる補足アンカーを識別するために使用され得る。 At operation 410, multiple digital supplementary data structure instances are identified based on the image content query. In some embodiments, supplemental anchors are identified within images provided in an image content query. The supplement anchor may then be used to query an index or database for the associated digital supplement. In some embodiments, other data provided with the query may also be used to identify digital supplements, such as the location of the client computing device or information associated with a user account. In some embodiments, multiple supplemental anchors may be used to identify associated supplemental anchors.

動作４１２において、複数のデジタル補足データ構造インスタンスの順序付けが、決定される。当該順序付けは、デジタル補足に関連付けられる様々なスコア、または画像コンテンツクエリに対するデジタル補足の関連性に基づき得る。ある実施形態では、画像コンテンツクエリに対するデジタル補足の関連性に対応する関連性スコアは、複数のデジタル補足データ構造インスタンスを順序付けるために使用され得る
関連性スコアは、デジタル補足の内容、当該デジタル補足にリンクするネットワークアクセス可能なリソース（または、デジタル補足に関連付けられたネットワークアクセス可能なリソース）のコンテンツ、リンクテキスト、または、他のネットワークアクセス可能なリソース上のデジタル補足へのリンクの近くのコンテンツのうちの１つまたは複数のような、複数の因子から決定され得る。 At operation 412, an ordering of the plurality of digital supplementary data structure instances is determined. The ordering may be based on various scores associated with the digital supplements or the relevance of the digital supplements to the image content query. In some embodiments, a relevance score corresponding to the relevance of a digital supplement to an image content query may be used to order multiple digital supplement data structure instances. Content on a Network-Accessible Resource (or a Network-Accessible Resource associated with a Digital Supplement) that links to, link text, or nearby content that links to a Digital Supplement on other Network-Accessible Resources. may be determined from multiple factors, such as one or more of the following:

スコアはまた、人気メトリックに基づき得る。名声メトリックは、人気メトリックの一例である。当該名声メトリックは、何個の他のネットワークリソースがデジタル補足にリンクするかと、それらの他のネットワークアクセス可能なリソースの名声スコアとの組み合わせに基づき得る。ある実施形態では、人気スコアは、デジタルリソースがどのくらい頻繁に選択されるか又は選択されたかに基づき得る。ある実施形態では、人気スコアは、デジタルリソースが画像コンテンツクエリのためにどのくらい頻繁に選択されるかに対応し得る。 Scores may also be based on popularity metrics. A reputation metric is an example of a popularity metric. The prestige metric may be based on a combination of how many other network resources link to the digital supplement and the prestige scores of those other network-accessible resources. In some embodiments, the popularity score may be based on how often the digital resource is or has been selected. In some embodiments, the popularity score may correspond to how often a digital resource is selected for image content queries.

スコアは、決定されてもよく、又は、データストアもしくはＡＰＩから取り出され得る。ある実施形態では、ＡＰＩは、デジタル補足のスコアを取り出すためにアクセスされる。例えば、スコアは、補足アンカーに基づく検索用語に関してデジタルリソースの関連性および／または人気を決定した検索エンジンから検索され得る。 The score may be determined or retrieved from a data store or API. In some embodiments, the API is accessed to retrieve the score of the digital supplement. For example, scores may be retrieved from search engines that have determined the relevance and/or popularity of digital resources with respect to search terms based on supplementary anchors.

複数のデジタル補足データ構造はまた、特定のユーザ（例えば、クライアントコンピューティングデバイスのユーザ）による使用頻度または当該特定のユーザによる最近の使用（recency）に基づいて順序付けされ得る。ある実施形態では、複数のデジタル補足データ構造は、ランダムに順序付けられる。 The plurality of digital supplementary data structures may also be ordered based on frequency of use or recency of use by a particular user (eg, a user of a client computing device). In some embodiments, the plurality of digital supplementary data structures are randomly ordered.

動作４１４において、画像コンテンツクエリは、複数のデジタル補足データ構造インスタンスに基づいて応答される。例えば、複数のデジタル補足データ構造インスタンスに関連付けられる情報は、動作４１２において決定された順序でクライアントコンピューティングデバイスに送信され得る。ある実施形態では、当該情報は、デジタル補足のユーザ選択を受信するように構成されたメニューまたは別のタイプのユーザインターフェイス内に示され得る記述データを含む。当該情報はまた、デジタル補足にアクセスまたは提示するためにクライアントコンピューティングデバイスによって使用され得るアクセスデータを含み得る。 At operation 414, the image content query is answered based on a plurality of digital supplementary data structure instances. For example, information associated with multiple digital supplementary data structure instances may be sent to the client computing device in the order determined in operation 412. In some embodiments, the information includes descriptive data that may be presented within a menu or other type of user interface configured to receive user selections of digital supplements. The information may also include access data that may be used by the client computing device to access or present the digital supplement.

図５は、本明細書で説明される実施形態に従う、デジタル補足を検索し提示する例示的な方法５００の図である。この方法５００は、画像コンテンツクエリに基づいてデジタル補足を識別して当該デジタル補足にアクセスするために、例えば、クライアントコンピューティングデバイス１０２のアプリケーション１２２によって実行され得る。 FIG. 5 is an illustration of an example method 500 for searching and presenting digital supplements, according to embodiments described herein. The method 500 may be performed, for example, by the application 122 of the client computing device 102 to identify and access digital supplements based on an image content query.

動作５０２において、画像に基づく画像コンテンツクエリは、サーバコンピューティングデバイス（例えば、検索サーバ１５２）に送信される。例えば、画像は、クライアントコンピューティングデバイス１０２のカメラアセンブリ１１２で取得され得る。当該画像はまた、カメラアセンブリ１１２によって以前に取得された画像のような記憶された画像であり得る。ある実施形態では、画像コンテンツクエリは、画像のみを含む。ある実施形態では、画像コンテンツクエリは、追加情報を含む。たとえば、画像コンテンツクエリは、クライアントコンピューティングデバイス１０２の位置またはクライアントコンピューティングデバイス１０２のユーザに関連付けられたアカウントの識別子のような情報を含み得る。アプリケーション１２２はまた、（例えば、補足アンカー識別エンジン１２４を用いて）画像内のアンカーを識別し得る。画像コンテンツクエリは、識別されたアンカーの識別子（例えば、テキスト、数字、または他の種類の識別子）を含み得る。少なくともある実施形態では、画像コンテンツクエリは、画像を含まない。 At act 502, an image-based image content query is sent to a server computing device (eg, search server 152). For example, images may be captured with camera assembly 112 of client computing device 102. The image may also be a stored image, such as an image previously captured by camera assembly 112. In some embodiments, the image content query includes only images. In some embodiments, the image content query includes additional information. For example, the image content query may include information such as the location of client computing device 102 or an identifier of an account associated with a user of client computing device 102. Application 122 may also identify anchors within the image (eg, using supplemental anchor identification engine 124). The image content query may include an identifier (eg, text, number, or other type of identifier) of the identified anchor. In at least some embodiments, the image content query does not include images.

ある実施形態では、画像コンテンツクエリをサーバに送信することは、ＡＰＩを呼び出すことを含む。ある実施形態では、画像コンテンツクエリをサーバに送信することは、サーバによって提供されるＡＰＩを呼び出すことを含む。ある実施形態では、画像コンテンツクエリをサーバに送信することは、ＨＴＴＰプロトコルを使用して（例えば、ＧＥＴリクエストまたはＰＯＳＴリクエストを提出して）フォームを提出することを含む。 In some embodiments, sending the image content query to the server includes calling an API. In some embodiments, sending the image content query to the server includes calling an API provided by the server. In some embodiments, sending the image content query to the server includes submitting a form using the HTTP protocol (eg, by submitting a GET or POST request).

動作５０４において、デジタル補足を識別する画像コンテンツクエリに対する応答が受信される。当該応答は、検索サーバ１５２からネットワーク１９０を介して受信され得る。応答は、検索サーバ１５２によって画像コンテンツクエリに基づいて識別された１つまたは複数のデジタル補足を含み得る。例えば、応答は、デジタル補足に関連付けられたデータのアレイを含み得る。ある実施形態では、デジタル補足に関連付けられたデータは、ユーザが選択するためのデジタル補足オプションを提示するために使用され得る記述データを含み得る。例えば、記述データは、名前、短い説明、発行者名、および画像を含み得る。データはまた、ＵＲＬを介した要求に含める当該ＵＲＬおよびパラメータ、またはアプリケーション名および関連パラメータのようなアクセスデータを含み得る。当該データはまた、（例えば、補足アンカーが検索サーバ１５２によって識別される場合）画像コンテンツクエリで送信される画像における補足アンカーの位置、座標、または寸法を含み得る。 At operation 504, a response to an image content query that identifies a digital supplement is received. The response may be received from search server 152 via network 190 . The response may include one or more digital supplements identified by search server 152 based on the image content query. For example, the response may include an array of data associated with the digital supplement. In some embodiments, data associated with a digital supplement may include descriptive data that may be used to present digital supplement options for a user to select. For example, descriptive data may include a name, short description, publisher name, and image. The data may also include access data such as the URL and parameters to include in the request via the URL, or the application name and associated parameters. The data may also include the location, coordinates, or dimensions of the supplemental anchor in the image sent in the image content query (eg, if the supplemental anchor is identified by search server 152).

動作５０６では、デジタル補足に関連付けられる情報を含むユーザインターフェイス画面が表示される。ある実施形態では、ユーザインターフェイス画面は、（たとえば、提供された座標に基づいて）識別された補足アンカーをオーバーレイする注釈を含む。注釈は、識別された補足アンカーに関連付けられた画像内のオブジェクトに関する情報を提供し得る。当該注釈は、デジタル補足を提示または起動するように作動され得る、ユーザ作動可能コントロールを含み得る。ユーザインターフェイス画面はまた、動作５０４において受信された応答において識別される複数のデジタル補足から選択するために使用され得るデジタル補足選択パネルを含み得る。ある実施形態では、ユーザインターフェイス画面は、デジタル補足によって指定されたＵＲＬを開くウェブブラウザによって生成され得る。ユーザインターフェイス画面はまた、デジタル補足を提供するために起動される別のアプリケーションによって生成され得る。 At operation 506, a user interface screen is displayed that includes information associated with the digital supplement. In some embodiments, the user interface screen includes annotations that overlay identified supplemental anchors (eg, based on the provided coordinates). The annotation may provide information about objects within the image associated with the identified supplemental anchor. The annotation may include user actuatable controls that may be activated to present or launch a digital supplement. The user interface screen may also include a digital supplement selection panel that may be used to select from multiple digital supplements identified in the response received in operation 504. In some embodiments, the user interface screen may be generated by a web browser opening a URL specified by the digital supplement. User interface screens may also be generated by another application launched to provide digital supplements.

図６は、本明細書で説明される実施形態に従う、画像に基づいてデジタル補足を識別および提示する例示的な方法６００の図である。この方法６００は、画像コンテンツクエリに基づいてデジタル補足を識別して当該デジタル補足にアクセスするために、例えば、クライアントコンピューティングデバイス１０２のアプリケーション１２２によって実行され得る。 FIG. 6 is an illustration of an example method 600 for identifying and presenting digital supplements based on images, according to embodiments described herein. The method 600 may be performed, for example, by the application 122 of the client computing device 102 to identify and access digital supplements based on an image content query.

動作６０２において、画像が取得される。例えば、画像は、クライアントコンピューティングデバイス１０２のカメラアセンブリ１１２によって取得され得る。ある実施形態では、一連の画像（すなわち、映像）が、カメラアセンブリ１１２によって取得され得る。 At operation 602, an image is acquired. For example, images may be captured by camera assembly 112 of client computing device 102. In some embodiments, a series of images (ie, video) may be captured by camera assembly 112.

動作６０４において、画像に基づく画像コンテンツクエリは、検索サーバ１５２のようなサーバコンピューティングデバイスに送信される。動作６０４は、動作５０２と同様であり得る。画像のシーケンスが取得される実施形態では、画像コンテンツクエリは、複数の画像または画像のシーケンスを含み得る。ある実施形態では、画像のシーケンスは、当該サーバコンピューティングデバイスにストリーミングされ得る。 At operation 604, the image-based image content query is sent to a server computing device, such as search server 152. Act 604 may be similar to act 502. In embodiments where a sequence of images is obtained, the image content query may include multiple images or a sequence of images. In some embodiments, the sequence of images may be streamed to the server computing device.

動作６０６において、複数のデジタル補足を識別する画像コンテンツクエリに対する応答が受信される。動作６０６は、前述の動作５０４と同様であり得る。 At operation 606, a response to an image content query that identifies a plurality of digital supplements is received. Act 606 may be similar to act 504 described above.

動作６０８において、複数のデジタル補足から一つのデジタル補足を選択するためのユーザ作動可能コントロールを含むユーザインターフェイス画面が表示される。例えば、デジタル補足選択パネルが表示され得る。デジタル補足選択パネルは、応答において識別される複数のデジタル補足のうちの１つに各々が関連付けられる複数のユーザ作動可能コントロールを含み得る。デジタル補足選択は、サーバコンピューティングデバイスによって提供されるデジタル補足の順序付けまたはランキングに基づいて、ユーザ作動可能な制御を準備し得る。デジタル補足選択パネルは、ユーザが作動可能コントロールを垂直に、水平に、またはその他の方法で配置し得る。ユーザ作動可能コントロールは、ユーザがデジタル補足を選択するかどうかを決定する際に考慮できる、関連付けられたデジタル補足に関する情報に関連付けられるか、または当該情報を含み得る。例えば、表示される情報は、デジタル補足の名前、説明、画像、および発行者名のうちの１つまたは複数を含み得る。 At operation 608, a user interface screen is displayed that includes user actuatable controls for selecting a digital supplement from a plurality of digital supplements. For example, a digital supplemental selection panel may be displayed. The digital supplement selection panel may include a plurality of user actuatable controls each associated with one of a plurality of digital supplements identified in the response. Digital supplement selection may provide user actuatable controls based on the ordering or ranking of digital supplements provided by the server computing device. The digital supplemental selection panel may arrange the user actuatable controls vertically, horizontally, or otherwise. The user actuatable controls may be associated with or include information about the associated digital supplement that the user may consider in determining whether to select the digital supplement. For example, the displayed information may include one or more of the digital supplement's name, description, image, and publisher name.

動作６１０において、デジタル補足を選択するためのユーザ入力が受け取られる。ユーザ入力は、マウスまたは他のデバイスを用いたクリックであり得る。ユーザ入力はまた、スタイラスまたは指からのタッチ入力であり得る。ユーザ入力の別の例は、略タッチ入力（例えば、指またはポインティングデバイスを画面に近接して保持すること）である。ある実施形態では、ユーザ入力はまた、手のジェスチャ、頭の動き、目での動作、または発話入力を含み得る。 At operation 610, user input is received to select a digital supplement. User input may be a click with a mouse or other device. User input may also be touch input from a stylus or finger. Another example of user input is near-touch input (eg, holding a finger or pointing device close to the screen). In some embodiments, user input may also include hand gestures, head movements, eye movements, or speech input.

動作６１２において、情報は、選択されたデジタル補足に関連付けられたリソースに提供される。例えば、クライアントコンピューティングデバイスのユーザについての情報は、（情報を提供する許可が提供されている場合）デジタル補足を提供するサーバに送信され得る。情報はまた、デジタル補足を提供するアプリケーションに提供され得る。様々な種類の情報が提供され得る。例えば、情報は、ユーザ名、ユーザの嗜好、または場所のようなユーザ情報を含み得る。 At operation 612, information is provided to resources associated with the selected digital supplement. For example, information about a user of a client computing device may be sent to a server that provides digital supplements (if permission to provide the information is provided). Information may also be provided to applications that provide digital supplements. Various types of information may be provided. For example, the information may include user information such as a user name, user preferences, or location.

情報はまた、画像または画像のシーケンスのような画像コンテンツクエリに関連付けられる情報を含み得る。情報はまた、画像内の１つまたは複数の補足アンカーの識別子および／または位置を含み得る。この情報は、デジタル補足をユーザに提供するために使用され得る。例えば、デジタル補足のＡＲコンテンツは、画像に基づいて形成されて位置決めされ得る。 The information may also include information associated with an image content query, such as an image or sequence of images. The information may also include an identifier and/or location of one or more supplemental anchors within the image. This information may be used to provide digital supplements to the user. For example, digital supplementary AR content may be created and positioned based on the image.

情報は、クライアントコンピューティングデバイス１０２によって、デジタル補足に関連付けられるリソース（例えば、デジタル補足サーバ１７２）に直接送信され得る。ある実施形態では、情報は、検索サーバ１５２によって、デジタル補足に関連付けられているリソースに提供される（例えば、クライアントコンピューティングデバイスは、それほど多くのデータを送信する必要がない）。これらの実施のうちの少なくともいくつかでは、クライアントコンピューティングデバイス１０２は、選択されたデジタル補足を識別する選択情報を検索サーバ１５２に送信し得る。選択を受信し、ユーザが情報の共有を許可したことを確認すると、検索サーバ１５２は、デジタル補足を提供するリソースに情報を送信し得る。クライアントコンピューティングデバイス１０２はまた、情報を共有できるようにユーザに促し得る。ある実施形態では、検索サーバ１５２は、デジタル補足データ構造インスタンス（これは、デジタル補足に関連付けられるメタデータに基づくことができる）に基づいてリソースに送信する情報を決定し得る。 Information may be sent by client computing device 102 directly to a resource associated with the digital supplement (eg, digital supplement server 172). In some embodiments, information is provided by search server 152 to resources associated with the digital supplement (eg, the client computing device does not need to send as much data). In at least some of these implementations, client computing device 102 may transmit selection information identifying the selected digital supplement to search server 152. Upon receiving the selection and confirming that the user has authorized sharing of the information, search server 152 may send the information to the resource that provides the digital supplement. Client computing device 102 may also prompt users to share information. In some embodiments, search server 152 may determine information to send to a resource based on a digital supplement data structure instance, which may be based on metadata associated with the digital supplement.

動作６１４において、ユーザインターフェイスは、選択されたデジタル補足に基づいて更新される。動作６１４は、動作５０６と同様であり得る。 At operation 614, the user interface is updated based on the selected digital supplement. Act 614 may be similar to act 506.

図７Ａ～７Ｃは、画像コンテンツ検索を行なってデジタル補足を表示するためにクライアントコンピューティングデバイス１０２の実施形態によって表示されるユーザインターフェイス画面の概略図である。図７Ａには、ユーザインターフェイス画面７００ａが示されている。ユーザインターフェイス画面７００ａは、画像表示パネル７０８および情報パネル７３０を含む。この例では、画像表示パネル７０８は、（例えば、店内で見つけるかもしれない）ワインボトルで一杯の棚の画像を表示している。画像表示パネル７０８はまた、インジケータ７４０およびインジケータ７４２を含む。これらのインジケータの各々は、当該インジケータの下の画像に示されるワインボトルが補足アンカー（例えば、この場合、認識された製品として）として認識されていることを示す。インジケータ７４０およびインジケータ７４２は、ユーザ作動可能コントロールの例である。情報パネル７３０内では、「興味があるものをタップして下さい」という指示が提供される。 7A-7C are schematic illustrations of user interface screens displayed by an embodiment of client computing device 102 to perform image content searches and display digital supplements. FIG. 7A shows a user interface screen 700a. User interface screen 700a includes an image display panel 708 and an information panel 730. In this example, image display panel 708 is displaying an image of a shelf full of wine bottles (such as those that might be found in a store). Image display panel 708 also includes an indicator 740 and an indicator 742. Each of these indicators indicates that the wine bottle shown in the image below the indicator is recognized as a complementary anchor (eg, in this case as a recognized product). Indicator 740 and indicator 742 are examples of user actuatable controls. Within the information panel 730, instructions are provided to "tap on what interests you."

図７Ｂでは、ユーザインターフェイス画面７００ｂが、ユーザがインジケータ７４０を作動させた後に示される。作動後、デジタル補足からの注釈７４４が表示される。注釈７４４は、ユーザが購入するワインのボトルを選択するのに役立ち得る、ワインの格付けに関する情報を含む。 In FIG. 7B, user interface screen 700b is shown after the user activates indicator 740. After activation, annotations 744 from the digital supplement are displayed. Annotations 744 include information regarding wine ratings that may assist the user in selecting a bottle of wine to purchase.

図７Ｃでは、別のユーザインターフェイス画面７００ｃは、ユーザがインジケータ７４０を作動させた後に示される。ユーザインターフェイス画面７００ｃは、（例えば、注釈７４４の作動後、またはユーザが図７Ｂの情報パネル７３０上でスワイプアップする場合）、ユーザインターフェイス画面７００ｂが表示されることの代わりに又はそれに加えて、示される。図７Ｃには、拡張された情報パネル７３２が示されている。拡張された情報パネル７３２は、図７Ａおよび図７Ｂで取り上げられた情報パネル７３０よりも多くのユーザインターフェイス画面７００ｃを占める。 In FIG. 7C, another user interface screen 700c is shown after the user activates indicator 740. User interface screen 700c may be shown instead of or in addition to user interface screen 700b being displayed (e.g., after actuation of annotation 744 or when the user swipes up on information panel 730 of FIG. 7B). It will be done. An expanded information panel 732 is shown in FIG. 7C. Expanded information panel 732 occupies more user interface screen 700c than information panel 730 featured in FIGS. 7A and 7B.

拡張情報パネル７３２は、デジタル補足選択パネル７１０およびデジタル補足コンテンツ表示パネル７３４を含む。デジタル補足選択パネル７１０は、ユーザ作動可能コントロール７１２と、ユーザ作動可能コントロール７１４と、（部分的にしか見えない）ユーザ作動可能コントロール７１６とを含む。ある実施形態では、ユーザがデジタル補足選択パネル７１０上でスワイプすると、追加のユーザ作動可能コントロールが表示され得る。デジタル補足選択パネル７１０のユーザ作動可能コントロールは、ランク付けされた順序で配列され得る。ユーザ作動可能コントロール７１２は、食事ペアリングのためのデジタル補足に関連付けられる。ユーザ作動可能コントロール７１２が作動すると、選択されたワインのための食べ物および食事のペアリング情報を表示するデジタル補足が表示され得る。ユーザ作動可能コントロール７１４は、写真を保存するデジタル補足に関連付けられている。作動すると、写真を保存するアプリケーションが作動させられて画像が提供され得る。追加の情報は、識別された補足アンカーのような写真と共に保存され得る。 Extended information panel 732 includes a digital supplement selection panel 710 and a digital supplement content display panel 734. Digital supplementary selection panel 710 includes user-actuatable controls 712, user-actuatable controls 714, and (only partially visible) user-actuatable controls 716. In some embodiments, when a user swipes on digital supplemental selection panel 710, additional user-actuatable controls may be displayed. The user actuatable controls of digital supplementary selection panel 710 may be arranged in a ranked order. User actuatable controls 712 are associated with digital supplements for meal pairing. When user actuatable control 712 is activated, a digital supplement may be displayed that displays food and meal pairing information for the selected wine. User actuatable controls 714 are associated with a digital supplement that stores photos. When activated, a photo storage application may be activated to provide the image. Additional information may be saved with the photo, such as identified supplemental anchors.

デジタル補足コンテンツ表示パネル７３４は、デジタル補足からのコンテンツを表示し得る。デジタル補足コンテンツ表示パネル７３４は、デフォルトのデジタル補足を、または、識別された補足アンカーに関連付けられている最高ランクのデジタル補足を表示し得る。この例では、デジタル補足コンテンツ表示パネル７３４は、選択された補足アンカーに関連付けられている製品に関する製品情報を含む。この場合、ワイン名、レーティング、原産地、画像、およびコメントが提供される。 Digital supplement content display panel 734 may display content from the digital supplement. Digital supplement content display panel 734 may display the default digital supplement or the highest ranking digital supplement associated with the identified supplement anchor. In this example, digital supplemental content display panel 734 includes product information about the product associated with the selected supplemental anchor. In this case, the wine name, rating, country of origin, image, and comments are provided.

図８Ａ～８Ｃは、画像コンテンツ検索を行なってデジタル補足を表示するためにクライアントコンピューティングデバイス１０２の実施形態によって表示されるユーザインターフェイス画面の概略図である。この例では、画像コンテンツ検索は、レシートの画像に基づく。 8A-8C are schematic illustrations of user interface screens displayed by an embodiment of client computing device 102 to perform image content searches and display digital supplements. In this example, the image content search is based on images of receipts.

図８Ａには、ユーザインターフェイス画面８００ａが示されている。ユーザインターフェイス画面８００ａは、画像表示パネル８０８および情報パネル８３０を含む。この例では、画像表示パネル８０８は、レストランからのレシートの画像を表示している。画像表示パネル８０８はまた、インジケータ８４０と、インジケータ８４２と、注釈８４４と、ハイライトオーバーレイ８４６とを含む。この場合、インジケータ８４０は、文書として当該レシートに関連付けられており、インジケータ８４２は、当該レシートに名付けられた特定のレストランに関連付けられている。識別されたレシート文書および識別されたレストラン名は、両方とも補足アンカーの例である。 FIG. 8A shows a user interface screen 800a. User interface screen 800a includes an image display panel 808 and an information panel 830. In this example, image display panel 808 is displaying an image of a receipt from a restaurant. Image display panel 808 also includes indicators 840, indicators 842, annotations 844, and highlight overlays 846. In this case, indicator 840 is associated with the receipt as a document, and indicator 842 is associated with the particular restaurant named on the receipt. The identified receipt document and the identified restaurant name are both examples of supplemental anchors.

注釈８４４は、先端計算器を提供するデジタル補足に関連付けられている。この例では、例示的な先端計算器が注釈８４４に含まれ、画像表示パネル８０８上の適切な位置に重ねられる。ある実施形態では、デジタル補足は、デフォルトによって選択されてもよく、適切な補足アンカーを識別すると表示され得る。ハイライトオーバーレイ８４６は、先端計算器のデジタル補足によって使用される情報を含むレシート文書の一部分上にオーバーレイされる。 Annotations 844 are associated with digital supplements that provide advanced calculators. In this example, an exemplary tip calculator is included in annotation 844 and superimposed on the image display panel 808 at the appropriate location. In some embodiments, the digital supplement may be selected by default and displayed upon identifying the appropriate supplement anchor. A highlight overlay 846 is overlaid on a portion of the receipt document containing information used by the digital supplement of the advanced calculator.

この例では、情報パネル８３０に表示される項目は、インジケータ８４０が作動されたかのように、文書としてのレシートに関連する。ある実施形態では、識別された補足アンカーは、たとえば、ユーザの過去の動作、同様の画像に対する他のユーザの動作、補足アンカーの信頼スコア、または、補足アンカーが関連する画像の部分の位置またはサイズに基づく、ユーザの可能性のある関連性または関心に基づいて、ランク付けされる。次いで、情報パネル８３０は、少なくともある実施形態では、最も高くランク付けされた補足アンカーに関連する項目を表示し得る。もし、代わりに、インジケータ８４２が作動された場合、情報パネル８３０は、特定のレストランに関する項目を含み得る。 In this example, the items displayed on information panel 830 relate to the receipt as a document, as if indicator 840 had been activated. In some embodiments, the identified supplementary anchor may be based on, for example, the user's past behavior, the behavior of other users on similar images, the confidence score of the supplementary anchor, or the location or size of the portion of the image to which the supplemental anchor relates. Based on the user's likely relevance or interest. Information panel 830 may then display items related to the highest ranked supplemental anchor, at least in some embodiments. If indicator 842 is activated instead, information panel 830 may include items regarding the particular restaurant.

ここで、情報パネル８３０は、デジタル補足選択パネル８１０を含む。当該デジタル補足パネルは、ユーザ作動可能コントロール８１２と、ユーザ作動可能コントロール８１４と、ユーザ作動可能コントロール８１６とを含む。この例では、ユーザ作動可能コントロール８１２は先端計算器のデジタル補足に関連付けられており、ユーザ作動可能コントロール８１４はチェック分割デジタル補足に関連付けられており、ユーザ作動可能コントロール８１６は経費報告デジタル補足に関連付けられている。例えば、ユーザ作動可能コントロール８１２が作動すると、先端計算器のパラメータを調整するためのユーザインターフェイスコントロールは、（例えば、パーセンテージを調整するために）表示され得る。 Here, information panel 830 includes digital supplement selection panel 810. The digital supplement panel includes user-actuatable controls 812 , user-actuatable controls 814 , and user-actuatable controls 816 . In this example, user-actuatable controls 812 are associated with the Tip Calculator Digital Supplement, user-activated controls 814 are associated with the Check Split Digital Supplement, and user-activable controls 816 are associated with the Expense Report Digital Supplement. It is being For example, when user-actuatable control 812 is activated, user interface controls for adjusting parameters of the tip calculator (eg, to adjust percentages) may be displayed.

図８Ｂでは、ユーザインターフェイス画面８００ｂは、ユーザがユーザ作動可能コントロール８１４を作動させた後に示されている。作動後、ユーザが請求書をどのように分割するかを計算するのに役立つ項目を含む拡張情報パネル８３２が、示される。例えば、請求書を分割する人数は、それぞれが支払うべき額を決定するために入力され得る。 In FIG. 8B, user interface screen 800b is shown after the user has activated user-actuatable controls 814. In FIG. After activation, an expanded information panel 832 is shown that includes items to help the user calculate how to split the bill. For example, the number of people splitting the bill can be entered to determine the amount each should pay.

図８Ｃでは、ユーザインターフェイス画面８００ｃは、ユーザがユーザ作動可能コントロール８１６を作動させた後に示される。作動後、ユーザが経費報告書にレシートを記憶するのに役立つ項目を含む拡張情報パネル８３４が示される。例えば、ユーザは、レシートが関連付けられるべき経費報告（例えば、「シドニー旅行２０１８」）を選択できる。一度当該経費報告が選択されると、レシートの画像は経費報告提出または管理システムにアップロードされ得る。ある実施形態では、画像表示パネル８０８に表示される全画像がアップロードされる。ある実施形態では、画像の一部分がアップロードされる（例えば、画像は、レシートのみを含むようにトリミングされる）。 In FIG. 8C, user interface screen 800c is shown after the user activates user-actuatable controls 816. After activation, an expanded information panel 834 is shown that includes items to help the user store receipts in an expense report. For example, a user can select an expense report (eg, "Sydney Trip 2018") with which the receipt should be associated. Once the expense report is selected, an image of the receipt may be uploaded to the expense report submission or management system. In some embodiments, all images displayed on image display panel 808 are uploaded. In some embodiments, a portion of the image is uploaded (eg, the image is cropped to include only the receipt).

図９Ａおよび図９Ｂは、画像コンテンツ検索を行なってデジタル補足を表示するためにクライアントコンピューティングデバイス１０２の実施形態によって表示されるユーザインターフェイス画面の概略図である。この例では、画像コンテンツ検索は、顔の画像に基づく。 9A and 9B are schematic illustrations of user interface screens displayed by an embodiment of client computing device 102 to perform image content searches and display digital supplements. In this example, the image content search is based on images of faces.

図９Ａには、ユーザインターフェイス画面９００ａが示されている。ユーザインターフェイス画面９００ａは、画像表示パネル９０８および情報パネル９３０を含む。この例では、画像表示パネル９０８は、顔の画像を表示している。ここで、当該顔は、補足アンカーの一例である。情報パネル９３０は、画像（すなわち顔）内の補足アンカーについて識別されたデジタル補足のためのユーザ作動可能コントロール９１２を含む。ユーザ作動可能コントロール９１２は、眼鏡に結び付けるためのデジタル補足に関連付けられている。 FIG. 9A shows a user interface screen 900a. User interface screen 900a includes an image display panel 908 and an information panel 930. In this example, image display panel 908 is displaying an image of a face. Here, the face is an example of a supplementary anchor. Information panel 930 includes user-actuatable controls 912 for digital supplementation identified for supplemental anchors within the image (ie, faces). User actuatable controls 912 are associated with a digital supplement for coupling to the glasses.

図９Ｂでは、ユーザインターフェイス画面９００ｂは、ユーザがユーザ作動可能コントロール９１２を作動させた後に示される。作動後、ユーザが画像内の顔上の眼鏡を視覚的に試すのを助けるための項目を含む拡張情報パネル９３２が示される。ここでは、複数の眼鏡スタイルが表示されており、ユーザは試行するペアを選択できる。一対の眼鏡を選択すると、ＡＲコンテンツ９６０は、画像表示パネル９０８に重ねられる。ここで、ＡＲコンテンツ９６０は、選択された眼鏡に対応し、画像内の顔に合致する大きさである。ある実施形態では、眼鏡を試着するためのデジタル補足が選択されると、画像表示パネル９０８に示される画像は、デジタル補足を提供するサーバに送信され、それにより、当該画像は、ＡＲコンテンツ９６０をどこに、かつ、どのように配置して、どのような大きさとするべきか、または、眼鏡を試着するように推奨すべきかを決定するために、分析され得る。 In FIG. 9B, user interface screen 900b is shown after the user activates user-actuatable controls 912. In FIG. After activation, an expanded information panel 932 is shown that includes items to help the user visually try out the glasses on the face in the image. Here, multiple eyeglass styles are displayed and the user can select a pair to try. When a pair of glasses is selected, AR content 960 is superimposed on image display panel 908. Here, the AR content 960 corresponds to the selected glasses and is sized to match the face in the image. In some embodiments, when a digital supplement for trying on glasses is selected, the image shown on the image display panel 908 is sent to a server that provides the digital supplement, such that the image includes AR content 960. It can be analyzed to determine where and how to place it and what size it should be or whether it should be recommended to try on the glasses.

図１０Ａ～１０Ｃは、画像コンテンツ検索を行なってデジタル補足を表示するためにクライアントコンピューティングデバイス１０２の実施形態によって表示されるユーザインターフェイス画面の概略図である。この例では、画像コンテンツ検索は、カタログ内の家具の画像に基づく。 10A-10C are schematic illustrations of user interface screens displayed by an embodiment of client computing device 102 to perform image content searches and display digital supplements. In this example, the image content search is based on images of furniture in the catalog.

図１０Ａには、ユーザインターフェイス画面１０００ａが示されている。ユーザインターフェイス画面１０００ａは、画像表示パネル１００８を含む。この例では、画像表示パネル１００８は、家具カタログのページの一部の画像を表示している。画像表示パネルはまた、インジケータ１０４０と、インジケータ１０４２と、インジケータ１０４４とを含む。この例では、インジケータ１０４０はベッドに関連付けられており、インジケータ１０４２は装飾品に関連付けられており、インジケータ１０４４はラグに関連付けられている。カタログにおけるベッド、装飾品、およびラグの画像は、補足アンカーの例である。 FIG. 10A shows a user interface screen 1000a. User interface screen 1000a includes an image display panel 1008. In this example, the image display panel 1008 is displaying an image of a portion of a page of a furniture catalog. The image display panel also includes an indicator 1040, an indicator 1042, and an indicator 1044. In this example, indicator 1040 is associated with the bed, indicator 1042 is associated with the ornament, and indicator 1044 is associated with the rug. Images of beds, decorations, and rugs in catalogs are examples of supplemental anchors.

図１０Ｂでは、ユーザインターフェイス画面１０００ｂは、（例えば、インジケータ１０４０が表示される場所またはその近くで画面に触れることによって）ユーザがインジケータ１０４０を選択した後に、示される。ユーザインターフェイス画面１０００ｂは、デジタル補足選択パネル１０１０および情報パネル１０３０を含む。情報パネル１０３０は、選択されたインジケータに関連付けられている補足アンカーに関する情報（例えば、製品名、説明、画像）を含む。 In FIG. 10B, user interface screen 1000b is shown after the user selects indicator 1040 (eg, by touching the screen at or near where indicator 1040 is displayed). User interface screen 1000b includes a digital supplement selection panel 1010 and an information panel 1030. Information panel 1030 includes information about the supplemental anchor associated with the selected indicator (eg, product name, description, image).

デジタル補足選択パネル１０１０は、ユーザ作動可能コントロール１０１２およびユーザ作動可能コントロール１０１４を含む。ユーザ作動可能コントロール１０１２は、家庭内ビューを提供するデジタル補足に関連付けられている。ユーザ作動可能コントロール１０１４は、別のデジタル補足（例えば、ソーシャルメディアサイトに投稿するためのデジタル補足）に関連付けられている。 Digital supplementary selection panel 1010 includes user-actuatable controls 1012 and user-actuatable controls 1014. User actuatable controls 1012 are associated with digital supplements that provide an in-home view. User actuatable controls 1014 are associated with another digital supplement (eg, a digital supplement for posting to a social media site).

図１０Ｃでは、ユーザインターフェイス画面１０００ｃは、ユーザ作動可能コントロール１０１２の作動後に示される。ユーザインターフェイス画面１０００ｃは、画像表示パネル１００８と、デジタル補足選択パネル１０１０と、縮小情報パネル１０３２とを含む。縮小情報パネル１０３２は、作動されると、情報パネルをポップアップさせて表示させられ得るユーザ作動可能コントロールを含み得る。 In FIG. 10C, user interface screen 1000c is shown after actuation of user actuatable controls 1012. In FIG. User interface screen 1000c includes an image display panel 1008, a digital supplement selection panel 1010, and a reduced information panel 1032. The reduced information panel 1032 may include user actuatable controls that, when activated, can cause the information panel to pop up and display.

ここで、画像表示パネル１００８は、部屋の画像を表示し、ＡＲコンテンツ１０６０を含む。ＡＲコンテンツ１０６０は、画像パネル上に重ねられたインジケータ１０４０に関連付けられているベッドの３Ｄモデルを含む。ユーザは、ベッドが室内にどのように適合するかを見るために、室内のＡＲコンテンツ１０６０の位置を調節し得る。ある実施形態では、家庭内ビューのためのデジタル補足が選択されると、画像表示パネル１００８に示される画像は、デジタル補足を提供するサーバに送信され、それにより、当該画像は、ＡＲコンテンツ１０６０をどこに、かつ、どのように位置付けて、どのような大きさとするべきかを決定するために分析され得る。ある実施形態では、ＡＲコンテンツ１０６０は、画像コンテンツクエリよりも後に提供され得る。 Here, the image display panel 1008 displays an image of the room and includes AR content 1060. AR content 1060 includes a 3D model of a bed associated with indicator 1040 overlaid on the image panel. The user may adjust the position of the AR content 1060 in the room to see how the bed fits into the room. In some embodiments, when a digital supplement for an in-home view is selected, the image shown on the image display panel 1008 is sent to a server that provides the digital supplement, such that the image includes AR content 1060. It can be analyzed to determine where and how it should be located and what size it should be. In some embodiments, AR content 1060 may be provided after the image content query.

図１１Ａ～１１Ｃは、店内で様々な画像コンテンツ検索を行うためにクライアントコンピューティングデバイス１０２の実施形態によって表示されるユーザインターフェイス画面の概略図である。この例では、画像コンテンツ検索は、店内で取得された商品の画像に基づく。 11A-11C are schematic diagrams of user interface screens displayed by an embodiment of client computing device 102 to perform various image content searches within a store. In this example, the image content search is based on images of products obtained within the store.

図１１Ａには、ユーザインターフェイス画面１１００ａが示されている。ユーザインターフェイス画面１１００ａは、画像表示パネル１１０８および情報パネル１１３０を含む。この例では、画像表示パネル１１０８は、店内で撮影された画像を表示している。画像表示パネル１１０８はまた、花瓶に関連付けられているインジケータ１１４０を含む。画像表示パネル１１０８に表示される花瓶は、補足アンカーの一例である。情報パネル１１３０は、当該花瓶に関する商品情報および当該花瓶を購入するための機能性を含むデジタル補足を表示している。デジタル補足は、例えば、花瓶の購入を開始するワークフローを含み得る。この例において、当該デジタル補足は、画像コンテンツおよびクライアントコンピューティングデバイスの位置に基づいて識別され、それにより、画像が取得された店によって公開された（または店に関連付けられている）デジタル補足が識別され得、また、クライアントコンピューティングデバイスが店内にあるときに画像コンテンツクエリに対し、高ランキングの結果として提供され得る。ある実施形態では、もしクライアントコンピューティングデバイスの位置が変更された場合、同じ画像に対して異なるデジタル補足が提供されるであろう。 FIG. 11A shows a user interface screen 1100a. User interface screen 1100a includes an image display panel 1108 and an information panel 1130. In this example, image display panel 1108 displays images taken inside the store. Image display panel 1108 also includes an indicator 1140 associated with the vase. The vase displayed on image display panel 1108 is an example of a supplementary anchor. Information panel 1130 displays digital supplements including product information about the vase and functionality for purchasing the vase. The digital supplement may include, for example, a workflow to initiate the purchase of a vase. In this example, the digital supplement is identified based on the image content and the location of the client computing device, thereby identifying the digital supplement published by (or associated with) the store from which the image was obtained. and may be provided as a high ranking result for image content queries when the client computing device is in-store. In some embodiments, a different digital supplement will be provided for the same image if the location of the client computing device changes.

図１１Ｂには、ユーザインターフェイス画面１１００ｂが示されている。ユーザインターフェイス画面１１００ｂは、画像表示パネル１１０８および情報パネル１１３０を含む。この例では、画像表示パネル１１０８は、店内で取得された別の画像を表示している。画像表示パネル１１０８はまた、ラグに関連付けられているインジケータ１１４２を含む。画像表示パネル１１０８に表示されるラグは、補足アンカーの一例である。情報パネル１１３０は、ラグに関する製品情報およびサイズを選択してラグを購入するための機能性を含むデジタル補足を表示している。図１１Ａのように、当該デジタル補足は、画像コンテンツおよびクライアントコンピューティングデバイスの位置に基づいて識別される。 FIG. 11B shows a user interface screen 1100b. User interface screen 1100b includes an image display panel 1108 and an information panel 1130. In this example, image display panel 1108 is displaying another image captured within the store. Image display panel 1108 also includes indicators 1142 associated with the lugs. The lag displayed on image display panel 1108 is an example of a supplemental anchor. Information panel 1130 displays a digital supplement that includes product information about the rug and the functionality to select a size and purchase the rug. As in FIG. 11A, the digital supplement is identified based on the image content and the location of the client computing device.

図１１Ｃには、ユーザインターフェイス画面１１００ｃが示されている。ユーザインターフェイス画面１１００ｃは、画像表示パネル１１０８および情報パネル１１３０を含む。この例では、画像表示パネル１１０８は、店内で取得された別の画像を表示している。画像表示パネル１１０８はまた、花瓶に関連付けられているインジケータ１１４４を含む。画像表示パネル１１０８に表示される花瓶は、補足アンカーの一例である。情報パネル１１３０は、花瓶に関する製品情報を含むデジタル補足を表示している。情報パネル１１３０はまた、クーポンインジケータ１１３２およびクーポンを引き換えるための機能性を含む。クーポンを引き換えることは、販売店に関連付けられているウェブサイトから割引価格で商品を購入することを含み得る。ある実施形態では、清算の間割引を確保するために使用され得るクーポンコードが提示される。図１１Ａおよび１１Ｂと同様に、デジタル補足は、画像コンテンツおよびクライアントコンピューティングデバイスの位置に基づいて識別される。 FIG. 11C shows a user interface screen 1100c. User interface screen 1100c includes an image display panel 1108 and an information panel 1130. In this example, image display panel 1108 is displaying another image captured within the store. Image display panel 1108 also includes an indicator 1144 associated with the vase. The vase displayed on image display panel 1108 is an example of a supplementary anchor. Information panel 1130 displays a digital supplement containing product information about the vase. Information panel 1130 also includes a coupon indicator 1132 and functionality for redeeming coupons. Redeeming a coupon may include purchasing merchandise at a discounted price from a website associated with the retailer. In some embodiments, a coupon code is presented that can be used to secure a discount during checkout. Similar to FIGS. 11A and 11B, digital supplements are identified based on image content and the location of the client computing device.

図１２Ａ～図１２Ｃは、様々な画像コンテンツ検索中にクライアントコンピューティングデバイス１０２の実施形態によって表示されるユーザインターフェイス画面の概略図である。この例では、画像コンテンツ検索は、（例えば、映画館で取得されるかもしれない）映画ポスターの画像に基づく。 12A-12C are schematic diagrams of user interface screens displayed by embodiments of client computing device 102 during various image content searches. In this example, the image content search is based on images of movie posters (eg, as may be obtained at a movie theater).

図１２Ａには、ユーザインターフェイス画面１２００ａが示されている。ユーザインターフェイス画面１２００ａは、画像表示パネル１２０８を含む。この例では、画像表示パネル１２０８は、映画ポスターの画像を表示している。画像表示パネル１２０８はまた、画像内で識別された映画ポスターに関連付けられているインジケータ１２４０を含む。映画ポスターは、補足アンカーの一例である。インジケータ１２４０は、作動されるとデジタル補足またはデジタル補足を選択するためのメニューを表示する、ユーザ作動可能コントロールを含み得る。 FIG. 12A shows a user interface screen 1200a. User interface screen 1200a includes an image display panel 1208. In this example, image display panel 1208 is displaying an image of a movie poster. Image display panel 1208 also includes an indicator 1240 associated with the movie poster identified within the image. A movie poster is an example of a supplemental anchor. Indicator 1240 may include a user actuatable control that, when activated, displays a digital supplement or a menu for selecting a digital supplement.

図１２Ｂには、ユーザインターフェイス画面１２００ｂが示されている。画像表示パネル１２０８はまた、画像内で識別された映画ポスターに関連付けられているプレビューデジタル補足１２４２を含む。例えば、プレビューデジタル補足１２４２は、（図１２Ａの）インジケータ１２４０の作動後に示され得る。プレビューデジタル補足１２４２は、識別された映画ポスターに関連付けられている映画からの画像または映像を映画ポスターの画像の上に重ね合わせ得る。 FIG. 12B shows a user interface screen 1200b. Image display panel 1208 also includes a preview digital supplement 1242 associated with the movie poster identified within the image. For example, preview digital supplement 1242 may be shown after actuation of indicator 1240 (of FIG. 12A). Preview digital supplement 1242 may superimpose images or footage from the movie associated with the identified movie poster onto the image of the movie poster.

図１２Ｃには、ユーザインターフェイス画面１２００ｃが示されている。画像表示パネル１２０８は、格付けインジケータ１２４４および格付けインジケータ１２４６も含む。格付けインジケータ１２４４および格付けインジケータ１２４６は、映画ポスターを含む画像コンテンツクエリに応答して、１つまたは複数のデジタル補足によって生成され得る。デジタル補足は、例えば、映画ポスターに関連付けられている映画のために、格付け情報を画像内で重ね合わせ得る。格付けインジケータ１２４４および格付けインジケータ１２４６は、作動されると、格付けおよび関連付けられる映画に関する追加情報を示させるユーザ作動可能コントロールを含み得る。 FIG. 12C shows a user interface screen 1200c. Image display panel 1208 also includes rating indicator 1244 and rating indicator 1246. Rating indicator 1244 and rating indicator 1246 may be generated by one or more digital supplements in response to an image content query that includes a movie poster. The digital supplement may overlay rating information within the image, for example, for a movie associated with a movie poster. Rating indicator 1244 and rating indicator 1246 may include user actuatable controls that, when activated, cause the rating and additional information about the associated movie to be indicated.

図１３は、本明細書で説明される（例えば、クライアントコンピューティングデバイス１０２、検索サーバ１５２、およびデジタル補足サーバ１７２を実施するための）技術とともに使用され得る、コンピュータデバイス１３００およびモバイルコンピュータデバイス１３５０の例を示す。コンピューティングデバイス１３００は、プロセッサ１３０２と、メモリ１３０４と、記憶装置１３０６と、メモリ１３０４および高速拡張ポート１３１０に接続するための高速インターフェイス１３０８と、低速バス１３１４および記憶装置１３０６に接続するための低速インターフェイス１３１２とを含む。構成要素１３０２，１３０４，１３０６，１３０８，１３１０および１３１２の各々は、様々なバスを使用して相互接続されており、共通のマザーボードに、または必要に応じて他の方法で据え付けられ得る。プロセッサ１３０２は、高速インターフェイス１３０８に結合されたディスプレイ１３１６のような外部の入力／出力装置上にＧＵＩのためのグラフィカル情報を表示するために、メモリ１３０４または記憶装置１３０６に格納された命令を含む、コンピューティングデバイス１３００内での実行のための命令を処理できる。他の実施形態では、複数のプロセッサおよび／または複数のバスが、必要に応じて、複数のメモリおよびタイプのメモリとともに使用され得る。また、複数のコンピューティングデバイス１３００が接続されてもよく、各デバイスは、（例えば、サーババンク、ブレードサーバのグループ、またはマルチプロセッサシステムとして）必要な動作の部分を提供する。 FIG. 13 illustrates a computer device 1300 and a mobile computer device 1350 that may be used with the techniques described herein (e.g., to implement client computing device 102, search server 152, and digital supplement server 172). Give an example. Computing device 1300 includes a processor 1302, memory 1304, storage 1306, a high speed interface 1308 for connecting to memory 1304 and high speed expansion port 1310, and a low speed interface for connecting to low speed bus 1314 and storage 1306. 1312. Each of the components 1302, 1304, 1306, 1308, 1310 and 1312 are interconnected using various buses and may be mounted on a common motherboard or otherwise as desired. Processor 1302 includes instructions stored in memory 1304 or storage 1306 for displaying graphical information for a GUI on an external input/output device, such as a display 1316 coupled to high-speed interface 1308. Instructions can be processed for execution within computing device 1300. In other embodiments, multiple processors and/or multiple buses may be used, along with multiple memories and types of memory, as desired. Also, multiple computing devices 1300 may be connected, each device providing a portion of the required operation (eg, as a server bank, group of blade servers, or multiprocessor system).

メモリ１３０４は、コンピューティングデバイス１３００内の情報を記憶する。一実施形態では、メモリ１３０４は１つまたは複数の揮発性メモリユニットである。別の実施形態では、メモリ１３０４は、１つまたは複数の不揮発性メモリユニットである。メモリ１３０４はまた、磁気または光ディスクのような別の形態のコンピュータ読み取り可能な記憶媒体であり得る。 Memory 1304 stores information within computing device 1300. In one embodiment, memory 1304 is one or more volatile memory units. In another embodiment, memory 1304 is one or more non-volatile memory units. Memory 1304 may also be another form of computer readable storage medium, such as a magnetic or optical disk.

記憶装置１３０６は、コンピューティングデバイス１３００のために大容量ストレージを提供し得る。一実施形態では、記憶装置１３０６は、フロッピー（登録商標）ディスク装置、ハードディスク装置、光学ディスク装置、またはテープ装置、フラッシュメモリもしくは他の同様のソリッドステートメモリ装置、またはストレージエリアネットワークもしくは他の構成内の装置を含むデバイスのアレイなどのコンピュータ読み取り可能な記憶媒体であり得るか、またはそれを含み得る。コンピュータプログラム製品は、情報担体で有形に具現化され得る。コンピュータプログラム製品はまた、実行されると、上述の方法のような１つまたは複数の方法を実行する命令を含み得る。情報担体は、メモリ１３０４、記憶装置１３０６、またはプロセッサ１３０２上のメモリのようなコンピュータ読み取り可能な記憶媒体または機械可読媒体である。 Storage device 1306 may provide mass storage for computing device 1300. In one embodiment, the storage device 1306 is a floppy disk drive, hard disk drive, optical disk drive, or tape drive, flash memory or other similar solid state memory device, or within a storage area network or other configuration. may be or may include a computer-readable storage medium such as an array of devices including apparatus. A computer program product may be tangibly embodied in an information carrier. The computer program product may also include instructions that, when executed, perform one or more methods, such as those described above. The information carrier is a computer-readable storage medium or machine-readable medium, such as memory 1304 , storage device 1306 , or memory on processor 1302 .

高速コントローラ１３０８は、コンピューティングデバイス１３００のための帯域幅集約的動作を管理する一方、低速コントローラ１３１２は、より低い帯域幅集約的動作を管理する。このような機能の割り当ては、例示にすぎない。一実施形態では、高速コントローラ１３０８は、メモリ１３０４に、（例えば、グラフィックプロセッサ又はアクセラレータを介して）ディスプレイ１３１６に、および、様々な拡張カード（図示しない）を受け入れることができる高速拡張ポート１３１０に結合されている。この実施形態では、低速コントローラ１３１２は、記憶装置１３０６および低速拡張ポート１３１４に結合されている。様々な通信ポート（例えば、ＵＳＢ、ブルートゥース（登録商標）、イーサネット（登録商標）、ワイヤレスイーサネット（登録商標））を含み得る低速拡張ポートは、例えば、ネットワークアダプタを介して、キーボード、ポインティングデバイス、スキャナ、またはスイッチもしくはルータのようなネットワーキングデバイスのように、１つまたは複数の入力／出力装置に結合され得る。 A high-speed controller 1308 manages bandwidth-intensive operations for the computing device 1300, while a low-speed controller 1312 manages less bandwidth-intensive operations. Such functional assignments are exemplary only. In one embodiment, high speed controller 1308 is coupled to memory 1304, to display 1316 (e.g., via a graphics processor or accelerator), and to high speed expansion port 1310 that can accept various expansion cards (not shown). has been done. In this embodiment, low speed controller 1312 is coupled to storage device 1306 and low speed expansion port 1314. Low-speed expansion ports, which may include a variety of communication ports (e.g., USB, Bluetooth, Ethernet, Wireless Ethernet), can be used to connect keyboards, pointing devices, scanners, etc. via network adapters. , or to one or more input/output devices, such as a networking device such as a switch or router.

コンピューティングデバイス１３００は、図に示されるように、いくつかの異なる形態で実施され得る。例えば、それは、標準サーバ１３２０として、またはそのようなサーバのグループにおいて複数回実施され得る。また、それは、ラックサーバシステム１３２４の一部として実施され得る。加えて、それは、ラップトップコンピュータ１３２２のようなパーソナルコンピュータとして実施され得る。代替的には、コンピューティングデバイス１３００からの構成要素は、デバイス１３５０のようなモバイルデバイス（図示しない）内の他の構成要素と組み合わされ得る。そのようなデバイスの各々は、コンピューティングデバイス１３００，１３５０のうちの１つまたは複数を含み得、また、システム全体は、互いに通信する複数のコンピューティングデバイス１３００、１３５０から構成され得る。 Computing device 1300 may be implemented in several different forms, as illustrated. For example, it may be implemented multiple times as a standard server 1320 or in a group of such servers. Also, it may be implemented as part of rack server system 1324. Additionally, it can be implemented as a personal computer, such as a laptop computer 1322. Alternatively, components from computing device 1300 may be combined with other components within a mobile device (not shown), such as device 1350. Each such device may include one or more of the computing devices 1300, 1350, and the entire system may be comprised of multiple computing devices 1300, 1350 in communication with each other.

コンピューティングデバイス１３５０は、構成要素のうち特に、プロセッサ１３５２、メモリ１３６４、ディスプレイ１３５４のような入力／出力装置、通信インターフェイス１３６６、およびトランシーバ１３６８を含む。デバイス１３５０はまた、追加のストレージを提供するために、マイクロドライブまたは他のデバイスのような記憶装置を備え得る。構成要素１３５０，１３５２，１３６４，１３５４，１３６６および１３６８の各々は、様々なバスを使用して相互接続されており、構成要素のいくつかは、共通のマザーボード上に、または必要に応じて他の方法で、据え付けられ得る。 Computing device 1350 includes a processor 1352, memory 1364, input/output devices such as a display 1354, a communication interface 1366, and a transceiver 1368, among other components. Device 1350 may also include storage, such as a microdrive or other device, to provide additional storage. Each of the components 1350, 1352, 1364, 1354, 1366 and 1368 are interconnected using various buses, with some of the components on a common motherboard or on other method, it can be installed.

プロセッサ１３５２は、メモリ１３６４に格納された命令を含む、コンピューティングデバイス１３５０内で命令を実行できる。当該プロセッサは、別個の複数のアナログおよびデジタルプロセッサを含むチップのチップセットとして実施され得る。当該プロセッサは、たとえば、ユーザインターフェイスの制御、デバイス１３５０によって実行されるアプリケーション、およびデバイス１３５０によるワイヤレス通信のような、デバイス１３５０の他の構成要素の協調を提供し得る。 Processor 1352 can execute instructions within computing device 1350, including instructions stored in memory 1364. The processor may be implemented as a chipset of chips that includes separate analog and digital processors. The processor may provide coordination of other components of device 1350, such as control of a user interface, applications executed by device 1350, and wireless communications by device 1350.

プロセッサ１３５２は、ディスプレイ１３５４に結合された制御インターフェイス１３５８およびディスプレイインターフェイス１３５６を介してユーザと通信し得る。ディスプレイ１３５４は、例えば、ＴＦＴＬＣＤ（Thin-Film-Transistor Liquid Crystal display）、ＬＥＤ（Light Emitting Diode）、ＯＬＥＤ（Organic Light Emitting Diode）ディスプレイ、または他の適切なディスプレイ技術であり得る。ディスプレイインターフェイス１３５６は、グラフィックおよび他の情報をユーザに提示するためにディスプレイ１３５４を駆動するための適切な回路を含み得る。制御インターフェイス１３５８は、ユーザからコマンドを受信し、それらを変換してプロセッサ１３５２に提出し得る。加えて、外部インターフェイス１３６２は、デバイス１３５０と他のデバイスとの近距離通信を可能にするために、プロセッサ１３５２と通信して提供され得る。外部インターフェイス１３６２は、たとえば、ある実施形態では有線通信のために、または、他の実施形態では無線通信のために準備し得、複数のインターフェイスも使用され得る。 Processor 1352 may communicate with a user via a control interface 1358 and a display interface 1356 coupled to display 1354. Display 1354 may be, for example, a Thin-Film-Transistor Liquid Crystal display (TFT), a Light Emitting Diode (LED), an Organic Light Emitting Diode (OLED) display, or other suitable display technology. Display interface 1356 may include suitable circuitry to drive display 1354 to present graphics and other information to the user. Control interface 1358 may receive commands from a user, convert and submit them to processor 1352. Additionally, an external interface 1362 may be provided in communication with processor 1352 to enable close range communication between device 1350 and other devices. External interface 1362 may be provided for wired communication in some embodiments or wireless communication in other embodiments, for example, and multiple interfaces may also be used.

メモリ１３６４は、コンピューティングデバイス１３５０内の情報を記憶する。メモリ１３６４は、１つまたは複数のコンピュータ読み取り可能な記憶媒体、１つまたは複数の揮発性メモリユニット、または１つまたは複数の不揮発性メモリユニットのうちの１つまたは複数として実施され得る。拡張メモリ１３７４はまた、例えば、ＳＩＭＭ（Single In-Line Memory Module）カードインターフェイスを含み得る拡張インターフェイス１３７２を介してデバイス１３５０に提供および接続され得る。そのような拡張メモリ１３７４は、デバイス１３５０のための追加の記憶空間を提供でき、または、デバイス１３５０のためのアプリケーションまたは他の情報を記憶し得る。具体的には、拡張メモリ１３７４は、上述のプロセスを実行または補足するための命令を含むことができ、また、セキュアな情報も含み得る。したがって、たとえば、拡張メモリ１３７４は、デバイス１３５０のためのセキュリティモジュールとして提供され得、また、デバイス１３５０の安全な使用を可能にする命令でプログラムされ得る。加えて、セキュアなアプリケーションは、ＳＩＭＭカード上に識別情報をハッキング不能に置くことのように、追加の情報と共にＳＩＭＭカードを介して提供され得る。 Memory 1364 stores information within computing device 1350. Memory 1364 may be implemented as one or more of one or more computer-readable storage media, one or more volatile memory units, or one or more non-volatile memory units. Expansion memory 1374 may also be provided and connected to device 1350 via expansion interface 1372, which may include, for example, a Single In-Line Memory Module (SIMM) card interface. Such expanded memory 1374 can provide additional storage space for device 1350 or store applications or other information for device 1350. In particular, expanded memory 1374 may include instructions to perform or supplement the processes described above, and may also include secure information. Thus, for example, expanded memory 1374 may be provided as a security module for device 1350 and may be programmed with instructions to enable secure use of device 1350. Additionally, secure applications may be provided via the SIMM card with additional information, such as placing identifying information on the SIMM card in a non-hackable manner.

当該メモリは、以下で説明されるように、たとえば、フラッシュメモリおよび／またはＮＶＲＡＭメモリを含み得る。一実施形態では、コンピュータプログラム製品は、情報担体において有形に具現化される。コンピュータプログラム製品は、実行されると、上述の方法のような、１つまたは複数の方法を実行する命令を含む。情報担体は、例えばトランシーバ１３６８または外部インターフェイス１３６２を介して受信され得る、メモリ１３６４、拡張メモリ１３７４、またはプロセッサ１３５２上のメモリのようなコンピュータ読み取り可能な記憶媒体または機械読み取り可能な媒体である。 The memory may include, for example, flash memory and/or NVRAM memory, as described below. In one embodiment, a computer program product is tangibly embodied in an information carrier. The computer program product includes instructions that, when executed, perform one or more methods, such as those described above. The information carrier is a computer-readable storage medium or machine-readable medium, such as memory 1364, expansion memory 1374, or memory on processor 1352, which may be received via transceiver 1368 or external interface 1362, for example.

デバイス１３５０は、必要に応じてデジタル信号処理回路を含み得る通信インターフェイス１３６６を介してワイヤレスで通信し得る。通信インターフェイス１３６６は、とりわけ、ＧＳＭ（登録商標）ボイス呼、ＳＭＳ、ＥＭＳ、またはＭＭＳメッセージング、ＣＤＭＡ、ＴＤＭＡ、ＰＤＣ、ＷＣＤＭＡ（登録商標）、ＣＤＭＡ２０００、またはＧＰＲＳのような様々なモードまたはプロトコルの下で通信を提供し得る。そのような通信は、例えば、無線周波数トランシーバ１３６８を介して行われ得る。加えて、ブルートゥース、Ｗｉ－Ｆｉ、または他のそのようなトランシーバ（図示しない）を用いてのように、短距離通信が発生し得る。加えて、ＧＰＳ（全地球測位システム）受信機モジュール１３７０は、デバイス１３５０上で実行するアプリケーションによって必要に応じて使用され得る、追加のナビゲーションおよび位置に関連付けられた無線データをデバイス１３５０に提供し得る。 Device 1350 may communicate wirelessly via communication interface 1366, which may optionally include digital signal processing circuitry. The communication interface 1366 can communicate under various modes or protocols such as GSM voice calls, SMS, EMS, or MMS messaging, CDMA, TDMA, PDC, WCDMA, CDMA2000, or GPRS, among others. may provide communications. Such communication may occur via radio frequency transceiver 1368, for example. Additionally, short range communications may occur, such as using Bluetooth, Wi-Fi, or other such transceivers (not shown). In addition, a GPS (Global Positioning System) receiver module 1370 may provide additional navigation and location-related wireless data to the device 1350, which may be used as needed by applications running on the device 1350. .

デバイス１３５０はまた、ユーザから発話情報を受信し、それを使用可能なデジタル情報に変換し得る、音声コーデック１３６０を使用して、聴覚的に通信し得る。音声コーデック１３６０は、同様に、例えば、デバイス１３５０のハンドセット内のスピーカを介してのように、ユーザのための可聴音を生成し得る。そのような音は、音声電話からの音を含んでもよく、記録された音（例えば、音声メッセージ、音楽ファイルなど）を含んでもよく、デバイス１３５０上で動作するアプリケーションによって生成される音も含み得る。 Device 1350 may also communicate audibly using audio codec 1360, which may receive spoken information from a user and convert it into usable digital information. Audio codec 1360 may also generate audible sound for the user, such as, for example, via a speaker in the handset of device 1350. Such sounds may include sounds from voice calls, may include recorded sounds (e.g., voice messages, music files, etc.), and may also include sounds generated by applications running on device 1350. .

コンピューティングデバイス１３５０は、図に示されるように、いくつかの異なる形態で実施され得る。例えば、それは、携帯電話１３８０として実施され得る。また、それは、スマートフォン１３８２、パーソナルデジタルアシスタント、または他の同様のモバイルデバイスの一部として実施され得る。 Computing device 1350 may be implemented in several different forms, as shown. For example, it may be implemented as a mobile phone 1380. Also, it may be implemented as part of a smartphone 1382, personal digital assistant, or other similar mobile device.

本明細書で説明されるシステムおよび技術の様々な実施形態は、デジタル電子回路、集積回路、特別に設計されたＡＳＩＣ（特定用途向け集積回路）、コンピュータハードウェア、ファームウェア、ソフトウェア、および／またはそれらの組み合せで実現され得る。これらの様々な実施は、ストレージシステム、少なくとも１つの入力デバイス、および少なくとも１つの出力デバイスからデータおよび命令を受信し、それらにデータおよび命令を送信するように結合された、専用のまたは汎用であり得る少なくとも１つのプログラム可能なプロセッサを含むプログラム可能なシステム上で実行可能な、および／または解釈可能な、１つまたは複数のコンピュータプログラムにおける実施を含み得る。 Various embodiments of the systems and techniques described herein include digital electronic circuits, integrated circuits, specially designed ASICs (Application Specific Integrated Circuits), computer hardware, firmware, software, and/or the like. It can be realized by a combination of. These various implementations may be specialized or general purpose, coupled to receive data and instructions from and transmit data and instructions to a storage system, at least one input device, and at least one output device. may include implementation in one or more computer programs executable and/or interpretable on a programmable system including at least one programmable processor that obtains the information.

これらのコンピュータプログラム（また、プログラム、ソフトウェア、ソフトウェアアプリケーションまたはコードとしても知られる）は、プログラム可能なプロセッサのための機械命令を含み、高水準手続きおよび／またはオブジェクト指向プログラミング言語で、および／またはアセンブリ／機械言語で実施され得る。本明細書で使用されるように、「機械読み取り可能な媒体」、「コンピュータ読み取り可能な記憶媒体」という用語は、機械読み取り可能な信号として機械命令を受信する機械読み取り可能な媒体を含む、機械命令および／またはデータをプログラム可能なプロセッサに提供するために使用される任意のコンピュータプログラム製品、装置、および／またはデバイス（例えば、磁気ディスク、光ディスク、メモリ、プログラマブルロジックデバイス（ＰＬＤ））を指す。用語「機械読み取り可能な信号」は、機械命令および／またはデータをプログラム可能なプロセッサに提供するために使用される任意の信号を指す。 These computer programs (also known as programs, software, software applications, or code) contain machine instructions for a programmable processor and are written in high-level procedural and/or object-oriented programming languages and/or in assembly. /Can be implemented in machine language. As used herein, the terms "machine-readable medium" and "computer-readable storage medium" include a machine-readable medium that receives machine instructions as machine-readable signals. Refers to any computer program product, apparatus, and/or device (eg, magnetic disk, optical disk, memory, programmable logic device (PLD)) that is used to provide instructions and/or data to a programmable processor. The term "machine readable signal" refers to any signal used to provide machine instructions and/or data to a programmable processor.

ユーザとの対話を提供するために、本明細書で説明されるシステムおよび技術は、ユーザに情報を表示するためのディスプレイデバイス（ＬＥＤ（発光ダイオード）またはＯＬＥＤ（有機ＬＥＤ）またはＬＣＤ（液晶ディスプレイ）モニタ／スクリーン）と、ユーザがコンピュータに入力を与え得るキーボードおよびポインティングデバイス（例えば、マウスまたはトラックボール）と、を有するコンピュータ上で実施され得る。他の種類のデバイスもまた、ユーザとの相互作用を提供するために使用されることができ、例えば、ユーザに提供されるフィードバックは、任意の形態の感覚フィードバック（例えば、視覚フィードバック、聴覚フィードバック、または触覚フィードバック）であり得、また、ユーザからの入力は、音響入力、音声入力、または触覚入力を含む任意の形態で受信され得る。 To provide user interaction, the systems and techniques described herein use display devices (LEDs (light emitting diodes) or OLEDs (organic LEDs) or LCDs (liquid crystal displays) to display information to the user. The computer may be implemented on a computer that has a monitor/screen) and a keyboard and pointing device (eg, a mouse or trackball) through which a user may provide input to the computer. Other types of devices may also be used to provide interaction with the user, for example, the feedback provided to the user may include any form of sensory feedback (e.g., visual feedback, auditory feedback, or haptic feedback), and input from the user may be received in any form including acoustic input, voice input, or tactile input.

本明細書で説明されるシステムおよび技術は、（たとえば、データサーバとしての）バックエンドコンポーネントを含む、または、（たとえば、アプリケーションサーバとしての）ミドルウェアコンポーネントを含む、または、フロントエンドコンポーネント（例えば、ユーザが本明細書に記載するシステムおよび技術の実施形態と相互作用し得るグラフィカルユーザインターフェイスまたはウェブブラウザを有するクライアントコンピュータ）を含む、または、そのようなバックエンド、ミドルウェア、もしくはフロントエンドコンポーネントの任意の組み合せを含む、コンピューティングシステムにおいて実施され得る。システムの構成要素は、デジタルデータ通信（例えば、通信ネットワーク）の任意の形態または媒体によって相互接続され得る。通信ネットワークの例は、ローカルエリアネットワーク（「ＬＡＮ」）、ワイドエリアネットワーク（「ＷＡＮ」）、およびインターネットを含む。 The systems and techniques described herein include back-end components (e.g., as data servers), or middleware components (e.g., as application servers), or front-end components (e.g., user a client computer having a graphical user interface or a web browser that can interact with embodiments of the systems and techniques described herein), or any combination of such back-end, middleware, or front-end components. may be implemented in a computing system including. The components of the system may be interconnected by any form or medium of digital data communication (eg, a communication network). Examples of communication networks include local area networks (“LANs”), wide area networks (“WANs”), and the Internet.

コンピューティングシステムは、クライアントおよびサーバを含み得る。クライアントおよびサーバは、一般に、互いに遠隔であり、典型的には、通信ネットワークを介して相互作用する。クライアントとサーバとの関係は、それぞれのコンピュータ上で実行されて互いにクライアント－サーバ関係を有するコンピュータプログラムによって生じる。 A computing system may include clients and servers. Clients and servers are generally remote from each other and typically interact via a communications network. The relationship between clients and servers is created by computer programs running on their respective computers and having a client-server relationship with each other.

ある実施形態では、図１３に示されるコンピューティングデバイスは、ＡＲヘッドセット／ＨＭＤデバイス１３９０とインターフェイスして、挿入されたコンテンツを物理的空間内で視聴するための拡張環境を生成するためのセンサを含み得る。例えば、図１３に示されるコンピューティングデバイス１３５０または他のコンピューティングデバイスに含まれる１つまたは複数のセンサは、ＡＲヘッドセット１３９０に入力を与えることができ、または、一般にＡＲ空間に入力を与え得る。センサは、タッチスクリーン、加速度計、ジャイロスコープ、圧力センサ、バイオメトリックセンサ、温度センサ、湿度センサ、および周囲光センサを含み得るが、これらに限定されない。コンピューティングデバイス１３５０は、センサを使用して、ＡＲ空間におけるコンピューティングデバイスの絶対位置および／または検出された回転を決定でき、そして、その絶対位置および／または検出された回転は、ＡＲ空間への入力として使用され得る。例えば、コンピューティングデバイス１３５０は、コントローラ、レーザポインタ、キーボード、武器のような仮想オブジェクトとしてＡＲ空間に組み込まれ得る。ＡＲ空間に組み込まれるとき、ユーザによるコンピューティングデバイス／仮想オブジェクトの位置決めは、ユーザが、ＡＲ空間内において特定の方法で仮想オブジェクトを見るように、コンピューティングデバイスを位置決めすることを可能にし得る。例えば、仮想オブジェクトがレーザポインタを表す場合、ユーザは、コンピューティングデバイスが実際のレーザポインタであるかのように、当該コンピューティングデバイスを操作し得る。ユーザは、コンピューティングデバイスを左右に、上下に、円形に動かすことができ、レーザポインタを使用するのと同様に当該デバイスを使用できる。ある実施形態では、ユーザは、仮想レーザポインタを使用して、目標とする所を狙うことが得る。 In some embodiments, the computing device shown in FIG. 13 includes sensors for interfacing with an AR headset/HMD device 1390 to create an augmented environment for viewing inserted content within a physical space. may be included. For example, one or more sensors included in computing device 1350 shown in FIG. 13 or other computing device may provide input to AR headset 1390, or may provide input to AR space in general. . Sensors may include, but are not limited to, touch screens, accelerometers, gyroscopes, pressure sensors, biometric sensors, temperature sensors, humidity sensors, and ambient light sensors. Computing device 1350 can determine the absolute position and/or detected rotation of the computing device in AR space using the sensor, and the absolute position and/or detected rotation is connected to AR space. Can be used as input. For example, computing device 1350 may be incorporated into the AR space as a virtual object such as a controller, laser pointer, keyboard, weapon, etc. When embedded in an AR space, positioning of a computing device/virtual object by a user may allow the user to position the computing device to view the virtual object in a particular way within the AR space. For example, if the virtual object represents a laser pointer, the user may manipulate the computing device as if it were an actual laser pointer. A user can move the computing device from side to side, up and down, and in a circular motion, allowing the user to use the device in a manner similar to using a laser pointer. In some embodiments, a user may use a virtual laser pointer to aim at a target.

ある実施形態では、コンピューティングデバイス１３５０に含まれるか又はそれに結合される１つまたは複数の入力デバイスは、ＡＲ空間への入力として使用され得る。入力装置は、タッチスクリーン、キーボード、１つまたは複数のボタン、トラックパッド、タッチパッド、ポインティングデバイス、マウス、トラックボール、ジョイスティック、カメラ、マイクロフォン、入力機能を有するイヤホンまたは耳の穴に差し込むタイプのイヤホン（buds）、ゲームコントローラ、または他の接続可能な入力装置を含み得るが、これらに限定されない。コンピューティングデバイスがＡＲ空間に組み込まれるとき、コンピューティングデバイス１３５０に含まれる入力デバイスと相互作用するユーザは、ＡＲ空間において特定のアクションを生じさせることができる。 In some embodiments, one or more input devices included in or coupled to computing device 1350 may be used as input to the AR space. Input devices include a touch screen, keyboard, one or more buttons, trackpad, touch pad, pointing device, mouse, trackball, joystick, camera, microphone, and earphones or earphones with input capabilities. (buds), game controllers, or other connectable input devices. When a computing device is embedded in an AR space, a user interacting with input devices included in the computing device 1350 can cause certain actions in the AR space.

ある実施形態では、コンピューティングデバイス１３５０のタッチスクリーンは、ＡＲ空間内でタッチパッドとしてレンダリングされ得る。ユーザは、コンピューティングデバイス１３５０のタッチスクリーンと相互作用し得る。相互作用は、例えばＡＲヘッドセット１３９０において、ＡＲ空間内でレンダリングされたタッチパッド上での動きとしてレンダリングされる。当該レンダリングされた動きは、ＡＲ空間内で仮想オブジェクトを制御できる。 In some embodiments, the touch screen of computing device 1350 may be rendered as a touchpad within the AR space. A user may interact with the touch screen of computing device 1350. The interaction is rendered as movement on a touchpad rendered in AR space, for example in an AR headset 1390. The rendered motion can control virtual objects within the AR space.

ある実施形態では、コンピューティングデバイス１３５０に含まれる１つまたは複数の出力デバイスは、ＡＲ空間内でＡＲヘッドセット１３９０のユーザに出力および／またはフィードバックを提供できる。出力および／またはフィードバックは、視覚的、触覚的、または音声であり得る。出力および／またはフィードバックは、振動、１つまたは複数のライトまたはストロボの点灯および消灯または点滅および／または発光、アラームを鳴動すること、チャイムを鳴らすこと、楽曲を演奏すること、および、オーディオファイルの演奏を含み得るが、これらに限定されない。出力デバイスは、振動モータ、振動コイル、圧電デバイス、静電デバイス、発光ダイオード（ＬＥＤ）、ストロボ、およびスピーカを含み得るが、これらに限定されない。 In some embodiments, one or more output devices included in computing device 1350 can provide output and/or feedback to a user of AR headset 1390 within an AR space. Output and/or feedback may be visual, tactile, or audio. Outputs and/or feedback may include vibrations, turning on and off or flashing and/or emitting one or more lights or strobes, sounding an alarm, sounding a chime, playing a musical piece, and playing an audio file. may include, but are not limited to, musical performances. Output devices may include, but are not limited to, vibrating motors, vibrating coils, piezoelectric devices, electrostatic devices, light emitting diodes (LEDs), strobes, and speakers.

ある実施形態では、コンピューティングデバイス１３５０は、コンピュータによって生成される３Ｄ環境において別のオブジェクトとして現れ得る。ユーザによるコンピューティングデバイス１３５０との相互作用（例えば、回転すること、振ること、タッチスクリーンに触れること、タッチスクリーンを横切って指をスワイプすること）は、ＡＲ空間内におけるオブジェクトとの相互作用と解釈され得る。ＡＲ空間におけるレーザポインタの例において、コンピューティングデバイス１３５０は、コンピュータによって生成される３Ｄ環境における仮想レーザポインタとして現れる。ユーザがコンピューティングデバイス１３５０を操作すると、ＡＲ空間内のユーザは、レーザポインタの動きを見る。ユーザは、コンピューティングデバイス１３５０またはＡＲヘッドセット１３９０上のＡＲ環境におけるコンピューティングデバイス１３５０との相互作用からのフィードバックを受信する。コンピューティングデバイスとのユーザの相互作用は、制御可能なデバイスのためのＡＲ環境において生成されるユーザインターフェイスとの相互作用に変換され得る。 In some embodiments, computing device 1350 may appear as a separate object in a computer-generated 3D environment. User interactions with computing device 1350 (e.g., rotating, shaking, touching the touch screen, swiping a finger across the touch screen) are interpreted as interactions with objects within the AR space. can be done. In the example of a laser pointer in AR space, computing device 1350 appears as a virtual laser pointer in a computer-generated 3D environment. As the user manipulates the computing device 1350, the user in the AR space sees movement of the laser pointer. A user receives feedback from interactions with computing device 1350 in an AR environment on computing device 1350 or AR headset 1390. A user's interaction with a computing device may be translated into interaction with a user interface generated in an AR environment for the controllable device.

ある実施形態では、コンピューティングデバイス１３５０は、タッチスクリーンを含み得る。例えば、ユーザは、制御可能なデバイスのためのユーザインターフェイスと相互作用するためにタッチスクリーンと相互作用できる。例えば、タッチスクリーンは、制御可能なデバイスの特性を制御できるスライダのようなユーザインターフェイス要素を含み得る。 In some embodiments, computing device 1350 may include a touch screen. For example, a user can interact with a touch screen to interact with a user interface for a controllable device. For example, a touch screen may include user interface elements such as sliders that can control characteristics of a controllable device.

コンピューティングデバイス１３００は、限定はしないが、ラップトップ、デスクトップ、ワークステーション、携帯情報端末、サーバ、ブレードサーバ、メインフレーム、および他の適切なコンピュータを含む、様々な形態のデジタルコンピュータおよびデバイスを表すことが意図される。コンピューティングデバイス１３５０は、携帯情報端末、携帯電話、スマートフォン、および他の同様のコンピューティングデバイスのようなモバイルデバイスの様々な形態を表すことが意図される。本明細書に示される構成要素、それらの接続および関係、ならびにそれらの機能は、例示的なものにすぎず、本明細書で説明され、および／または請求される発明の実施形態を限定することを意図していない。 Computing device 1300 represents various forms of digital computers and devices, including, but not limited to, laptops, desktops, workstations, personal digital assistants, servers, blade servers, mainframes, and other suitable computers. It is intended that Computing device 1350 is intended to represent various forms of mobile devices, such as personal digital assistants, cell phones, smartphones, and other similar computing devices. The components, their connections and relationships, and their functions depicted herein are exemplary only and are not intended to limit the embodiments of the invention described and/or claimed herein. is not intended.

いくつかの実施形態が説明された。しかしながら、本明細書の精神および範囲から逸脱することなく、様々な修正が行われ得ることが理解されるであろう。 Several embodiments have been described. However, it will be understood that various modifications may be made without departing from the spirit and scope of the specification.

加えて、図に示される論理フローは、望ましい結果を達成するために、示された特定の順序または逐次の順序を必要としない。加えて、他のステップが提供されてもよく、または、ステップが記載されたフローから排除されてもよく、他の構成要素が当該記載されたシステムに追加されてもよく、または、当該システムら除去され得る。したがって、他の実施形態は、以下の特許請求の範囲内である。 Additionally, the logic flows illustrated in the figures do not require the particular order shown or sequential order to achieve desirable results. In addition, other steps may be provided or steps may be omitted from the described flow, and other components may be added to or modified from the described system. Can be removed. Accordingly, other embodiments are within the scope of the following claims.

説明される実施形態のいくつかの特徴が本明細書に記載されるように説明されたが、多くの修正、置換、変更、および均等物が当業者には想起されるであろう。したがって、添付の特許請求の範囲は、実施形態の範囲内に含まれるそのようなすべての修正および変更を網羅することを意図していることを理解されたい。それらは、限定ではなく例として提示されたにすぎず、形態および詳細の様々な変更が行われ得ることを理解されたい。本明細書で説明される装置および／または方法の任意の部分は、相互に排他的な組み合わせを除き、任意の組み合わせで組み合わせられ得る。本明細書で説明される実施形態は、説明された異なる実施形態の機能、構成要素、および／または特徴の様々な組み合せ、および／または、サブコンビネーションを含み得る。
Although certain features of the illustrated embodiments have been described as described herein, many modifications, substitutions, changes, and equivalents will occur to those skilled in the art. It is therefore to be understood that the appended claims are intended to cover all such modifications and changes that fall within the scope of the embodiments. It is to be understood that they are presented by way of example only and not as a limitation, and that various changes in form and detail may be made. Any portions of the apparatus and/or methods described herein may be combined in any combination, except in mutually exclusive combinations. The embodiments described herein may include various combinations and/or subcombinations of the features, components, and/or features of different described embodiments.

Claims

A computer-implemented method, the method comprising:
a search server receiving a web page from a digital supplementary server ;
the search server obtains an image supplement anchor and a digital supplement associated with the image supplement anchor by analyzing metadata of the web page ;
the search server generating a data structure instance associating the digital supplement with the image supplement anchor , the digital supplement being virtual content ;
storing the data structure instance in a database of the search server ;
the search server receiving from a client computing device an image content query that includes an image of a physical object, the database including a plurality of other data structure instances, and the database including a plurality of other data structure instances; each includes a digital supplement and one or more image supplement anchors associated with each digital supplement, the method further comprising:
the search server , in response to receiving the image content query , obtains digital supplements associated with image supplement anchors identified from images included in the image content query;
transmitting the obtained digital supplement to the client computing device to cause the client computing device to display the digital supplement as augmented reality content overlaid on an image of the physical object; Method.

2. The computer-implemented method of claim 1 , further comprising crawling a plurality of web pages including the web page.

3. The computer-implemented method of claim 1 or 2 , further comprising associating the data structure instance with a popularity score of the web page.

4. The computer-implemented method of any of claims 1-3 , further comprising associating the data structure instance with a relevance score of the image supplement anchor based on a web page.

5. The computer-implemented method of any preceding claim, wherein the metadata includes a name, description, image, and uniform resource locator.

6. The computer-implemented method of claim 5 , wherein the metadata includes an application identifier.

Sending the obtained digital supplements to the client computing device includes sending a list of digital supplements, the list including the digital supplements from the data structure instance and the plurality of other data structures. 4. The computer-implemented method of claim 3 , including digital supplements from one of the instances, ordered based on the popularity score .

Sending the obtained digital supplements to the client computing device includes sending a list of digital supplements, the list including digital supplements from the data structure instance and the plurality of other data structure instances. 5. The computer-implemented method of claim 4, wherein the computer-implemented method includes a digital supplement from one of the digital supplements, and is ordered based on the relevance score.

A computing device,
at least one processor;
a memory storing instructions, the instructions, when executed by the at least one processor, cause the computing device to:
receive a web page from a digital supplementary server ;
obtaining an image supplement anchor and a digital supplement associated with the image supplement anchor by analyzing metadata of the web page ;
generating a data structure instance that associates the digital supplement with the image supplement anchor, the digital supplement being virtual content;
storing the data structure instance in a database;
an image content query including an image of a physical object is received from a client computing device, the database including a plurality of other data structure instances, each of the plurality of other data structure instances including a digital supplement and a respective one or more image supplement anchors associated with the digital supplement;
in response to receiving the image content query , retrieving a digital supplement associated with an image supplement anchor identified from an image included in the image content query;
A computing device that causes the acquired digital supplement to be transmitted to the client computing device to cause the client computing device to display the digital supplement as augmented reality content overlaid on an image of the physical object.

The instructions further cause the computing device to:
10. The computing device of claim 9, configured to crawl a plurality of web pages including the web page.

The instructions further cause the computing device to:
11. The computing device of claim 9 or 10, further comprising associating the data structure instance with a popularity score of the web page.

The instructions further cause the computing device to:
A computing device according to any of claims 9 to 11, further causing associating the data structure instance with a relevance score of the image supplement anchor based on a web page.

A computing device according to any of claims 9 to 12 , wherein the metadata includes a name, a description, an image, and a uniform resource locator.

14. The computing device of claim 13 , wherein the metadata includes an application identifier.

Sending the obtained digital supplements to the client computing device includes sending a list of digital supplements, the list including digital supplements from the data structure instance and the plurality of other data structure instances. 12. The computing device of claim 11 , wherein the computing device includes a digital supplement from one of the digital supplements and is ordered based on the popularity score .

Sending the obtained digital supplements to the client computing device includes sending a list of the digital supplements, the list including the digital supplements from the data structure instance and the plurality of other data structures. and a digital supplement from one of the instances, ordered based on the relevance score.

A computing device,
at least one processor;
a memory storing instructions, the instructions, when executed by at least one processor, causing the computing device to perform the method of any of claims 1 to 8 . .

9. A computer program product comprising instructions, said instructions, when executed by one or more processors, causing said one or more processors to perform the method of any of claims 1 to 8 . program.