KR101329102B1

KR101329102B1 - augmented reality - image retrieval system using layout descriptor and image feature.

Info

Publication number: KR101329102B1
Application number: KR1020120020018A
Authority: KR
Inventors: 이재영; 온기원; 안기영
Original assignee: 주식회사 케이쓰리아이
Priority date: 2012-02-28
Filing date: 2012-02-28
Publication date: 2013-11-14
Also published as: KR20130098470A; WO2013129729A1

Abstract

본 발명은 레이아웃 기술자와 이미지 특징점을 이용한 실시간 증강현실 이미지 검색시스템에 관한 것으로, 더욱 상세하게는 증강현실 서비스의 대상이 되는 인쇄매체, 신문 등의 PDF 파일로부터 추출하여 저장된 데이터베이스에서 원하는 페이지를 효율적으로 검출하는 것으로서, 페이지의 전체 레이아웃으로 조회할 때 레이아웃 기술자(Layout descriptor)를 이용하여 매칭시키며, 제한된 페이지 범위 내에서의 이미지 특징점 벡터를 이용하여 매칭시켜 모바일 단말기기와 네트워크 환경하에서 문서이미지관리서버로부터 증강현실 서비스의 대상이 되는 페이지면을 조회하는 레이아웃 기술자와 이미지 특징점을 이용한 실시간 증강현실 이미지 검색시스템에 관한 것이다.
본 발명에 의하면, 증강현실 서비스의 대상이 되는 인쇄매체, 신문 등의 PDF 파일로부터 추출하여 저장된 데이터베이스에서 원하는 페이지를 효율적으로 검출하도록 하는데 있으며, 특히 페이지의 전체 레이아웃으로 조회할 때 레이아웃 기술자(Layout descriptor)를 이용하여 매칭시키며, 제한된 페이지 범위 내에서의 이미지 특징점 벡터를 이용하여 매칭시켜 모바일 단말기기와 네트워크 환경하에서 문서이미지관리서버로부터 증강현실 서비스의 대상이 되는 페이지면을 조회할 수 있게 된다.The present invention relates to a real-time augmented reality image retrieval system using a layout descriptor and an image feature point, more specifically, to extract a desired page from a PDF file of a print medium, a newspaper, etc., which is a target of an augmented reality service, and efficiently stores a desired page. When searching by the entire layout of the page, it matches by using a layout descriptor, and matches by using image feature point vectors within a limited page range and augments it from a document image management server under a mobile terminal and a network environment. A real time augmented reality image retrieval system using a layout descriptor for retrieving a page surface that is a target of a real service and an image feature point.
According to the present invention, it is possible to efficiently detect a desired page in a stored database extracted from a PDF file of a print medium, a newspaper, etc., which is a target of an augmented reality service, and in particular, when looking up the entire layout of a page (Layout descriptor) ), And by using image feature point vectors within a limited page range, it is possible to search the page surface of the augmented reality service from the document image management server under the mobile terminal and the network environment.

Description

Augmented reality-image retrieval system using layout descriptor and image feature.}

본 발명은 레이아웃 기술자와 이미지 특징점을 이용한 실시간 증강현실 이미지 검색시스템에 관한 것으로, 더욱 상세하게는 증강현실 서비스의 대상이 되는 인쇄매체, 신문 등의 PDF 파일로부터 추출하여 저장된 데이터베이스에서 원하는 페이지를 효율적으로 검출하는 것으로서, 페이지의 전체 레이아웃으로 조회할 때 레이아웃 기술자(Layout descriptor)를 이용하여 매칭시키며, 제한된 페이지 범위 내에서의 이미지 특징점 벡터를 이용하여 매칭시켜 모바일 단말기기와 네트워크 환경하에서 문서이미지관리서버로부터 증강현실 서비스의 대상이 되는 페이지면을 조회하는 레이아웃 기술자와 이미지 특징점을 이용한 실시간 증강현실 이미지 검색시스템에 관한 것이다.
The present invention relates to a real-time augmented reality image retrieval system using a layout descriptor and an image feature point, more specifically, to extract a desired page from a PDF file of a print medium, a newspaper, etc., which is a target of an augmented reality service, and efficiently stores a desired page. When searching by the entire layout of the page, it matches by using a layout descriptor, and matches by using image feature point vectors within a limited page range and augments it from a document image management server under a mobile terminal and a network environment. A real time augmented reality image retrieval system using a layout descriptor for retrieving a page surface that is a target of a real service and an image feature point.

통상적으로 전자책이란 좁은 의미로는 종이에 인쇄된 책이 아닌 XML, 디지털 이미지, 멀티미디어 등으로 제작해 컴퓨터나 PDA, 휴대폰과 같은 단말기 화면을 통해 보는 디지털 책을 말한다. In general, an e-book refers to a digital book that is produced through XML, digital images, multimedia, etc., rather than a book printed on paper, and viewed through terminal screens such as computers, PDAs, and mobile phones.

최근 컴퓨터와 핸드폰, PMP 등과 같은 단말기로 이용할 수 있는 다양한 전자책이 개발되어 왔으며, PDF, XML 등의 1세대 전자책과 Flash 등 멀티미디어가 결합된 2세대 전자책이 활발히 개발되어 기존의 종이책(paper book)을 대체할 수 있는 수단으로 각광받아 왔다. Recently, various e-books that can be used as terminals such as computers, mobile phones, and PMPs have been developed. Second-generation e-books that combine multimedia such as PDF, XML, and first generation e-books have been actively developed. It has been in the spotlight as a means to replace paper books.

이는 종이책에서 얻을 수 없는 정보를 다양한 멀티미디어 콘텐츠를 통해 획득할 수 있으며, 동일한 종이책에 목적과 상황에 따른 다양한 멀티미디어 시나리오를 결합할 수 있어 컨텐츠의 다양화를 제공하는 잇점이 존재하게 된다.This can be obtained through a variety of multimedia content information that can not be obtained from a paper book, there is an advantage that can provide a variety of content by combining a variety of multimedia scenarios according to the purpose and situation in the same paper book.

상기의 기술은 시각적 정보를 증강현실 기술을 이용하여 제공하게 되므로 페이지의 인식 및 추적기법은 가장 중요한 부분이 된다. Since the above technology provides visual information by using augmented reality technology, page recognition and tracking techniques become the most important part.

종래에는 페이지를 인식하기 위해 주로 마커와 텍스쳐 기반의 자연 특징점 매칭 기법이 적용되었다.Conventionally, natural feature point matching techniques based on markers and textures have been applied to recognize pages.

마커 기반 기법은 페이지 인식 및 추적이 가능하지만, 인쇄된 출판물에 특정패턴의 마커를 부착해야 할 뿐만 아니라 마커의 일부분이 가려질 경우 콘텐츠가 증강되지 않는 문제점이 있다. Marker-based techniques are capable of page recognition and tracking, but there is a problem in that the content is not augmented when a part of the marker is obscured, as well as attaching a marker of a specific pattern to the printed publication.

또한, 구현 가능한 마커 패턴의 한계로 방대한 분량의 페이지를 갖는 책에는 적용하기가 어렵다.In addition, it is difficult to apply to a book having a large amount of pages due to the limitation of the marker pattern that can be implemented.

특히, 인쇄된 페이지를 사용자의 카메라로 촬영하여 이미지 특징점만으로 데이터베이스에서 매칭 과정을 수행하기에는 대상 페이지들간의 유사성과 방대한 특징점의 양으로 인해 빠르고 정확한 페이지 인식이 어려운 문제점이 발생하게 된다.Particularly, in order to perform a matching process in a database using only image feature points by photographing printed pages with a user's camera, a problem of fast and accurate page recognition is difficult due to the similarity between target pages and the amount of feature points.

따라서, 인식의 편리성 및 효율적인 페이지 매칭 방법을 제공하는 기술을 요구하게 되었다.
Therefore, there is a need for a technology that provides a method for convenient recognition and efficient page matching.

없음.none.

따라서 본 발명은 상기와 같은 종래 기술의 문제점을 감안하여 제안된 것으로서, 본 발명의 목적은 증강현실 서비스의 대상이 되는 인쇄매체, 신문 등의 PDF 파일로부터 추출하여 저장된 데이터베이스에서 원하는 페이지를 효율적으로 검출하도록 하는데 있으며, 특히 페이지의 전체 레이아웃으로 조회할 때 레이아웃 기술자(Layout descriptor)를 이용하여 매칭시키며, 제한된 페이지 범위 내에서의 이미지 특징점 벡터를 이용하여 매칭시켜 모바일 단말기기와 네트워크 환경하에서 문서이미지관리서버로부터 증강현실 서비스의 대상이 되는 페이지면을 조회할 수 있도록 하는데 있다.
Therefore, the present invention has been proposed in view of the above-described problems of the prior art, and an object of the present invention is to efficiently detect a desired page from a database stored in a PDF file extracted from a print medium, a newspaper, etc., which is an object of an augmented reality service. In particular, when looking up the entire layout of a page, it is matched by using a layout descriptor, and it is matched by using the image feature point vector within the limited page range from the document image management server under the mobile terminal and the network environment. It is to be able to search the page surface of the augmented reality service.

본 발명이 해결하고자 하는 과제를 달성하기 위하여,In order to achieve the object of the present invention,

본 발명의 일실시예에 따른 레이아웃 기술자와 이미지 특징점을 이용한 실시간 증강현실 이미지 검색시스템은,Real-time augmented reality image retrieval system using a layout descriptor and image feature point in accordance with an embodiment of the present invention,

카메라로부터 이미지를 캡쳐하여 해당 이미지의 특징점을 추출하여 벡터로 변환하며, 인접 픽셀간의 거리를 임계값 기준으로 설정하여 레이아웃 블록을 생성한 후 레이아웃 기술자 엑스엠엘(XML)을 생성하며, 상기 생성된 이미지 특징점 벡터와 레이아웃 기술자 엑스엠엘을 문서이미지관리서버로 송출하여 이미지 조회를 요청하여 문서이미지관리서버로부터 제공되는 이미지를 렌더링하여 화면에 출력시키기 위한 모바일단말기(100)와;Capture the image from the camera and extract the feature points of the image into a vector, create a layout block by setting the distance between adjacent pixels as a threshold value, and generate a layout descriptor XML (XML), the generated image A mobile terminal 100 for transmitting the feature point vector and the layout descriptor XML to the document image management server and requesting an image inquiry to render an image provided from the document image management server and output it to the screen;

레이아웃 기술자 항목 및 이미지 특징점 벡터 정보를 저장하고 있는 이미지정보디비를 포함하여 구성되며, 모바일단말기로부터 이미지 조회 요청시 제공되는 레이아웃 기술자 엑스엠엘을 참조하여 해당 레이아웃 조건에 부합되는 이미지들을 이미지정보디비에서 검색하여 레이아웃 매칭 결과리스트를 작성하며, 상기 레이아웃 매칭 결과리스트에서 이미지 특징점 벡터와 매칭되는 이미지를 조회하여 추출된 해당 이미지를 모바일단말기에 제공하기 위한 문서이미지관리서버(200);를 포함하여 구성되어 본 발명의 과제를 해결하게 된다.
It includes the image information DB that stores the layout descriptor item and the image feature point vector information, and searches for the image in the image information DB by referring to the layout descriptor XML provided when the image is requested from the mobile terminal. A document image management server 200 for creating a layout matching result list and providing a corresponding image extracted by searching for an image matching the image feature point vector in the layout matching result list to the mobile terminal. The problem of the invention is solved.

이상의 구성 및 작용을 지니는 본 발명에 따른 레이아웃 기술자와 이미지 특징점을 이용한 실시간 증강현실 이미지 검색시스템은,Real-time augmented reality image retrieval system using a layout descriptor and image feature point according to the present invention having the above configuration and action,

증강현실 서비스의 대상이 되는 인쇄매체, 신문 등의 PDF 파일로부터 추출하여 저장된 데이터베이스에서 원하는 페이지를 효율적으로 검출하도록 하는데 있으며, 특히 페이지의 전체 레이아웃으로 조회할 때 레이아웃 기술자(Layout descriptor)를 이용하여 매칭시키며, 제한된 페이지 범위 내에서의 이미지 특징점 벡터를 이용하여 매칭시켜 모바일 단말기기와 네트워크 환경하에서 문서이미지관리서버로부터 증강현실 서비스의 대상이 되는 페이지면을 조회할 수 있게 된다.
It extracts from a PDF file such as a print media or newspaper, which is the target of augmented reality service, to efficiently detect a desired page in a stored database. Especially, when searching the entire layout of a page, it matches using a layout descriptor. By using image feature point vectors within a limited page range, the page surface which is augmented reality service target can be retrieved from document image management server under mobile terminal and network environment.

도 1은 본 발명의 일실시예에 따른 레이아웃 기술자와 이미지 특징점을 이용한 실시간 증강현실 이미지 검색시스템의 전체 구성도이다.
도 2는 본 발명의 일실시예에 따른 레이아웃 기술자와 이미지 특징점을 이용한 실시간 증강현실 이미지 검색시스템의 모바일단말기 블록도이다.
도 3은 본 발명의 일실시예에 따른 레이아웃 기술자와 이미지 특징점을 이용한 실시간 증강현실 이미지 검색시스템의 문서이미지관리서버 블록도이다.
도 4는 본 발명의 일실시예에 따른 레이아웃 기술자와 이미지 특징점을 이용한 실시간 증강현실 이미지 검색시스템의 이미지정보디비구축부 블록도이다.
도 5는 본 발명의 일실시예에 따른 레이아웃 기술자와 이미지 특징점을 이용한 실시간 증강현실 이미지 검색시스템의 증강현실을 제공하는 흐름을 나타낸 흐름도이다.
도 6은 본 발명의 일실시예에 따른 레이아웃 기술자와 이미지 특징점을 이용한 실시간 증강현실 이미지 검색시스템의 이미지정보디비에 정보를 저장하는 흐름을 나타낸 흐름도이다.
도 7은 본 발명의 일실시예에 따른 레이아웃 기술자와 이미지 특징점을 이용한 실시간 증강현실 이미지 검색시스템의 이미지정보디비의 데이터필드를 나타낸 예시도이다.
도 8은 본 발명의 일실시예에 따른 레이아웃 기술자와 이미지 특징점을 이용한 실시간 증강현실 이미지 검색시스템의 피디에프 파일을 이미지로 변환한 예를 나타낸 예시도이다.
도 9는 본 발명의 일실시예에 따른 레이아웃 기술자와 이미지 특징점을 이용한 실시간 증강현실 이미지 검색시스템의 전체 페이지 레이아웃을 나타낸 예시도이다.
도 10은 본 발명의 일실시예에 따른 레이아웃 기술자와 이미지 특징점을 이용한 실시간 증강현실 이미지 검색시스템의 레이아웃 기술자 엑스엠엘(XML)을 나타낸 예시도이다.1 is an overall configuration diagram of a real-time augmented reality image retrieval system using a layout descriptor and image feature points according to an embodiment of the present invention.
2 is a block diagram of a mobile terminal of a real-time augmented reality image retrieval system using a layout descriptor and image feature points according to an embodiment of the present invention.
3 is a block diagram of a document image management server of a real-time augmented reality image retrieval system using a layout descriptor and image feature points according to an embodiment of the present invention.
4 is a block diagram of an image information DB construction unit of a real-time augmented reality image retrieval system using a layout descriptor and image feature points according to an exemplary embodiment of the present invention.
5 is a flowchart illustrating an augmented reality of a real-time augmented reality image retrieval system using a layout descriptor and an image feature point according to an embodiment of the present invention.
6 is a flowchart illustrating a flow of storing information in an image information DB of a real-time augmented reality image retrieval system using a layout descriptor and an image feature point according to an embodiment of the present invention.
7 is an exemplary diagram illustrating a data field of an image information DB of a real-time augmented reality image retrieval system using a layout descriptor and image feature points according to an embodiment of the present invention.
8 is an exemplary diagram illustrating an example of converting a PD file to an image of a real-time augmented reality image retrieval system using a layout descriptor and an image feature point according to an embodiment of the present invention.
9 is an exemplary view showing the entire page layout of a real-time augmented reality image search system using a layout descriptor and image feature points according to an embodiment of the present invention.
FIG. 10 is an exemplary diagram illustrating a layout descriptor XML of a real-time augmented reality image retrieval system using a layout descriptor and image feature points according to an embodiment of the present invention.

이하, 본 발명에 의한 레이아웃 기술자와 이미지 특징점을 이용한 실시간 증강현실 이미지 검색시스템의 실시예를 통해 상세히 설명하도록 한다.Hereinafter, an embodiment of a real-time augmented reality image search system using a layout descriptor and an image feature point according to the present invention will be described in detail.

도 1은 본 발명의 일실시예에 따른 레이아웃 기술자와 이미지 특징점을 이용한 실시간 증강현실 이미지 검색시스템의 전체 구성도이다.1 is an overall configuration diagram of a real-time augmented reality image retrieval system using a layout descriptor and image feature points according to an embodiment of the present invention.

도 1에 도시한 바와 같이, 본 발명의 일실시예에 따른 레이아웃 기술자와 이미지 특징점을 이용한 실시간 증강현실 이미지 검색시스템은,As shown in FIG. 1, a real-time augmented reality image retrieval system using a layout descriptor and an image feature point according to an embodiment of the present invention is provided.

레이아웃 기술자 항목 및 이미지 특징점 벡터 정보를 저장하고 있는 이미지정보디비를 포함하여 구성되며, 모바일단말기로부터 이미지 조회 요청시 제공되는 레이아웃 기술자 엑스엠엘을 참조하여 해당 레이아웃 조건에 부합되는 이미지들을 이미지정보디비에서 검색하여 레이아웃 매칭 결과리스트를 작성하며, 상기 레이아웃 매칭 결과리스트에서 이미지 특징점 벡터와 매칭되는 이미지를 조회하여 추출된 해당 이미지를 모바일단말기에 제공하기 위한 문서이미지관리서버(200);를 포함하여 구성되게 된다.It includes the image information DB that stores the layout descriptor item and the image feature point vector information, and searches for the image in the image information DB by referring to the layout descriptor XML provided when the image is requested from the mobile terminal. A document image management server 200 for creating a layout matching result list and providing a corresponding image extracted by searching for an image matching the image feature point vector in the layout matching result list to the mobile terminal. .

상기 모바일단말기를 통해 문서이미지관리서버로부터 증강 현실 컨텐츠(이미지)를 제공받음에 있어 증강현실에 대상되는 페이지면을 조회하게 되는 것이다.When receiving the augmented reality content (image) from the document image management server through the mobile terminal is to query the page surface that is augmented reality.

모바일단말게는 조회 요청 쿼리 이미지를 가공하여 블록을 형성한 뒤 레이아웃 기술자(Layout descriptor)를 XML로 기록한다. The mobile terminal processes the query request query image to form a block and records the layout descriptor as XML.

상기 Layout descriptor XML은 block의 전체 개수, 중앙기준 위치별 block의 개수, 구분선(separator)의 개수와 위치 등을 포함한다.The layout descriptor XML includes the total number of blocks, the number of blocks for each central reference position, the number and positions of separators, and the like.

상기 Layout descriptor와 추출된 이미지 특징점 정보로써 문서이미지관리서버에 조회를 요청하면 문서이미지관리서버에서는 Layout descriptor XML 내용을 데이터베이스 조회 조건으로 파싱하여 조회를 수행한다. When a request is made to the document image management server using the layout descriptor and the extracted image feature point information, the document image management server parses the layout descriptor XML content as a database inquiry condition and performs the inquiry.

페이지 간의 레이아웃의 유사성 때문에 매칭 결과 다수개의 페이지가 인출될 수 있으며, 인출된 페이지 범위 내에서 이미지 특징점 매칭 작업을 수행하여 유사성(Similarity)이 가장 높은 페이지를 인식 결과로써 모바일단말기에 제시하게 된다.Due to the similarity of layout between pages, a plurality of pages may be retrieved as a result of the matching, and the image feature point matching operation is performed within the retrieved page range, and the page having the highest similarity is presented to the mobile terminal as a recognition result.

도 2는 본 발명의 일실시예에 따른 레이아웃 기술자와 이미지 특징점을 이용한 실시간 증강현실 이미지 검색시스템의 모바일단말기 블록도이다.2 is a block diagram of a mobile terminal of a real-time augmented reality image retrieval system using a layout descriptor and image feature points according to an embodiment of the present invention.

도 2에 도시한 바와 같이, 본 발명의 모바일단말기(100)는,As shown in Figure 2, the mobile terminal 100 of the present invention,

카메라로부터 획득되는 이미지를 캡쳐하기 위한 이미지캡쳐부(110)와,An image capture unit 110 for capturing an image obtained from a camera;

상기 캡쳐된 이미지를 획득하여 다운샘플링, 노이즈 제거, 기울기 보정의 전처리 과정을 수행하기 위한 이미지전처리수행부(120)와,An image preprocessing unit 120 for acquiring the captured image and performing preprocessing of downsampling, noise removal, and tilt correction;

상기 이미지전처리수행부에 의해 전처리 과정을 수행한 이미지를 획득하여 특징점을 추출하기 위한 이미지특징점추출부(130)와,An image feature point extracting unit 130 for extracting a feature point by acquiring an image having undergone a preprocessing process by the image preprocessing unit;

이미지의 인접 픽셀간의 거리를 임계값 기준으로 설정하여 레이아웃 블록을 생성하기 위한 레이아웃블록생성부(140)와,A layout block generator 140 for generating a layout block by setting a distance between adjacent pixels of an image as a threshold value;

상기 이미지특징점추출부에 의해 추출된 이미지 특징점을 획득하여 벡터로 변환하기 위한 이미지특징점벡터변환부(150)와,An image feature point vector converter 150 for acquiring an image feature point extracted by the image feature point extractor and converting the image feature point into a vector;

상기 레이아웃블록생성부에 의해 생성된 레이아웃 블록으로부터 레이아웃 기술자 엑스엠엘(XML)을 생성하기 위한 레이아웃기술자엑스엠엘생성부(160)와,A layout descriptor XML generation unit 160 for generating a layout descriptor XML from the layout block generated by the layout block generation unit;

상기 이미지특징점벡터변환부에 의해 변환된 이미지 특징점 벡터와 레이아웃기술자엑스엠엘생성부에 의해 생성된 레이아웃 기술자 엑스엠엘을 획득하여 문서이미지관리서버로 이미지 조회를 요청하기 위한 이미지조회요청부(170)와,An image lookup request unit 170 for acquiring an image feature point vector converted by the image feature point vector converting unit and a layout descriptor XML generated by the layout descriptor XMl generation unit and requesting an image search to a document image management server; ,

문서이미지관리서버에서 송출된 이미지 페이지 상의 3차원 좌표를 추적하기 위한 트래킹모듈부(180)와,Tracking module unit 180 for tracking the three-dimensional coordinates on the image page sent from the document image management server,

상기 트래킹모듈부에 의해 추적된 3차원 좌표를 참조하여 렌더링을 수행하기 위한 렌더링모듈부(190)와,A rendering module unit 190 for performing rendering with reference to the 3D coordinates tracked by the tracking module unit;

상기 각각의 부 간의 신호 흐름을 제어하기 위한 단말기제어부(195)를 포함하여 구성되게 된다.It is configured to include a terminal control unit 195 for controlling the signal flow between the respective units.

상기 이미지캡쳐부(110)는 카메라로부터의 연속 프레임에서 이미지를 캡쳐하게 된다.The image capture unit 110 is to capture an image in a continuous frame from the camera.

상기 이미지전처리수행부(120)는 캡쳐된 이미지를 획득하여 다운샘플링, 노이즈 제거, 기울기 보정 등의 전처리 과정을 수행하게 된다.The image preprocessing unit 120 acquires the captured image and performs preprocessing such as downsampling, noise removal, and tilt correction.

상기 다운샘플링, 노이즈 제거, 기울기 보정은 당업자들에게 널리 알려진 기술이므로 상세한 설명은 생략하겠다.The downsampling, noise removal, and tilt correction are well known to those skilled in the art, and thus detailed descriptions thereof will be omitted.

이후, 본 발명의 특징적인 부분은 특징점 추출 및 특징점 벡터 변환과 레이아웃 블록생성 및 레이아웃 기술자 XML 생성인 것이다.The characteristic parts of the present invention are the feature point extraction and feature point vector transformation and layout block generation and layout descriptor XML generation.

이를 통해 검색의 정확성과 프로세싱 시간의 단축을 효율적으로 수행할 수 있게 되는 것이다.This makes it possible to efficiently perform search accuracy and shorten processing time.

상기 이미지특징점추출부(130)는 이미지전처리수행부에 의해 전처리 과정을 수행한 이미지를 획득하여 특징점을 추출하게 되며, 레이아웃블록생성부(140)는 이미지의 인접 픽셀간의 거리를 임계값 기준으로 설정하여 레이아웃 블록을 생성하게 된다.The image feature point extractor 130 obtains an image that has been preprocessed by the image preprocessing unit and extracts feature points, and the layout block generator 140 sets distances between adjacent pixels of the image based on a threshold value. To create the layout block.

이후, 이미지특징점벡터변환부(150)는 이미지특징점추출부에 의해 추출된 이미지 특징점을 획득하여 벡터로 변환하게 되며, 상기 레이아웃기술자엑스엠엘생성부(160)는 레이아웃블록생성부에 의해 생성된 레이아웃 블록으로부터 레이아웃 기술자 엑스엠엘(XML)을 생성하게 된다.Thereafter, the image feature point vector converter 150 acquires the image feature points extracted by the image feature point extractor and converts the image feature points into a vector. The layout descriptor X-Mn generator 160 generates a layout generated by the layout block generator. The layout descriptor XML (XML) is generated from the block.

상기 이미지 상에서 특징점을 추출하는 기술과 벡터로 변환하는 기술은 이미 당업자들에게 널리 알려진 기술이므로 상세한 설명은 생략하겠다.Techniques for extracting feature points on the image and converting the feature points into vectors are well known to those skilled in the art, and thus detailed descriptions thereof will be omitted.

상기 레이아웃기술자엑스엠엘생성부에 의해 레이아웃 블록으로부터 레이아웃 기술자 엑스엠엘(XML)을 생성하는 것은 도 10에 도시하였다.Generating layout descriptor XML from the layout block by the layout descriptor XML generating unit is illustrated in FIG. 10.

상기 처리 과정을 마친 후, 이미지조회요청부(170)에서는 이미지특징점벡터변환부에 의해 변환된 이미지 특징점 벡터와 레이아웃기술자엑스엠엘생성부에 의해 생성된 레이아웃 기술자 엑스엠엘을 획득하여 문서이미지관리서버로 이미지 조회를 요청하게 된다.After completing the above processing, the image lookup request unit 170 obtains the image feature point vector converted by the image feature point vector converting unit and the layout descriptor XML generated by the layout engineer XML generation unit to obtain a document image management server. You will be asked to retrieve an image.

이후, 트래킹모듈부(180)에서 문서이미지관리서버로부터 송출된 이미지 페이지 상의 3차원 좌표를 추적하게 되며, 렌더링모듈부(190)에 의해 트래킹모듈부로부터 추적된 3차원 좌표를 참조하여 렌더링을 수행하여 증강 현실 정보를 제공하게 되는 것이다.Thereafter, the tracking module unit 180 tracks the three-dimensional coordinates on the image page sent from the document image management server, and performs rendering with reference to the three-dimensional coordinates tracked from the tracking module unit by the rendering module unit 190. To provide augmented reality information.

상기 트래킹 프로세싱 기술과 렌더링 기술은 증강현실에서 일반적으로 사용되는 기술이므로 상세한 설명은 생략하겠다.Since the tracking processing technique and the rendering technique are generally used in augmented reality, detailed description thereof will be omitted.

도 3은 본 발명의 일실시예에 따른 레이아웃 기술자와 이미지 특징점을 이용한 실시간 증강현실 이미지 검색시스템의 문서이미지관리서버 블록도이다.3 is a block diagram of a document image management server of a real-time augmented reality image retrieval system using a layout descriptor and image feature points according to an embodiment of the present invention.

도 3에 도시한 바와 같이, 상기 문서이미지관리서버(200)는,As shown in Figure 3, the document image management server 200,

모바일단말기로부터 이미지 조회 요청시, 송출되는 레이아웃 기술자 엑스엠엘을 획득하여 이미지정보디비 조회 조건으로 변환하기 위하여 파싱하기 위한 파싱부(210)와,Parsing unit 210 for parsing in order to obtain the layout descriptor XML transmitted to the image information request from the mobile terminal to convert the image information DB query conditions, and

이미지정보디비로부터 레이아웃 조건에 부합하는 이미지를 조회하여 레이아웃 매칭 결과리스트를 작성하는 매칭결과리스트작성부(220)와,A matching result list creating unit 220 for searching an image corresponding to the layout condition from the image information DB and creating a layout matching result list;

모바일단말기로부터 이미지 조회 요청시, 송출되는 이미지 특징점 벡터를 획득하여 이미지 특징점 벡터와 매칭되는 이미지를 매칭결과리스트작성부에 의해 작성된 레이아웃 매칭 결과리스트에서 조회하기 위한 이미지매칭조회부(230)와,An image matching query unit 230 for retrieving an image feature point vector transmitted from the mobile terminal and retrieving an image matching the image feature point vector from a layout matching result list created by the matching result list creating unit;

원본 저장경로, 이미지 저장경로, 레이아웃 기술자 항목, 이미지 특징점 벡터 정보를 포함하여 저장하고 관리하기 위한 이미지정보디비(240)와,An image information DB 240 for storing and managing an original storage path, an image storage path, a layout descriptor item, and image feature point vector information;

상기 이미지매칭조회부에 의해 조회된 이미지를 모바일단말기로 송출하기 위한 이미지송출부(250)와,An image transmitting unit 250 for transmitting an image queried by the image matching inquiry unit to a mobile terminal;

상기 각각의 부와 디비 간의 신호 흐름을 제어하기 위한 서버제어부(260)를 포함하여 구성되는 것을 특징으로 한다.It characterized in that it comprises a server control unit 260 for controlling the signal flow between the respective units and the DB.

즉, 파싱부(210)는 모바일단말기로부터 이미지 조회 요청시, 송출되는 레이아웃 기술자 엑스엠엘(XML)을 획득하여 이미지정보디비 조회 조건으로 변환하기 위하여 파싱하게 된다.That is, the parsing unit 210 acquires the layout descriptor XML (XML) transmitted when the image inquiry request is received from the mobile terminal and parses it to convert the image information into the DB information retrieval condition.

상기 매칭결과리스트작성부(220)는 이미지정보디비(240)로부터 레이아웃 조건에 부합하는 이미지를 조회하여 레이아웃 매칭 결과리스트를 작성하게 된다.The matching result list creating unit 220 inquires an image matching the layout condition from the image information DB 240 and creates a layout matching result list.

이후에 이미지매칭조회부(230)는 모바일단말기로부터 이미지 조회 요청시, 송출되는 이미지 특징점 벡터를 획득하여 이미지 특징점 벡터와 매칭되는 이미지를 매칭결과리스트작성부에 의해 작성된 레이아웃 매칭 결과리스트에서 조회하게 되는데, 특징적인 것은 조회 결과 임계값 이상의 유사성(similarity rate)을 가지고 있는 이미지를 획득하였으면 조회 결과를 이미지송출부로 전송하게 되며, 조회 결과 임계값 미만의 유사성(similarity rate)을 가지고 있는 이미지를 획득하였으면 다른 이미지에 대하여 매칭 연산을 수행하는 것이다.Thereafter, when the image matching query unit 230 requests an image search from the mobile terminal, the image matching query unit 230 obtains an image feature point vector to be sent and searches an image matching the image feature point vector in the layout matching result list created by the matching result list creating unit. The characteristic is that if an image with similarity rate above the threshold is obtained, the search result is sent to the image transmitter. If an image with a similarity rate below the threshold is obtained, another image is obtained. It performs a matching operation on the image.

상기 이미지송출부(250)는 이미지매칭조회부에 의해 조회된 이미지를 모바일단말기로 송출하게 되고, 상기 모바일단말기에서는 해당 이미지를 트래킹하여 렌더링하게 되는 것이다.The image transmitting unit 250 transmits the image inquired by the image matching inquiry unit to the mobile terminal, and the mobile terminal tracks and renders the image.

도 4는 본 발명의 일실시예에 따른 레이아웃 기술자와 이미지 특징점을 이용한 실시간 증강현실 이미지 검색시스템의 이미지정보디비구축부 블록도이다.4 is a block diagram of an image information DB construction unit of a real-time augmented reality image retrieval system using a layout descriptor and image feature points according to an exemplary embodiment of the present invention.

도 4에 도시한 바와 같이, 본 발명의 이미지정보디비구축부(270)는 레이아웃 기술자 항목, 이미지 특징점 벡터 정보를 원본 이미지 파일로부터 추출하여 이미지정보디비에 저장시키게 된다.As shown in FIG. 4, the image information DB construction unit 270 of the present invention extracts the layout descriptor item and the image feature point vector information from the original image file and stores the extracted information in the image information DB.

구체적으로 설명하자면, 상기 이미지정보디비구축부(270)는,Specifically, the image information DB building unit 270,

피디에프 파일을 이미지로 변환하기 위한 이미지변환부(271)와,An image converter 271 for converting a PD file into an image,

상기 이미지변환부에 의해 변환된 이미지를 획득하여 블록을 형성하기 위한 이미지블록화부(272)와,An image blocker 272 for acquiring an image converted by the image converter and forming a block;

상기 이미지블록화부에 의해 형성된 이미지 블록들간의 수직 거리가 임계치 이하인 경우에 하나의 블록으로 통합하기 위한 인접블록통합부(273)와,An adjacent block integrating unit 273 for integrating into one block when the vertical distance between the image blocks formed by the image blocking unit is less than or equal to a threshold;

상기 이미지블록화부 및 인접블록통합부에 의해 획득된 전체 레이아웃을 기준으로 각 블록의 상대적 위치를 설명하는 레이아웃 기술자(Layout descriptor)를 추출하기 위한 레이아웃기술자추출부(274)와,A layout descriptor extracting unit 274 for extracting a layout descriptor describing a relative position of each block based on the entire layout obtained by the image block forming unit and the adjacent block integrating unit;

상기 이미지변환부(271)에 의해 변환된 이미지를 획득하여 다운샘플링, 노이즈 제거, 기울기 보정의 전처리 과정을 수행하고, 전처리 과정을 수행한 이미지에서 특징점을 추출하기 위한 디비저장용특징점추출부(275)와,Performing preprocessing of downsampling, noise removal, and tilt correction by acquiring the image converted by the image conversion unit 271, and extracting feature point extractor 275 for extracting feature points from the preprocessed image. )Wow,

상기 전처리 과정 및 특징점 추출을 수행한 이미지 정보를 획득하여 불변량 좌표계로 변환하기 위한 불변량좌표계변환부(276)와,An invariant coordinate system conversion unit 276 for acquiring image information obtained by performing the preprocessing and feature point extraction and converting the image information into an invariant coordinate system;

상기 전처리 과정 및 특징점 추출을 수행한 이미지 정보를 획득하여 해쉬 인덱스를 산출하기 위한 해쉬인덱스산출부(277)와,A hash index calculation unit 277 for calculating a hash index by acquiring image information obtained by performing the preprocessing and feature point extraction;

이미지 아이디, 원본 저장경로, 이미지 저장경로, 레이아웃 기술자 항목들, 이미지 특징점 벡터 정보를 획득하여 해쉬 테이블을 생성시킨 후 이미지정보디비에 저장시키는 디비구축모듈부(278)을 포함하여 구성되게 된다.And a DB building module unit 278 for generating a hash table by acquiring an image ID, an original storage path, an image storage path, layout descriptor items, and image feature point vector information, and storing the image in an image information DB.

상기 이미지변환부(271)는 피디에프 파일을 도 8에 도시한 바와 같이, 이미지로 변환하게 되며, 피디에프 파일은 페이지 한 개당 개별적으로 주어지게 된다.The image conversion unit 271 converts the PD file into an image, as shown in FIG. 8, and the PD file is individually given per page.

상기 이미지블록화부(272)는 이미지변환부에 의해 변환된 이미지를 획득하여 블록을 형성하게 된다.The image blocker 272 forms a block by acquiring an image converted by the image converter.

이후, 인접블록통합부(273)는 이미지블록화부에 의해 형성된 이미지 블록들간의 수직 거리가 임계치 이하인 경우에 하나의 블록으로 통합하게 된다.Thereafter, the adjacent block integrating unit 273 merges into one block when the vertical distance between the image blocks formed by the image block forming unit is less than or equal to the threshold.

즉, 레이아웃 기술자를 좀 더 단순화하기 위하여 이미지블록화부에서 형성된 이미지 블록들을 취합할 필요가 있으며 블록간의 수직거리가 임계치 이하인 경우 도 9에 도시한 바와 같이, 하나의 블록으로 통합하게 된다.That is, in order to further simplify the layout descriptor, it is necessary to collect image blocks formed in the image block unit, and when the vertical distance between the blocks is less than or equal to the threshold, as shown in FIG. 9, the blocks are integrated into one block.

이후, 레이아웃기술자추출부(274)는 이미지블록화부 및 인접블록통합부에 의해 획득된 전체 레이아웃을 기준으로 각 블록의 상대적 위치를 설명하는 레이아웃 기술자(Layout descriptor)를 추출하게 된다.Thereafter, the layout descriptor extracting unit 274 extracts a layout descriptor describing a relative position of each block based on the entire layout obtained by the image block forming unit and the adjacent block integrating unit.

예를 들어, 페이지 중간을 기준으로 좌/우의 블록수, 단락간 구분자의 사분면 위치 등을 추출하게 되며, 이러한 속성들을 데이터베이스의 인덱스로 활용하게 되는 것이다.For example, the number of blocks on the left and right and the quadrant position of the separator between paragraphs are extracted based on the middle of the page, and these attributes are used as an index of the database.

상기 디비저장용특징점추출부(275)는 이미지변환부(271)에 의해 변환된 이미지를 획득하여 다운샘플링, 노이즈 제거, 기울기 보정의 전처리 과정을 수행하고, 전처리 과정을 수행한 이미지에서 특징점을 추출하게 된다.The feature storing point extractor 275 for storing the image is obtained by converting the image converted by the image converter 271 to perform presampling of downsampling, noise removal, and tilt correction, and extracts feature points from the preprocessed image. Done.

상기 특징점 추출은 일반적으로 바이너리 이미지 생성과 가우시안필터로 blurred 이미지를 생성하게 되며, 상기 결과 도출된 각 단어영역(word regions)의 중심을 feature point로 추출하는 것이다.In general, the feature point extraction generates a blurred image using a binary image generation and a Gaussian filter, and extracts the center of each of the resulting word regions as a feature point.

이는 모바일단말기의 전처리과정과 동일한 과정이다.This is the same process as the preprocessing of the mobile terminal.

상기 불변량좌표계변환부(276)는 전처리 과정 및 특징점 추출을 수행한 이미지 정보를 획득하여 불변량 좌표계로 변환하게 된다.The invariant coordinate system conversion unit 276 obtains image information obtained by performing a preprocessing process and feature point extraction and converts the image information into a invariant coordinate system.

상기 불변량 계산 방식은 이미 당업자들에게 알려진 기술이므로 간단하게 설명하도록 한다.Since the invariant calculation method is already known to those skilled in the art, it will be briefly described.

즉, 상기 특징점추출과정에서 추출된 각 특징점(feature point)마다 7개의 인접점을 찾게 되며, 인접점간의 관계로 불변량을 나타내기 위하여 4개의 인접점을 잇는 두 삼각형 간의 cross ratio를 계산하게 되며, 상기와 같은 방식으로 모든 인접점에 대한 조합으로 cross ratio를 산출해 내는 것이다.That is, seven adjacent points are found for each feature point extracted in the feature point extraction process, and the cross ratio between two triangles connecting four adjacent points is calculated to represent invariant relations between adjacent points. In the same manner as above, the cross ratio is calculated by combining all adjacent points.

상기 해쉬인덱스산출부(277)는 전처리 과정 및 특징점 추출을 수행한 이미지 정보를 획득하여 해쉬 인덱스를 산출하게 된다.The hash index calculation unit 277 calculates a hash index by obtaining image information obtained by performing a preprocessing process and feature point extraction.

상기 해쉬 인덱스 산출 과정은 이미 당업자들에게 알려진 기술이므로 상세한 설명은 생략하겠다.Since the hash index calculation process is already known to those skilled in the art, a detailed description thereof will be omitted.

본 발명에서는 빠른 검색을 위해 해쉬 구조(hash scheme)를 채용한 것이다. In the present invention, a hash scheme is adopted for fast searching.

이후, 상기 디비구축모듈부(278)는 이미지 아이디, 원본 저장경로, 이미지 저장경로, 레이아웃 기술자 항목들, 이미지 특징점 벡터 정보를 획득하여 해쉬 테이블을 생성시킨 후 이미지정보디비에 저장시키게 되는 것이다.(도 7 참조)Thereafter, the DB building module unit 278 acquires an image ID, an original storage path, an image storage path, layout descriptor items, and image feature point vector information, generates a hash table, and stores the hash table in the image information DB. See FIG. 7)

도 5는 본 발명의 일실시예에 따른 레이아웃 기술자와 이미지 특징점을 이용한 실시간 증강현실 이미지 검색시스템의 증강현실을 제공하는 흐름을 나타낸 흐름도이다.5 is a flowchart illustrating an augmented reality of a real-time augmented reality image retrieval system using a layout descriptor and an image feature point according to an embodiment of the present invention.

도 5에 도시한 바와 같이, 이미지캡쳐부(110)가 카메라로부터 획득되는 이미지를 캡쳐(S100)하게 되며, 이미지전처리수행부(120)가 캡쳐된 이미지를 획득하여 다운샘플링, 노이즈 제거, 기울기 보정의 전처리 과정을 수행(S110)하게 된다.As shown in FIG. 5, the image capture unit 110 captures an image obtained from a camera (S100), and the image preprocessing unit 120 acquires the captured image to downsample, remove noise, and correct the tilt. The pretreatment process is performed (S110).

이후에 이미지특징점추출부(130)가 이미지전처리수행부에 의해 전처리 과정을 수행한 이미지를 획득하여 특징점을 추출(S120)하게 되며, 이미지특징점벡터변환부(150)가 이미지특징점추출부에 의해 추출된 이미지 특징점을 획득하여 벡터로 변환(S125)하게 되는 것이다.Thereafter, the image feature point extractor 130 obtains an image that has been preprocessed by the image preprocessing unit to extract the feature point (S120), and the image feature point vector converter 150 is extracted by the image feature point extractor. The image feature point is obtained and converted into a vector (S125).

또한, 레이아웃블록생성부(140)가 이미지의 인접 픽셀간의 거리를 임계값 기준으로 설정하여 레이아웃 블록을 생성(S130)하게 되며, 레이아웃기술자엑스엠엘생성부(160)가 레이아웃블록생성부에 의해 생성된 레이아웃 블록으로부터 레이아웃 기술자 엑스엠엘(XML)을 생성(S135)하게 되는 것이다.In addition, the layout block generation unit 140 sets the distance between adjacent pixels of the image based on a threshold value to generate a layout block (S130), and the layout descriptor XM generation unit 160 generates the layout block by the layout block generation unit. The layout descriptor XML (XML) is generated from the used layout block (S135).

이후에, 이미지조회요청부(170)가 이미지특징점벡터변환부에 의해 변환된 이미지 특징점 벡터와 레이아웃기술자엑스엠엘생성부에 의해 생성된 레이아웃 기술자 엑스엠엘을 획득하여 문서이미지관리서버로 이미지 조회를 요청(S140)하게 되는 것이다.Subsequently, the image lookup request unit 170 obtains the image feature point vector converted by the image feature point vector converting unit and the layout descriptor XML generated by the layout engineer XML generation unit, and requests an image inquiry to the document image management server. (S140) will be.

이후에, 문서이미지관리서버(200)의 파싱부(210)가 모바일단말기로부터 이미지 조회 요청시, 송출되는 레이아웃 기술자 엑스엠엘을 획득하여 이미지정보디비 조회 조건으로 변환하기 위하여 파싱(S150)하게 되며, 매칭결과리스트작성부(220)가 이미지정보디비로부터 레이아웃 조건에 부합하는 이미지를 조회(S160)하여 레이아웃 매칭 결과리스트를 작성(S170)하게 되며, 이미지매칭조회부(230)가 모바일단말기로부터 송출되는 이미지 특징점 벡터를 획득하여 이미지 특징점 벡터와 매칭되는 이미지를 매칭결과리스트작성부에 의해 작성된 레이아웃 매칭 결과리스트에서 조회(S180)하게 되는 것이다.Subsequently, when the parsing unit 210 of the document image management server 200 requests an image inquiry from the mobile terminal, the parsing unit 210 parses to obtain the layout descriptor XML that is sent and converts it into an image information DB inquiry condition (S150). The matching result list creation unit 220 inquires an image matching the layout condition from the image information DB (S160) to create a layout matching result list (S170), and the image matching query unit 230 is sent from the mobile terminal. By obtaining the image feature point vector, an image matching the image feature point vector is searched in the layout matching result list created by the matching result list generator (S180).

이때, 이미지매칭조회부(230)는 임계값 이상의 유사성을 갖는지를 판단(S190)하게 되며, 임계값 이상의 유사성을 갖는 이미지를 이미지송출부에서 획득하여 모바일단말기로 송출하게 된다.At this time, the image matching inquiry unit 230 determines whether or not having similarity above the threshold value (S190), and obtains an image having similarity above the threshold value from the image transmitting unit and transmits it to the mobile terminal.

이때, 상기 모바일단말기에서는 트래킹모듈부(180)이 이미지 페이지 상의 3차원 좌표를 추적(S200)하게 되며, 렌더링모듈부(190)이 트래킹모듈부에 의해 추적된 3차원 좌표를 참조하여 렌더링을 수행(S210)하여 화면에 증강현실 정보를 출력하게 되는 것이다.In this case, in the mobile terminal, the tracking module 180 tracks the 3D coordinates on the image page (S200), and the rendering module 190 performs rendering by referring to the 3D coordinates tracked by the tracking module unit. (S210) to output the augmented reality information on the screen.

도 6은 본 발명의 일실시예에 따른 레이아웃 기술자와 이미지 특징점을 이용한 실시간 증강현실 이미지 검색시스템의 이미지정보디비에 정보를 저장하는 흐름을 나타낸 흐름도이다.6 is a flowchart illustrating a flow of storing information in an image information DB of a real-time augmented reality image retrieval system using a layout descriptor and an image feature point according to an embodiment of the present invention.

도 6에 도시한 바와 같이, 피디에프 파일을 로딩(S300)한 후, 이미지변환부(271)가 피디에프 파일을 이미지로 변환(S310)하게 된다.As shown in FIG. 6, after the PD file is loaded (S300), the image converter 271 converts the PD file to an image (S310).

이후에, 이미지블록화부(272)가 이미지변환부에 의해 변환된 이미지를 획득하여 블록을 형성(S350)하게 되며, 인접블록통합부(273)가 이미지블록화부에 의해 형성된 이미지 블록들간의 수직 거리가 임계치 이하인 경우에 하나의 블록으로 통합(S360)하게 되며, 레이아웃기술자추출부(274)가 이미지블록화부 및 인접블록통합부에 의해 획득된 전체 레이아웃을 기준으로 각 블록의 상대적 위치를 설명하는 레이아웃 기술자(Layout descriptor)를 추출(S370)하게 된다.Thereafter, the image blocker 272 acquires the image converted by the image converter to form a block (S350), and the adjacent block integrator 273 vertical distances between the image blocks formed by the image blocker. Is less than or equal to the threshold (S360) is integrated into one block, the layout descriptor extraction unit 274 is a layout for explaining the relative position of each block based on the overall layout obtained by the image block unit and the adjacent block integration unit A descriptor is extracted (S370).

한편, 상기 피디에프 파일을 이미지로 변환(S310)에서 레이아웃 기술자 추출 이외에도 특징점 추출 과정을 진행하여야 한다.Meanwhile, in addition to the layout descriptor extraction, the feature point extraction process should be performed in the conversion of the PD file into an image (S310).

즉, 디비저장용특징점추출부(275)가 이미지변환부(271)에 의해 변환된 이미지를 획득하여 다운샘플링, 노이즈 제거, 기울기 보정의 전처리 과정을 수행하고, 전처리 과정을 수행한 이미지에서 특징점을 추출(S320)하게 되며, 불변량좌표계변환부(276)가 전처리 과정 및 특징점 추출을 수행한 이미지 정보를 획득하여 불변량 좌표계로 변환(S330)하게 된다.That is, the feature storing unit extractor 275 for storing the data acquires the image converted by the image converting unit 271 to perform preprocessing of downsampling, noise removal, and tilt correction, and then extracts the feature points from the preprocessing image. In operation S320, the invariant coordinate system converter 276 acquires the image information obtained by performing the preprocessing and the feature point extraction, and converts the information into the invariant coordinate system S330.

이후에, 해쉬인덱스산출부(277)가 상기 전처리 과정 및 특징점 추출을 수행한 이미지 정보를 획득하여 해쉬 인덱스를 산출(S340)하게 된다.Subsequently, the hash index calculation unit 277 obtains image information obtained by performing the preprocessing and feature point extraction to calculate a hash index (S340).

이때, 디비구축모듈부(278)는 이미지 아이디, 원본 저장경로, 이미지 저장경로, 레이아웃 기술자 항목들, 이미지 특징점 벡터 정보를 획득하여 해쉬 테이블을 생성시킨 후 이미지정보디비에 저장시키게 되는 것이다.In this case, the DB building module unit 278 obtains an image ID, an original storage path, an image storage path, layout descriptor items, and image feature point vector information, generates a hash table, and stores the hash table in the image information DB.

상기와 같은 구성 및 동작을 통해 증강현실 서비스의 대상이 되는 인쇄매체, 신문 등의 PDF 파일로부터 추출하여 저장된 데이터베이스에서 원하는 페이지를 효율적으로 검출하도록 하는데 있으며, 특히 페이지의 전체 레이아웃으로 조회할 때 레이아웃 기술자(Layout descriptor)를 이용하여 매칭시키며, 제한된 페이지 범위 내에서의 이미지 특징점 벡터를 이용하여 매칭시켜 모바일 단말기기와 네트워크 환경하에서 문서이미지관리서버로부터 증강현실 서비스의 대상이 되는 페이지면을 조회할 수 있게 된다.Through the configuration and operation as described above, it extracts from a PDF file such as a print medium or a newspaper, which is a target of augmented reality service, to efficiently detect a desired page in a stored database. (Layout descriptor) is matched and image feature point vectors within a limited page range can be matched to search the page surface of the augmented reality service from the document image management server under the mobile terminal and the network environment. .

이상에서와 같은 내용의 본 발명이 속하는 기술분야의 당업자는 본 발명의 기술적 사상이나 필수적 특징을 변경하지 않고서 다른 구체적인 형태로 실시될 수 있다는 것을 이해할 수 있을 것이다. 그러므로 이상에서 기술한 실시 예들은 모든 면에서 예시된 것이며 한정적인 것이 아닌 것으로서 이해해야만 한다. It will be appreciated by those skilled in the art that the present invention may be embodied in other specific forms without departing from the spirit or essential characteristics thereof. Therefore, the above-described embodiments are to be understood as illustrative in all respects and not restrictive.

본 발명의 범위는 상기 상세한 설명보다는 후술하는 특허청구범위에 의하여 나타내어지며, 특허청구 범위의 의미 및 범위 그리고 그 등가 개념으로부터 도출되는 모든 변경 또는 변형된 형태가 본 발명의 범위에 포함되는 것으로 해석되어야 한다.
The scope of the present invention is defined by the appended claims rather than the detailed description and all changes or modifications derived from the meaning and scope of the claims and their equivalents are to be construed as being included within the scope of the present invention do.

100 : 모바일단말기
200 : 문서이미지관리서버100: mobile terminal
200: document image management server

Claims

delete

In the real-time augmented reality image retrieval system using a layout descriptor and image feature points,
Capture the image from the camera and extract the feature points of the image into a vector, create a layout block by setting the distance between adjacent pixels as a threshold value, and generate a layout descriptor XML (XML), the generated image A mobile terminal 100 for transmitting the feature point vector and the layout descriptor XML to the document image management server and requesting an image inquiry to render an image provided from the document image management server and output it to the screen;
Parsing unit 210 for parsing in order to obtain the layout descriptor XML transmitted to the image information request from the mobile terminal to convert the image information DB query conditions, and
A matching result list creating unit 220 for searching an image corresponding to the layout condition from the image information DB and creating a layout matching result list;
An image matching query unit 230 for retrieving an image feature point vector transmitted from the mobile terminal and retrieving an image matching the image feature point vector from a layout matching result list created by the matching result list creating unit;
An image information DB 240 for storing and managing an original storage path, an image storage path, a layout descriptor item, and image feature point vector information;
An image transmitting unit 250 for transmitting an image queried by the image matching inquiry unit to a mobile terminal;
And a document image management server 200 including a server controller 260 for controlling the signal flow between the respective units and DBs.
The image matching inquiry unit 230,
If an image with a similarity rate above the threshold is obtained, the search result is sent to the image transmitter. If an image with a similarity rate below the threshold is acquired, matching is made with other images. Real-time augmented reality image retrieval system using a layout descriptor and an image feature point characterized in that performing the operation.

delete

In the real-time augmented reality image retrieval system using a layout descriptor and image feature points,
Capture the image from the camera and extract the feature points of the image into a vector, create a layout block by setting the distance between adjacent pixels as a threshold value, and generate a layout descriptor XML (XML), the generated image A mobile terminal 100 for transmitting the feature point vector and the layout descriptor XML to the document image management server and requesting an image inquiry to render an image provided from the document image management server and output it to the screen;
Parsing unit 210 for parsing in order to obtain the layout descriptor XML transmitted to the image information request from the mobile terminal to convert the image information DB query conditions, and
A matching result list creating unit 220 for searching an image corresponding to the layout condition from the image information DB and creating a layout matching result list;
An image matching query unit 230 for retrieving an image feature point vector transmitted from the mobile terminal and retrieving an image matching the image feature point vector from a layout matching result list created by the matching result list creating unit;
An image information DB 240 for storing and managing an original storage path, an image storage path, a layout descriptor item, and image feature point vector information;
An image transmitting unit 250 for transmitting an image queried by the image matching inquiry unit to a mobile terminal;
An image converter 271 for converting a PD file into an image,
An image blocker 272 for acquiring an image converted by the image converter and forming a block;
An adjacent block integrating unit 273 for integrating into one block when the vertical distance between the image blocks formed by the image blocking unit is less than or equal to a threshold;
A layout descriptor extracting unit 274 for extracting a layout descriptor describing a relative position of each block based on the entire layout obtained by the image block forming unit and the adjacent block integrating unit;
Performing preprocessing of downsampling, noise removal, and tilt correction by acquiring the image converted by the image conversion unit 271, and extracting feature point extractor 275 for extracting feature points from the preprocessed image. )Wow,
An invariant coordinate system conversion unit 276 for acquiring image information obtained by performing the preprocessing and feature point extraction and converting the image information into an invariant coordinate system;
A hash index calculation unit 277 for calculating a hash index by acquiring image information obtained by performing the preprocessing and feature point extraction;
Image information DB comprising a DVD construction module unit 278 that obtains an image ID, an original storage path, an image storage path, layout descriptor items, image feature point vector information, and generates a hash table and stores the image in the image information DB. The construction unit 270,
Document image management server 200 including a server control unit 260 for controlling the signal flow between the respective units and DB; Real-time augmented reality using a layout descriptor and the image feature point comprising a Image Search System.

The method according to claim 6,
The layout descriptor extraction unit 274,
A real-time augmented reality image retrieval system using a layout descriptor and image feature points, characterized by extracting the quadrant positions of the left and right blocks of the reference and the separator between paragraphs and using them as indexes of the image information DB.

delete