US6574632B2 - Multiple engine information retrieval and visualization system - Google Patents
Multiple engine information retrieval and visualization system Download PDFInfo
- Publication number
- US6574632B2 US6574632B2 US09/195,773 US19577398A US6574632B2 US 6574632 B2 US6574632 B2 US 6574632B2 US 19577398 A US19577398 A US 19577398A US 6574632 B2 US6574632 B2 US 6574632B2
- Authority
- US
- United States
- Prior art keywords
- document
- documents
- retrieved
- user
- information retrieval
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Fee Related
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/33—Querying
- G06F16/338—Presentation of query results
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/951—Indexing; Web crawling techniques
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10—TECHNICAL SUBJECTS COVERED BY FORMER USPC
- Y10S—TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10S707/00—Data processing: database and file management or data structures
- Y10S707/912—Applications of a database
- Y10S707/917—Text
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10—TECHNICAL SUBJECTS COVERED BY FORMER USPC
- Y10S—TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10S707/00—Data processing: database and file management or data structures
- Y10S707/953—Organization of data
- Y10S707/959—Network
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10—TECHNICAL SUBJECTS COVERED BY FORMER USPC
- Y10S—TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10S707/00—Data processing: database and file management or data structures
- Y10S707/99931—Database or file accessing
- Y10S707/99933—Query processing, i.e. searching
- Y10S707/99935—Query augmenting and refining, e.g. inexact access
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10—TECHNICAL SUBJECTS COVERED BY FORMER USPC
- Y10S—TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10S707/00—Data processing: database and file management or data structures
- Y10S707/99941—Database schema or data structure
- Y10S707/99943—Generating database or data structure, e.g. via user interface
Definitions
- the present invention relates to the field of information retrieval systems, and, more particularly, to computer based information retrieval and visualization systems.
- Information retrieval systems search and retrieve data from a collection of documents in response to user input queries.
- Ever increasing volumes of data are rendering traditional information retrieval systems ineffective in production environments.
- search engines that support search and retrieval with non-prohibitive search times.
- These larger data collections necessitate the need to formulate accurate queries, as well as the need to intuitively present the results to the user to increase retrieval efficiency of the desired information.
- search engines exist, such as, for example, Excite, Infoseek, Yahaoo, Alta Vista, Sony Search Engine and Lycos. Private document collections may also be searched using these search engines.
- a common goal of each search engine is to yield a highly accurate set of results to satisfy the information desired.
- Two accuracy measures often used to evaluate information retrieval systems are recall and precision. Recall is the ratio of the number of the relevant documents retrieved from the total number of relevant documents available collection-wide. Precision is the ratio of the number of relevant documents retrieved from the total number of documents retrieved.
- users require only a few highly relevant documents to form a general assessment of the topic, as opposed to detailed knowledge obtained by reading many related documents.
- Time constraints and interest level typically limit the user to reviewing the top documents before determining if the results of a query are accurate and satisfactory. In such cases, retrieval times and precision accuracy are at a premium, with recall potentially being less important.
- a recent user study conducted by Excite Corporation demonstrated that less than five percent of the users looked beyond the first screen of documents returned in response to their queries.
- Other studies conducted on a wide range of operational environments have shown that the average number of terms provided by the user as an input query are often less than two and rarely greater than four. Therefore, high precision with efficient search times may typically be more critical than high recall.
- An article by Cavnar titled “Using an N-Gram Based Document Representation with a Vector Processing Retrieval Model,” discloses the use of a n-gram technology and a vector space model in a single information retrieval system.
- the two search retrieval techniques are combined such that the vector processing model is used for documents and queries, and the n-gram frequencies are used as the basis for the vector element values instead of the traditional term frequencies.
- the information retrieval system disclosed by Cavnar is a hybrid between an n-gram search engine and a vector space model search engine.
- an information retrieval system for selectively retrieving documents from a document database using multiple search engines and a three-dimensional visualization approach. More particularly, the system comprises an input interface for accepting at least one user search query, and a plurality of search engines for retrieving documents from the document database based upon at least one user search query. Each of the search engines advantageously produces a common mathematical representation of each retrieved document. The system further comprises a display and visualization display means for mapping respective mathematical representations of the retrieved documents onto the display.
- At least one search engine produces a document context vector representation and an axis context vector representation of each retrieved document.
- the document context vector representation is the sum of all the words in a document after reducing low content words, and is used to compare documents and queries.
- the axis context vector representation is a sum of the words in each axis after reducing low content words, and is used for building a query for a document cluster.
- the axis context vector is also used by the visualization means to map onto the display.
- the present invention provides precision in retrieving documents from a document database by providing users with multiple input interaction modes, and fusing results obtained from multiple information retrieval search engines, each supporting a different retrieval strategy, and by supporting relevance feedback mechanisms.
- the multiple engine information retrieval and visualization system allows users to build and tailor a query as they further define the topic of interest, moving from a generic search to specific topic areas through query inputs. Users can increase or decrease the system precision, effecting the number of documents that are retrieved as relevant.
- the weights on the retrieval engines can be modified to favor different engines based on the query types.
- a method aspect of the invention is for selectively retrieving documents from a document database using an information retrieval system comprising a plurality of search engines.
- the method preferably comprises the steps of generating at least one user search query and retrieving documents from the document database based upon the user search query.
- Each search engine searches the document database and produces a common mathematical representation of each retrieved document.
- the respective mathematical representations of the retrieved documents are mapped onto a display.
- the method further preferably comprises the steps of producing a document context vector representation of each retrieved document, and producing an axis context vector representation of each retrieved document.
- the step of mapping preferably comprises the step of mapping the axis context vector representations of the retrieved documents onto the display.
- training the dictionary words comprises the steps of receiving a training document, creating context vectors for each word in the training document, and converging the context vectors toward each other for the context vectors representing words appearing close to one another based upon contextual usage.
- Assigning axis representation comprises the step of assigning each dictionary word to an axis having the largest component.
- the method further preferably comprises the steps of displaying a mathematical representation of the retrieved documents from the document database corresponding to the search query.
- FIG. 2 a is a schematic display view of an example vector in 3-dimensional space with the words “budget” and “music” assigned to an axis according to the present invention.
- FIG. 2 b is a graph of a distribution of the words through the first thirty axis for the example illustrated in FIG. 2 a.
- FIG. 4 is a sample display of a document being reduced according to the present invention.
- FIG. 6 are graphs of comparative word retrieval recall and precision results according to the present invention.
- FIG. 8 are graphs illustrating recall and precision of an example search using the VSM document axis according to the present invention.
- FIG. 9 are graphs illustrating selection of document retrieval engine penalty according to the present invention.
- FIG. 10 is a diagram illustrating an assignment of the list location penalty according to the present invention.
- FIGS. 11 a - 11 d are display screens for example searches according to the present invention.
- FIG. 12 a is a display screen showing an example of the 3-dimensional viewer containing documents retrieved for the McVeigh trial topic using the query keywords: McVeigh, trial, Oklahoma City, and bomb, according to the present invention.
- FIG. 16 are graphs showing results using a modified ranking algorithm according to the present invention.
- the retrieval system 10 selectively retrieves documents from a document database. System features and benefits are listed in Table 1 for the retrieval system 10 .
- the retrieval system 10 includes an input interface 12 for accepting at least one user interface query.
- the input interface 12 also allows users to build queries for a topic of interest, execute the queries, examine retrieved documents, and build additional or refine existing queries.
- a plurality of search engines are used for retrieving documents from the document database based upon the user search query, wherein each search query produces a common mathematical representation of each retrieved document.
- the plurality of search engines include n-gram search engine 14 , a Vector Space Model (VSM) search engine 16 which, in turn, includes a neural network training portion 18 to query a document corpus 20 to retrieve relevant documents. Results of the retrieval engines 14 , 16 are fused together and ranked.
- VSM Vector Space Model
- the fusing together and ranking of the retrieved documents is performed by a ranking portion 22 of the computer system.
- the retrieval system 10 further comprises visualization display means 24 for mapping respective mathematical representations of the retrieved documents onto a display.
- the visualization display means 24 allow users to explore various aspects of the retrieved documents, and look for additional relevant documents by redefining the topic query corpus 26 via the input interface 12 .
- VSM Strength unique terms e.g. example documents used as proper nouns
- input mis-spelled words document meaning short documents (e.g. e-mail)
- Weakness long documents unique terms e.g. proper nouns - terms that did not appear in the training corpus and hence do not appear in the VSM's dictionary
- an input query is partitioned into n-grams to form an n-gram query 30 .
- An n-gram is a consecutive sequence of n characters, with n being a positive integer.
- the premise of an n-gram search engine 14 is to separate terms into word fragments of size n, then design algorithms that use these fragments to determine whether or not a match exists. For example, the first 15 tri-grams (3-grams) in the phrase “information retrieval” are listed below:
- Relevant documents are rapidly identified by looking for the occurrence in the document of the least-frequent n-gram of a search string, such as a keyword. If the least frequently occurring n-gram of a search term is not present in a document, then the term itself is not in the document. If the least frequently occurring n-gram is present, the search continues for the entire string.
- a 3-character sliding window (3-grams) is used.
- a tri-gram frequency analysis is performed on a representative sample of documents. From this analysis, a table of tri-gram probabilities is developed for assigning an occurrence probability p(t) to each of the unique tri-grams.
- the developed table is used to determine the LFT for any given query, and the frequencies were not evenly distributed throughout the tri-grams.
- certain tri-grams TER, ING, THE
- JXF, DBP, GGG occur only under highly unusual circumstances.
- the n-gram retrieval search engine 14 counts the number of occurrences of the string. Since typical early searches are keywords, the n-gram search engine 14 acts as a filter to quickly identify candidate documents to be reviewed early in the retrieval process. It is especially useful for phrases not in the dictionary/vocabulary 44 of the VSM search engine 16 . The identified relevant documents are then used on the next pass through the retrieval system 10 . Additionally, if the keyword phrase of interest is unique enough, the documents containing the keyword phrase are used to create a document subset. Further queries are performed on just the document subset, which increases the search speed.
- the retrieval system 10 also comprises a Vector Space Model (VSM) search engine 16 to represent documents in an n-dimensional vector space.
- VSM Vector Space Model
- the strengths and weaknesses of the VSM 16 are listed above in Table 2.
- a context vector model is implemented for the retrieval system 10 .
- Words appearing in the document training corpus 40 are represented as vectors in the n-dimensional vector space, ⁇ .
- This similarity measure is the cosine of the angle between the two vectors.
- a vector for each document, ⁇ is constructed based on the terms in a document.
- a query is considered to be like a document, so a document and a query can be compared by comparing their respective vectors in the vector space.
- Documents whose content, as measured by the terms in the document, correspond most closely to the content of the query are judged to be the most relevant.
- the documents are retrieved through keyword, word clusters (series of words), and example document queries mapped into the n-dimensional vector space. The documents whose respective vectors are a minimal distance from the query's vector are retrieved.
- Keywords, keyword phrases, single documents, and document clusters are provided as input to the retrieval system's 10 VSM component as queries.
- Queries 42 constructed by the VSM 16 can be broadly or narrowly focused, depending on the keywords, phrases, and example documents used in the queries.
- the document's score is obtained by computing the distance between the vectors representing the query and the document. Scores for relevant documents typically range from approximately 0.45 to 1. The closer to 1, the better the document matches the search query.
- a neural network (NN) training portion 18 is used within the retrieval system 10 to train the word vectors, ⁇ , in the VSM search engine 16 .
- the NN training algorithm 18 is based on the training rule for Kohonen's Self-Organizing Map. This unsupervised learning algorithm organizes a high-dimensional vector space based on features within the training data so that items with similar usage are clustered together. Heavier weights are placed on words in closer proximity.
- the neural network 18 also accounts for training that has already taken place by adjusting the lesser trained words.
- the training algorithm is described as follows:
- ⁇ w k i should be much larger than ⁇ d, since the document's context vector is very random until some word training has been done. Eventually, ⁇ d may dominate ⁇ w k i since k i shrinks rapidly as word training continues.
- Application of the neural network 18 training rule causes the vectors for words with similar meaning, as defined by similar usage in the document corpus 20 , to converge towards each other.
- words are assigned to the closest axis in the n-dimensional vector space.
- An axis represents the direction of each vector in the n-dimensional space.
- Words are assigned to the closest axis by taking the highest cosine similarity among the axes.
- FIG. 2 a shows an example vector in a 3-dimensional space, with the words “budget” and “music” being assigned to an axis.
- FIG. 2 b shows a fairly even distribution of words through the first thirty axes.
- a document is cleaned by removing stop-words, performing stemming, and inserting compound words.
- a document is further reduced by examining the word axis representation. Document cleaning and reduction is performed in portion 46 , as illustrated in FIG. 1 .
- the number of words in each axis are also counted.
- the axes containing the highest percentage of words are retained until over 70% of the document is represented. In lieu of 70%, other percentage levels are acceptable.
- FIG. 3 shows the percentage of words in the first 30 document axes before reduction and after reduction.
- FIG. 4 shows text of a document that has been reduced from 58 axes to 26 axes.
- Reducing the words in a document increases the speed of the document representation and improves the query matching. Reduction increases the match because it removes terms that lower the values of the higher axes used to match other documents. Tests have been performed to determine what is a sufficient amount of document reduction without removing too many words. Reduction beyond 70% begins to remove some of the unique words of the document. Therefore, documents reduced up to 70% give the best performance.
- FIG. 5 An example of test results performed on over 2000 web news stories is shown in FIG. 5 .
- the topic was “find awards/honors given to people and things, i.e., television shows”.
- the search started with the set of words: honor, award, mvp, noble prize, and hall of fame.
- the search was performed on documents which contain 100%, 90%, 80%, 70%, and 60% of the original document.
- FIG. 5 shows the results using the keyword search.
- the top 10 relevant documents found in the reduction were then used in the second pass to further define the query.
- FIG. 6 shows the results of the keyword and document example search.
- Information for a document is stored in two ways: a document context vector, and an axis context vector.
- the document context vector, X ⁇ is the sum of all the words that are in the document after clean up and reduction. It is used for building a single document query, and to compare documents and queries.
- a document's axis context vector is the sum of the words in each axis vector after document clean up and reduction.
- the axis context vector is used for building a query for a document cluster and the 3-D display.
- a query whether it consists of keywords, keyword clusters, example documents, or document clusters, is considered to be like a document.
- a query is represented by an n-dimensional context vector, y ⁇ . Therefore, a query can be compared within the document corpus 20 .
- Positive queries are combinations of words and documents.
- Single word and single document queries use the entire word/document as the query.
- the document axes with the highest usage are used to build the query.
- Table 3 shows three documents with an example vector size of 6 to build a positive query.
- the query is built using axes 1 , 3 , and 5 since they contain the highest axis usage among the three relevant documents. The default is to use the axis used by all the documents, and the next highest used axes. The user is allowed to lower or raise the number of axes used to build the query.
- FIG. 7 shows examples of multiple query retrieval.
- the multiple query done on documents with a measure of similarity greater then 0.6 retrieves more relevant documents in the top 25 retrieved documents.
- the documents retrieved are mainly the documents retrieved by the individual queries, with the multiple query identifying one new relevant document in the top 25.
- the multiple queries can be used to help further identify relevant documents, through repetition of the document being repeated in several queries. Multiple document queries with dissimilar documents identify more irrelevant documents. These queries correspond to a cosine measure of similarity less then 0.6.
- a negative query is very similar to building a positive query. Instead of looking for the most frequently used axes, the least frequently used axes are examined. That is, the least frequently used axes in the specified relevant documents relative to the axis used by the bad documents are examined. Table 4 shows a negative query being built. In this example, the least frequently used axes 2 , 4 , and 6 are used with respect to the good documents to build the negative query. As with building the positive query, the user can also raise or lower the number of axes used in building the negative query.
- the retrieval system 10 initiates each retrieval engine to calculate a document's score for each query.
- the retrieval engines maintain only the high level scores.
- the retrieval system 10 standardizes the scores from each retrieval engine to range from 0 to 1.
- the user can adjust the lowest acceptable score and retrieval engine weight to effect score results to favor/disfavor a particular retrieval engine.
- a ranking processor 22 uses an algorithm to fuse the results of the retrieval engines and ranks the documents based on the number of times the document was selected, highest score, lowest score, average score, location in the query list and number of retrieval engines locating the document. Irrelevant documents and queries can be removed.
- Each topic is made up of multiple queries from each of the retrieval components.
- the scores are scaled by query and for the entire topic, i.e., all the queries.
- Each set of scores are a separate entry into the ranking algorithm.
- the query 42 for the VSM 16 contains the entire vector and the axis vectors. Scores are obtained by taking the cosine similarity measure of the query vector with the entire vector of each of the documents in the document corpus 20 . The closer to one the better the match, where only high positive scores are kept. If none of the document scores equals the value 1, then the documents are scaled based on the highest score for the query. This is a quick method of increasing the scores on potentially relevant VSM documents, thus allowing fusing of the VSM highest results with the n-gram result scores.
- a cosine similarity measure of the query vector against the corresponding axis of the documents in the document corpus 20 can also be applied.
- applying a cosine similarity measure is also useful for word queries.
- the axis for the document typically contain a large number of axes, and a large number of documents unrelated to the topic are retrieved.
- retrieved documents describing “awards/honors given to people and things receiving awards” shows the effect of using the document axis queries, as shown in Table 5 and FIG. 8 .
- the precision and recall scores are lower, and 16% more irrelevant documents were retrieved. This causes more work to be performed in locating relevant documents.
- the filter quickly identifies candidate documents to be reviewed early in the retrieval process.
- the filter is especially useful for keywords or phrases which may not have appeared in the document corpus 40 used to train the VSM 16 component of the retrieval system 10 .
- the identified relevant documents are used on the next pass through the retrieval system 10 .
- a tri-gram was selected due to the speed of processing and small amount of storage for the least frequency table corresponding to block 34 in FIG. 1 .
- the n-gram frequency count can vary widely.
- the n-grams need to be in the range from 0 to 1 to correspond with the standardized range.
- the documents having a large number of the specified tri-grams (3-grams) appearing in one document were examined.
- the documents were divided into three groups: few matches, high matches, and scaled documents.
- the few matches were removed from the scaling calculation. Few matches consists of a large number of documents (greater than 50) with a couple of matches (approximately ranging from 1-20 matches) per document.
- High matches have a large number of matches in a single document.
- These documents need to be examined and are set to a value 1.
- the remaining documents are scaled between 0 and 1.
- the scaling helps to identify documents that have the most 3-gram matches and should be reviewed.
- taking the mean or dividing by the largest number does not provide a good representation of the n-gram documents. Furthermore, looking at a document that had only had a few matching occurrences is not desirable.
- a statistical method is used to locate the clusters of ref to be scaled.
- the mean and standar deviation are calculated without the largest n-gram frequency value. If the largest value fits within three standar deviations of the mean, then the number is used as the scaling factor. If the largest value does not fit, it is considered to be outside the cluster range. When the process is repeated, it takes out the nex larges value. Accordingly, this process is repeated until the largest removed frequency value falls within the range of the third standard deviation. Numbers larger than the selected scaling number are set to one.
- An example of calculating the scaling value is show in Table 6.
- Table 7 shows the number of files used to calculate the n-gram scaling factor applied to data provided for a TREC-6 conference.
- TREC is an acronym for Text REtrieval Conference. This conference is a workshop series that encourages research in information retrieval from large text applications by providing a large test collection, uniform scoring procedures, and a forum for organizations interested in comparing their results.
- the user specifies the lowest acceptable score for each of the retrieval engines. This helps eliminate the lower scoring documents. This also controls the number of documents shown to the user. The higher the number, the fewer the documents shown to the user. Additionally, the documents should reveal better matches to the query.
- Each retrieval engine is assigned a specific percentage by the user. The document scores for the retrieval engine are reduced by the specified percentage. Depending upon the query type, different retrieval engines can be emphasized. For example, potentially misspelled words may put more emphasis on the n-gram search engine 14 . Document example queries place more emphasis on the VSM search engine 16 .
- An algorithm ranks the document scores from the different retrieval engines. The algorithm rates the following items: number of times document identified per query and per retrieval engine, maximum and minimum score, average score and penalty points. An algorithm performing these functions is well known to one skilled in the art and will not be described in further detail herein. The ranking is thus determined by a score.
- Penalty points are assigned to a document based on the retrieval engine and the document location in the ranked list.
- a penalty is added to a document for each retrieval engine not identifying it as relevant. Multiple engines retrieving the document is a strong indication of a relevant document. The score must be above the supplied value after it is calculated according to predetermined scaling and retrieval engine weight. If all the engines retrieve the document, then the penalty score is 0. Referring to FIG. 9, the value 200 was chosen because it supplied enough of a penalty that it moved a document's rank below documents that appeared in more then one method.
- the algorithm allows each document to receive a penalty point for its location in each query list. This is intended to reward the documents that are located close to the top of the list in the individual queries, by assigning fewer penalty points.
- FIG. 10 further illustrates the penalty point assignment.
- High precision in the retrieval system 10 is derived by providing users multiple input interaction modes, fusing results obtained from multiple information retrieval search engines 14 and 16 , each supporting a different retrieval strategy, and by supporting relevance feedback mechanisms.
- the basic premise of relevance feedback is to implement information retrieval in multiple passes. The user refines the query in each pass based on results of previous queries. Typically, the user indicates which of the documents presented in response to an initial query are relevant, and new terms are added to the query based on this selection. Additionally, existing terms in the query can be re-weighted based on user feedback.
- a web-browser-based user interface 12 Primary user interaction with the retrieval system 10 is through a web-browser-based user interface 12 .
- Users can build and tailor queries as the topic of interest is further defined, moving from a generic search to specific topic areas through query inputs. Queries may consist of a single keyword, multiple keywords (or phrases), keyword clusters, an example document, and document clusters.
- a user can increase or decrease the system precision, effecting the number of documents that will be rated as relevant.
- the weights on the retrieval engines can be modified to favor different engines based on the type of query.
- a set of documents retrieved for a particular topic many exhibit a variety of aspects. This can be seen, for example, in a search retrieving information about the Oklahoma City bombing of 1996. There are relevant articles about the bomb, damage from the bomb blast, rescue work, the victims, suspects, the Timothy McVeigh trial, and the Terry Nichols trial, just to name a few.
- the retrieval system 10 is used to search for documents relevant to the trial of Timothy McVeigh for the Oklahoma City bombing.
- the document corpus consists of over 2000 news stories from the CNN web site on a variety of topics. In this case, a user begins by creating a new topic of interest: McVeigh Trial. Since this is a new topic, there are no queries associated with the topic.
- a ranked list of documents, complete with score, is returned to the user. Clicking on the document retrieves the text through HTML hyperlinks. This enables a user to determine the relevance, from his point of view, of a document to the topic. Top documents can be reviewed, and both relevant and irrelevant documents are identified and marked as such. Irrelevant documents are filtered from subsequent queries. Removal of the higher-scoring irrelevant documents allows lower scoring documents to be accepted on the final result list. Documents can also be marked for use as examples in additional queries for the topic. Such stories are then added to the list of queries, as shown in FIG. 11 d.
- visualization display means comprises an n-dimensional document visualization display for enhancing user understanding of the retrieved document set.
- This tool supports multiple levels of data abstraction, clustered document presentation, data thresholding, and a variety of user interaction paradigms.
- the n-dimensional document visualization display enables the user to view different aspects of the document's topic.
- the visualization display means displays a similarity measure of the documents. The information retrieval system 10 is thus able to reduce the display down to the most important aspects of the document.
- the Oklahoma City bombing stories have a number of different aspects: the bomb, building damage, the victims, victim's families, the Timothy McVeigh trial, etc.
- Displaying documents in a 3-dimensional space enables a user to see document clusters, the relationships of documents to each other, and also aids in the location of additional documents that may be relevant to a query. Documents near identified relevant documents (through queries) can be easily reviewed for topic relevance. The user is able to manipulate the dimensional view to gain new views of document relationships. Changing the document's dimensionality allows the information to be viewed for different topic aspects to aid in further identification of relevant documents.
- FIG. 12 a shows an example of the 3-dimensional viewer containing documents retrieved for the McVeigh trial topic using the query keywords: McVeigh, trial, Oklahoma City, and bomb.
- Document locations are represented in space by a box. Additionally in this view, documents determined as relevant by the information retrieval system 10 displays the document name next to the box.
- Clustering of documents can be observed in several areas. A first cluster is with respect to the Mcveigh trial stories, plus additional stories related to the topic, but not identified by retrieval system 10 . A second cluster is with respect to the Lau stories deals with bombing, and are near the keyword “bomb”. A third cluster is with respect to the O. J. Simpson trial stories, and appears near the word “trial”.
- each document in the retrieved document corpus may be represented mathematically in the 3-dimensional space by a colored cube.
- a red cube could represent a query request—in this case the words “trial”, “bombing”, “McVeigh” and “Oklahoma City”.
- Yellow text could be used to indicate the relevant documents found through text queries submitted to the retrieval system 10 .
- Additional colors may be used to indicated document clusters, i.e., documents related in some aspect.
- the colors selected to represent a query request, relevant documents and document clusters are not limited to red and yellow. These colors are for illustrative purposes, wherein other colors are acceptable for indicating such information to a user. Therefore, using different colors to represent different aspects of the retrieved documents on the display would allow the user to more quickly identify the relevant information to be retrieved.
- FIG. 14 a shows a close-up focused on the word “trial”, and the text has been turned on so that the file names of documents represented by the boxes are also displayed. It is noted that the O. J. Simpson trial articles have clustered near the word trial. A user can double click on any of the boxes to bring up the associated document.
- FIG. 14 b shows another perspective on the whole corpus of retrieved documents using different dimensions. Again, it is noted that the relevant stories appear to separate from the other stories using the retrieval system 10 . To look at some of the documents which have not been identified, the user can double click on any of the boxes to bring up the associated document, and make an inspection to determine whether or not it is relevant to the Timothy McVeigh trial.
- the information retrieval system 10 has been evaluated using the benchmark data and query sets provided by the National Institute of Standards and Technology as part of the 6th Text REtrieval Conference (TREC-6). The results obtained demonstrated high precision for limited-sized retrieval sets.
- the Text REtrieval Conference (TREC) workshop series encourages research in information retrieval from large text applications by providing a large test collection, uniform scoring procedures, and a forum for organizations interested in comparing their results.
- TREC has become the major experimental effort in the field of information retrieval.
- the information retrieval system 10 of the present invention competed in the Manual Ad Hoc Retrieval task for the large data set.
- the n-gram search engine 14 was used as a high speed filter to provide document examples.
- the VSM search engine 16 was used for final retrieval and document scoring.
- Precision is the percentage of documents from the retrieved set that are relevant to the query.
- Recall is the percentage of relevant documents retrieved from the total number of relevant documents available within the collection.
- Time constraints and interest-level limit the typical user of an information retrieval system to reviewing the top documents before determining if the results of a query are accurate and satisfactory.
- the user needs precise, representative documents at the top of the list to make this determination.
- the information retrieval system 10 emphasizes precision (accuracy) and speed while retrieving relevant documents high on the list.
- the information retrieval system 10 permits the user to build and tailor the query as he or she further defines the topic.
- the system also permits movement from a generic search to a specific topic area through query inputs.
- the information retrieval system 10 maintains a high level of precision for the top 5, 10, 20 and 30 documents retrieved.
- precision for the Applicants' invention (which is assigned to Harris corporation) is higher than the other TREC teams that retrieved more relevant documents. This fits well with the fact that time constraints and interest-level on the part of an information retrieval system user often limit the user to reviewing the top documents before the user determines if the results of a particular query were accurate and satisfactory.
- the documents were scored for the entire topic, i.e., all the queries were combined into the ranking.
- a modified algorithm included individual query list locations into the overall scoring, which improved the results, as shown in FIG. 16 .
- the information retrieval system 10 is an efficient, high-level precision information retrieval and visualization system.
- the information retrieval system 10 allows interactive formation of query refinement, and fuses results form multiple retrieval engines to leverage the strengths of the each one.
- the information retrieval system 10 allows for efficient maintenance, i.e., making it easy to add new documents.
- the information retrieval system 10 also allows for multiple dictionaries and vocabularies, thus allowing a user to develop role-based dictionaries and/or vocabularies for searching specific databases.
- the information retrieval system 10 provides a user interface for user interaction as well as a 3-dimensional presentation of the retrieved documents for more efficiently exploring the documents retrieved in response to a user's search query.
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Databases & Information Systems (AREA)
- Data Mining & Analysis (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Computational Linguistics (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
Description
TABLE 1 |
System Features |
FEATURES | BENEFITS |
Fusion of multiple retrieval | Improved retrieval performance |
engines | over independent search engines |
An architecture that supports | Flexible search strategies |
the addition of new search | |
engines | |
Search by keyword or document | Tailored queries |
example | |
Multiple query capability for a | Emphasize best search engine |
topic | and queries for a topic |
Document reduction | Improved performance and |
smaller document footprint | |
Partitioning of document corpus | Reduces search space |
based on similarity | |
Search screening | Remove topical but irrelevant |
documents | |
Manual Relevance Feedback | Query refinement |
Web-based user interface | Familiar style facilitates |
quick review of retrieved | |
documents and selection of | |
example documents for use as | |
additional queries | |
3-D Visualization of retrieved | Facilitates intuitive |
document sets | identification of additional |
relevant documents | |
TABLE 2 |
Retrieval Engine Strengths and Weaknesses |
n-gram | VSM | ||
Strength | unique terms (e.g. | example documents used as |
proper nouns) | input | |
mis-spelled words | document meaning | |
short documents (e.g. | ||
e-mail) | ||
Weakness | long documents | unique terms (e.g. proper |
nouns - terms that did not | ||
appear in the training corpus | ||
and hence do not appear in | ||
the VSM's dictionary) | ||
inf | orm | ati | on_ | ret | ||
nfo | rma | tio | n_r | etr | ||
for | mat | ion | _re | tri | ||
TABLE 3 |
Positive Query |
Axis | Doc |
1 | |
|
||
1 | x | x | x | |
2 | x | |||
3 | x | x | x | |
4 | x | |||
5 | x | x | ||
6 | x | |||
TABLE 4 |
Negative Query Example |
Relevant Documents | |
Axis | Doc |
1 | |
|
|
|
1 | x | x | x | x |
2 | x | x | ||
3 | x | x | x | |
4 | x | x | ||
5 | x | x | ||
6 | x | x | ||
TABLE 5 |
Example of Using VSM Document Axis |
n-gram, | n-gram, VSM, VSM | ||
VSM, VSM | word axis, VSM | ||
word axis | document axis | ||
Number |
12 | 17 |
located | ||
Number |
17 | 17 |
in the corpus | ||
Number documents retrieved | 42 | 705 |
TABLE 6 |
Scaling Example |
Original | |||
Data | Removed Largest # | Removed Next Largest # | |
n- |
1 | 1 | 1 |
|
2 | 2 | 2 |
Occur- | 1 | 1 | 1 |
|
3 | 3 | 2 |
12 | 2 | 2 | |
2 | 2 | 1 | |
2 | 1 | 1 | |
1 | 1 | ||
1 | |||
Mean | 2.77 | 1.625 | 1.42 |
Standard | 3.52 | 0.744 | 0.534 |
|
|||
3 Standard | 13.33 | 3.857 | 3.03 |
Deviations | |||
Comment | If used | Remove the largest | Remove the next largest # - |
largest | # -12. The # 12 | 3. The # 3 does fall within | |
value. | does not fall within | three standard deviations. | |
12/12 = | three standard devi- | |
|
1 | ations of the | All numbers larger than 3 | |
3/12 = | remaining values | are set to | |
0.25 | 1. | ||
2/12 = | 12 = 1 | ||
0.16 | 3/3 = 1 | ||
1/12 = | 2/3 = 0.66 | ||
0.08 | 1/3 = 0.33 | ||
It is | The documents with values | ||
doubtful | 1-0.66 would probably be | ||
that | reviewed. | ||
anything | |||
other | |||
then the | |||
document | |||
with the | |||
value of | |||
one | |||
would be | |||
reviewed | |||
TABLE 7 |
Number of Files Used to |
Calculate the n-gram Scaling |
Query # |
301 | Query #337 | |
# Files | 79,777 | 550 | 3,271 |
Range of files dropped out | 1-18 | 1-3 | 1-10 |
(# matches/file) | |||
# of files dropped out | 79,155 | 434 | 2,547 |
Average | 60 | 10 | 48 |
# Documents Scaled | 522 | 73 | 538 |
Range of files where | 60-829 | 4-10 | 48-7,789 |
documents = 1 | |||
# Documents = 1 | 100 | 43 | 186 |
Claims (27)
Priority Applications (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US09/195,773 US20030069873A1 (en) | 1998-11-18 | 1998-11-18 | Multiple engine information retrieval and visualization system |
US09/195,773 US6574632B2 (en) | 1998-11-18 | 1998-11-18 | Multiple engine information retrieval and visualization system |
US10/356,958 US6701318B2 (en) | 1998-11-18 | 2003-02-03 | Multiple engine information retrieval and visualization system |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US09/195,773 US6574632B2 (en) | 1998-11-18 | 1998-11-18 | Multiple engine information retrieval and visualization system |
Related Child Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US10/356,958 Division US6701318B2 (en) | 1998-11-18 | 2003-02-03 | Multiple engine information retrieval and visualization system |
Publications (1)
Publication Number | Publication Date |
---|---|
US6574632B2 true US6574632B2 (en) | 2003-06-03 |
Family
ID=22722748
Family Applications (3)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US09/195,773 Expired - Fee Related US6574632B2 (en) | 1998-11-18 | 1998-11-18 | Multiple engine information retrieval and visualization system |
US09/195,773 Granted US20030069873A1 (en) | 1998-11-18 | 1998-11-18 | Multiple engine information retrieval and visualization system |
US10/356,958 Expired - Fee Related US6701318B2 (en) | 1998-11-18 | 2003-02-03 | Multiple engine information retrieval and visualization system |
Family Applications After (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US09/195,773 Granted US20030069873A1 (en) | 1998-11-18 | 1998-11-18 | Multiple engine information retrieval and visualization system |
US10/356,958 Expired - Fee Related US6701318B2 (en) | 1998-11-18 | 2003-02-03 | Multiple engine information retrieval and visualization system |
Country Status (1)
Country | Link |
---|---|
US (3) | US6574632B2 (en) |
Cited By (101)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20020103786A1 (en) * | 2000-08-08 | 2002-08-01 | Surendra Goel | Searching content on web pages |
US20020103797A1 (en) * | 2000-08-08 | 2002-08-01 | Surendra Goel | Displaying search results |
US20020145630A1 (en) * | 2001-02-21 | 2002-10-10 | International Business Machines Corporation | GUI for representing entity matches utilizing graphical transitions performed directly on the matching object |
US20020169765A1 (en) * | 2000-10-27 | 2002-11-14 | Ivan Saltz | Limit engine database management system |
US20030004941A1 (en) * | 2001-06-29 | 2003-01-02 | International Business Machines Corporation | Method, terminal and computer program for keyword searching |
US20030014405A1 (en) * | 2001-07-09 | 2003-01-16 | Jacob Shapiro | Search engine designed for handling long queries |
US20030066025A1 (en) * | 2001-07-13 | 2003-04-03 | Garner Harold R. | Method and system for information retrieval |
US20040049498A1 (en) * | 2002-07-03 | 2004-03-11 | Dehlinger Peter J. | Text-classification code, system and method |
US20040117189A1 (en) * | 1999-11-12 | 2004-06-17 | Bennett Ian M. | Query engine for processing voice based queries including semantic decoding |
US20040139067A1 (en) * | 2002-12-19 | 2004-07-15 | International Business Machines Corporation | Computer system, method, and program product for generating a data structure for information retrieval, and an associated graphical user interface |
US6799199B1 (en) * | 2000-01-11 | 2004-09-28 | The Relegence Corporation | Media monitor system |
US20040220842A1 (en) * | 1999-09-14 | 2004-11-04 | Barney Jonathan A. | Method and system for rating patents and other intangible assets |
US20040254957A1 (en) * | 2003-06-13 | 2004-12-16 | Nokia Corporation | Method and a system for modeling user preferences |
US20040267728A1 (en) * | 2003-01-29 | 2004-12-30 | Kemal Delic | Maintenance of information retrieval systems using global metrics |
US20040267740A1 (en) * | 2000-10-30 | 2004-12-30 | Microsoft Corporation | Image retrieval systems and methods with semantic and feature based relevance feedback |
US20050004910A1 (en) * | 2003-07-02 | 2005-01-06 | Trepess David William | Information retrieval |
US20050010553A1 (en) * | 2000-10-30 | 2005-01-13 | Microsoft Corporation | Semi-automatic annotation of multimedia objects |
US6895552B1 (en) * | 2000-05-31 | 2005-05-17 | Ricoh Co., Ltd. | Method and an apparatus for visual summarization of documents |
US20050165805A1 (en) * | 2001-06-29 | 2005-07-28 | International Business Machines Corporation | Method and system for spatial information retrieval for hyperlinked documents |
US20050192926A1 (en) * | 2004-02-18 | 2005-09-01 | International Business Machines Corporation | Hierarchical visualization of a semantic network |
EP1587013A1 (en) * | 2004-04-14 | 2005-10-19 | Fujitsu Limited | Information processing technique relating to relation between users and documents |
US20050234891A1 (en) * | 2004-03-15 | 2005-10-20 | Yahoo! Inc. | Search systems and methods with integration of user annotations |
US20050262133A1 (en) * | 2000-11-25 | 2005-11-24 | Lg Electronics Inc. | Multimedia retrieval method using multi-weighted feature |
US20060036451A1 (en) * | 2004-08-10 | 2006-02-16 | Lundberg Steven W | Patent mapping |
US7007019B2 (en) * | 1999-12-21 | 2006-02-28 | Matsushita Electric Industrial Co., Ltd. | Vector index preparing method, similar vector searching method, and apparatuses for the methods |
US7007008B2 (en) | 2000-08-08 | 2006-02-28 | America Online, Inc. | Category searching |
US20060059135A1 (en) * | 2004-09-10 | 2006-03-16 | Eran Palmon | Conducting a search directed by a hierarchy-free set of topics |
US20060089926A1 (en) * | 2004-10-27 | 2006-04-27 | Harris Corporation, Corporation Of The State Of Delaware | Method for re-ranking documents retrieved from a document database |
US20060173817A1 (en) * | 2004-12-29 | 2006-08-03 | Chowdhury Abdur R | Search fusion |
US20060173839A1 (en) * | 2004-10-27 | 2006-08-03 | Harris Corporation | Method for re-ranking documents retrieved from a multi-lingual document database |
US20060179051A1 (en) * | 2005-02-09 | 2006-08-10 | Battelle Memorial Institute | Methods and apparatus for steering the analyses of collections of documents |
US20060206483A1 (en) * | 2004-10-27 | 2006-09-14 | Harris Corporation | Method for domain identification of documents in a document database |
US20060218140A1 (en) * | 2005-02-09 | 2006-09-28 | Battelle Memorial Institute | Method and apparatus for labeling in steered visual analysis of collections of documents |
US20070016574A1 (en) * | 2005-07-14 | 2007-01-18 | International Business Machines Corporation | Merging of results in distributed information retrieval |
US20070038588A1 (en) * | 1999-02-02 | 2007-02-15 | Alan Sullivan | Neural network system and method for controlling information output based on user feedback |
US7191223B1 (en) * | 2000-01-11 | 2007-03-13 | The Relegence Corporation | System and method for real-time alerts |
US20070078886A1 (en) * | 1993-11-19 | 2007-04-05 | Rivette Kevin G | Intellectual property asset manager (IPAM) for context processing of data objects |
US20070094297A1 (en) * | 2005-09-07 | 2007-04-26 | Barney Jonathan A | Method of determining an obsolescence rate of a technology |
US7225180B2 (en) | 2000-08-08 | 2007-05-29 | Aol Llc | Filtering search results |
US20070124690A1 (en) * | 2000-12-29 | 2007-05-31 | Aol Llc | Message screening system |
US20070198578A1 (en) * | 2005-07-27 | 2007-08-23 | Lundberg Steven W | Patent mapping |
US20080040660A1 (en) * | 2000-02-23 | 2008-02-14 | Alexander Georke | Method And Apparatus For Processing Electronic Documents |
US20080059456A1 (en) * | 2004-12-29 | 2008-03-06 | Aol Llc, A Delaware Limited Liability Company (Formerly Known As America Online, Inc.) | Domain Expert Search |
US20080069448A1 (en) * | 2006-09-15 | 2008-03-20 | Turner Alan E | Text analysis devices, articles of manufacture, and text analysis methods |
US20080071762A1 (en) * | 2006-09-15 | 2008-03-20 | Turner Alan E | Text analysis devices, articles of manufacture, and text analysis methods |
US7356461B1 (en) * | 2002-01-14 | 2008-04-08 | Nstein Technologies Inc. | Text categorization method and apparatus |
US20080172368A1 (en) * | 2004-12-29 | 2008-07-17 | Aol Llc | Query routing |
US20080235192A1 (en) * | 2007-03-19 | 2008-09-25 | Mitsuhisa Kanaya | Information retrieval system and information retrieval method |
US20090034805A1 (en) * | 2006-05-10 | 2009-02-05 | Aol Llc | Using Relevance Feedback In Face Recognition |
US7546280B1 (en) * | 2005-08-30 | 2009-06-09 | Quintura, Inc. | Use of neural networks for keyword generation |
US20090222444A1 (en) * | 2004-07-01 | 2009-09-03 | Aol Llc | Query disambiguation |
US20090259506A1 (en) * | 1999-09-14 | 2009-10-15 | Barney Jonathan A | Method and system for rating patents and other intangible assets |
US7640336B1 (en) | 2002-12-30 | 2009-12-29 | Aol Llc | Supervising user interaction with online services |
US20100010973A1 (en) * | 2008-07-09 | 2010-01-14 | International Business Machines Corporation | Vector Space Lightweight Directory Access Protocol Data Search |
US20100094823A1 (en) * | 2008-10-14 | 2010-04-15 | Mathieu Lemaire | Enhanced linear presentation of search results based on search result metadata |
US20100257193A1 (en) * | 2009-04-07 | 2010-10-07 | Microsoft Corporation | Search query extension |
WO2010128323A1 (en) * | 2009-05-05 | 2010-11-11 | Aurix Limited | User interface for use in non-deterministic searching |
US20110047145A1 (en) * | 2007-02-19 | 2011-02-24 | Quintura, Inc. | Search engine graphical interface using maps of search terms and images |
US20110047111A1 (en) * | 2005-09-26 | 2011-02-24 | Quintura, Inc. | Use of neural networks for annotating search results |
US7907755B1 (en) | 2006-05-10 | 2011-03-15 | Aol Inc. | Detecting facial similarity based on human perception of facial similarity |
US20110103689A1 (en) * | 2009-11-02 | 2011-05-05 | Harry Urbschat | System and method for obtaining document information |
US20110103688A1 (en) * | 2009-11-02 | 2011-05-05 | Harry Urbschat | System and method for increasing the accuracy of optical character recognition (OCR) |
US20110106823A1 (en) * | 2009-11-02 | 2011-05-05 | Harry Urbschat | System and method of using dynamic variance networks |
US20110113385A1 (en) * | 2009-11-06 | 2011-05-12 | Craig Peter Sayers | Visually representing a hierarchy of category nodes |
US20110179002A1 (en) * | 2010-01-19 | 2011-07-21 | Dell Products L.P. | System and Method for a Vector-Space Search Engine |
US8078557B1 (en) | 2005-09-26 | 2011-12-13 | Dranias Development Llc | Use of neural networks for keyword generation |
US8131702B1 (en) * | 2004-03-31 | 2012-03-06 | Google Inc. | Systems and methods for browsing historical content |
US20120078934A1 (en) * | 2001-08-27 | 2012-03-29 | Bdgb Enterprise Software S.A.R.L. | Method for automatically indexing documents |
US8180754B1 (en) | 2008-04-01 | 2012-05-15 | Dranias Development Llc | Semantic neural network for aggregating query searches |
US20120124036A1 (en) * | 2010-11-16 | 2012-05-17 | Microsoft Corporation | Browsing related image search result sets |
US8346953B1 (en) | 2007-12-18 | 2013-01-01 | AOL, Inc. | Methods and systems for restricting electronic content access based on guardian control decisions |
US20130091436A1 (en) * | 2006-06-22 | 2013-04-11 | Linkedin Corporation | Content visualization |
CN101996191B (en) * | 2009-08-14 | 2013-08-07 | 北京大学 | Method and system for searching for two-dimensional cross-media element |
US8595146B1 (en) | 2004-03-15 | 2013-11-26 | Aol Inc. | Social networking permissions |
US20140115527A1 (en) * | 2012-10-24 | 2014-04-24 | Lucky Sort, Inc. | Gathering, selecting and graphing n-grams |
US8849807B2 (en) | 2010-05-25 | 2014-09-30 | Mark F. McLellan | Active search results page ranking technology |
US9058395B2 (en) | 2003-05-30 | 2015-06-16 | Microsoft Technology Licensing, Llc | Resolving queries based on automatic determination of requestor geographic location |
US9159584B2 (en) | 2000-08-18 | 2015-10-13 | Gannady Lapir | Methods and systems of retrieving documents |
US9396254B1 (en) * | 2007-07-20 | 2016-07-19 | Hewlett-Packard Development Company, L.P. | Generation of representative document components |
US9602573B1 (en) | 2007-09-24 | 2017-03-21 | National Science Foundation | Automatic clustering for self-organizing grids |
US10095778B2 (en) | 2005-09-27 | 2018-10-09 | Patentratings, Llc | Method and system for probabilistically quantifying and visualizing relevance between two or more citationally or contextually related data objects |
US10546273B2 (en) | 2008-10-23 | 2020-01-28 | Black Hills Ip Holdings, Llc | Patent mapping |
US10579662B2 (en) | 2013-04-23 | 2020-03-03 | Black Hills Ip Holdings, Llc | Patent claim scope evaluator |
US10606878B2 (en) | 2017-04-03 | 2020-03-31 | Relativity Oda Llc | Technology for visualizing clusters of electronic documents |
US10614082B2 (en) | 2011-10-03 | 2020-04-07 | Black Hills Ip Holdings, Llc | Patent mapping |
US10810693B2 (en) | 2005-05-27 | 2020-10-20 | Black Hills Ip Holdings, Llc | Method and apparatus for cross-referencing important IP relationships |
US10860657B2 (en) | 2011-10-03 | 2020-12-08 | Black Hills Ip Holdings, Llc | Patent mapping |
US11461862B2 (en) | 2012-08-20 | 2022-10-04 | Black Hills Ip Holdings, Llc | Analytics generation for patent portfolio management |
US11467883B2 (en) | 2004-03-13 | 2022-10-11 | Iii Holdings 12, Llc | Co-allocating a reservation spanning different compute resources types |
US11496415B2 (en) | 2005-04-07 | 2022-11-08 | Iii Holdings 12, Llc | On-demand access to compute resources |
US11494235B2 (en) | 2004-11-08 | 2022-11-08 | Iii Holdings 12, Llc | System and method of providing system jobs within a compute environment |
US11526304B2 (en) | 2009-10-30 | 2022-12-13 | Iii Holdings 2, Llc | Memcached server functionality in a cluster of data processing nodes |
US11630704B2 (en) | 2004-08-20 | 2023-04-18 | Iii Holdings 12, Llc | System and method for a workload management and scheduling module to manage access to a compute environment according to local and non-local user identity information |
US11652706B2 (en) | 2004-06-18 | 2023-05-16 | Iii Holdings 12, Llc | System and method for providing dynamic provisioning within a compute environment |
US11650857B2 (en) | 2006-03-16 | 2023-05-16 | Iii Holdings 12, Llc | System and method for managing a hybrid computer environment |
US11658916B2 (en) | 2005-03-16 | 2023-05-23 | Iii Holdings 12, Llc | Simple integration of an on-demand compute environment |
US11714839B2 (en) | 2011-05-04 | 2023-08-01 | Black Hills Ip Holdings, Llc | Apparatus and method for automated and assisted patent claim mapping and expense planning |
US11720290B2 (en) | 2009-10-30 | 2023-08-08 | Iii Holdings 2, Llc | Memcached server functionality in a cluster of data processing nodes |
US11960937B2 (en) | 2004-03-13 | 2024-04-16 | Iii Holdings 12, Llc | System and method for an optimizing reservation in time of compute resources based on prioritization function and reservation policy parameter |
US12120040B2 (en) | 2005-03-16 | 2024-10-15 | Iii Holdings 12, Llc | On-demand compute environment |
US12147439B2 (en) | 2023-06-27 | 2024-11-19 | Black Hills IP Holdings, LLC. | Patent mapping |
Families Citing this family (300)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
AUPQ131399A0 (en) * | 1999-06-30 | 1999-07-22 | Silverbrook Research Pty Ltd | A method and apparatus (NPAGE02) |
US6326988B1 (en) * | 1999-06-08 | 2001-12-04 | Monkey Media, Inc. | Method, apparatus and article of manufacture for displaying content in a multi-dimensional topic space |
US6883135B1 (en) | 2000-01-28 | 2005-04-19 | Microsoft Corporation | Proxy server using a statistical model |
US6704727B1 (en) * | 2000-01-31 | 2004-03-09 | Overture Services, Inc. | Method and system for generating a set of search terms |
US7333983B2 (en) * | 2000-02-03 | 2008-02-19 | Hitachi, Ltd. | Method of and an apparatus for retrieving and delivering documents and a recording media on which a program for retrieving and delivering documents are stored |
CA2400161C (en) * | 2000-02-22 | 2015-11-24 | Metacarta, Inc. | Spatially coding and displaying information |
US8645137B2 (en) | 2000-03-16 | 2014-02-04 | Apple Inc. | Fast, language-independent method for user authentication by voice |
DE10014066A1 (en) * | 2000-03-22 | 2001-09-27 | Definiens Ag | Computerized object location method uses repetition of object group selection, evaluation and assessment until required object is located |
US6636848B1 (en) * | 2000-05-31 | 2003-10-21 | International Business Machines Corporation | Information search using knowledge agents |
JP3499808B2 (en) * | 2000-06-29 | 2004-02-23 | 本田技研工業株式会社 | Electronic document classification system |
US20020078091A1 (en) * | 2000-07-25 | 2002-06-20 | Sonny Vu | Automatic summarization of a document |
US7103838B1 (en) * | 2000-08-18 | 2006-09-05 | Firstrain, Inc. | Method and apparatus for extracting relevant data |
US7080073B1 (en) | 2000-08-18 | 2006-07-18 | Firstrain, Inc. | Method and apparatus for focused crawling |
US6915294B1 (en) | 2000-08-18 | 2005-07-05 | Firstrain, Inc. | Method and apparatus for searching network resources |
AUPR208000A0 (en) * | 2000-12-15 | 2001-01-11 | 80-20 Software Pty Limited | Method of document searching |
EP1244027A1 (en) * | 2001-03-23 | 2002-09-25 | Siemens Business Services GmbH & Co. OHG | Method of categorizing a document into a document hierarchy |
US7188141B2 (en) * | 2001-06-29 | 2007-03-06 | International Business Machines Corporation | Method and system for collaborative web research |
US7117434B2 (en) | 2001-06-29 | 2006-10-03 | International Business Machines Corporation | Graphical web browsing interface for spatial data navigation and method of navigating data blocks |
US8316306B2 (en) * | 2001-10-15 | 2012-11-20 | Maya-Systems Inc. | Method and system for sequentially navigating axes of elements |
US7606819B2 (en) * | 2001-10-15 | 2009-10-20 | Maya-Systems Inc. | Multi-dimensional locating system and method |
US7680817B2 (en) * | 2001-10-15 | 2010-03-16 | Maya-Systems Inc. | Multi-dimensional locating system and method |
US20030084219A1 (en) * | 2001-10-26 | 2003-05-01 | Maxxan Systems, Inc. | System, apparatus and method for address forwarding for a computer network |
US7085846B2 (en) * | 2001-12-31 | 2006-08-01 | Maxxan Systems, Incorporated | Buffer to buffer credit flow control for computer network |
US7145914B2 (en) | 2001-12-31 | 2006-12-05 | Maxxan Systems, Incorporated | System and method for controlling data paths of a network processor subsystem |
US20050114198A1 (en) * | 2003-11-24 | 2005-05-26 | Ross Koningstein | Using concepts for ad targeting |
US7379970B1 (en) | 2002-04-05 | 2008-05-27 | Ciphermax, Inc. | Method and system for reduced distributed event handling in a network environment |
US20030195956A1 (en) * | 2002-04-15 | 2003-10-16 | Maxxan Systems, Inc. | System and method for allocating unique zone membership |
US20030200330A1 (en) * | 2002-04-22 | 2003-10-23 | Maxxan Systems, Inc. | System and method for load-sharing computer network switch |
US20030202510A1 (en) * | 2002-04-26 | 2003-10-30 | Maxxan Systems, Inc. | System and method for scalable switch fabric for computer network |
US7146359B2 (en) * | 2002-05-03 | 2006-12-05 | Hewlett-Packard Development Company, L.P. | Method and system for filtering content in a discovered topic |
JP4116329B2 (en) * | 2002-05-27 | 2008-07-09 | 株式会社日立製作所 | Document information display system, document information display method, and document search method |
CA2395905A1 (en) * | 2002-07-26 | 2004-01-26 | Teraxion Inc. | Multi-grating tunable chromatic dispersion compensator |
US20040030766A1 (en) * | 2002-08-12 | 2004-02-12 | Michael Witkowski | Method and apparatus for switch fabric configuration |
US20080058106A1 (en) * | 2002-10-07 | 2008-03-06 | Maya-Systems Inc. | Multi-dimensional locating game system and method |
AU2003282342A1 (en) * | 2002-11-13 | 2004-06-03 | Kenneth, Nadav | Method and system for using query information to enhance categorization and navigation within the whole knowledge base |
WO2004068300A2 (en) * | 2003-01-25 | 2004-08-12 | Purdue Research Foundation | Methods, systems, and data structures for performing searches on three dimensional objects |
US20040177081A1 (en) * | 2003-03-03 | 2004-09-09 | Scott Dresden | Neural-based internet search engine with fuzzy and learning processes implemented at multiple levels |
US8055669B1 (en) * | 2003-03-03 | 2011-11-08 | Google Inc. | Search queries improved based on query semantic information |
EP1611546B1 (en) * | 2003-04-04 | 2013-01-02 | Icosystem Corporation | Methods and systems for interactive evolutionary computing (iec) |
US7283997B1 (en) * | 2003-05-14 | 2007-10-16 | Apple Inc. | System and method for ranking the relevance of documents retrieved by a query |
US20050060287A1 (en) * | 2003-05-16 | 2005-03-17 | Hellman Ziv Z. | System and method for automatic clustering, sub-clustering and cluster hierarchization of search results in cross-referenced databases using articulation nodes |
US7206780B2 (en) * | 2003-06-27 | 2007-04-17 | Sbc Knowledge Ventures, L.P. | Relevance value for each category of a particular search result in the ranked list is estimated based on its rank and actual relevance values |
US7333960B2 (en) | 2003-08-01 | 2008-02-19 | Icosystem Corporation | Methods and systems for applying genetic operators to determine system conditions |
US7356518B2 (en) * | 2003-08-27 | 2008-04-08 | Icosystem Corporation | Methods and systems for multi-participant interactive evolutionary computing |
US8086619B2 (en) * | 2003-09-05 | 2011-12-27 | Google Inc. | System and method for providing search query refinements |
US20050060289A1 (en) * | 2003-09-12 | 2005-03-17 | Mark Keenan | A method for relaxing multiple constraints in search and calculation and then displaying results |
JP4349875B2 (en) * | 2003-09-19 | 2009-10-21 | 株式会社リコー | Document filtering apparatus, document filtering method, and document filtering program |
WO2005050482A1 (en) | 2003-10-21 | 2005-06-02 | Nielsen Media Research, Inc. | Methods and apparatus for fusing databases |
US7584221B2 (en) | 2004-03-18 | 2009-09-01 | Microsoft Corporation | Field weighting in text searching |
US20070300142A1 (en) * | 2005-04-01 | 2007-12-27 | King Martin T | Contextual dynamic advertising based upon captured rendered text |
WO2006014454A1 (en) * | 2004-07-06 | 2006-02-09 | Icosystem Corporation | Methods and apparatus for query refinement using genetic algorithms |
US7707220B2 (en) * | 2004-07-06 | 2010-04-27 | Icosystem Corporation | Methods and apparatus for interactive searching techniques |
US7311666B2 (en) * | 2004-07-10 | 2007-12-25 | Trigeminal Solutions, Inc. | Apparatus for collecting information |
US7395260B2 (en) * | 2004-08-04 | 2008-07-01 | International Business Machines Corporation | Method for providing graphical representations of search results in multiple related histograms |
US8261196B2 (en) * | 2004-08-04 | 2012-09-04 | International Business Machines Corporation | Method for displaying usage metrics as part of search results |
EP1779263A1 (en) * | 2004-08-13 | 2007-05-02 | Swiss Reinsurance Company | Speech and textual analysis device and corresponding method |
US7540051B2 (en) * | 2004-08-20 | 2009-06-02 | Spatial Systems, Inc. | Mapping web sites based on significance of contact and category |
US7606793B2 (en) * | 2004-09-27 | 2009-10-20 | Microsoft Corporation | System and method for scoping searches using index keys |
US7739277B2 (en) * | 2004-09-30 | 2010-06-15 | Microsoft Corporation | System and method for incorporating anchor text into ranking search results |
US7761448B2 (en) | 2004-09-30 | 2010-07-20 | Microsoft Corporation | System and method for ranking search results using click distance |
JP4639734B2 (en) * | 2004-09-30 | 2011-02-23 | 富士ゼロックス株式会社 | Slide content processing apparatus and program |
US7827181B2 (en) * | 2004-09-30 | 2010-11-02 | Microsoft Corporation | Click distance determination |
US20060112079A1 (en) * | 2004-11-23 | 2006-05-25 | International Business Machines Corporation | System and method for generating personalized web pages |
US7716198B2 (en) * | 2004-12-21 | 2010-05-11 | Microsoft Corporation | Ranking search results using feature extraction |
WO2006081698A1 (en) * | 2005-02-02 | 2006-08-10 | Sdn Ag | Search engine based self-teaching system |
US7792833B2 (en) * | 2005-03-03 | 2010-09-07 | Microsoft Corporation | Ranking search results using language types |
US20060200460A1 (en) * | 2005-03-03 | 2006-09-07 | Microsoft Corporation | System and method for ranking search results using file types |
JP4314204B2 (en) * | 2005-03-11 | 2009-08-12 | 株式会社東芝 | Document management method, system and program |
US7519580B2 (en) * | 2005-04-19 | 2009-04-14 | International Business Machines Corporation | Search criteria control system and method |
US8020110B2 (en) * | 2005-05-26 | 2011-09-13 | Weisermazars Llp | Methods for defining queries, generating query results and displaying same |
US20070005588A1 (en) * | 2005-07-01 | 2007-01-04 | Microsoft Corporation | Determining relevance using queries as surrogate content |
US9286388B2 (en) * | 2005-08-04 | 2016-03-15 | Time Warner Cable Enterprises Llc | Method and apparatus for context-specific content delivery |
US7599917B2 (en) * | 2005-08-15 | 2009-10-06 | Microsoft Corporation | Ranking search results using biased click distance |
EP1755051A1 (en) * | 2005-08-15 | 2007-02-21 | Mitsubishi Electric Information Technology Centre Europe B.V. | Method and apparatus for accessing data using a symbolic representation space |
US8677377B2 (en) | 2005-09-08 | 2014-03-18 | Apple Inc. | Method and apparatus for building an intelligent automated assistant |
EP1927058A4 (en) * | 2005-09-21 | 2011-02-02 | Icosystem Corp | System and method for aiding product design and quantifying acceptance |
US20070214119A1 (en) * | 2006-03-07 | 2007-09-13 | Microsoft Corporation | Searching within a Site of a Search Result |
US20070214158A1 (en) * | 2006-03-08 | 2007-09-13 | Yakov Kamen | Method and apparatus for conducting a robust search |
US8150827B2 (en) * | 2006-06-07 | 2012-04-03 | Renew Data Corp. | Methods for enhancing efficiency and cost effectiveness of first pass review of documents |
US7809704B2 (en) * | 2006-06-15 | 2010-10-05 | Microsoft Corporation | Combining spectral and probabilistic clustering |
EP2032224A2 (en) * | 2006-06-26 | 2009-03-11 | Icosystem Corporation | Methods and systems for interactive customization of avatars and other animate or inanimate items in video games |
US9318108B2 (en) | 2010-01-18 | 2016-04-19 | Apple Inc. | Intelligent automated assistant |
KR100878535B1 (en) * | 2006-10-26 | 2009-01-13 | 삼성전자주식회사 | Apparatus and method for searching for contents |
JP4856196B2 (en) * | 2007-01-30 | 2012-01-18 | 富士通株式会社 | Configuration check information collection method, configuration check information collection device, and configuration check information collection program |
US8977255B2 (en) | 2007-04-03 | 2015-03-10 | Apple Inc. | Method and system for operating a multi-function portable electronic device using voice-activation |
US7933904B2 (en) * | 2007-04-10 | 2011-04-26 | Nelson Cliff | File search engine and computerized method of tagging files with vectors |
US8484192B1 (en) * | 2007-04-30 | 2013-07-09 | Google Inc. | Media search broadening |
US8631440B2 (en) | 2007-04-30 | 2014-01-14 | Google Inc. | Program guide user interface |
US20090225322A1 (en) * | 2007-05-07 | 2009-09-10 | Sparta, Inc. | Selection of interrogation wavelengths in optical bio-detection systems |
US9218412B2 (en) * | 2007-05-10 | 2015-12-22 | Microsoft Technology Licensing, Llc | Searching a database of listings |
US8826123B2 (en) * | 2007-05-25 | 2014-09-02 | 9224-5489 Quebec Inc. | Timescale for presenting information |
EP2176730A4 (en) * | 2007-08-08 | 2011-04-20 | Baynote Inc | Method and apparatus for context-based content recommendation |
US8601392B2 (en) * | 2007-08-22 | 2013-12-03 | 9224-5489 Quebec Inc. | Timeline for presenting information |
US8069404B2 (en) | 2007-08-22 | 2011-11-29 | Maya-Systems Inc. | Method of managing expected documents and system providing same |
US20090055368A1 (en) * | 2007-08-24 | 2009-02-26 | Gaurav Rewari | Content classification and extraction apparatus, systems, and methods |
US20090055242A1 (en) * | 2007-08-24 | 2009-02-26 | Gaurav Rewari | Content identification and classification apparatus, systems, and methods |
US7716228B2 (en) * | 2007-09-25 | 2010-05-11 | Firstrain, Inc. | Content quality apparatus, systems, and methods |
US9348912B2 (en) * | 2007-10-18 | 2016-05-24 | Microsoft Technology Licensing, Llc | Document length as a static relevance feature for ranking search results |
US7840569B2 (en) | 2007-10-18 | 2010-11-23 | Microsoft Corporation | Enterprise relevancy ranking using a neural network |
US20090106221A1 (en) * | 2007-10-18 | 2009-04-23 | Microsoft Corporation | Ranking and Providing Search Results Based In Part On A Number Of Click-Through Features |
US8332411B2 (en) * | 2007-10-19 | 2012-12-11 | Microsoft Corporation | Boosting a ranker for improved ranking accuracy |
US7779019B2 (en) * | 2007-10-19 | 2010-08-17 | Microsoft Corporation | Linear combination of rankers |
US7937389B2 (en) * | 2007-11-01 | 2011-05-03 | Ut-Battelle, Llc | Dynamic reduction of dimensions of a document vector in a document search and retrieval system |
US7895225B1 (en) * | 2007-12-06 | 2011-02-22 | Amazon Technologies, Inc. | Identifying potential duplicates of a document in a document corpus |
US9330720B2 (en) | 2008-01-03 | 2016-05-03 | Apple Inc. | Methods and apparatus for altering audio output signals |
US8260772B2 (en) * | 2008-01-31 | 2012-09-04 | SAP France S.A. | Apparatus and method for displaying documents relevant to the content of a website |
US8615733B2 (en) * | 2008-01-31 | 2013-12-24 | SAP France S.A. | Building a component to display documents relevant to the content of a website |
JP4850859B2 (en) * | 2008-02-05 | 2012-01-11 | 株式会社コナミデジタルエンタテインメント | Information processing apparatus, information processing apparatus control method, and program |
US8739050B2 (en) * | 2008-03-07 | 2014-05-27 | 9224-5489 Quebec Inc. | Documents discrimination system and method thereof |
US20090248669A1 (en) * | 2008-04-01 | 2009-10-01 | Nitin Mangesh Shetti | Method and system for organizing information |
US8645409B1 (en) * | 2008-04-02 | 2014-02-04 | Google Inc. | Contextual search term evaluation |
US8996376B2 (en) | 2008-04-05 | 2015-03-31 | Apple Inc. | Intelligent text-to-speech conversion |
US8812493B2 (en) | 2008-04-11 | 2014-08-19 | Microsoft Corporation | Search results ranking using editing distance and document information |
US8032469B2 (en) * | 2008-05-06 | 2011-10-04 | Microsoft Corporation | Recommending similar content identified with a neural network |
US10496753B2 (en) | 2010-01-18 | 2019-12-03 | Apple Inc. | Automatically adapting user interfaces for hands-free interaction |
KR100987330B1 (en) * | 2008-05-21 | 2010-10-13 | 성균관대학교산학협력단 | A system and method generating multi-concept networks based on user's web usage data |
US20090313286A1 (en) * | 2008-06-17 | 2009-12-17 | Microsoft Corporation | Generating training data from click logs |
JP5226401B2 (en) * | 2008-06-25 | 2013-07-03 | インターナショナル・ビジネス・マシーンズ・コーポレーション | Apparatus and method for supporting retrieval of document data |
US8171031B2 (en) * | 2008-06-27 | 2012-05-01 | Microsoft Corporation | Index optimization for ranking using a linear model |
US8161036B2 (en) * | 2008-06-27 | 2012-04-17 | Microsoft Corporation | Index optimization for ranking using a linear model |
US20100030549A1 (en) | 2008-07-31 | 2010-02-04 | Lee Michael M | Mobile device having human language translation capability with positional feedback |
US20120166414A1 (en) * | 2008-08-11 | 2012-06-28 | Ultra Unilimited Corporation (dba Publish) | Systems and methods for relevance scoring |
US8607155B2 (en) * | 2008-09-12 | 2013-12-10 | 9224-5489 Quebec Inc. | Method of managing groups of arrays of documents |
US9317599B2 (en) * | 2008-09-19 | 2016-04-19 | Nokia Technologies Oy | Method, apparatus and computer program product for providing relevance indication |
JPWO2010058519A1 (en) * | 2008-11-18 | 2012-04-19 | 日本電気株式会社 | Hybrid search system, hybrid search method, and hybrid search program |
US8396865B1 (en) * | 2008-12-10 | 2013-03-12 | Google Inc. | Sharing search engine relevance data between corpora |
WO2010067118A1 (en) | 2008-12-11 | 2010-06-17 | Novauris Technologies Limited | Speech recognition involving a mobile device |
US10255566B2 (en) | 2011-06-03 | 2019-04-09 | Apple Inc. | Generating and processing task items that represent tasks to perform |
US10241644B2 (en) | 2011-06-03 | 2019-03-26 | Apple Inc. | Actionable reminder entries |
US10241752B2 (en) | 2011-09-30 | 2019-03-26 | Apple Inc. | Interface for a virtual digital assistant |
US9858925B2 (en) | 2009-06-05 | 2018-01-02 | Apple Inc. | Using context information to facilitate processing of commands in a virtual assistant |
US8321398B2 (en) * | 2009-07-01 | 2012-11-27 | Thomson Reuters (Markets) Llc | Method and system for determining relevance of terms in text documents |
US9431006B2 (en) | 2009-07-02 | 2016-08-30 | Apple Inc. | Methods and apparatuses for automatic speech recognition |
US9298722B2 (en) * | 2009-07-16 | 2016-03-29 | Novell, Inc. | Optimal sequential (de)compression of digital data |
US20110161340A1 (en) * | 2009-12-31 | 2011-06-30 | Honeywell International Inc. | Long-term query refinement system |
US10705794B2 (en) | 2010-01-18 | 2020-07-07 | Apple Inc. | Automatically adapting user interfaces for hands-free interaction |
US10553209B2 (en) | 2010-01-18 | 2020-02-04 | Apple Inc. | Systems and methods for hands-free notification summaries |
US10276170B2 (en) | 2010-01-18 | 2019-04-30 | Apple Inc. | Intelligent automated assistant |
US10679605B2 (en) | 2010-01-18 | 2020-06-09 | Apple Inc. | Hands-free list-reading by intelligent automated assistant |
US8977584B2 (en) | 2010-01-25 | 2015-03-10 | Newvaluexchange Global Ai Llp | Apparatuses, methods and systems for a digital conversation management platform |
US8682667B2 (en) | 2010-02-25 | 2014-03-25 | Apple Inc. | User profiling for selecting user specific voice input processing information |
US8782734B2 (en) * | 2010-03-10 | 2014-07-15 | Novell, Inc. | Semantic controls on data storage and access |
US8339094B2 (en) * | 2010-03-11 | 2012-12-25 | GM Global Technology Operations LLC | Methods, systems and apparatus for overmodulation of a five-phase machine |
US10643227B1 (en) | 2010-03-23 | 2020-05-05 | Aurea Software, Inc. | Business lines |
US11367295B1 (en) | 2010-03-23 | 2022-06-21 | Aurea Software, Inc. | Graphical user interface for presentation of events |
US10546311B1 (en) | 2010-03-23 | 2020-01-28 | Aurea Software, Inc. | Identifying competitors of companies |
US9760634B1 (en) | 2010-03-23 | 2017-09-12 | Firstrain, Inc. | Models for classifying documents |
US8832103B2 (en) * | 2010-04-13 | 2014-09-09 | Novell, Inc. | Relevancy filter for new data based on underlying files |
US9129300B2 (en) * | 2010-04-21 | 2015-09-08 | Yahoo! Inc. | Using external sources for sponsored search AD selection |
US20110264530A1 (en) | 2010-04-23 | 2011-10-27 | Bryan Santangelo | Apparatus and methods for dynamic secondary content and data insertion and delivery |
JP5083367B2 (en) * | 2010-04-27 | 2012-11-28 | カシオ計算機株式会社 | SEARCH DEVICE, SEARCH METHOD, AND COMPUTER PROGRAM |
US8661047B2 (en) * | 2010-05-17 | 2014-02-25 | Microsoft Corporation | Geometric mechanism for privacy-preserving answers |
US8738635B2 (en) | 2010-06-01 | 2014-05-27 | Microsoft Corporation | Detection of junk in search result ranking |
US8713021B2 (en) * | 2010-07-07 | 2014-04-29 | Apple Inc. | Unsupervised document clustering using latent semantic density analysis |
US8825649B2 (en) * | 2010-07-21 | 2014-09-02 | Microsoft Corporation | Smart defaults for data visualizations |
US9703871B1 (en) * | 2010-07-30 | 2017-07-11 | Google Inc. | Generating query refinements using query components |
WO2012027262A1 (en) * | 2010-08-23 | 2012-03-01 | Google Inc. | Parallel document mining |
US10762293B2 (en) | 2010-12-22 | 2020-09-01 | Apple Inc. | Using parts-of-speech tagging and named entity recognition for spelling correction |
US9189129B2 (en) | 2011-02-01 | 2015-11-17 | 9224-5489 Quebec Inc. | Non-homogeneous objects magnification and reduction |
US9262612B2 (en) | 2011-03-21 | 2016-02-16 | Apple Inc. | Device access using voice authentication |
US8355922B1 (en) * | 2011-04-07 | 2013-01-15 | Symantec Corporation | Systems and methods for prioritizing review of items |
US8620933B2 (en) | 2011-04-11 | 2013-12-31 | Google Inc. | Illustrating cross channel conversion paths |
US8510326B2 (en) | 2011-04-11 | 2013-08-13 | Google Inc. | Priority dimensional data conversion path reporting |
US10057736B2 (en) | 2011-06-03 | 2018-08-21 | Apple Inc. | Active transport based notifications |
US9202297B1 (en) | 2011-07-12 | 2015-12-01 | Domo, Inc. | Dynamic expansion of data visualizations |
US9792017B1 (en) | 2011-07-12 | 2017-10-17 | Domo, Inc. | Automatic creation of drill paths |
US10001898B1 (en) | 2011-07-12 | 2018-06-19 | Domo, Inc. | Automated provisioning of relational information for a summary data visualization |
US8655907B2 (en) | 2011-07-18 | 2014-02-18 | Google Inc. | Multi-channel conversion path position reporting |
US8959450B2 (en) | 2011-08-22 | 2015-02-17 | Google Inc. | Path explorer visualization |
US8994660B2 (en) | 2011-08-29 | 2015-03-31 | Apple Inc. | Text correction processing |
CA2790799C (en) | 2011-09-25 | 2023-03-21 | Mathieu Audet | Method and apparatus of navigating information element axes |
US8782042B1 (en) | 2011-10-14 | 2014-07-15 | Firstrain, Inc. | Method and system for identifying entities |
US9495462B2 (en) | 2012-01-27 | 2016-11-15 | Microsoft Technology Licensing, Llc | Re-ranking search results |
US10134385B2 (en) | 2012-03-02 | 2018-11-20 | Apple Inc. | Systems and methods for name pronunciation |
US9483461B2 (en) | 2012-03-06 | 2016-11-01 | Apple Inc. | Handling speech synthesis of content for multiple languages |
US8880525B2 (en) * | 2012-04-02 | 2014-11-04 | Xerox Corporation | Full and semi-batch clustering |
US9015201B2 (en) * | 2012-04-24 | 2015-04-21 | Honeywell International Inc. | Discriminative classification using index-based ranking of large multimedia archives |
US9280610B2 (en) | 2012-05-14 | 2016-03-08 | Apple Inc. | Crowd sourcing information to fulfill user requests |
US9721563B2 (en) | 2012-06-08 | 2017-08-01 | Apple Inc. | Name recognition system |
US9519693B2 (en) | 2012-06-11 | 2016-12-13 | 9224-5489 Quebec Inc. | Method and apparatus for displaying data element axes |
US9646080B2 (en) | 2012-06-12 | 2017-05-09 | 9224-5489 Quebec Inc. | Multi-functions axis-based interface |
US8977613B1 (en) | 2012-06-12 | 2015-03-10 | Firstrain, Inc. | Generation of recurring searches |
US9495129B2 (en) | 2012-06-29 | 2016-11-15 | Apple Inc. | Device, method, and user interface for voice-activated navigation and browsing of a document |
US8793258B2 (en) * | 2012-07-31 | 2014-07-29 | Hewlett-Packard Development Company, L.P. | Predicting sharing on a social network |
US9576574B2 (en) | 2012-09-10 | 2017-02-21 | Apple Inc. | Context-sensitive handling of interruptions by intelligent digital assistant |
US9547647B2 (en) | 2012-09-19 | 2017-01-17 | Apple Inc. | Voice-based media searching |
US10467530B2 (en) * | 2012-11-08 | 2019-11-05 | Georges Harik | Searching text via function learning |
US10592480B1 (en) | 2012-12-30 | 2020-03-17 | Aurea Software, Inc. | Affinity scoring |
US9418145B2 (en) | 2013-02-04 | 2016-08-16 | TextWise Company, LLC | Method and system for visualizing documents |
KR102423670B1 (en) | 2013-02-07 | 2022-07-22 | 애플 인크. | Voice trigger for a digital assistant |
US20140250376A1 (en) * | 2013-03-04 | 2014-09-04 | Microsoft Corporation | Summarizing and navigating data using counting grids |
US9368114B2 (en) | 2013-03-14 | 2016-06-14 | Apple Inc. | Context-sensitive handling of interruptions |
AU2014233517B2 (en) | 2013-03-15 | 2017-05-25 | Apple Inc. | Training an at least partial voice command system |
WO2014144579A1 (en) | 2013-03-15 | 2014-09-18 | Apple Inc. | System and method for updating an adaptive speech recognition model |
US10127475B1 (en) | 2013-05-31 | 2018-11-13 | Google Llc | Classifying images |
WO2014197334A2 (en) | 2013-06-07 | 2014-12-11 | Apple Inc. | System and method for user-specified pronunciation of words for speech synthesis and recognition |
US9582608B2 (en) | 2013-06-07 | 2017-02-28 | Apple Inc. | Unified ranking with entropy-weighted information for phrase-based semantic auto-completion |
WO2014197336A1 (en) | 2013-06-07 | 2014-12-11 | Apple Inc. | System and method for detecting errors in interactions with a voice-based digital assistant |
WO2014197335A1 (en) | 2013-06-08 | 2014-12-11 | Apple Inc. | Interpreting and acting upon commands that involve sharing information with remote devices |
KR101922663B1 (en) | 2013-06-09 | 2018-11-28 | 애플 인크. | Device, method, and graphical user interface for enabling conversation persistence across two or more instances of a digital assistant |
US10176167B2 (en) | 2013-06-09 | 2019-01-08 | Apple Inc. | System and method for inferring user intent from speech inputs |
JP2016521948A (en) | 2013-06-13 | 2016-07-25 | アップル インコーポレイテッド | System and method for emergency calls initiated by voice command |
KR101749009B1 (en) | 2013-08-06 | 2017-06-19 | 애플 인크. | Auto-activating smart responses based on activities from remote devices |
US9519859B2 (en) | 2013-09-06 | 2016-12-13 | Microsoft Technology Licensing, Llc | Deep structured semantic model produced using click-through data |
US8744840B1 (en) | 2013-10-11 | 2014-06-03 | Realfusion LLC | Method and system for n-dimentional, language agnostic, entity, meaning, place, time, and words mapping |
US20150199339A1 (en) * | 2014-01-14 | 2015-07-16 | Xerox Corporation | Semantic refining of cross-lingual information retrieval results |
US9477654B2 (en) | 2014-04-01 | 2016-10-25 | Microsoft Corporation | Convolutional latent semantic models and their applications |
US9535960B2 (en) | 2014-04-14 | 2017-01-03 | Microsoft Corporation | Context-sensitive search using a deep learning model |
US9620105B2 (en) | 2014-05-15 | 2017-04-11 | Apple Inc. | Analyzing audio input for efficient speech and music recognition |
US10592095B2 (en) | 2014-05-23 | 2020-03-17 | Apple Inc. | Instantaneous speaking of content on touch devices |
US9502031B2 (en) | 2014-05-27 | 2016-11-22 | Apple Inc. | Method for supporting dynamic grammars in WFST-based ASR |
US10170123B2 (en) | 2014-05-30 | 2019-01-01 | Apple Inc. | Intelligent assistant for home automation |
US10289433B2 (en) | 2014-05-30 | 2019-05-14 | Apple Inc. | Domain specific language for encoding assistant dialog |
US9430463B2 (en) | 2014-05-30 | 2016-08-30 | Apple Inc. | Exemplar-based natural language processing |
US9842101B2 (en) | 2014-05-30 | 2017-12-12 | Apple Inc. | Predictive conversion of language input |
US9734193B2 (en) | 2014-05-30 | 2017-08-15 | Apple Inc. | Determining domain salience ranking from ambiguous words in natural speech |
US9785630B2 (en) | 2014-05-30 | 2017-10-10 | Apple Inc. | Text prediction using combined word N-gram and unigram language models |
AU2015266863B2 (en) | 2014-05-30 | 2018-03-15 | Apple Inc. | Multi-command single utterance input method |
US10078631B2 (en) | 2014-05-30 | 2018-09-18 | Apple Inc. | Entropy-guided text prediction using combined word and character n-gram language models |
US9633004B2 (en) | 2014-05-30 | 2017-04-25 | Apple Inc. | Better resolution when referencing to concepts |
US9760559B2 (en) | 2014-05-30 | 2017-09-12 | Apple Inc. | Predictive text input |
US9715875B2 (en) | 2014-05-30 | 2017-07-25 | Apple Inc. | Reducing the need for manual start/end-pointing and trigger phrases |
US9338493B2 (en) | 2014-06-30 | 2016-05-10 | Apple Inc. | Intelligent automated assistant for TV user interactions |
US10659851B2 (en) | 2014-06-30 | 2020-05-19 | Apple Inc. | Real-time digital assistant knowledge updates |
US20160019316A1 (en) | 2014-07-21 | 2016-01-21 | Splunk Inc. | Wizard for creating a correlation search |
US10089580B2 (en) | 2014-08-11 | 2018-10-02 | Microsoft Technology Licensing, Llc | Generating and using a knowledge-enhanced model |
US10446141B2 (en) | 2014-08-28 | 2019-10-15 | Apple Inc. | Automatic speech recognition based on user feedback |
US9818400B2 (en) | 2014-09-11 | 2017-11-14 | Apple Inc. | Method and apparatus for discovering trending terms in speech requests |
US10789041B2 (en) | 2014-09-12 | 2020-09-29 | Apple Inc. | Dynamic thresholds for always listening speech trigger |
US9606986B2 (en) | 2014-09-29 | 2017-03-28 | Apple Inc. | Integrated word N-gram and class M-gram language models |
US9668121B2 (en) | 2014-09-30 | 2017-05-30 | Apple Inc. | Social reminders |
US9646609B2 (en) | 2014-09-30 | 2017-05-09 | Apple Inc. | Caching apparatus for serving phonetic pronunciations |
US9886432B2 (en) | 2014-09-30 | 2018-02-06 | Apple Inc. | Parsimonious handling of word inflection via categorical stem + suffix N-gram language models |
US10127911B2 (en) | 2014-09-30 | 2018-11-13 | Apple Inc. | Speaker identification and unsupervised speaker adaptation techniques |
US10074360B2 (en) | 2014-09-30 | 2018-09-11 | Apple Inc. | Providing an indication of the suitability of speech recognition |
US10552013B2 (en) | 2014-12-02 | 2020-02-04 | Apple Inc. | Data detection |
US9711141B2 (en) | 2014-12-09 | 2017-07-18 | Apple Inc. | Disambiguating heteronyms in speech synthesis |
US9865280B2 (en) | 2015-03-06 | 2018-01-09 | Apple Inc. | Structured dictation using intelligent automated assistants |
US10567477B2 (en) | 2015-03-08 | 2020-02-18 | Apple Inc. | Virtual assistant continuity |
US9721566B2 (en) | 2015-03-08 | 2017-08-01 | Apple Inc. | Competing devices responding to voice triggers |
US9886953B2 (en) | 2015-03-08 | 2018-02-06 | Apple Inc. | Virtual assistant activation |
US9899019B2 (en) | 2015-03-18 | 2018-02-20 | Apple Inc. | Systems and methods for structured stem and suffix language models |
US9842105B2 (en) | 2015-04-16 | 2017-12-12 | Apple Inc. | Parsimonious continuous-space phrase representations for natural language processing |
US10083688B2 (en) | 2015-05-27 | 2018-09-25 | Apple Inc. | Device voice control for selecting a displayed affordance |
US10127220B2 (en) | 2015-06-04 | 2018-11-13 | Apple Inc. | Language identification from short strings |
US10101822B2 (en) | 2015-06-05 | 2018-10-16 | Apple Inc. | Language input correction |
US9578173B2 (en) | 2015-06-05 | 2017-02-21 | Apple Inc. | Virtual assistant aided communication with 3rd party service in a communication session |
US10186254B2 (en) | 2015-06-07 | 2019-01-22 | Apple Inc. | Context-based endpoint detection |
US10255907B2 (en) | 2015-06-07 | 2019-04-09 | Apple Inc. | Automatic accent detection using acoustic models |
US11025565B2 (en) | 2015-06-07 | 2021-06-01 | Apple Inc. | Personalized prediction of responses for instant messaging |
US10671428B2 (en) | 2015-09-08 | 2020-06-02 | Apple Inc. | Distributed personal assistant |
US10747498B2 (en) | 2015-09-08 | 2020-08-18 | Apple Inc. | Zero latency digital assistant |
US9697820B2 (en) | 2015-09-24 | 2017-07-04 | Apple Inc. | Unit-selection text-to-speech synthesis using concatenation-sensitive neural networks |
US11010550B2 (en) | 2015-09-29 | 2021-05-18 | Apple Inc. | Unified language modeling framework for word prediction, auto-completion and auto-correction |
US10366158B2 (en) | 2015-09-29 | 2019-07-30 | Apple Inc. | Efficient word encoding for recurrent neural network language models |
US11587559B2 (en) | 2015-09-30 | 2023-02-21 | Apple Inc. | Intelligent device identification |
US10691473B2 (en) | 2015-11-06 | 2020-06-23 | Apple Inc. | Intelligent automated assistant in a messaging environment |
US10049668B2 (en) | 2015-12-02 | 2018-08-14 | Apple Inc. | Applying neural network language models to weighted finite state transducers for automatic speech recognition |
US10223066B2 (en) | 2015-12-23 | 2019-03-05 | Apple Inc. | Proactive assistance based on dialog communication between devices |
US10372714B2 (en) * | 2016-02-05 | 2019-08-06 | International Business Machines Corporation | Automated determination of document utility for a document corpus |
US10446143B2 (en) | 2016-03-14 | 2019-10-15 | Apple Inc. | Identification of voice inputs providing credentials |
JP6400037B2 (en) * | 2016-03-17 | 2018-10-03 | ヤフー株式会社 | Determination apparatus and determination method |
US10909450B2 (en) | 2016-03-29 | 2021-02-02 | Microsoft Technology Licensing, Llc | Multiple-action computational model training and operation |
CN105808526B (en) * | 2016-03-30 | 2019-07-30 | 北京京东尚科信息技术有限公司 | Commodity short text core word extracting method and device |
US9934775B2 (en) | 2016-05-26 | 2018-04-03 | Apple Inc. | Unit-selection text-to-speech synthesis based on predicted concatenation parameters |
US9972304B2 (en) | 2016-06-03 | 2018-05-15 | Apple Inc. | Privacy preserving distributed evaluation framework for embedded personalized systems |
US10249300B2 (en) | 2016-06-06 | 2019-04-02 | Apple Inc. | Intelligent list reading |
US10049663B2 (en) | 2016-06-08 | 2018-08-14 | Apple, Inc. | Intelligent automated assistant for media exploration |
DK179588B1 (en) | 2016-06-09 | 2019-02-22 | Apple Inc. | Intelligent automated assistant in a home environment |
US10067938B2 (en) | 2016-06-10 | 2018-09-04 | Apple Inc. | Multilingual word prediction |
US10509862B2 (en) | 2016-06-10 | 2019-12-17 | Apple Inc. | Dynamic phrase expansion of language input |
US10192552B2 (en) | 2016-06-10 | 2019-01-29 | Apple Inc. | Digital assistant providing whispered speech |
US10490187B2 (en) | 2016-06-10 | 2019-11-26 | Apple Inc. | Digital assistant providing automated status report |
US10586535B2 (en) | 2016-06-10 | 2020-03-10 | Apple Inc. | Intelligent digital assistant in a multi-tasking environment |
DK201670540A1 (en) | 2016-06-11 | 2018-01-08 | Apple Inc | Application integration with a digital assistant |
DK179415B1 (en) | 2016-06-11 | 2018-06-14 | Apple Inc | Intelligent device arbitration and control |
DK179343B1 (en) | 2016-06-11 | 2018-05-14 | Apple Inc | Intelligent task discovery |
DK179049B1 (en) | 2016-06-11 | 2017-09-18 | Apple Inc | Data driven natural language event detection and classification |
US10043516B2 (en) | 2016-09-23 | 2018-08-07 | Apple Inc. | Intelligent automated assistant |
US10593346B2 (en) | 2016-12-22 | 2020-03-17 | Apple Inc. | Rank-reduced token representation for automatic speech recognition |
US10810377B2 (en) | 2017-01-31 | 2020-10-20 | Boomi, Inc. | Method and system for information retreival |
US11188824B2 (en) * | 2017-02-17 | 2021-11-30 | Google Llc | Cooperatively training and/or using separate input and subsequent content neural networks for information retrieval |
DK201770439A1 (en) | 2017-05-11 | 2018-12-13 | Apple Inc. | Offline personal assistant |
DK179496B1 (en) | 2017-05-12 | 2019-01-15 | Apple Inc. | USER-SPECIFIC Acoustic Models |
DK179745B1 (en) | 2017-05-12 | 2019-05-01 | Apple Inc. | SYNCHRONIZATION AND TASK DELEGATION OF A DIGITAL ASSISTANT |
DK201770432A1 (en) | 2017-05-15 | 2018-12-21 | Apple Inc. | Hierarchical belief states for digital assistants |
DK201770431A1 (en) | 2017-05-15 | 2018-12-20 | Apple Inc. | Optimizing dialogue policy decisions for digital assistants using implicit feedback |
DK179560B1 (en) | 2017-05-16 | 2019-02-18 | Apple Inc. | Far-field extension for digital assistant services |
CN107180103A (en) * | 2017-05-31 | 2017-09-19 | 成都明途科技有限公司 | The more fast and convenient interactive system of search |
CA3007166C (en) | 2017-06-05 | 2024-04-30 | 9224-5489 Quebec Inc. | Method and apparatus of aligning information element axes |
US10592542B2 (en) * | 2017-08-31 | 2020-03-17 | International Business Machines Corporation | Document ranking by contextual vectors from natural language query |
CN107957989B9 (en) | 2017-10-23 | 2021-01-12 | 创新先进技术有限公司 | Cluster-based word vector processing method, device and equipment |
CN108170663A (en) | 2017-11-14 | 2018-06-15 | 阿里巴巴集团控股有限公司 | Term vector processing method, device and equipment based on cluster |
US20200019632A1 (en) * | 2018-07-11 | 2020-01-16 | Home Depot Product Authority, Llc | Presentation of related and corrected queries for a search engine |
CN109977286B (en) * | 2019-03-21 | 2022-10-28 | 中国科学技术大学 | Information retrieval method based on content |
US11403849B2 (en) | 2019-09-25 | 2022-08-02 | Charter Communications Operating, Llc | Methods and apparatus for characterization of digital content |
CN111339261A (en) * | 2020-03-17 | 2020-06-26 | 北京香侬慧语科技有限责任公司 | Document extraction method and system based on pre-training model |
US12118025B2 (en) * | 2020-10-28 | 2024-10-15 | Ihc Invest, Inc. | Comprehension engine to comprehend contents of selected documents |
US11461372B1 (en) * | 2021-03-18 | 2022-10-04 | Bae Systems Information And Electronic Systems Integration Inc. | Data clustering in logic devices using unsupervised learning |
CN114780673B (en) * | 2022-03-28 | 2024-04-30 | 西安远诺技术转移有限公司 | Scientific and technological achievement management method and platform based on field matching |
Citations (22)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4599692A (en) | 1984-01-16 | 1986-07-08 | Itt Corporation | Probabilistic learning element employing context drive searching |
US5062143A (en) | 1990-02-23 | 1991-10-29 | Harris Corporation | Trigram-based method of language identification |
US5325298A (en) | 1990-11-07 | 1994-06-28 | Hnc, Inc. | Methods for generating or revising context vectors for a plurality of word stems |
US5649193A (en) | 1993-03-12 | 1997-07-15 | Kabushiki Kaisha Toshiba | Document detection system using detection result presentation for facilitating user's comprehension |
US5675819A (en) | 1994-06-16 | 1997-10-07 | Xerox Corporation | Document information retrieval using global word co-occurrence patterns |
US5696962A (en) | 1993-06-24 | 1997-12-09 | Xerox Corporation | Method for computerized information retrieval using shallow linguistic analysis |
US5706497A (en) | 1994-08-15 | 1998-01-06 | Nec Research Institute, Inc. | Document retrieval using fuzzy-logic inference |
US5713016A (en) | 1995-09-05 | 1998-01-27 | Electronic Data Systems Corporation | Process and system for determining relevance |
US5717913A (en) | 1995-01-03 | 1998-02-10 | University Of Central Florida | Method for detecting and extracting text data using database schemas |
US5724571A (en) | 1995-07-07 | 1998-03-03 | Sun Microsystems, Inc. | Method and apparatus for generating query responses in a computer-based document retrieval system |
US5724567A (en) | 1994-04-25 | 1998-03-03 | Apple Computer, Inc. | System for directing relevance-ranked data objects to computer users |
US5745893A (en) | 1995-11-30 | 1998-04-28 | Electronic Data Systems Corporation | Process and system for arrangement of documents |
US5765150A (en) | 1996-08-09 | 1998-06-09 | Digital Equipment Corporation | Method for statistically projecting the ranking of information |
US5774888A (en) * | 1996-12-30 | 1998-06-30 | Intel Corporation | Method for characterizing a document set using evaluation surrogates |
US5835905A (en) * | 1997-04-09 | 1998-11-10 | Xerox Corporation | System for predicting documents relevant to focus documents by spreading activation through network representations of a linked collection of documents |
US5987446A (en) * | 1996-11-12 | 1999-11-16 | U.S. West, Inc. | Searching large collections of text using multiple search engines concurrently |
US6026397A (en) * | 1996-05-22 | 2000-02-15 | Electronic Data Systems Corporation | Data analysis system and method |
US6029172A (en) * | 1996-08-28 | 2000-02-22 | U.S. Philips Corporation | Method and system for selecting an information item |
US6038561A (en) * | 1996-10-15 | 2000-03-14 | Manning & Napier Information Services | Management and analysis of document information text |
US6041331A (en) * | 1997-04-01 | 2000-03-21 | Manning And Napier Information Services, Llc | Automatic extraction and graphic visualization system and method |
US6216134B1 (en) * | 1998-06-25 | 2001-04-10 | Microsoft Corporation | Method and system for visualization of clusters and classifications |
US6269362B1 (en) * | 1997-12-19 | 2001-07-31 | Alta Vista Company | System and method for monitoring web pages by comparing generated abstracts |
-
1998
- 1998-11-18 US US09/195,773 patent/US6574632B2/en not_active Expired - Fee Related
- 1998-11-18 US US09/195,773 patent/US20030069873A1/en active Granted
-
2003
- 2003-02-03 US US10/356,958 patent/US6701318B2/en not_active Expired - Fee Related
Patent Citations (22)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4599692A (en) | 1984-01-16 | 1986-07-08 | Itt Corporation | Probabilistic learning element employing context drive searching |
US5062143A (en) | 1990-02-23 | 1991-10-29 | Harris Corporation | Trigram-based method of language identification |
US5325298A (en) | 1990-11-07 | 1994-06-28 | Hnc, Inc. | Methods for generating or revising context vectors for a plurality of word stems |
US5649193A (en) | 1993-03-12 | 1997-07-15 | Kabushiki Kaisha Toshiba | Document detection system using detection result presentation for facilitating user's comprehension |
US5696962A (en) | 1993-06-24 | 1997-12-09 | Xerox Corporation | Method for computerized information retrieval using shallow linguistic analysis |
US5724567A (en) | 1994-04-25 | 1998-03-03 | Apple Computer, Inc. | System for directing relevance-ranked data objects to computer users |
US5675819A (en) | 1994-06-16 | 1997-10-07 | Xerox Corporation | Document information retrieval using global word co-occurrence patterns |
US5706497A (en) | 1994-08-15 | 1998-01-06 | Nec Research Institute, Inc. | Document retrieval using fuzzy-logic inference |
US5717913A (en) | 1995-01-03 | 1998-02-10 | University Of Central Florida | Method for detecting and extracting text data using database schemas |
US5724571A (en) | 1995-07-07 | 1998-03-03 | Sun Microsystems, Inc. | Method and apparatus for generating query responses in a computer-based document retrieval system |
US5713016A (en) | 1995-09-05 | 1998-01-27 | Electronic Data Systems Corporation | Process and system for determining relevance |
US5745893A (en) | 1995-11-30 | 1998-04-28 | Electronic Data Systems Corporation | Process and system for arrangement of documents |
US6026397A (en) * | 1996-05-22 | 2000-02-15 | Electronic Data Systems Corporation | Data analysis system and method |
US5765150A (en) | 1996-08-09 | 1998-06-09 | Digital Equipment Corporation | Method for statistically projecting the ranking of information |
US6029172A (en) * | 1996-08-28 | 2000-02-22 | U.S. Philips Corporation | Method and system for selecting an information item |
US6038561A (en) * | 1996-10-15 | 2000-03-14 | Manning & Napier Information Services | Management and analysis of document information text |
US5987446A (en) * | 1996-11-12 | 1999-11-16 | U.S. West, Inc. | Searching large collections of text using multiple search engines concurrently |
US5774888A (en) * | 1996-12-30 | 1998-06-30 | Intel Corporation | Method for characterizing a document set using evaluation surrogates |
US6041331A (en) * | 1997-04-01 | 2000-03-21 | Manning And Napier Information Services, Llc | Automatic extraction and graphic visualization system and method |
US5835905A (en) * | 1997-04-09 | 1998-11-10 | Xerox Corporation | System for predicting documents relevant to focus documents by spreading activation through network representations of a linked collection of documents |
US6269362B1 (en) * | 1997-12-19 | 2001-07-31 | Alta Vista Company | System and method for monitoring web pages by comparing generated abstracts |
US6216134B1 (en) * | 1998-06-25 | 2001-04-10 | Microsoft Corporation | Method and system for visualization of clusters and classifications |
Non-Patent Citations (3)
Title |
---|
"Displaying Data in Multidimensional Relevance Space with 2D Visualization Maps," Assa et al., Proceedings of the 1997 Visualization Conference, Phoenix, Arizona, USA, ACM Press, pp. 127-134.* * |
Cavnar, Using an N-Gram-Based Document Representation with a Vector Processing Retrieval Model, Ann Arbor, Michigan, 9 pages Date Unknown. |
Shaw et al., Combination of Multiple Searches, Department of Computer Science, Virginia Tech., Blacksburg, Virginia, 4 pages Date Unknown. |
Cited By (225)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20070078886A1 (en) * | 1993-11-19 | 2007-04-05 | Rivette Kevin G | Intellectual property asset manager (IPAM) for context processing of data objects |
US9092545B2 (en) * | 1993-11-19 | 2015-07-28 | Rose Blush Software Llc | Intellectual property asset manager (IPAM) for context processing of data objects |
US20070038588A1 (en) * | 1999-02-02 | 2007-02-15 | Alan Sullivan | Neural network system and method for controlling information output based on user feedback |
US8260733B2 (en) | 1999-02-02 | 2012-09-04 | Garbortese Holdings, Llc | Neural network system and method for controlling information output based on user feedback |
US9177349B2 (en) | 1999-09-14 | 2015-11-03 | Patentratings, Llc | Method and system for rating patents and other intangible assets |
US20090259506A1 (en) * | 1999-09-14 | 2009-10-15 | Barney Jonathan A | Method and system for rating patents and other intangible assets |
US7962511B2 (en) | 1999-09-14 | 2011-06-14 | Patentratings, Llc | Method and system for rating patents and other intangible assets |
US20040220842A1 (en) * | 1999-09-14 | 2004-11-04 | Barney Jonathan A. | Method and system for rating patents and other intangible assets |
US20040117189A1 (en) * | 1999-11-12 | 2004-06-17 | Bennett Ian M. | Query engine for processing voice based queries including semantic decoding |
US7725307B2 (en) * | 1999-11-12 | 2010-05-25 | Phoenix Solutions, Inc. | Query engine for processing voice based queries including semantic decoding |
US7007019B2 (en) * | 1999-12-21 | 2006-02-28 | Matsushita Electric Industrial Co., Ltd. | Vector index preparing method, similar vector searching method, and apparatuses for the methods |
US6799199B1 (en) * | 2000-01-11 | 2004-09-28 | The Relegence Corporation | Media monitor system |
US7191223B1 (en) * | 2000-01-11 | 2007-03-13 | The Relegence Corporation | System and method for real-time alerts |
US6999957B1 (en) * | 2000-01-11 | 2006-02-14 | The Relegence Corporation | System and method for real-time searching |
US20080040660A1 (en) * | 2000-02-23 | 2008-02-14 | Alexander Georke | Method And Apparatus For Processing Electronic Documents |
US6895552B1 (en) * | 2000-05-31 | 2005-05-17 | Ricoh Co., Ltd. | Method and an apparatus for visual summarization of documents |
US7519906B2 (en) | 2000-05-31 | 2009-04-14 | Ricoh Company, Ltd. | Method and an apparatus for visual summarization of documents |
US20050160369A1 (en) * | 2000-05-31 | 2005-07-21 | Marko Balabanovic | Method and an apparatus for visual summarization of documents |
US7523103B2 (en) | 2000-08-08 | 2009-04-21 | Aol Llc | Category searching |
US7359951B2 (en) * | 2000-08-08 | 2008-04-15 | Aol Llc, A Delaware Limited Liability Company | Displaying search results |
US20020103786A1 (en) * | 2000-08-08 | 2002-08-01 | Surendra Goel | Searching content on web pages |
US20020103797A1 (en) * | 2000-08-08 | 2002-08-01 | Surendra Goel | Displaying search results |
US7558805B2 (en) | 2000-08-08 | 2009-07-07 | Aol Llc | Searching content on web pages |
US20060184515A1 (en) * | 2000-08-08 | 2006-08-17 | America Online, Inc., A Delaware Corporation | Category searching |
US7047229B2 (en) | 2000-08-08 | 2006-05-16 | America Online, Inc. | Searching content on web pages |
US20090177655A1 (en) * | 2000-08-08 | 2009-07-09 | Aol Llc | Category searching |
US7225180B2 (en) | 2000-08-08 | 2007-05-29 | Aol Llc | Filtering search results |
US9128992B2 (en) | 2000-08-08 | 2015-09-08 | Microsoft Technology Licensing, Llc | Displaying search results |
US9122731B2 (en) | 2000-08-08 | 2015-09-01 | Microsoft Technology Licensing, Llc | Displaying search results |
US7970750B2 (en) | 2000-08-08 | 2011-06-28 | Aol Inc. | Category searching |
US9116966B2 (en) | 2000-08-08 | 2015-08-25 | Microsoft Technology Licensing, Llc | Displaying search results |
US7984061B1 (en) | 2000-08-08 | 2011-07-19 | Aol Inc. | Displaying search results |
US7007008B2 (en) | 2000-08-08 | 2006-02-28 | America Online, Inc. | Category searching |
US9159584B2 (en) | 2000-08-18 | 2015-10-13 | Gannady Lapir | Methods and systems of retrieving documents |
US20020169765A1 (en) * | 2000-10-27 | 2002-11-14 | Ivan Saltz | Limit engine database management system |
US7158989B2 (en) * | 2000-10-27 | 2007-01-02 | Buc International Corporation | Limit engine database management system |
US20050114325A1 (en) * | 2000-10-30 | 2005-05-26 | Microsoft Corporation | Semi-automatic annotation of multimedia objects |
US7349895B2 (en) | 2000-10-30 | 2008-03-25 | Microsoft Corporation | Semi-automatic annotation of multimedia objects |
US6970860B1 (en) * | 2000-10-30 | 2005-11-29 | Microsoft Corporation | Semi-automatic annotation of multimedia objects |
US7499916B2 (en) | 2000-10-30 | 2009-03-03 | Microsoft Corporation | Image retrieval systems and methods with semantic and feature based relevance feedback |
US7529732B2 (en) | 2000-10-30 | 2009-05-05 | Microsoft Corporation | Image retrieval systems and methods with semantic and feature based relevance feedback |
US7099860B1 (en) * | 2000-10-30 | 2006-08-29 | Microsoft Corporation | Image retrieval systems and methods with semantic and feature based relevance feedback |
US20050055344A1 (en) * | 2000-10-30 | 2005-03-10 | Microsoft Corporation | Image retrieval systems and methods with semantic and feature based relevance feedback |
US20050010553A1 (en) * | 2000-10-30 | 2005-01-13 | Microsoft Corporation | Semi-automatic annotation of multimedia objects |
US20040267740A1 (en) * | 2000-10-30 | 2004-12-30 | Microsoft Corporation | Image retrieval systems and methods with semantic and feature based relevance feedback |
US7627556B2 (en) | 2000-10-30 | 2009-12-01 | Microsoft Corporation | Semi-automatic annotation of multimedia objects |
US20050262133A1 (en) * | 2000-11-25 | 2005-11-24 | Lg Electronics Inc. | Multimedia retrieval method using multi-weighted feature |
US8776222B2 (en) | 2000-12-29 | 2014-07-08 | Facebook, Inc. | Message screening system |
US8099780B2 (en) | 2000-12-29 | 2012-01-17 | Aol Inc. | Message screening system |
US9621501B2 (en) | 2000-12-29 | 2017-04-11 | Facebook, Inc. | Message screening system utilizing supervisory screening and approval |
US20070124690A1 (en) * | 2000-12-29 | 2007-05-31 | Aol Llc | Message screening system |
US9083666B2 (en) | 2000-12-29 | 2015-07-14 | Facebook, Inc. | Message screening system utilizing supervisory screening and approval |
US6727922B2 (en) * | 2001-02-21 | 2004-04-27 | International Business Machines Corporation | GUI for representing entity matches utilizing graphical transitions performed directly on the matching object |
US20020145630A1 (en) * | 2001-02-21 | 2002-10-10 | International Business Machines Corporation | GUI for representing entity matches utilizing graphical transitions performed directly on the matching object |
US20050165805A1 (en) * | 2001-06-29 | 2005-07-28 | International Business Machines Corporation | Method and system for spatial information retrieval for hyperlinked documents |
US20030004941A1 (en) * | 2001-06-29 | 2003-01-02 | International Business Machines Corporation | Method, terminal and computer program for keyword searching |
US20030014405A1 (en) * | 2001-07-09 | 2003-01-16 | Jacob Shapiro | Search engine designed for handling long queries |
US20030066025A1 (en) * | 2001-07-13 | 2003-04-03 | Garner Harold R. | Method and system for information retrieval |
US9141691B2 (en) * | 2001-08-27 | 2015-09-22 | Alexander GOERKE | Method for automatically indexing documents |
US20120078934A1 (en) * | 2001-08-27 | 2012-03-29 | Bdgb Enterprise Software S.A.R.L. | Method for automatically indexing documents |
US7356461B1 (en) * | 2002-01-14 | 2008-04-08 | Nstein Technologies Inc. | Text categorization method and apparatus |
US7024408B2 (en) * | 2002-07-03 | 2006-04-04 | Word Data Corp. | Text-classification code, system and method |
US20040049498A1 (en) * | 2002-07-03 | 2004-03-11 | Dehlinger Peter J. | Text-classification code, system and method |
US7428541B2 (en) * | 2002-12-19 | 2008-09-23 | International Business Machines Corporation | Computer system, method, and program product for generating a data structure for information retrieval, and an associated graphical user interface |
US20090006378A1 (en) * | 2002-12-19 | 2009-01-01 | International Business Machines Corporation | Computer system method and program product for generating a data structure for information retrieval and an associated graphical user interface |
US8229900B2 (en) | 2002-12-19 | 2012-07-24 | International Business Machines Corporation | Generating a data structure for information retrieval |
US20040139067A1 (en) * | 2002-12-19 | 2004-07-15 | International Business Machines Corporation | Computer system, method, and program product for generating a data structure for information retrieval, and an associated graphical user interface |
USRE45558E1 (en) | 2002-12-30 | 2015-06-09 | Facebook, Inc. | Supervising user interaction with online services |
US7904554B1 (en) | 2002-12-30 | 2011-03-08 | Aol Inc. | Supervising user interaction with online services |
US7640336B1 (en) | 2002-12-30 | 2009-12-29 | Aol Llc | Supervising user interaction with online services |
US20040267728A1 (en) * | 2003-01-29 | 2004-12-30 | Kemal Delic | Maintenance of information retrieval systems using global metrics |
US9058395B2 (en) | 2003-05-30 | 2015-06-16 | Microsoft Technology Licensing, Llc | Resolving queries based on automatic determination of requestor geographic location |
US20040254957A1 (en) * | 2003-06-13 | 2004-12-16 | Nokia Corporation | Method and a system for modeling user preferences |
US8230364B2 (en) * | 2003-07-02 | 2012-07-24 | Sony United Kingdom Limited | Information retrieval |
US20050004910A1 (en) * | 2003-07-02 | 2005-01-06 | Trepess David William | Information retrieval |
US20050192926A1 (en) * | 2004-02-18 | 2005-09-01 | International Business Machines Corporation | Hierarchical visualization of a semantic network |
US11960937B2 (en) | 2004-03-13 | 2024-04-16 | Iii Holdings 12, Llc | System and method for an optimizing reservation in time of compute resources based on prioritization function and reservation policy parameter |
US12124878B2 (en) | 2004-03-13 | 2024-10-22 | Iii Holdings 12, Llc | System and method for scheduling resources within a compute environment using a scheduler process with reservation mask function |
US11467883B2 (en) | 2004-03-13 | 2022-10-11 | Iii Holdings 12, Llc | Co-allocating a reservation spanning different compute resources types |
US8005850B2 (en) * | 2004-03-15 | 2011-08-23 | Yahoo! Inc. | Search systems and methods with integration of user annotations |
US8005835B2 (en) * | 2004-03-15 | 2011-08-23 | Yahoo! Inc. | Search systems and methods with integration of aggregate user annotations |
US20050256867A1 (en) * | 2004-03-15 | 2005-11-17 | Yahoo! Inc. | Search systems and methods with integration of aggregate user annotations |
US8595146B1 (en) | 2004-03-15 | 2013-11-26 | Aol Inc. | Social networking permissions |
US20050234891A1 (en) * | 2004-03-15 | 2005-10-20 | Yahoo! Inc. | Search systems and methods with integration of user annotations |
US10367860B2 (en) | 2004-03-15 | 2019-07-30 | Oath Inc. | Social networking permissions |
US8548975B1 (en) | 2004-03-31 | 2013-10-01 | Google Inc. | Systems and methods for browsing historical content |
US9760636B1 (en) | 2004-03-31 | 2017-09-12 | Google Inc. | Systems and methods for browsing historical content |
US8131702B1 (en) * | 2004-03-31 | 2012-03-06 | Google Inc. | Systems and methods for browsing historical content |
US9141703B1 (en) | 2004-03-31 | 2015-09-22 | Google Inc. | Systems and methods for browsing historical content |
EP1587013A1 (en) * | 2004-04-14 | 2005-10-19 | Fujitsu Limited | Information processing technique relating to relation between users and documents |
US11652706B2 (en) | 2004-06-18 | 2023-05-16 | Iii Holdings 12, Llc | System and method for providing dynamic provisioning within a compute environment |
US12009996B2 (en) | 2004-06-18 | 2024-06-11 | Iii Holdings 12, Llc | System and method for providing dynamic provisioning within a compute environment |
US8768908B2 (en) | 2004-07-01 | 2014-07-01 | Facebook, Inc. | Query disambiguation |
US9183250B2 (en) | 2004-07-01 | 2015-11-10 | Facebook, Inc. | Query disambiguation |
US20090222444A1 (en) * | 2004-07-01 | 2009-09-03 | Aol Llc | Query disambiguation |
US9697577B2 (en) | 2004-08-10 | 2017-07-04 | Lucid Patent Llc | Patent mapping |
US11080807B2 (en) | 2004-08-10 | 2021-08-03 | Lucid Patent Llc | Patent mapping |
US11776084B2 (en) | 2004-08-10 | 2023-10-03 | Lucid Patent Llc | Patent mapping |
US20060036451A1 (en) * | 2004-08-10 | 2006-02-16 | Lundberg Steven W | Patent mapping |
US20110072014A1 (en) * | 2004-08-10 | 2011-03-24 | Foundationip, Llc | Patent mapping |
US11630704B2 (en) | 2004-08-20 | 2023-04-18 | Iii Holdings 12, Llc | System and method for a workload management and scheduling module to manage access to a compute environment according to local and non-local user identity information |
US20060059135A1 (en) * | 2004-09-10 | 2006-03-16 | Eran Palmon | Conducting a search directed by a hierarchy-free set of topics |
US20060089926A1 (en) * | 2004-10-27 | 2006-04-27 | Harris Corporation, Corporation Of The State Of Delaware | Method for re-ranking documents retrieved from a document database |
US20060173839A1 (en) * | 2004-10-27 | 2006-08-03 | Harris Corporation | Method for re-ranking documents retrieved from a multi-lingual document database |
US7801887B2 (en) | 2004-10-27 | 2010-09-21 | Harris Corporation | Method for re-ranking documents retrieved from a document database |
US7814105B2 (en) | 2004-10-27 | 2010-10-12 | Harris Corporation | Method for domain identification of documents in a document database |
US7603353B2 (en) | 2004-10-27 | 2009-10-13 | Harris Corporation | Method for re-ranking documents retrieved from a multi-lingual document database |
US20060206483A1 (en) * | 2004-10-27 | 2006-09-14 | Harris Corporation | Method for domain identification of documents in a document database |
US20110016113A1 (en) * | 2004-10-27 | 2011-01-20 | HARRIS CORPORATION, a Delaware corporation. | Method for re-ranking documents retrieved from a document database |
US11537435B2 (en) | 2004-11-08 | 2022-12-27 | Iii Holdings 12, Llc | System and method of providing system jobs within a compute environment |
US11709709B2 (en) | 2004-11-08 | 2023-07-25 | Iii Holdings 12, Llc | System and method of providing system jobs within a compute environment |
US11861404B2 (en) | 2004-11-08 | 2024-01-02 | Iii Holdings 12, Llc | System and method of providing system jobs within a compute environment |
US11886915B2 (en) | 2004-11-08 | 2024-01-30 | Iii Holdings 12, Llc | System and method of providing system jobs within a compute environment |
US11762694B2 (en) | 2004-11-08 | 2023-09-19 | Iii Holdings 12, Llc | System and method of providing system jobs within a compute environment |
US12008405B2 (en) | 2004-11-08 | 2024-06-11 | Iii Holdings 12, Llc | System and method of providing system jobs within a compute environment |
US11537434B2 (en) | 2004-11-08 | 2022-12-27 | Iii Holdings 12, Llc | System and method of providing system jobs within a compute environment |
US11494235B2 (en) | 2004-11-08 | 2022-11-08 | Iii Holdings 12, Llc | System and method of providing system jobs within a compute environment |
US11656907B2 (en) | 2004-11-08 | 2023-05-23 | Iii Holdings 12, Llc | System and method of providing system jobs within a compute environment |
US12039370B2 (en) | 2004-11-08 | 2024-07-16 | Iii Holdings 12, Llc | System and method of providing system jobs within a compute environment |
US20080172368A1 (en) * | 2004-12-29 | 2008-07-17 | Aol Llc | Query routing |
US8135737B2 (en) | 2004-12-29 | 2012-03-13 | Aol Inc. | Query routing |
US20080059456A1 (en) * | 2004-12-29 | 2008-03-06 | Aol Llc, A Delaware Limited Liability Company (Formerly Known As America Online, Inc.) | Domain Expert Search |
US8005813B2 (en) | 2004-12-29 | 2011-08-23 | Aol Inc. | Domain expert search |
US8521713B2 (en) | 2004-12-29 | 2013-08-27 | Microsoft Corporation | Domain expert search |
US20060173817A1 (en) * | 2004-12-29 | 2006-08-03 | Chowdhury Abdur R | Search fusion |
US7818314B2 (en) * | 2004-12-29 | 2010-10-19 | Aol Inc. | Search fusion |
US20060218140A1 (en) * | 2005-02-09 | 2006-09-28 | Battelle Memorial Institute | Method and apparatus for labeling in steered visual analysis of collections of documents |
US20060179051A1 (en) * | 2005-02-09 | 2006-08-10 | Battelle Memorial Institute | Methods and apparatus for steering the analyses of collections of documents |
US12120040B2 (en) | 2005-03-16 | 2024-10-15 | Iii Holdings 12, Llc | On-demand compute environment |
US11658916B2 (en) | 2005-03-16 | 2023-05-23 | Iii Holdings 12, Llc | Simple integration of an on-demand compute environment |
US11522811B2 (en) | 2005-04-07 | 2022-12-06 | Iii Holdings 12, Llc | On-demand access to compute resources |
US11765101B2 (en) | 2005-04-07 | 2023-09-19 | Iii Holdings 12, Llc | On-demand access to compute resources |
US11533274B2 (en) | 2005-04-07 | 2022-12-20 | Iii Holdings 12, Llc | On-demand access to compute resources |
US11496415B2 (en) | 2005-04-07 | 2022-11-08 | Iii Holdings 12, Llc | On-demand access to compute resources |
US11831564B2 (en) | 2005-04-07 | 2023-11-28 | Iii Holdings 12, Llc | On-demand access to compute resources |
US10810693B2 (en) | 2005-05-27 | 2020-10-20 | Black Hills Ip Holdings, Llc | Method and apparatus for cross-referencing important IP relationships |
US11798111B2 (en) | 2005-05-27 | 2023-10-24 | Black Hills Ip Holdings, Llc | Method and apparatus for cross-referencing important IP relationships |
US7984039B2 (en) | 2005-07-14 | 2011-07-19 | International Business Machines Corporation | Merging of results in distributed information retrieval |
US20070016574A1 (en) * | 2005-07-14 | 2007-01-18 | International Business Machines Corporation | Merging of results in distributed information retrieval |
US9201956B2 (en) | 2005-07-27 | 2015-12-01 | Schwegman Lundberg & Woessner, P.A. | Patent mapping |
US20070198578A1 (en) * | 2005-07-27 | 2007-08-23 | Lundberg Steven W | Patent mapping |
US9659071B2 (en) | 2005-07-27 | 2017-05-23 | Schwegman Lundberg & Woessner, P.A. | Patent mapping |
US8161025B2 (en) | 2005-07-27 | 2012-04-17 | Schwegman, Lundberg & Woessner, P.A. | Patent mapping |
US7546280B1 (en) * | 2005-08-30 | 2009-06-09 | Quintura, Inc. | Use of neural networks for keyword generation |
US7949581B2 (en) | 2005-09-07 | 2011-05-24 | Patentratings, Llc | Method of determining an obsolescence rate of a technology |
US20070094297A1 (en) * | 2005-09-07 | 2007-04-26 | Barney Jonathan A | Method of determining an obsolescence rate of a technology |
US20110047111A1 (en) * | 2005-09-26 | 2011-02-24 | Quintura, Inc. | Use of neural networks for annotating search results |
US8229948B1 (en) | 2005-09-26 | 2012-07-24 | Dranias Development Llc | Context-based search query visualization and search query context management using neural networks |
US8533130B2 (en) | 2005-09-26 | 2013-09-10 | Dranias Development Llc | Use of neural networks for annotating search results |
US8078557B1 (en) | 2005-09-26 | 2011-12-13 | Dranias Development Llc | Use of neural networks for keyword generation |
US10095778B2 (en) | 2005-09-27 | 2018-10-09 | Patentratings, Llc | Method and system for probabilistically quantifying and visualizing relevance between two or more citationally or contextually related data objects |
US11650857B2 (en) | 2006-03-16 | 2023-05-16 | Iii Holdings 12, Llc | System and method for managing a hybrid computer environment |
US9773160B2 (en) | 2006-05-10 | 2017-09-26 | Aol Inc. | Using relevance feedback in face recognition |
US8542888B2 (en) | 2006-05-10 | 2013-09-24 | Aol Inc. | Detecting facial similarity based on human perception of facial similarity |
US20090034805A1 (en) * | 2006-05-10 | 2009-02-05 | Aol Llc | Using Relevance Feedback In Face Recognition |
US9141878B2 (en) | 2006-05-10 | 2015-09-22 | Aol Inc. | Detecting facial similarity based on human perception of facial similarity |
US7783085B2 (en) * | 2006-05-10 | 2010-08-24 | Aol Inc. | Using relevance feedback in face recognition |
US7907755B1 (en) | 2006-05-10 | 2011-03-15 | Aol Inc. | Detecting facial similarity based on human perception of facial similarity |
US20110085710A1 (en) * | 2006-05-10 | 2011-04-14 | Aol Inc. | Using relevance feedback in face recognition |
US20110129145A1 (en) * | 2006-05-10 | 2011-06-02 | Aol Inc | Detecting facial similarity based on human perception of facial similarity |
US8194939B2 (en) | 2006-05-10 | 2012-06-05 | Aol Inc. | Using relevance feedback in face recognition |
US8233679B2 (en) | 2006-05-10 | 2012-07-31 | Aol Inc. | Detecting facial similarity based on human perception of facial similarity |
US10067662B2 (en) | 2006-06-22 | 2018-09-04 | Microsoft Technology Licensing, Llc | Content visualization |
US9213471B2 (en) * | 2006-06-22 | 2015-12-15 | Linkedin Corporation | Content visualization |
US8751940B2 (en) * | 2006-06-22 | 2014-06-10 | Linkedin Corporation | Content visualization |
US10042540B2 (en) | 2006-06-22 | 2018-08-07 | Microsoft Technology Licensing, Llc | Content visualization |
US20130091436A1 (en) * | 2006-06-22 | 2013-04-11 | Linkedin Corporation | Content visualization |
US8996993B2 (en) * | 2006-09-15 | 2015-03-31 | Battelle Memorial Institute | Text analysis devices, articles of manufacture, and text analysis methods |
US20080069448A1 (en) * | 2006-09-15 | 2008-03-20 | Turner Alan E | Text analysis devices, articles of manufacture, and text analysis methods |
US8452767B2 (en) * | 2006-09-15 | 2013-05-28 | Battelle Memorial Institute | Text analysis devices, articles of manufacture, and text analysis methods |
US20080071762A1 (en) * | 2006-09-15 | 2008-03-20 | Turner Alan E | Text analysis devices, articles of manufacture, and text analysis methods |
US8533185B2 (en) | 2007-02-19 | 2013-09-10 | Dranias Development Llc | Search engine graphical interface using maps of search terms and images |
US20110047145A1 (en) * | 2007-02-19 | 2011-02-24 | Quintura, Inc. | Search engine graphical interface using maps of search terms and images |
US20080235192A1 (en) * | 2007-03-19 | 2008-09-25 | Mitsuhisa Kanaya | Information retrieval system and information retrieval method |
US9396254B1 (en) * | 2007-07-20 | 2016-07-19 | Hewlett-Packard Development Company, L.P. | Generation of representative document components |
US9602573B1 (en) | 2007-09-24 | 2017-03-21 | National Science Foundation | Automatic clustering for self-organizing grids |
US10735505B2 (en) | 2007-09-24 | 2020-08-04 | The Research Foundation For The State University Of New York | Automatic clustering for self-organizing grids |
US11522952B2 (en) | 2007-09-24 | 2022-12-06 | The Research Foundation For The State University Of New York | Automatic clustering for self-organizing grids |
US8346953B1 (en) | 2007-12-18 | 2013-01-01 | AOL, Inc. | Methods and systems for restricting electronic content access based on guardian control decisions |
US8180754B1 (en) | 2008-04-01 | 2012-05-15 | Dranias Development Llc | Semantic neural network for aggregating query searches |
US20100010973A1 (en) * | 2008-07-09 | 2010-01-14 | International Business Machines Corporation | Vector Space Lightweight Directory Access Protocol Data Search |
US8918383B2 (en) * | 2008-07-09 | 2014-12-23 | International Business Machines Corporation | Vector space lightweight directory access protocol data search |
US20100094823A1 (en) * | 2008-10-14 | 2010-04-15 | Mathieu Lemaire | Enhanced linear presentation of search results based on search result metadata |
US10546273B2 (en) | 2008-10-23 | 2020-01-28 | Black Hills Ip Holdings, Llc | Patent mapping |
US11301810B2 (en) | 2008-10-23 | 2022-04-12 | Black Hills Ip Holdings, Llc | Patent mapping |
US20100257193A1 (en) * | 2009-04-07 | 2010-10-07 | Microsoft Corporation | Search query extension |
US8224839B2 (en) | 2009-04-07 | 2012-07-17 | Microsoft Corporation | Search query extension |
WO2010128323A1 (en) * | 2009-05-05 | 2010-11-11 | Aurix Limited | User interface for use in non-deterministic searching |
US8818998B2 (en) | 2009-05-05 | 2014-08-26 | Aurix Limited | User interface for use in non-deterministic searching |
CN101996191B (en) * | 2009-08-14 | 2013-08-07 | 北京大学 | Method and system for searching for two-dimensional cross-media element |
US11526304B2 (en) | 2009-10-30 | 2022-12-13 | Iii Holdings 2, Llc | Memcached server functionality in a cluster of data processing nodes |
US11720290B2 (en) | 2009-10-30 | 2023-08-08 | Iii Holdings 2, Llc | Memcached server functionality in a cluster of data processing nodes |
US20110106823A1 (en) * | 2009-11-02 | 2011-05-05 | Harry Urbschat | System and method of using dynamic variance networks |
US20110103689A1 (en) * | 2009-11-02 | 2011-05-05 | Harry Urbschat | System and method for obtaining document information |
US9152883B2 (en) | 2009-11-02 | 2015-10-06 | Harry Urbschat | System and method for increasing the accuracy of optical character recognition (OCR) |
US9158833B2 (en) | 2009-11-02 | 2015-10-13 | Harry Urbschat | System and method for obtaining document information |
US9213756B2 (en) | 2009-11-02 | 2015-12-15 | Harry Urbschat | System and method of using dynamic variance networks |
US20110103688A1 (en) * | 2009-11-02 | 2011-05-05 | Harry Urbschat | System and method for increasing the accuracy of optical character recognition (OCR) |
US8954893B2 (en) * | 2009-11-06 | 2015-02-10 | Hewlett-Packard Development Company, L.P. | Visually representing a hierarchy of category nodes |
US20110113385A1 (en) * | 2009-11-06 | 2011-05-12 | Craig Peter Sayers | Visually representing a hierarchy of category nodes |
US20110179002A1 (en) * | 2010-01-19 | 2011-07-21 | Dell Products L.P. | System and Method for a Vector-Space Search Engine |
US8849807B2 (en) | 2010-05-25 | 2014-09-30 | Mark F. McLellan | Active search results page ranking technology |
US9372873B2 (en) | 2010-11-16 | 2016-06-21 | Microsoft Technology Licensing, Llc | Browsing related image search result sets |
US20120124036A1 (en) * | 2010-11-16 | 2012-05-17 | Microsoft Corporation | Browsing related image search result sets |
US9384216B2 (en) * | 2010-11-16 | 2016-07-05 | Microsoft Technology Licensing, Llc | Browsing related image search result sets |
US11714839B2 (en) | 2011-05-04 | 2023-08-01 | Black Hills Ip Holdings, Llc | Apparatus and method for automated and assisted patent claim mapping and expense planning |
US11775538B2 (en) | 2011-10-03 | 2023-10-03 | Black Hills Ip Holdings, Llc | Systems, methods and user interfaces in a patent management system |
US11714819B2 (en) | 2011-10-03 | 2023-08-01 | Black Hills Ip Holdings, Llc | Patent mapping |
US11048709B2 (en) | 2011-10-03 | 2021-06-29 | Black Hills Ip Holdings, Llc | Patent mapping |
US11789954B2 (en) | 2011-10-03 | 2023-10-17 | Black Hills Ip Holdings, Llc | System and method for patent and prior art analysis |
US10860657B2 (en) | 2011-10-03 | 2020-12-08 | Black Hills Ip Holdings, Llc | Patent mapping |
US11797546B2 (en) | 2011-10-03 | 2023-10-24 | Black Hills Ip Holdings, Llc | Patent mapping |
US11803560B2 (en) | 2011-10-03 | 2023-10-31 | Black Hills Ip Holdings, Llc | Patent claim mapping |
US10614082B2 (en) | 2011-10-03 | 2020-04-07 | Black Hills Ip Holdings, Llc | Patent mapping |
US12135724B2 (en) | 2011-10-03 | 2024-11-05 | Black Hills Ip Holdings, Llc | Systems, methods and user interfaces in a patent management system |
US11360988B2 (en) | 2011-10-03 | 2022-06-14 | Black Hills Ip Holdings, Llc | Systems, methods and user interfaces in a patent management system |
US11256706B2 (en) | 2011-10-03 | 2022-02-22 | Black Hills Ip Holdings, Llc | System and method for patent and prior art analysis |
US12008670B2 (en) | 2012-08-20 | 2024-06-11 | Black Hills IP Holdings, LLC. | Analytics generation for patent portfolio management |
US11461862B2 (en) | 2012-08-20 | 2022-10-04 | Black Hills Ip Holdings, Llc | Analytics generation for patent portfolio management |
US20140115527A1 (en) * | 2012-10-24 | 2014-04-24 | Lucky Sort, Inc. | Gathering, selecting and graphing n-grams |
US9536223B2 (en) * | 2012-10-24 | 2017-01-03 | Twitter, Inc. | Gathering, selecting and graphing n-grams |
US11354344B2 (en) | 2013-04-23 | 2022-06-07 | Black Hills Ip Holdings, Llc | Patent claim scope evaluator |
US10579662B2 (en) | 2013-04-23 | 2020-03-03 | Black Hills Ip Holdings, Llc | Patent claim scope evaluator |
US10606878B2 (en) | 2017-04-03 | 2020-03-31 | Relativity Oda Llc | Technology for visualizing clusters of electronic documents |
US12147439B2 (en) | 2023-06-27 | 2024-11-19 | Black Hills IP Holdings, LLC. | Patent mapping |
Also Published As
Publication number | Publication date |
---|---|
US20030130998A1 (en) | 2003-07-10 |
US6701318B2 (en) | 2004-03-02 |
US20030069873A1 (en) | 2003-04-10 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US6574632B2 (en) | Multiple engine information retrieval and visualization system | |
US8024331B2 (en) | Construction of trainable semantic vectors and clustering, classification, and searching using trainable semantic vectors | |
Buckley et al. | New retrieval approaches using SMART: TREC 4 | |
EP0704075B1 (en) | Method and apparatus for indexing searching and displaying data | |
EP1199647B1 (en) | Methods, apparatus and computer program products for information retrieval and document classification utilizing a multidimensional subspace | |
US5625767A (en) | Method and system for two-dimensional visualization of an information taxonomy and of text documents based on topical content of the documents | |
US6611825B1 (en) | Method and system for text mining using multidimensional subspaces | |
US5761685A (en) | Method and system for real-time information analysis of textual material | |
US8676802B2 (en) | Method and system for information retrieval with clustering | |
US7113943B2 (en) | Method for document comparison and selection | |
Croft | Advances in information retrieval: recent research from the center for intelligent information retrieval | |
Caid et al. | Learned vector-space models for document retrieval | |
WO2009154570A1 (en) | System and method for aligning and indexing multilingual documents | |
US20070214154A1 (en) | Data Storage And Retrieval | |
Fox et al. | SENTINEL: A multiple engine information retrieval and visualization system | |
JP4967133B2 (en) | Information acquisition apparatus, program and method thereof | |
Aas | A survey on personalized information filtering systems for the world wide web | |
Newby | Metric multidimensional information space | |
Mukherjea et al. | Automatically determining semantics for world wide web multimedia information retrieval | |
WO2002037328A2 (en) | Integrating search, classification, scoring and ranking | |
Hauck et al. | Concept-based searching and browsing: A geoscience experiment | |
Ye et al. | A visualised software library: nested self-organising maps for retrieving and browsing reusable software assets | |
Kohonen et al. | Self-organization of very large document collections | |
Duffing | Thematico-visual image retrieval: how to deal with partially indexed corpora | |
Wen et al. | A search engine for images |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AS | Assignment |
Owner name: HARRIS CORPORATION, FLORIDA Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:FOX, KEVIN L.;KNEPPER, MARGARET M.;KILLIAM, ROBERT A.;AND OTHERS;REEL/FRAME:013676/0748;SIGNING DATES FROM 19990604 TO 19990616 |
|
AS | Assignment |
Owner name: HARRIS CORPORATION, FLORIDA Free format text: CORRECTIVE ASSIGNMENT TO CORRECT THE THIRD ASSIGNOR'S NAME PREVIOUSLY RECORDED ON REEL 013676, FRAME 0748;ASSIGNORS:FOX, KEVIN L.;KNEPPER, MARGARET M.;KILLAM, ROBERT A.;AND OTHERS;REEL/FRAME:014172/0905;SIGNING DATES FROM 19990604 TO 19990616 |
|
CC | Certificate of correction | ||
FPAY | Fee payment |
Year of fee payment: 4 |
|
FPAY | Fee payment |
Year of fee payment: 8 |
|
REMI | Maintenance fee reminder mailed | ||
LAPS | Lapse for failure to pay maintenance fees | ||
STCH | Information on status: patent discontinuation |
Free format text: PATENT EXPIRED DUE TO NONPAYMENT OF MAINTENANCE FEES UNDER 37 CFR 1.362 |
|
FP | Lapsed due to failure to pay maintenance fee |
Effective date: 20150603 |