CN110532492A - A kind of forum data management classification system and method - Google Patents

A kind of forum data management classification system and method Download PDF

Info

Publication number
CN110532492A
CN110532492A CN201910793205.2A CN201910793205A CN110532492A CN 110532492 A CN110532492 A CN 110532492A CN 201910793205 A CN201910793205 A CN 201910793205A CN 110532492 A CN110532492 A CN 110532492A
Authority
CN
China
Prior art keywords
data
forum
module
unit
original
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201910793205.2A
Other languages
Chinese (zh)
Inventor
王斌
杨晓春
孙学磊
王�琦
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Northeastern University China
Original Assignee
Northeastern University China
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Northeastern University China filed Critical Northeastern University China
Priority to CN201910793205.2A priority Critical patent/CN110532492A/en
Publication of CN110532492A publication Critical patent/CN110532492A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/14Error detection or correction of the data by redundancy in operation
    • G06F11/1402Saving, restoring, recovering or retrying
    • G06F11/1446Point-in-time backing up or restoration of persistent data
    • G06F11/1458Management of the backup or restore process
    • G06F11/1464Management of the backup or restore process for networked environments
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/22Indexing; Data structures therefor; Storage structures
    • G06F16/2228Indexing structures
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/25Integrating or interfacing systems involving database management systems
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/27Replication, distribution or synchronisation of data between databases or within a distributed database system; Distributed database system architectures therefor
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/951Indexing; Web crawling techniques
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9538Presentation of query results
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/958Organisation or management of web site content, e.g. publishing, maintaining pages or automatic linking
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F21/00Security arrangements for protecting computers, components thereof, programs or data against unauthorised activity
    • G06F21/60Protecting data
    • G06F21/604Tools and structures for managing or administering access control systems
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F21/00Security arrangements for protecting computers, components thereof, programs or data against unauthorised activity
    • G06F21/60Protecting data
    • G06F21/62Protecting access to data via a platform, e.g. using keys or access control rules
    • G06F21/6218Protecting access to data via a platform, e.g. using keys or access control rules to a system of files or objects, e.g. local or distributed file system or database

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • Software Systems (AREA)
  • Health & Medical Sciences (AREA)
  • Bioethics (AREA)
  • General Health & Medical Sciences (AREA)
  • Computer Hardware Design (AREA)
  • Computer Security & Cryptography (AREA)
  • Computing Systems (AREA)
  • Quality & Reliability (AREA)
  • Automation & Control Theory (AREA)
  • Information Transfer Between Computers (AREA)

Abstract

The present invention provides a kind of forum data management classification system and method, is related to big data processing technology field.The system and method acquires original forum data from multiple data sources by data acquisition module in real time;And data preprocessing module is passed to after being cached original forum data via message queue unit and carries out data prediction;It is stored after carrying out building index from the forum data of the Json format exported in data preprocessing module;Intelligent classification is carried out to forum data in algorithm engine module, keeps every class forum data corresponding with a kind of administrator;Enter system by the forum administrator of authentication module to analyze the forum data in extent of competence.Present system and method can be with quick search to related content to be searched, to be lifted at the efficiency for inquiring data in forum by carrying out Classification Management to forum data.

Description

A kind of forum data management classification system and method
Technical field
The present invention relates to big data processing technology field more particularly to a kind of forum data management classification system and methods.
Background technique
Forum (Forums) is the incorporation in online commerce services.Forum may operate a library, one Chatroom, the advertisement inventory for allowing people to carry out real-time information interchange or even classify there are one it.Forum has been mutual at present Thing very universal, some people can deliver novel in forum or continuously update certain contents in networking, and also someone can discuss Altar initiates certain topic, then has a lot of other users that can the topic be commented on or be delivered the opinion of oneself.
With the high speed development of internet, the data volume that forum generates increases with exponential model, traditional one-of-a-kind system It can no longer meet user to the process demand of magnanimity forum information and for magnanimity forum data with relevant database Store problem analysis.
If belonging to different types according to the difference of its topic or content to these magnanimity forum datas, to these numbers According to Classification Management is carried out, the storage of magnanimity forum data and the query processing in later period can be facilitated, improve search efficiency.
Summary of the invention
The technical problem to be solved by the present invention is in view of the above shortcomings of the prior art, provide a kind of forum data management point Class system and method realizes the Classification Management to forum data.
In order to solve the above technical problems, the technical solution used in the present invention is: on the one hand, the present invention provides a kind of forum Data management categorizing system, including acquisition layer, storage process layer, visualization layer;
The acquisition layer includes data acquisition module, message queue unit, data preprocessing module;The data acquisition module Block is used to acquire original forum data in real time from multiple data sources;The message queue unit is for adopting data acquisition module The original forum data of collection is cached, when the processing speed of data preprocessing module is less than the processing speed of data acquisition module When, message queue plays buffer function, when data preprocessing module breaks down and restores, will read again from message queue Take original forum data;The data preprocessing module is used to carry out forum data original in message queue unit the mistake of data Filter, extraction, duplicate removal and type conversion, export the forum data of Json format;
The storage process layer includes data memory module, data analytical calculation module;The data memory module includes Distributed storage module and index construct module;Index construct module is used to the forum data that acquisition layer exports being indexed structure It builds;The forum data that the distributed storage module is used to export acquisition layer carries out persistence storage;
The data analytical calculation module includes query analysis module and algorithm engine module;Query analysis module includes complete Literary retrieval unit, aggregate query unit, graph tool unit and warning service unit;Full-text search unit for accurately inquire, Fuzzy query and regular expression inquiry are that the keyword inputted by forum administrator quickly inquires forum data Positioning;The statistics that aggregate query and graph tool unit is used to carry out forum data summarizes and report displaying;Alert service unit It notifies to give forum related management people in the form of mail or short message for realizing by the forum data for having sensitive or emphasis vocabulary Member;Algorithm engine module is used to carry out intelligent classification to forum data;
The visualization layer includes data visualization module, authentication module;Data visualization module is for retrieving knot The page presentation of fruit and aggregate query result, and the final analysis result of data analytical calculation module is passed through into Visual Chart Display;Authentication module is used to distinguish the processing authority of different forum administrators;
The original forum data acquired in data source is transmitted to message queue unit by the data acquisition module, described Data preprocessing module receives the original forum data exported by message queue unit, and original forum data is carried out type conversion Forum data afterwards is transmitted to index construct module, and the index construct module exports forum data to distributed storage mould Block, the distributed storage module export forum's forum data to algorithm engine module, and algorithm engine module is by forum data To distributed storage module, the distributed storage module can be updated forum data for output after classification;Forum administrator Full-text search unit, aggregate query unit, graph tool unit, warning service list are sent the request to by authentication module Any cell in member, receive the unit of request by the forum data needed in request extracted from data memory module into Row analysis, and it is sent to data visualization module.
On the other hand, the present invention also provides a kind of forum data management classification methods, pass through a kind of forum data management point Class system is realized, comprising the following steps:
Step 1 acquires original forum data from multiple data sources by data acquisition module in real time;
Step 2 is passed to number after being cached the original forum data in data acquisition module via message queue unit Data preprocess module carries out data prediction work;
The data prediction extracts data information from the original forum data of text formatting, title of such as posting, interior Appearance, time, the people that posts, people location of posting, and convert thereof into Json format;
The forum data of the Json format exported from data preprocessing module is carried out building index by step 3;Index structure After building, forum data is subjected to persistent storage by distributed storage module;
Step 4, the forum data exported from distributed storage module are passed to algorithm engine module, carry out to forum data Intelligent classification, different classes of forum data will be labeled with different class labels, then pass the forum data of whole tape label Enter and be updated in distributed storage module, a class label is corresponding with a kind of forum administrator;
On a personal computer by the software installation of visualization layer, forum administrator is entering this for step 5, forum administrator It needs to carry out Authority Verification by authentication module before system, each forum administrator can only be got under its corresponding authority Class label forum data;
Step 6 enters system by the forum administrator of authentication module to the forum data progress in extent of competence Analysis;Request is passed through full-text search unit, aggregate query unit, graph tool unit, warning service unit by forum administrator In any cell analyzed, the receiving request unit the data needed after analysis are extracted from data memory module Come, then the data are back to data visualization module, data visualization module constructs chart and shows forum for result is analyzed Administrator.
The extraction of field in original forum data is realized using regular expression matching technology in the step 2;
Index structure in the step 3 uses inverted index;
Distributed storage module in the step 3 use by NoSql database building at distributed storage cluster, often All at least there is a copy data in the cluster in forum data.
The beneficial effects of adopting the technical scheme are that a kind of forum data management classification provided by the invention System and method can be with quick search to related content to be searched, to be promoted by carrying out Classification Management to forum data The efficiency of data is inquired in forum.
Detailed description of the invention
Fig. 1 is a kind of structural block diagram of forum data management classification system provided in an embodiment of the present invention;
Fig. 2 is a kind of flow chart of forum data management classification method provided in an embodiment of the present invention;
Fig. 3 is data prediction result figure provided in an embodiment of the present invention;
Fig. 4 is the flow chart of distributed storage provided in an embodiment of the present invention;
Fig. 5 is the flow chart of distributed computing provided in an embodiment of the present invention.
Specific embodiment
With reference to the accompanying drawings and examples, specific embodiments of the present invention will be described in further detail.Implement below Example is not intended to limit the scope of the invention for illustrating the present invention.
In the present embodiment, a kind of forum data management classification system, as shown in Figure 1, include acquisition layer, storage process layer, Visualization layer;
The acquisition layer includes data acquisition module, message queue unit, data preprocessing module;The data acquisition module Block is used to acquire original forum data in real time from multiple data sources;The message queue unit is for adopting data acquisition module The original forum data of collection is cached, when the processing speed of data preprocessing module is less than the processing speed of data acquisition module When, message queue plays buffer function, when data preprocessing module breaks down and restores, will read again from message queue Take original forum data;The data preprocessing module is used to carry out forum data original in message queue unit the mistake of data Filter, extraction, duplicate removal and type conversion, export the forum data of Json format;
The storage process layer includes data memory module, data analytical calculation module;The data memory module includes Distributed storage module and index construct module;Index construct module is used to the forum data that acquisition layer exports being indexed structure It builds;The distributed storage module is used to carry out persistence storage to forum data;
The data analytical calculation module includes query analysis module and algorithm engine module;Query analysis module includes complete Literary retrieval unit, aggregate query unit, graph tool unit and warning service unit;Full-text search unit for accurately inquire, Fuzzy query and regular expression inquiry are fixed to the quick inquiry of forum data progress by the keyword of department user input Position;The statistics that aggregate query and graph tool unit is used to carry out forum data summarizes and report displaying, especially by pie chart, column The diagrammatic forms such as shape figure, bar chart, thermal map, table, label-cloud show data analysis result;Warning service unit is used for It realizes and notifies the forum data with sensitive or emphasis vocabulary to give forum related management personnel in the form of mail or short message;It calculates Method engine modules are used to carry out intelligent classification to forum data;The visualization layer includes data visualization module, authentication Module;Data visualization module leads to final analysis result for search result and the page presentation of aggregate query result Cross Visual Chart expression;Authentication module is used to distinguish the processing authority of forum administrator;
The original forum data acquired in data source is transmitted to message queue unit by the data acquisition module, described Data preprocessing module receives the original forum data exported by message queue unit, and original forum data is carried out type conversion Forum data afterwards is transmitted to index construct module, and the index construct module exports forum data to distributed storage mould Block, the distributed storage module export forum data to algorithm engine module, and algorithm engine module classifies forum data After export to distributed storage module, the distributed storage module can be updated forum data;Forum administrator passes through Authentication module sends the request to full-text search unit, aggregate query unit, graph tool unit, alerts in service unit Any cell, receive the unit of request and extract the forum data needed in request point from data memory module Analysis, and it is sent to data visualization module.
The present invention also provides a kind of forum data management classification methods, real by a kind of forum data management classification system It is existing, as shown in Figure 2, comprising the following steps:
Step 1 acquires original forum data from multiple data sources by data acquisition module in real time;Acquisition layer is installed On multiple forum data sources, the generally each computer of forum administrative staff;Because forum data is typically all by more Computer record, therefore the acquisition layer can read data in slave multiple stage computers in real time, parallel;Rationally it is arranged in acquisition layer The source of data and the whereabouts of data, the source of data include the source IP address for generating port, data of data, the whereabouts of data IP address and port numbers including message queue;
Step 2 is passed to number after being cached the original forum data in data acquisition module via message queue unit Data preprocess module carries out data prediction work;
Data field needed for the customized setting of data prediction need of work user, from the forum data of text formatting Data information is extracted, title of such as posting, content, time, the people that posts, people location of posting will using regular expression matching The forum data of original character string type is converted to Json type;Data preprocessing module is for subsequent index construct sum number Prepare according to persistent storage because the data of Json format be indexed, store and network transmit when it is all more convenient;In advance Processing result is as shown in Figure 3;
The forum data of the Json format exported from data preprocessing module is carried out building index by step 3;This implementation In example, bottom uses index structure of the inverted index as forum data, to facilitate the realization of subsequent full-text search function, index After building, forum data is subjected to persistent storage by distributed storage module;The use of distributed storage module by NoSql database building at distributed storage cluster need the host node of user's cluster-specific in the build process of cluster And back end, host node is mainly responsible for the update and management of cluster state, and back end is mainly responsible for and carries out depositing for data Storage and specific query analysis, when carrying out data storage, the copy of portion can be no less than for the storage of every part of data with prevent by Loss of data caused by the reasons such as mechanical disorder;Distributed storage cluster management is as shown in Figure 4, wherein master node is responsible for Manage and maintain other nodes in cluster;Data node be data memory node, can on different machines storing data and its Copy;QnThe data stored in fragment are PnThe copy of the data stored in fragment, wherein n >=1;
Step 4, the forum data exported from distributed storage module are passed to algorithm engine module, carry out to forum data Intelligent classification stamps correctly different classes of forum data using the domain knowledge base constructed and in conjunction with machine learning algorithm Tag along sort, then will the forum data of whole tape label be passed to distributed storage module in be updated, a class label It is corresponding with a kind of forum administrator;
On a personal computer by the software installation of visualization layer, forum administrator is when logging in for step 5, forum administrator It needs to carry out Authority Verification by authentication module, each forum administrator can only be got with administrator's generic The forum data of label;Such as the administrator of responsible traffic class management can only divide the forum data with " traffic " label Analysis;Computer administrator can only carry out query analysis to the forum data with " computer " label.Authentication module Major function is just to discriminate between the processing authority of each department.
Step 6 enters system by the forum administrator of authentication module to the forum data progress in extent of competence Analysis;Forum administrator will request by full-text search unit, aggregate query unit, any cell in graph tool unit into The unit of row analysis, receiving request extracts the data needed after analysis from data memory module, then by the data It is back to data visualization module, data visualization module constructs chart and shows forum administrator for result is analyzed, each to manage Member can get corresponding data when carrying out data analysis from distributed memory system first, then carry out distributed meter It calculates to obtain the analysis result of data.In distributed treatment cluster, the node for receiving user's request is coordinator node, it Request can be sent to simultaneously and store all calculate node X of the data in cluster, X is calculate node number;In each node On individually calculated, then result is carried out to summarize calculating, entire query analysis result is finally returned into client;Entirely Process is as shown in figure 5, End-Customer end can construct corresponding Visual Report Forms according to the data of return.
Finally, it should be noted that the above embodiments are merely illustrative of the technical solutions of the present invention, rather than its limitations;Although Present invention has been described in detail with reference to the aforementioned embodiments, those skilled in the art should understand that: it still may be used To modify to technical solution documented by previous embodiment, or some or all of the technical features are equal Replacement;And these are modified or replaceed, model defined by the claims in the present invention that it does not separate the essence of the corresponding technical solution It encloses.

Claims (5)

1. a kind of forum data management classification system, it is characterised in that: including acquisition layer, storage process layer, visualization layer;
The acquisition layer includes data acquisition module, message queue unit, data preprocessing module;The data acquisition module is used In acquiring original forum data in real time from multiple data sources;The message queue unit is used for data collecting module collected Original forum data is cached, when the processing speed of data preprocessing module is less than the processing speed of data acquisition module, Message queue plays buffer function, when data preprocessing module breaks down and restores, will re-read from message queue Original forum data;The data preprocessing module is used to carry out forum data original in message queue unit the mistake of data Filter, extraction, duplicate removal and type conversion, export the forum data of Json format;
The storage process layer includes data memory module, data analytical calculation module;The data memory module includes distribution Formula storage module and index construct module;Index construct module is used to the forum data that acquisition layer exports being indexed building; The forum data that the distributed storage module is used to export acquisition layer carries out persistence storage;
The data analytical calculation module includes query analysis module and algorithm engine module;Query analysis module includes that full text is examined Cable elements, aggregate query unit, graph tool unit and warning service unit;Full-text search unit is for accurately inquiring, obscuring Inquiry and regular expression are inquired, and are to carry out quickly inquiry to forum data by the keyword of forum administrator input to determine Position;The statistics that aggregate query and graph tool unit is used to carry out forum data summarizes and report displaying;Service unit is alerted to use It notifies the forum data with sensitive or emphasis vocabulary to give forum related management personnel in the form of mail or short message in realizing; Algorithm engine module is used to carry out intelligent classification to forum data;
The visualization layer includes data visualization module, authentication module;Data visualization module for search result with And the page presentation of aggregate query result, and the final analysis result of data analytical calculation module is shown by Visual Chart Show;Authentication module is used to distinguish the processing authority of different forum administrators;
The original forum data acquired in data source is transmitted to message queue unit, the data by the data acquisition module Preprocessing module receives the original forum data exported by message queue unit, after original forum data is carried out type conversion Forum data is transmitted to index construct module, and the index construct module exports forum data to distributed storage module, institute It states distributed storage module to export forum's forum data to algorithm engine module, after algorithm engine module classifies forum data To distributed storage module, the distributed storage module can be updated forum data for output;Forum administrator passes through body Part authentication module sends the request to full-text search unit, aggregate query unit, graph tool unit, alerts in service unit Any cell receives the unit of request and extracts the forum data needed in request point from data memory module Analysis, and it is sent to data visualization module.
2. a kind of forum data management classification method, real based on a kind of forum data management classification system described in claim 1 It is existing, comprising the following steps:
Step 1 acquires original forum data from multiple data sources by data acquisition module in real time;
Step 2, incoming data are pre- after being cached the original forum data in data acquisition module via message queue unit Processing module carries out data prediction work;
The data prediction extracts data information from the original forum data of text formatting, title of such as posting, content, when Between, the people that posts, people location of posting, and convert thereof into Json format;
The forum data of the Json format exported from data preprocessing module is carried out building index by step 3;Index construct is complete Forum data is carried out persistent storage by distributed storage module by Bi Hou;
Step 4, the forum data exported from distributed storage module are passed to algorithm engine module, carry out intelligence to forum data Classification, different classes of forum data will be labeled with different class labels, then by incoming point of the forum data of whole tape label It is updated in cloth storage module, a class label is corresponding with a kind of forum administrator;
On a personal computer by the software installation of visualization layer, forum administrator is entering this system for step 5, forum administrator It needs to carry out Authority Verification by authentication module before, each forum administrator can only get the class under its corresponding authority The forum data of distinguishing label;
Step 6 enters system by the forum administrator of authentication module and analyzes the forum data in extent of competence; Request is passed through appointing in full-text search unit, aggregate query unit, graph tool unit, warning service unit by forum administrator Unit one is analyzed, and the unit of receiving request extracts the data needed after analysis from data memory module, then The data are back to data visualization module, data visualization module constructs chart and shows forum to manage for result is analyzed Member.
3. a kind of forum data management classification method according to claim 2, it is characterised in that: to original in the step 2 The extraction of field is realized using regular expression matching technology in forum data.
4. a kind of forum data management classification method according to claim 2, it is characterised in that: the index in the step 3 Structure uses inverted index.
5. a kind of forum data management classification method according to claim 2, it is characterised in that: the distribution in the step 3 Formula storage module use by NoSql database building at distributed storage cluster, every forum data is in the cluster all at least There are a copy datas.
CN201910793205.2A 2019-08-27 2019-08-27 A kind of forum data management classification system and method Pending CN110532492A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910793205.2A CN110532492A (en) 2019-08-27 2019-08-27 A kind of forum data management classification system and method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910793205.2A CN110532492A (en) 2019-08-27 2019-08-27 A kind of forum data management classification system and method

Publications (1)

Publication Number Publication Date
CN110532492A true CN110532492A (en) 2019-12-03

Family

ID=68664378

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910793205.2A Pending CN110532492A (en) 2019-08-27 2019-08-27 A kind of forum data management classification system and method

Country Status (1)

Country Link
CN (1) CN110532492A (en)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111125446A (en) * 2019-12-20 2020-05-08 北京睦合达信息技术股份有限公司 Data management platform and data management method
CN111858783A (en) * 2020-07-10 2020-10-30 脑谷人工智能研究院(南京)有限公司 Data induction and arrangement platform based on big data intelligent analysis
CN112181940A (en) * 2020-08-25 2021-01-05 天津农学院 Method for constructing national industrial and commercial big data processing system
CN112187953A (en) * 2020-10-13 2021-01-05 南开大学 JSON-based gene ontology mapping system and method

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103544255A (en) * 2013-10-15 2014-01-29 常州大学 Text semantic relativity based network public opinion information analysis method
CN108959352A (en) * 2018-04-27 2018-12-07 北京天机数测数据科技有限公司 Time-space data analysis platform and processing method based on time and Spatial Data Model

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103544255A (en) * 2013-10-15 2014-01-29 常州大学 Text semantic relativity based network public opinion information analysis method
CN108959352A (en) * 2018-04-27 2018-12-07 北京天机数测数据科技有限公司 Time-space data analysis platform and processing method based on time and Spatial Data Model

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
李强: "《云计算及其应用》", 30 April 2018, 武汉大学出版社 *
李玮洁: "校园网舆情监测平台与网络群体演化的研究", 《中国优秀硕士学位论文全文数据库 信息科技辑》 *

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111125446A (en) * 2019-12-20 2020-05-08 北京睦合达信息技术股份有限公司 Data management platform and data management method
CN111858783A (en) * 2020-07-10 2020-10-30 脑谷人工智能研究院(南京)有限公司 Data induction and arrangement platform based on big data intelligent analysis
CN112181940A (en) * 2020-08-25 2021-01-05 天津农学院 Method for constructing national industrial and commercial big data processing system
CN112187953A (en) * 2020-10-13 2021-01-05 南开大学 JSON-based gene ontology mapping system and method
CN112187953B (en) * 2020-10-13 2022-05-03 南开大学 JSON-based gene ontology mapping system and method

Similar Documents

Publication Publication Date Title
CN110825882B (en) Knowledge graph-based information system management method
CN111708773B (en) Multi-source scientific and creative resource data fusion method
CN110532492A (en) A kind of forum data management classification system and method
CN105183869B (en) Building knowledge mapping database and its construction method
CN108776671A (en) A kind of network public sentiment monitoring system and method
JP2015524962A (en) System and method for automatically generating information-rich content from multiple microblogs, each microblog containing only sparse information
CN111708774B (en) Industry analytic system based on big data
CN105824959A (en) Public opinion monitoring method and system
CN103605651A (en) Data processing showing method based on on-line analytical processing (OLAP) multi-dimensional analysis
CN111460252A (en) Automatic search engine method and system based on network public opinion analysis
CN113505242A (en) Method and system for automatically embedding knowledge graph
CN111737421A (en) Intellectual property big data information retrieval system and storage medium
CN116384889A (en) Intelligent analysis method for information big data based on natural language processing technology
CN110543477B (en) Label construction system and method
Zhang Application of data mining technology in digital library.
Lande et al. A system for analysis of big data from social media
CN112256880A (en) Text recognition method and device, storage medium and electronic equipment
CN112860899B (en) Label generation method and device, computer equipment and computer readable storage medium
KR101327546B1 (en) System for structuring technology information and method for producing roadmap using the same
CN107330076A (en) A kind of network public sentiment information display systems and method
CN102902705A (en) Locating ambiguities in data
CN111353085A (en) Cloud mining network public opinion analysis method based on feature model
CN116701771B (en) Digital library retrieval and resource sharing system based on cloud computing
CN112506930B (en) Data insight system based on machine learning technology
CN115296892A (en) Data information service system

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20191203

RJ01 Rejection of invention patent application after publication