Method research on blockchain data
Method research on blockchain data
Question and answer
Describe the technical problem solved by the present invention.
What are the advantages of the method for displaying blockchain data proposed by the present invention over the existing technology?
Explain the functions and roles of the content security module and explain its possible deployment methods.
What are the ways to update the filtering rules in the content security module? Compare their advantages and disadvantages.
Combined with specific examples, explain how to set different filtering rules according to the user node type.
Combined with specific examples, explain how to set different filtering rules according to the sub-blockchain.
Explain the difference between illegal word lists and illegal sentence lists, and explain their respective application scenarios.
Compare the advantages and disadvantages of sentence vector construction methods based on fixed value dimension supplementation and based on predetermined sentence positions.
Explain the concepts of word vectors and sentence vectors, and explain their applications in the present invention.
In addition to using illegal word lists and illegal sentence lists, what other ways can be used to filter illegal content?
Question and answer
The present invention solves the problem that the existing blockchain browser directly displays the queried blockchain data, cannot block illegal content, and leads to regulatory risks. By introducing content security modules and filtering rules, illegal content in displayed blockchain data can be automatically blocked, improving the security of blockchain supervision.
Compared with the prior art, the present invention can automatically shield illegal content in blockchain data, improve the security of blockchain supervision, prevent the spread of illegal content, and promote the compliance application of blockchain technology.
The content security module is responsible for filtering the blockchain data displayed by the blockchain browser and removing illegal content. It can be deployed on a security server outside the terminal, or it can be a module inside the terminal but outside the blockchain browser.
The filtering rules in the content security module can be updated by active/passive regular updates or irregular updates. Irregular updates can update the rules in real time, but the network load is large; regular updates can reduce the network load, but the real-time performance is poor.
For example, the filtering rules of the public security system terminal can allow some sentences describing the human body, while the filtering rules of the duty-free zone terminal can allow the words "no tax" to appear.
For example, the sub-blockchain that records electronic invoice information can shield nouns related to the human body, while the sub-blockchain that records hospital diagnoses can allow some nouns related to the human body to appear.
The illegal word list is suitable for filtering single sensitive words, while the illegal sentence list is suitable for filtering sentences with specific semantics, which can more accurately identify and shield illegal content.
The fixed value dimension supplement method is simple and easy, but may introduce irrelevant information; the predetermined sentence position method is more accurate, but it needs to rely on grammatical analysis.
Word vector is a technology that uses vectors to represent the semantics of a single word, and sentence vector is a technology that uses vectors to represent the semantics of the entire sentence. In the present invention, word vectors are used to construct sentence vectors, and the sentence semantic similarity is determined by calculating the cosine distance of sentence vectors, thereby filtering illegal content.
In addition to using lists, filtering can also be performed according to preset rules, such as limiting the total number of words, identifying specific patterns, etc., and filtering in combination with illegal data block identifiers or illegal transaction information identifiers provided by regulatory authorities.
Paper title
Analyze the advantages and disadvantages of the method for displaying blockchain data proposed in the present invention in different application scenarios, and explore its impact on the development of blockchain technology.
Detailed description of the design and implementation of the content security module, including the definition, storage, update of filtering rules, and the design and optimization of filtering algorithms.
Explore how to combine artificial intelligence technologies, such as natural language processing and machine learning, to improve the accuracy and efficiency of illegal content filtering.
Analyze the challenges faced by the present invention in protecting user privacy, and propose corresponding solutions.
Explore how to strike a balance between ensuring blockchain data security and regulatory requirements, and the role of the present invention in it.
Glossary
Glossary Definitions Blockchain data Data stored on the blockchain, which can be data blocks or transaction information. Content security module Module that filters the blockchain data displayed by the blockchain browser to remove illegal content. Filtering rules Rules used to filter illegal content in blockchain data. User node Terminal that sends blockchain query requests. User node type Classification of user nodes according to the business they process. Sub-blockchain Sub-chains that make up the blockchain, and the data blocks on each sub-chain are connected to each other. Sub-blockchain identifier Markers such as the number of the sub-blockchain that can distinguish different sub-blockchains. Illegal word list A list of words that are pre-set and should be blocked as long as they appear in the blockchain data. Illegal sentence list A list of sentences that are pre-set and should be blocked as long as they appear in the blockchain data. Word vector is used to represent the vector of a word. The closer the meaning of the words, the smaller the cosine distance of the vectors. Sentence vector is used to represent the vector of a sentence. The closer the meaning of the sentences, the smaller the cosine distance of their sentence vectors. Illegal data block identifier Identifier of illegal data blocks known to the regulatory authorities, such as the summary or height of the data block. Illegal transaction information identification Identification of illegal transaction information known to the regulatory authorities, such as a summary of the transaction information.