
لینک کوتاه : https://en.magicfile.ir/?p=2257
Download source and keyword extraction code with entropy difference between internal and external mode with Visual Basic .NET
We are trying to propose a new criterion for evaluating and ranking the relevance of words in a text. This method uses the Shannon entropy difference between the inner and outer state, which refers to the fact that related words significantly reflect the author's intention to write, for example, their occurrence is modulated by the author's purpose, while That irrelevant words are randomly distributed in the text. . Using the origin of the species by Charles Darwin as a representative text sample, our detector performance is demonstrated and compared with previous suggestions. Since the reference text of the "profession" is all the writings, books, articles, etc. of an author, there is no need for his collected works. Our approach is especially suitable for individual documents where no prior information is available.
theme
One of the most important differences between texts written by humans and typing monkeys is the existence of general meaningful themes in human written texts. Keyword / Extraction and ranking of relevant words is the starting point of vital tasks such as subject identification and tracking in written texts. They are widely used in information extraction, selection and retrieval.
Internal mode and external mode in word type events in the text.
Here we give a brief introduction to the principle of the algorithm, it can help you better understand and use the dll and software. The idea of internal-external mode is based on the general idea that very important words tend to be modified by the author's intention, while common words are essentially evenly distributed throughout the text. Thus the intrinsic state of statistical properties indicates the appearance of a related word in a subject, for example, the statistical properties of clustering in each subject. At the same time, the external state shows the statistical characteristics of the disappearance of a word clustering throughout a written text and determines the relationship between the occurrence of clustering of words in a subject and the author's writing style. As shown in the figure. 2. The distance between two words that are consecutive repetitions is defined as di = ti + 1 - ti. Ti is the place of the word in the text. If _di The difference in the arrival time of di_ belongs to the intrinsic state. In other words, a given event of the word is part of an intrinsic state if its local isolation is less than the average waiting time. Let dI = {di | di } Union set for all di Is shown in the lower left figure. 2. We found through experiments that the keyword that appears in the article indicates the properties of the materials. So the entropy of the inner state is large while the entropy of the outer state is small. The general words are evenly distributed in the article, the distance between two consecutive words seems to change slightly, so the entropy difference between the inner and outer states is small. This way you can use the value of E, which is the entropy difference between the inner and outer state, to extract the keywords. In practice, to eliminate randomly distributed words and boundary conditions, we use the boundary conditions _C ~ c ~ and the normalized entropy difference _E_nor as final indicators. If you want to know more details of this algorithm.
Highlights
- We propose a new criterion for evaluating and ranking the relevance of words in a text.
- This metric uses the Shannon entropy difference between the inner and outer states.
- We believe this is a new result in keyword extraction and ranking.
- Our approach is especially suitable for individual documents where no prior information is available.
Content tags
Extract keywords from the text , Keyword extraction , Extract keywords , Extract keywords from Persian texts , Keyword with entropy difference , Keyword with entropy difference , Keyword with entropic difference ,Files that you may need

Download the source of the file converter robot

Source and code of ice cream sales management system with coding in Visual Basic .NB VB.NET environment

Download the source and code of the Instagram robot software with C #

Download the plug-in payment of the Basic Four Android application with the Pi port

Source and project code of the curriculum evaluation system in VB.NET online with mysql database
