Use of natural language processing tools for the analysis and development of scientific engineering articles
DOI:
https://doi.org/10.15381/risi.v17i2.29906Keywords:
Scientific articles, key bigrams, key phrases, natural language, text summarizing, text similarityAbstract
It is important to have tools that allow us to extract useful information from scientific texts without having to read all their content. For example, when it is necessary to determine the topics covered in scientific articles, establish an author's line of research through the review of their publications, or design the summary of an article and its keywords. So, the objective of this research is to use natural language processing techniques to extract useful information from scientific engineering articles. The twenty-two articles published by an author are taken to use them as the base document for the analysis, which is divided into: general analysis of all the articles, and particular analysis per published article. As a result of the first, the key words and bigrams were obtained, with the words “data”, “energy”, and “model” being the most frequent, and the bigrams “solar photovoltaic”, “explanatory variables”, and “renewable energies”, the most important ones. From the second analysis, it was obtained that the hierarchical bigrams for each article represent a good approximation of their keywords, and there is also a high similarity between the summaries obtained by applying natural language techniques to the articles published during the year 2024 and their summaries, being the one obtained with GPT2 presented the highest level of similarity. With the key phrases obtained with SGRank, the topic of the respective articles could be determined.
Downloads
Downloads
Published
Issue
Section
License
Copyright (c) 2024 César Aristóteles Yajure Ramírez

This work is licensed under a Creative Commons Attribution 4.0 International License.
AUTHORS RETAIN THEIR RIGHTS:
a. Authors retain their trade mark rights and patent, and also on any process or procedure described in the article.
b. Authors retain their right to share, copy, distribute, perform and publicly communicate their article (eg, to place their article in an institutional repository or publish it in a book), with an acknowledgment of its initial publication in the Revista de investigación de Sistemas e Informática.
c. Authors retain theirs right to make a subsequent publication of their work, to use the article or any part thereof (eg a compilation of his papers, lecture notes, thesis, or a book), always indicating its initial publication in the Revista de investigación de Sistemas e Informática (the originator of the work, journal, volume, number and date).