Using Generative Artificial Intelligence and Keyword Analysis to Support the Design of Research Projects in Higher Education
DOI:
https://doi.org/10.17398/1695-288X.24.2.87Keywords:
Co-word analysis, Generative Artificial Intelligence, Research Scope, Research Project, Higher EducationAbstract
At different levels of university education, students face challenges in locating bibliographic references that align with their research projects, requiring a balance between specificity and breadth in identifying relevant information. Studies show that metacognitive skills, such as keyword refinement and navigation through organized categories, enhance the efficiency and quality of search processes. In light of these challenges, this article investigates the combined use of co-word analysis and generative artificial intelligence (GenAI) as a strategy to support the definition of research project scopes in higher education. The study adopted a mixed-methods approach, involving 23 undergraduate and graduate students, who responded to two forms interspersed with the delivery of a personalized report. This report was produced through searches in bibliographic databases integrated with GenAI analysis and addressed aspects such as topic adequacy, keyword suggestions, search strategies, and a glossary of concepts. The results indicated that the procedure stimulated reflective processes among students, with the majority considering the intervention useful, particularly in expanding thematic understanding and refining the definition of keywords. It also became evident that the effectiveness of the method depends on the clarity of the initial information provided by the participants and the maturity level of their research projects.
Downloads
References
Bardin, L. (2016). Análise de conteúdo. São Paulo: Edições 70.
Bramer, W. M., Giustini, D., de Jonge, G. B., Holland, L., & Bekhuis, T. (2018). Evaluation of a new method for librarian-mediated literature searches for systematic reviews. Research Synthesis Methods, 9(4), 510–520. https://doi.org/10.1002/jrsm.1279.
Brasil. Coordenação de Aperfeiçoamento de Pessoal de Nível Superior. (2025). Portal de Periódicos da CAPES. https://www.periodicos.capes.gov.br/.
Callon, M., Courtial, J.-P., Turner, W. A., & Bauin, S. (1983). From translations to problematic networks: An introduction to co-word analysis. Social Science Information, 22(2), 191–235. https://doi.org/10.1177/053901883022002003.
Chang, Y., Xu, C., Wang, W., & Yan, X. (2024). A survey on evaluation of large language models. ACM Transactions on Intelligent Systems and Technology, 15(3), 1. https://doi.org/10.1145/3641289.
Cooksey, R., & McDonald, G. (2019). How do I scope, shape and configure my research project? In Surviving and thriving in postgraduate research (pp. 417–501). Springer. https://doi.org/10.1007/978-981-13-7747-1_12.
Dagdelen, J., Kononova, O., Horton, M., & Persson, K. A. (2024). Structured information extraction from scientific text with large language models. Nature Communications, 15, 1418. https://doi.org/10.1038/s41467-024-45563-x.
Delaney, A., & Tamás, P. (2017). Searching for evidence or approval? A commentary on database search in systematic reviews and alternative information retrieval methodologies. Research Synthesis Methods, 9(1), 124–131. https://doi.org/10.1002/jrsm.1282.
Dervin, B. (1998). Sense-making theory and practice: An overview of user interests in knowledge seeking and use. Journal of Knowledge Management, 2(2), 36–46. https://doi.org/10.1108/13673279810249369.
Ellis, D. (1989). A behavioural approach to information retrieval system design. Journal of Documentation, 45(3), 171–212. https://doi.org/10.1108/EB026843.
Grames, E. M., Stillman, A. N., Tingley, M. W., & Elphick, C. S. (2019). An automated approach to identifying search terms for systematic reviews using keyword co-occurrence networks. Methods in Ecology and Evolution, 10(10), 1645–1654. https://doi.org/10.1111/2041-210X.13268.
Han, S., Zhang, C., Wang, J., & Wang, X. (2024). A review of large language models: Fundamental architectures, key technological evolutions, interdisciplinary technologies integration, optimization and compression techniques, applications, and challenges. Electronics, 13(24), 5040. https://doi.org/10.3390/electronics13245040.
Klarin, A. (2024). How to conduct a bibliometric content analysis: Guidelines and contributions of content co-occurrence or co-word literature reviews. International Journal of Consumer Studies, 48(2), e13031. https://doi.org/10.1111/ijcs.13031.
Kuhlthau, C. C. (2004). Seeking meaning: A process approach to library and information services (2nd ed.). Libraries Unlimited.
Lee, W., Park, K., Kim, S., & Kim, Y. (2023). Toward keyword generation through large language models. In Proceedings of the International Conference on Intelligent User Interfaces (pp. 37–40). ACM. https://doi.org/10.1145/3581754.3584126.
Lei, G., Docherty, R., & Cooper, S. J. (2024). Materials science in the era of large language models: A perspective. Digital Discovery, 3(7), 1257. https://doi.org/10.1039/d4dd00074a.
Liu, M. C., Chang, H. C., Wu, J. S., & Huang, Y. C. (2013). Fostering learner’s metacognitive skills of keyword reformulation in image seeking by location-based hierarchical navigation. Educational Technology Research and Development, 61, 233–254. https://doi.org/10.1007/s11423-012-9280-3.
Maragheh, R. Y., Amini, M., & Aghajani, M. (2023). LLM-TAKE: Theme-aware keyword extraction using large language models. In Proceedings of the International Conference on Big Data (pp. 4318–4324). IEEE. https://doi.org/10.1109/BigData59044.2023.10386476.
Mbuagbaw, L., Rochwerg, B., Jaeschke, R., & Guyatt, G. (2020). A tutorial on methodological studies: The what, when, how and why. BMC Medical Research Methodology, 20(226). https://doi.org/10.1186/s12874-020-01107-7.
Peres, R., Matzler, K., Schlegelmilch, B. B., & Vom Brocke, J. (2023). On ChatGPT and beyond: How generative artificial intelligence may affect research, teaching, and practice. International Journal of Research in Marketing, 40(2), 269–275. https://doi.org/10.1016/j.ijresmar.2023.03.001.
Pessin, V. Z., Yamane, L. H., & Siman, R. R. (2022). Smart bibliometrics: An integrated method of science mapping and bibliometric analysis. Scientometrics, 127, 3695–3718. https://doi.org/10.1007/s11192-022-04406-6.
Peters, M. D. J., Godfrey, C. M., McInerney, P., Munn, Z., Tricco, A. C., & Khalil, H. (2020). Updated methodological guidance for the conduct of scoping reviews. JBI Evidence Synthesis, 18(10), 2119–2126. https://doi.org/10.11124/JBIES-20-00167.
Pęzik, P., Mikołajczyk, A., Wawrzyński, A., Żarnecki, F., & Ogrodniczuk, M. (2023). Transferable keyword extraction and generation with text-to-text language models. In J. Mikyška et al. (Eds.), Computational Science – ICCS 2023 (pp. 398–405). Springer. https://doi.org/10.1007/978-3-031-36021-3_42.
Pisu, A., Alessandri, L., Geraci, F., & Motta, E. (2024). Leveraging language models for generating ontologies of research topics. CEUR Workshop Proceedings. https://ceur-ws.org/Vol-3747/text2kg_paper6.pdf.
Saldaña, J. (2015). The coding manual for qualitative researchers (2nd ed.). Sage.
Scopus: Abstract and citation database. (2025). Elsevier. https://www.scopus.com/.
Schwartzman, S. (2022). Pesquisa e pós-graduação no Brasil: Duas faces da mesma moeda? Estudos Avançados, 36(104), 227–254. https://doi.org/10.1590/s0103-4014.2022.36104.011.
Tricco, A. C., Lillie, E., Zarin, W., O'Brien, K. K., Colquhoun, H., Levac, D., & Moher, D. (2018). PRISMA extension for scoping reviews (PRISMA-ScR): Checklist and explanation. Annals of Internal Medicine, 169(7), 467–473. https://doi.org/10.7326/M18-0850.
Tu, Y. W., Shih, M., & Tsai, C. C. (2008). Eighth graders’ web searching strategies and outcomes: The role of task types, web experiences and epistemological beliefs. Computers & Education, 51(3), 1142–1153. https://doi.org/10.1016/j.compedu.2007.11.003.
Wilson, T. D. (1981). On user studies and information needs. Journal of Documentation, 37(1), 3–15. https://doi.org/10.1108/eb026702.
Wilson, T. D. (1999). Models in information behaviour research. Journal of Documentation, 55(3), 249–270. https://doi.org/10.1108/EUM0000000007145.
Xu, D., Chen, W., Peng, W., & Wang, W. (2024). Large language models for generative information extraction: A survey. Frontiers of Computer Science, 18, 186357. https://doi.org/10.1007/s11704-024-40555-y.
Yang, H., Lin, J., & Wei, J. (2023). Research on the automatic subject-indexing method of academic papers based on climate change domain ontology. Sustainability, 15(5), 3919. https://doi.org/10.3390/su15053919
Downloads
Published
Issue
Section
License
Copyright (c) 2025 Rafael Antunes dos Santos, Eliseo Berni Reategui

This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.
Authors who publish in this journal accept the following conditions:
1. The Author retains copyright in the article. Upon acceptance of the article, the author shall grant to the Publisher the right of first publication of the article. with the dcoument registered with the Creative Commons Attribution-NonCommercial-NoDerivative 4.0 International (CC BY-NC-ND) license, which allows to third parties to use what is published whenever they mention the authorship of the work and the first publication in this journal.
2. Authors can make other independent and additional contractual agreements for the non-exclusive distribution of the article published in this journal (eg, include it in an institutional repository or publish it in a book) provided they clearly indicate that the work was published for the first time in this journal.
3. Authors are allowed and recommended to publish their work on the Internet (for example on institutional or personal pages) before and during the review and publication process, as it can lead to productive exchanges and a greater and faster diffusion of published work (see The Effect of Open Access).





