A mineração de textos em aplicações de pesquisa e desenvolvimento (P&D)

Thumbnail Image
Full text at PDC
Publication Date
Advisors (or tutors)
Journal Title
Journal ISSN
Volume Title
Universidad Complutense de Madrid
Google Scholar
Research Projects
Organizational Units
Journal Issue
A área de Recuperação da Informação (RI) é objeto de pesquisa e estudo na Ciência da Informação e suas técnicas estão presentes dentro dos mais variados processos. A mineração de textos é uma das técnicas do aprendizado de máquina aplicada à Recuperação da Informação (RI) e sua utilização atinge até o mais crítico processo em empresas e organizações, sendo, uma delas, o processo de Pesquisa e Desenvolvimento (P&D). O estudo realiza uma análise de artigos científicos que apresentaram o uso da mineração de textos para aplicação em P&D em três bases de dados referenciais: Web of Science (WoS), SCOPUS, e LISA. Com a utilização da análise bibliográfica, são apresentados aspectos da utilização da mineração de textos, aspectos da aplicação em P&D e detalhes desta aplicação aos artigos que apresentaram pertinência com o objetivo da pesquisa. Os principais resultados são apresentados em tabelas com os agrupamentos dos artigos onde a mineração de textos é utilizada para análise de Patentes, análise de bases especializadas e análise da internet. Foi observado duas grandes vertentes no uso da Mineração de Textos para P&D: na análise de patentes e na análise de bases especializadas onde neste último é predominante o uso na área da saúde.
The area of Information Retrieval (IR) is the subject of research and study in Information Science and its techniques are present within the various processes. The text mining is one of the techniques applied to IR and utilization reaches even the most critical process in companies and organizations one being the process of research and development (R & D) machine learning. The study presents an analysis of scientific articles that showed the use of text mining for use in R & D in three reference databases: Web of Science (WoS), SCOPUS, and LISA. Using content analysis aspects of the use of text mining, aspects of the application in R & D of this application and details of the items that had relevance to the purpose of the research are presented. The main results are presented in tables with groupings of articles Patent analysis, analysis of specialized databases and analysis of internet. Two major strands was observed in the use of Text Mining for R & D: in patent analysis and analysis of specialized databases where the latter is the predominant in the health area.
UCM subjects
Brasil, Decreto federal nº 5.798, de 7 de junho de 2006. Recuperado em 10 jun.2014 de <>. Borges, M. E. N. (1995). A informação recurso gerencial das organizações na sociedade do conhecimento. Ciência da informação, Brasília, v. 24, n. 2. Recuperado em 18 mar. 2009 em < 500>. Choi, S., Kim, H., Yoon, J., Kim, K., & Lee, J. Y. (2013). An SAO‐based text‐mining approach for technology roadmapping using patent information.R&D Management, 43(1), 52-74. de Abreu, A. F., & Sinzato, C. I. P. (1999). Acesso à informação–promovendo competitividade em P&D com o uso de tecnologia de informação. Ci. Inf, 28(3), 322-332. Feldman, R., & Sanger, J. (2007). The text mining handbook: advanced approaches in analyzing unstructured data. Cambridge University Press. Giddens, A. (2013). Modernity and self-identity: Self and society in the late modern age. Stanford University Press. Gil, A. C. (2002). Como elaborar projetos de pesquisa. São Paulo, 5, 61. Ingwersen, P. E. R. (1992). Information Retrieval Interaction. Taylor Graham. Recuperado em 10 jun. 2014 de Ingwersen_IRI.pdf Nanba, H., Ishino, A., & Takezawa, T. (2012). Automatic Compilation of Travel Information from Texts: A Survey. INTECH Open Access Publisher. Recuperado em 10 jun. 2014 de Lancaster, F. W. (1968). Information retrieval systems; characteristics, testing, and evaluation. Lin, B. W., & Chen, J. S. (2005). Corporate technology portfolios and R&D performance measures: a study of technology intensive firms. R&D Management, 35(2), 157-170. Manning, C. D., Raghavan, P., & Schütze, H. (2009). Introduction to information retrieval (Vol. 1, p. 496). Cambridge: Cambridge university press. Mattelart, A. (2002). História da sociedade da informação. Loyola. Porter, A. L., & Newman, N. C. (2011). Mining external R&D. Technovation, 31(4), 171-176. Porter, M. (2004). Estrategia competitiva. Elsevier Brasil. Rijsbergen, C. J. (1995) One introduction. In: Information Retrieval. University of Glasgow. Recuperado em 10 jun. 2014 de Schumpeter, J. A. (1934). The theory of economic development: An inquiry into profits, capital, credit, interest, and the business cycle (Vol. 55). Transaction publishers. Todesco, Jose Leomar; Carreteiro Díez, Luis Eugenio; Duran, Alfonso (2007) Business intelligence (business intelligence). [slides]. In.: Curso de business intelligence. Escuela complutense latinoamericana, Florianópolis. Wadsworth, J. (2013) Gráfico R&D as a percentage of gross domestic product. 2014 global r&d funding forecast. R&D Magazine, v. 55, n. 6, p. 6. Corpus da pesquisa Alencar, M. S. M., Porter, A. L., & Antunes, A. M. S. (2007). Nanopatenting patterns in relation to product life cycle. Technological Forecasting and Social Change, 74(9), 1661-1680. Choi, S., Kim, H., Yoon, J., Kim, K., & Lee, J. Y. (2013). An SAO‐based text‐mining approach for technology roadmapping using patent information.R&D Management, 43(1), 52-74. de Buenaga, M., Maña, M., Gachet, D., & Mata, J. (2006). The SINAMED and ISIS Projects: Applying Text Mining Techniques to Improve Access to a Medical Digital Library. In Research and Advanced Technology for Digital Libraries (pp. 548-551). Springer Berlin Heidelberg. de Miranda Santo, M., Coelho, G. M., dos Santos, D. M., & Fellows Filho, L. (2006). Text mining as a valuable tool in foresight exercises: A study on nanotechnology. Technological Forecasting and Social Change, 73(8), 1013-1027. Epstein, R. J. (2009). Unblocking blockbusters: using boolean text-mining to optimise clinical trial design and timeline for novel anticancer drugs. Cancer informatics, 7, 231. Izquierdo, J., & Larreina, S. (2005). Collective SME Approach to Technology Watch and Competitive Intelligence: The Role of Intermediate Centers. In Knowledge Mining (pp. 181-189). Springer Berlin Heidelberg. Jun, S. (2014). A Technology Forecasting Method using Text Mining and Visual Apriori Algorithm. Appl. Math, 8(1L), 35-40. Jun, S., & Lee, S. J. (2012). Emerging Technology Forecasting Using New Patent Information Analysis. International Journal of Software Engineering & Its Applications, 6(3). de Miranda Santo, M., Coelho, G. M., dos Santos, D. M., & Fellows Filho, L. (2006). Text mining as a valuable tool in foresight exercises: A study on nanotechnology. Technological Forecasting and Social Change, 73(8), 1013-1027. Kostoff, R. N., Briggs, M. B., & Lyons, T. J. (2008). Literature-related discovery (LRD): Potential treatments for multiple sclerosis. Technological Forecasting and Social Change, 75(2), 239-255. Morel, C. M., Serruya, S. J., Penna, G. O., & Guimarães, R. (2009). Co-authorship network analysis: a powerful tool for strategic planning of research, development and capacity building programs on neglected diseases. PLoS Negl Trop Dis, 3(8), e501. Porter, A. L., & Newman, N. C. (2011). Mining external R&D. Technovation, 31(4), 171-176. Schoeneck, D. J., Porter, A. L., Kostoff, R. N., & Berger, E. M. (2011). Assessment of Brazil's research literature. Technology Analysis & Strategic Management, 23(6), 601-621. Senthilkumaran, P., & Amudhavalli, A. (2007). Mapping of spices research in Asian countries. Scientometrics, 73(2), 149-159. Thorleuchter, D., & Van den Poel, D. (2013). Web mining based extraction of problem solution ideas. Expert Systems with Applications, 40(10), 3961-3969. Trappey, A. J., Trappey, C. V., & Wu, C. Y. (2009). Automatic patent document summarization for collaborative knowledge systems and services. Journal of Systems Science and Systems Engineering, 18(1), 71-94. Trappey, A. J., & Trappey, C. V. (2008). An R&D knowledge management method for patent document summarization. Industrial Management & Data Systems, 108(2), 245-257. Zhu, D., & Porter, A. L. (2002). Automated extraction and visualization of information for technological intelligence and forecasting. Technological forecasting and social change, 69(5), 495-506. Yoon, B., & Park, Y. (2004). A text-mining-based patent network: Analytical tool for high-technology trend. The Journal of High Technology Management Research, 15(1), 37-50. Yoon, B., & Park, Y. (2005). A systematic approach for identifying technology opportunities: Keyword-based morphology analysis. Technological Forecasting and Social Change, 72(2), 145-160. Yoon, B., Lee, S., & Lee, G. (2010). Development and application of a keyword-based knowledge map for effective R&D planning. Scientometrics, 85(3), 803-820. Wang, M. Y., Chang, D. S., & Kao, C. H. (2010). Identifying technology trends for R&D planning using TRIZ and text mining. R&d Management, 40(5), 491-509.