NEW TOOL TO BUILD AND ANALYSE DATABASES ON SCIENTIFIC CAREERS AVAILABLE

SiSOB data extraction and codification: A tool to analyze scientific careers is the title of an article published by the journal Research Policy, which «describes the methodology and software tool used to build a database on the careers and productivity of academics, using public information available on the Internet». This innovative software (SiSOB) providing outputs that can be used for either econometric research or data representation for policy analysis, is released under free software GNU General Public License. The methodology and software tool is validated for a sample of US and UK biomedical scientists, but can be applied to any countries where scientists’ CVs are available in English.

This article, written by Aldo Geuna (Department of Economics and Statistics Cognetti De Martiis, University of Turin, Italy and BRICK, Collegio Carlo Alberto, Moncalieri, Italy), Rodrigo Kataishi (Department of Economics and Statistics Cognetti De Martiis, University of Turin, Italy and BRICK, Collegio Carlo Alberto, Moncalieri, Italy), Manuel Toselli (Department of Economics and Statistics Cognetti De Martiis, University of Turin, Italy and BRICK, Collegio Carlo Alberto, Moncalieri, Italy), Eduardo Guzmán (Department of Languages and Computer Science, University of Malaga, Spain), Cornelia Lawson (Department of Economics and Statistics Cognetti De Martiis, University of Turin, Italy; BRICK, Collegio Carlo Alberto, Moncalieri, Italy and School of Sociology and Social Policy, University of Nottingham, UK) Ana Fernandez-Zubieta (Institute for Advanced Social Studies – Spanish Council for Scientific Research, Spain) and Beatriz Barros (Department of Languages and Computer Science, University of Malaga, Spain), offers a «first analysis of the data collected for a sample of 360 US scientists funded by the National Institute of Health (NIH) and 291 UK scientists funded by the Biotechnology and Biological Sciences Research Council (BBSRC)». It also provides «an overview of the motivations for constructing the database, and the data crawling and data mining techniques used to transform webpage-based information and CV information into a relational database». Finally, the paper describes the database and the effectiveness of our algorithms and provides suggestions for further improvements.

SiSOB is available here.

Esta entrada fue publicada en Metodología sociológica, Sociología de la investigación y etiquetada , , , , . Guarda el enlace permanente.