Generating large-scale network analyses of scientific landscapes in seconds using Dimensions on Google BigQuery | Papers | Michele Pasin

Title:

Generating large-scale network analyses of scientific landscapes in seconds using Dimensions on Google BigQuery

Year:

2022

Abstract:

The growth of large, programatically accessible bibliometrics databases presents new opportunities for complex analyses of publication metadata. In addition to providing a wealth of information about authors and institutions, databases such as those provided by Dimensions also provide conceptual information and links to entities such as grants, funders and patents. However, data is not the only challenge in evaluating patterns in scholarly work: These large datasets can be challenging to integrate, particularly for those unfamiliar with the complex schemas necessary for accommodating such heterogeneous information, and those most comfortable with data mining may not be as experienced in data visualisation. Here, we present an open-source Python library that streamlines the process accessing and diagramming subsets of the Dimensions on Google BigQuery database and demonstrate its use on the freely available Dimensions COVID-19 dataset. We are optimistic that this tool will expand access to this valuable information by streamlining what would otherwise be multiple complex technical tasks, enabling more researchers to examine patterns in research focus and collaboration over time.

Full reference:

Michele Pasin, Richard Abdill. Generating large-scale network analyses of scientific landscapes in seconds using Dimensions on Google BigQuery - International Conference on Science, Technology and Innovation Indicators (STI 2022) Granada September 2022 . PDF

Linkout:

See also:

2024

paper Dimensions: Calculating Disruption Indices at Scale

Quantitative Science Studies, Sep 2024. https://doi.org/10.48550/arXiv.2309.06120

2023

blog Paperpile: a PDF manager with Google Drive backend

2022

paper Generating large-scale network analyses of scientific landscapes in seconds using Dimensions on Google BigQuery

International Conference on Science, Technology and Innovation Indicators (STI 2022), Granada, Sep 2022.

2021

blog A static site generator using Django, Wget and Github Pages

2020

blog More Jupyter notebooks: pyvis and networkx

blog Getting to grips with Google Colab

2019

blog Introducing DimCli: a Python CLI for the Dimensions API

paper Interlinking SciGraph and DBpedia datasets using Link Discovery and Named Entity Recognition Techniques

Second biennial conference on Language, Data and Knowledge (LDK 2019), Leipzig, Germany, May 2019.

2018

blog Exploring scholarly publications using DBPedia concepts: an experiment

2017

paper Using Linked Open Data to Bootstrap a Knowledge Base of Classical Texts

WHiSe 2017 - 2nd Workshop on Humanities in the Semantic web (colocated with ISWC17), Vienna, Austria, Oct 2017.

blog Exploring SciGraph data using JSON-LD, Elastic Search and Kibana

2015

blog Is wikipedia a valid source of scientific knowledge?

2014

blog Dereference a DOI using python

paper Linked data experience at Macmillan: Building discovery services for scientific and scholarly content on top of a semantic data model

International Semantic Web Conference (ISWC-14), Riva del Garda, Italy, Oct 2014.

2013

blog Annual Bliss Classification Association Lecture: using faceted browsers in the DH

2012

blog Using Mendeley and Dropbox to sync your pdf library across computers

2011

blog Using Impromptu to visualize RSS feeds

2009

blog Using jquery's autocomplete in Django's admin: what about 'inlines'?

blog Using Django-MPTT: lessons learned...

blog Roman Port Networks project

blog Understanding Google's BigTable

blog SoundManager 2 makes it easier to play sounds using Javascript

blog Using the OS X Terminal Application

blog Google Flags Whole Internet As Malware

2007

blog Google-map goes down to the humans..