Skip to main content

Table 2 Operational network datasets implemented in the main comparison's referred research

From: Influential nodes identification in complex networks: a comprehensive literature review

Networks dataset

Common abbreviation

Description

LFR benchmark

LFR

Lancichinetti–Fortunato–Radicchi benchmark (An artificial network produced by the LFR algorithm that resembles a real-world network).

Zebra

ZBR

Animal network that contains interactions between 28 Grévy's zebras (Equus grevyi) in Kenya. Zebras are represented by nodes, and an edge between two zebras indicates that there was interaction between them during the study.

Zachary karate club

ZKC

Human Social network of university of karate club that gathers students of the club of karate by Wayne Zachary in 1977. Each node represents a member of the club, and each edge represents a tie between two members of the club.

Contiguous

CTG

The contiguous zone, the marin boundary between 12NM (Nautical miles) and 24NM.

Dolphins

DLP

A social network of bottlenose dolphins. The nodes are the bottlenose dolphins (genus Tursiops) of a bottlenose dolphin community living off Doubtful Sound, a fjord in New Zealand (spelled fiord in New Zealand). An edge indicates a frequent association. The dolphins were observed between 1994 and 2001.

Copperfield

CPF

Network of common word (adjacencies between noun and adjectives) for the novel David Copperfield by Charles Dickens. Nodes represent the most commonly occurring adjectives and nouns in the book. Edges connect any pair of words that occur in adjacent position in the text of the book.

Co authorship in network science

NTS

Co-authorship of scientists in network theory and experiments.

Caenorhabditis elegans

ELG

Neural network of neurons and synapses in C. elegans, a type of worm. It consists of around 1000 cells including 302 neurons.

Euroroad

ERD

A international E-road network located mostly in Europe. Network includes cities, and an edge connecting two cities indicates that they are linked. It contains 1174 cities.

Chicago

CCG

Contains a comprehensive list of all current City of Chicago workers with details.

Hamsterster

HMS

Network is of the friendships and family links between users of the website http://www.hamsterster.com. It is an independent site created in 2003 or 2004. Hamsterster appears to have been shut down as of October 2014.

US power grid

UG

Undirected infrastructure network provides data concerning the Western States of the USA of America's power grid. An edge represents a power supply line. A node is either a generator, a transformator or a substation.

Pretty good privacy

PGP

An online contact network or an interaction network of users of the pretty good privacy (PGP) algorithm. The network contains only the giant connected component of the network.

Astro physics

ASP

Collaboration or cooperation network based on the e-print arXiv and includes scientific partnerships between authors of articles submitted to the Astro Physics field. If an author i co-authored a paper with author j, the graph contains a undirected edge from i to j. The data covers papers in the period from January 1993 to April 2003 (124 months). It begins within a few months of the inception of the arXiv, and thus represents essentially the complete history of its ASTRO-PH section.

Enron email network

ENR

The Enron email dataset comprises about 500,000 emails sent by Enron Corporation employees. This data was originally made public, and posted to the web, by the Federal Energy Regulatory Commission during its investigation. Nodes of the network are email addresses and if an address i sent at least one email to address j, the graph contains an undirected edge from i to j.

Jazz musicians

JZ

Collaboration network between Jazz artists. Each node represents a Jazz artist, and each edge indicates that two artists have collaborated in a band. Two levels of collaborations are studied. First, the collaboration network between individuals, where two musicians are connected if they have played in the same band and second, the collaboration between bands, where two bands are connected if they have a musician in common.

Email network of URV

URV

The email communication network of the University Rovira I Virgili in Tarragona, Catalonia, Spain. Nodes are users and each edge represents that at least one email was sent. The direction of emails and the number of emails between two persons are not stored.

BLOGS

BG

Communication network between users of MSN’s (windows live) blog. It’s composed of 3982 nodes and 6803 edges.

COND-MAT (condense matter physics)

CoundMath

Collaboration network based on the e-print arXiv and includes research partnerships between authors who have submitted articles to the Condense Matter category. If an author i co-authored a paper with author j, the graph contains a undirected edge from i to j. If the paper is co-authored by k authors this generates a completely connected (sub) graph on k nodes. The data covers papers in the period from January 1993 to April 2003 (124 months). It begins within a few months of the inception of the arXiv, and thus represents essentially the complete history of its COND-MAT section.

Live journal

LJ

Free online blogging community with almost 10 million members where individuals express their friendship toward others. LiveJournal allows members to maintain journals, individual and group blogs, and it allows people to declare which other members are their friends they belong.

Contact network of inpatients

CNI

Presents link between two inpatients if they have both been admitted to the same hospital.

Internet Movie database actors in adult films

IMDB

Network of connections between actors who have co-starred in films, whose genre has been labeled by the Internet Movie Database as ‘adult’. The dataset is a bipartite graph in which each node either corresponds to an actor or to a movie. Edges go from a movie to each actor in the movie. It also provides metadata for the nodes like movie/actor name, year of the movie, and genre of the movie.

Email contact network

EM

The network of email contacts is formed on email messages sent and received at University College London's Computer Sciences Department.

The Internet at the router level (RL)

RL

The nodes of the RL Internet network are the Internet routers. Two routers are connected if there exists a physical connection between them.

The Internet at the autonomous system level (AS)

AS

The nodes are autonomous systems that are linked if there is a real connection beyond them. graph of routers comprising the Internet can be organized into sub-graphs called Autonomous Systems (AS). Each AS exchanges traffic flows with some neighbors (peers). We can construct a communication network of who-talks-to- whom from the BGP (Border Gateway Protocol) logs. The data was collected from University of Oregon Route Views Project—Online data and reports. The dataset contains 733 daily instances which span an interval of 785 days from November 8 1997 to January 2 2000. In contrast to citation networks, where nodes and edges only get added (not deleted) over time, the AS dataset also exhibits both the addition and deletion of the nodes and edges over time.

Product space of economic goods

PS

Is a network that formalizes the idea of relatedness between products traded in the global economy. Proximity network between products according to Ref.

Word

WAN

Represents an adjacency relation in English text.

E. coliproteins

ECP

Presents the problem of identifying E.coli proteins based on amino acid sequences in cell localization regions. It contains 336 E.coli proteins split into 8 different classes.

Tandem affinity purification

TAP

Yeast protein–protein binding network generated by tandem affinity purification experiments.

Yeast 2 hybrid

Y2H

Yeast protein–protein binding network generated using yeast two hybridization. It is originally created by Fields and Song. Is a genetic system wherein the interaction between two proteins of interest is detected via the reconstitution of a transcription factor and the subsequent activation of reporter genes under the control of this transcription factor.

Power

PWR

Connections between power stations.

Internet (router level)

Int

Symmetrized snapshot of the Internet ‘s structure at the level of autonomous systems, the network size is 22963.

Facebook

FB

This dataset consists of friends lists from Facebook. Nodes represents actors or friends and edge represent the relationship between them.

Twitter

TW

Microblogging social network operated by the company Twitter Inc. It allows a user to send free text messages, called tweets, over the internet, by instant messaging or by SMS.

The John Padgett—Florentine Families Dataset

JPFF

Multiplex network with 2 edge types representing marriage alliances and business relationships between Florentine families during the Italian Renaissance. Data hosted by Manlio De Domenico. Marriage and commercial links between Renaissance Florentine families are represented in this dataset.

Delicious.com

DLC

Feature network. This dataset includes labeled web pages obtained from the website delicious.com. Left nodes represent tags, right nodes represent URLs and an edge shows that a URL was tagged with a tag.

UsairPort

UP

Network of direct flights linking US airports in 2010. Each edge represents a connection from one airport to another, and the weight of an edge shows the number of flights on that connection in the given direction, in 2010.

AirLines

AL

Flight arrival and departure data for all commercial flights from 1987 to 2008.

American College Football Network

ACF

Interaction network that represents Football games between Division IA institutions during the regular season in the Fall 2000.

Yeast

YST

Metabolic network. The dataset consists of a protein–protein interaction network. Research showed that proteins with a high degree were more important for the survival of the yeast than others. A node represents a protein and an edge represents a metabolic interaction between two proteins. The network contains loops.

Router

RTR

Routing network composed of 5022 nodes and 12 516 connections.

Human protein

HP

A network of protein–protein interactions that includes physical contacts between proteins that have been experimentally demonstrated in humans, such as metabolic enzyme-coupled interactions and signaling interactions. Nodes represent human proteins and edges represent physical interaction between proteins in a human cell.

General relativity and quantum cosmology collaboration network

CA-GrQc

The collaboration network derives from the e-print arXiv and contains scientific partnerships between authors on articles submitted to the category of General Relativity and Quantum Cosmology. If an author i co-authored a paper with author j, the graph contains a undirected edge from i to j. The data covers papers in the period from January 1993 to April 2003 (124 months). It begins within a few months of the inception of the arXiv, and thus represents essentially the complete history of its GR-QC section.

High energy physics theory collaboration network

Ca-HepTh

collaboration network is from the e-print arXiv and covers scientific collaborations between authors papers submitted to High Energy Physics—Theory category. If an author i co-authored a paper with author j, the graph contains a undirected edge from i to j. If the paper is co-authored by k authors this generates a completely connected (sub)graph on k nodes. The data covers papers in the period from January 1993 to April 2003 (124 months). It begins within a few months of the inception of the arXiv, and thus represents essentially the complete history of its HEP-TH section.

Groad

GRD

Highway network of 1168 nodes.