From: Influential nodes identification in complex networks: a comprehensive literature review
Networks dataset | Common abbreviation | Description |
---|---|---|
LFR benchmark | LFR | Lancichinetti–Fortunato–Radicchi benchmark (An artificial network produced by the LFR algorithm that resembles a real-world network). |
Zebra | ZBR | Animal network that contains interactions between 28 Grévy's zebras (Equus grevyi) in Kenya. Zebras are represented by nodes, and an edge between two zebras indicates that there was interaction between them during the study. |
Zachary karate club | ZKC | Human Social network of university of karate club that gathers students of the club of karate by Wayne Zachary in 1977. Each node represents a member of the club, and each edge represents a tie between two members of the club. |
Contiguous | CTG | The contiguous zone, the marin boundary between 12NM (Nautical miles) and 24NM. |
Dolphins | DLP | A social network of bottlenose dolphins. The nodes are the bottlenose dolphins (genus Tursiops) of a bottlenose dolphin community living off Doubtful Sound, a fjord in New Zealand (spelled fiord in New Zealand). An edge indicates a frequent association. The dolphins were observed between 1994 and 2001. |
Copperfield | CPF | Network of common word (adjacencies between noun and adjectives) for the novel David Copperfield by Charles Dickens. Nodes represent the most commonly occurring adjectives and nouns in the book. Edges connect any pair of words that occur in adjacent position in the text of the book. |
Co authorship in network science | NTS | Co-authorship of scientists in network theory and experiments. |
Caenorhabditis elegans | ELG | Neural network of neurons and synapses in C. elegans, a type of worm. It consists of around 1000 cells including 302 neurons. |
Euroroad | ERD | A international E-road network located mostly in Europe. Network includes cities, and an edge connecting two cities indicates that they are linked. It contains 1174 cities. |
Chicago | CCG | Contains a comprehensive list of all current City of Chicago workers with details. |
Hamsterster | HMS | Network is of the friendships and family links between users of the website http://www.hamsterster.com. It is an independent site created in 2003 or 2004. Hamsterster appears to have been shut down as of October 2014. |
US power grid | UG | Undirected infrastructure network provides data concerning the Western States of the USA of America's power grid. An edge represents a power supply line. A node is either a generator, a transformator or a substation. |
Pretty good privacy | PGP | An online contact network or an interaction network of users of the pretty good privacy (PGP) algorithm. The network contains only the giant connected component of the network. |
Astro physics | ASP | Collaboration or cooperation network based on the e-print arXiv and includes scientific partnerships between authors of articles submitted to the Astro Physics field. If an author i co-authored a paper with author j, the graph contains a undirected edge from i to j. The data covers papers in the period from January 1993 to April 2003 (124Â months). It begins within a few months of the inception of the arXiv, and thus represents essentially the complete history of its ASTRO-PH section. |
Enron email network | ENR | The Enron email dataset comprises about 500,000 emails sent by Enron Corporation employees. This data was originally made public, and posted to the web, by the Federal Energy Regulatory Commission during its investigation. Nodes of the network are email addresses and if an address i sent at least one email to address j, the graph contains an undirected edge from i to j. |
Jazz musicians | JZ | Collaboration network between Jazz artists. Each node represents a Jazz artist, and each edge indicates that two artists have collaborated in a band. Two levels of collaborations are studied. First, the collaboration network between individuals, where two musicians are connected if they have played in the same band and second, the collaboration between bands, where two bands are connected if they have a musician in common. |
Email network of URV | URV | The email communication network of the University Rovira I Virgili in Tarragona, Catalonia, Spain. Nodes are users and each edge represents that at least one email was sent. The direction of emails and the number of emails between two persons are not stored. |
BLOGS | BG | Communication network between users of MSN’s (windows live) blog. It’s composed of 3982 nodes and 6803 edges. |
COND-MAT (condense matter physics) | CoundMath | Collaboration network based on the e-print arXiv and includes research partnerships between authors who have submitted articles to the Condense Matter category. If an author i co-authored a paper with author j, the graph contains a undirected edge from i to j. If the paper is co-authored by k authors this generates a completely connected (sub) graph on k nodes. The data covers papers in the period from January 1993 to April 2003 (124Â months). It begins within a few months of the inception of the arXiv, and thus represents essentially the complete history of its COND-MAT section. |
Live journal | LJ | Free online blogging community with almost 10 million members where individuals express their friendship toward others. LiveJournal allows members to maintain journals, individual and group blogs, and it allows people to declare which other members are their friends they belong. |
Contact network of inpatients | CNI | Presents link between two inpatients if they have both been admitted to the same hospital. |
Internet Movie database actors in adult films | IMDB | Network of connections between actors who have co-starred in films, whose genre has been labeled by the Internet Movie Database as ‘adult’. The dataset is a bipartite graph in which each node either corresponds to an actor or to a movie. Edges go from a movie to each actor in the movie. It also provides metadata for the nodes like movie/actor name, year of the movie, and genre of the movie. |
Email contact network | EM | The network of email contacts is formed on email messages sent and received at University College London's Computer Sciences Department. |
The Internet at the router level (RL) | RL | The nodes of the RL Internet network are the Internet routers. Two routers are connected if there exists a physical connection between them. |
The Internet at the autonomous system level (AS) | AS | The nodes are autonomous systems that are linked if there is a real connection beyond them. graph of routers comprising the Internet can be organized into sub-graphs called Autonomous Systems (AS). Each AS exchanges traffic flows with some neighbors (peers). We can construct a communication network of who-talks-to- whom from the BGP (Border Gateway Protocol) logs. The data was collected from University of Oregon Route Views Project—Online data and reports. The dataset contains 733 daily instances which span an interval of 785 days from November 8 1997 to January 2 2000. In contrast to citation networks, where nodes and edges only get added (not deleted) over time, the AS dataset also exhibits both the addition and deletion of the nodes and edges over time. |
Product space of economic goods | PS | Is a network that formalizes the idea of relatedness between products traded in the global economy. Proximity network between products according to Ref. |
Word | WAN | Represents an adjacency relation in English text. |
E. coliproteins | ECP | Presents the problem of identifying E.coli proteins based on amino acid sequences in cell localization regions. It contains 336 E.coli proteins split into 8 different classes. |
Tandem affinity purification | TAP | Yeast protein–protein binding network generated by tandem affinity purification experiments. |
Yeast 2 hybrid | Y2H | Yeast protein–protein binding network generated using yeast two hybridization. It is originally created by Fields and Song. Is a genetic system wherein the interaction between two proteins of interest is detected via the reconstitution of a transcription factor and the subsequent activation of reporter genes under the control of this transcription factor. |
Power | PWR | Connections between power stations. |
Internet (router level) | Int | Symmetrized snapshot of the Internet ‘s structure at the level of autonomous systems, the network size is 22963. |
FB | This dataset consists of friends lists from Facebook. Nodes represents actors or friends and edge represent the relationship between them. | |
TW | Microblogging social network operated by the company Twitter Inc. It allows a user to send free text messages, called tweets, over the internet, by instant messaging or by SMS. | |
The John Padgett—Florentine Families Dataset | JPFF | Multiplex network with 2 edge types representing marriage alliances and business relationships between Florentine families during the Italian Renaissance. Data hosted by Manlio De Domenico. Marriage and commercial links between Renaissance Florentine families are represented in this dataset. |
Delicious.com | DLC | Feature network. This dataset includes labeled web pages obtained from the website delicious.com. Left nodes represent tags, right nodes represent URLs and an edge shows that a URL was tagged with a tag. |
UsairPort | UP | Network of direct flights linking US airports in 2010. Each edge represents a connection from one airport to another, and the weight of an edge shows the number of flights on that connection in the given direction, in 2010. |
AirLines | AL | Flight arrival and departure data for all commercial flights from 1987 to 2008. |
American College Football Network | ACF | Interaction network that represents Football games between Division IA institutions during the regular season in the Fall 2000. |
Yeast | YST | Metabolic network. The dataset consists of a protein–protein interaction network. Research showed that proteins with a high degree were more important for the survival of the yeast than others. A node represents a protein and an edge represents a metabolic interaction between two proteins. The network contains loops. |
Router | RTR | Routing network composed of 5022 nodes and 12 516 connections. |
Human protein | HP | A network of protein–protein interactions that includes physical contacts between proteins that have been experimentally demonstrated in humans, such as metabolic enzyme-coupled interactions and signaling interactions. Nodes represent human proteins and edges represent physical interaction between proteins in a human cell. |
General relativity and quantum cosmology collaboration network | CA-GrQc | The collaboration network derives from the e-print arXiv and contains scientific partnerships between authors on articles submitted to the category of General Relativity and Quantum Cosmology. If an author i co-authored a paper with author j, the graph contains a undirected edge from i to j. The data covers papers in the period from January 1993 to April 2003 (124Â months). It begins within a few months of the inception of the arXiv, and thus represents essentially the complete history of its GR-QC section. |
High energy physics theory collaboration network | Ca-HepTh | collaboration network is from the e-print arXiv and covers scientific collaborations between authors papers submitted to High Energy Physics—Theory category. If an author i co-authored a paper with author j, the graph contains a undirected edge from i to j. If the paper is co-authored by k authors this generates a completely connected (sub)graph on k nodes. The data covers papers in the period from January 1993 to April 2003 (124 months). It begins within a few months of the inception of the arXiv, and thus represents essentially the complete history of its HEP-TH section. |
Groad | GRD | Highway network of 1168 nodes. |