User Manual

Overview

The Semantic Network Summary is a Cytsocape plugin that generates a word tag cloud from a user-defined node selection, summarizing an attribute of choice. For instance, if selected nodes are proteins, and the string attribute "full protein name" is selected, every string will be broken down into words, which will be plotted on a panel with size proportional to their frequency.

Example_Cloud1.png

It is also possible to use the plugin to cluster words that appear together in the selected nodes. For instance, if node A has name attribute "Origin Recognition Complex 1" and node B has name attribute "Origin Recognition Complex 2", then the words "Origin", "Recognition" and "Complex" will be clustered together, following the order in which they appear. The plugin operates on any network and on any selected attributes, although it has been specifically designed for string attributes such as gene names or gene ontology annotations.

Example_Cloud2.png

Installation

The Semantic Network Summary Plugin requires Cytoscape Version 2.7.x. If you don't have Cytoscape or have an older Version (2.6 or older), please download the latest Release from http://www.cytoscape.org/ and install it on your computer.

Quick Start Guide

Creating a Semantic Summary Word Cloud

After loading a Cytoscape network, and selecting the nodes of interest, there are 3 locations from which you can create a Semantic Summary Word Cloud.

You can use the parameter defaults for all of these methods for creating a Semantic Summary Word Cloud. For a more careful choice of the parameter settings, please go to the Full User Guide.

Exploring the Semantic Summary Word Cloud

Advanced Tips


Full User Guide

Tips on Parameter Choice

Attribute Choice

You can choose which Cytoscape attribute to build your word cloud from in the Semantic Analysis section of the Input panel. Your options include the node ID (the default option) and all currently available attributes of the type String or List. Notice, that for an attribute of type List, only those entries that are String will be used to build the word cloud. When changing the attribute that a cloud is built from, a user must hit the "Update" button to see the updated results for the current cloud.

Network Normalization

The size of any word in a tag cloud is a reflection of how often that word appears in the selected nodes of the network, normalized over how often that word appears in the entire network. The Network Normalization parameter determines how much weight to give the word counts over the entire network when determining word size.

Word Aggregation Cutoff

Word aggregation cutoff is a parameter used only with cloud layouts that incorporate word clustering.. The Semantic Network Summary Plugin uses a greedy algorithm in combination with heirarchical clustering to create the word clusters that a user will see. The word clusters are built by aggregating pairs of words. Words are aggregated in such a way that their order in the cluster reflects which words appear next to each other in the selected nodes in the network.

The algorithm used for word aggregation utilizes a probability value given to every ordered pair of words that appears next to each other in at least one selected node in the network. This probability value is the ratio of the observed joint probability of these words appearing next to each other, divided by the probability of these words appearing next to each other if their occurences were independent of each other. Only word pairs having this probability value above or equal to the Word Aggregation Cutoff can appear next to each other in a single cluster.

Smaller values for this parameter will usually translate to larger clusters of words. If this parameter is sufficiently large, every word will appear in its own cluster.

Cloud Layout

The choice of cloud layout style will affect the look and feel of the word tag cloud that appears. This is also where a user will determine if they want their word tag cloud to include clustering. Below are examples of the same word cloud with the different layout style options.

Clustered-Standard:

Style-Clustered-Standard.jpg

Clustered-Boxes:

Style-Clustered-Boxes.jpg

Non-Clustered:

Style-Non-Clustered.png

The Input Panel

[ATTACH]

1. Cloud List - A list of all clouds that exist for the current network. If no network is currently selected, then this section will display "No Network Loaded". A cloud can be renamed by selecting the cloud in the list, right clicking on the mouse and selecting "Edit Cloud Name". No two clouds for the same network can have the same name. By default the clouds will be named using sequential numbering.

2. Semantic Analysis - Allows the user to specify which attribute to use when building the word cloud.

3. Advanced Parameters - User can specify the maximum number of words to display in the word cloud, as well as the Network Normalization and Word Aggregation Cutoff.

4. Word Exclusion List - A user can add or remove words to be ignored when building the word cloud. The list of words to filter out is applicable at the network level. Hence, any words added or removed while a particular cloud is selected in the cloud list will affect all future clouds created or updated from the network associated with that cloud.

5. Cloud Layout - The user has a choice of serveral different styles for the layout of the word tag cloud. This is where a user will select whether or not they want to see a layout that includes clustering.

6. Actions - The user has three choices: Delete (deletes the currently selected cloud, Update (takes in all parameters, the set of currently selected nodes, and all nodes in the network and builds an updated word cloud), and Create (takes in all parameters, the set of currently selected nodes, and all nodes in the network and builds a new word cloud).

The Cloud Display Panel

Default and Valid Parameter Values

Node ID/Attribute:

Max Num of Words:

Network Normalization:

Word Aggregation Cutoff:

Style:

Software/WordCloudPlugin/UserManual (last edited 2010-07-17 18:17:58 by LaylaOesper)

MoinMoin Appliance - Powered by TurnKey Linux