| Size: 649 Comment:  | Size: 6846 Comment:  | 
| Deletions are marked like this. | Additions are marked like this. | 
| Line 3: | Line 3: | 
| = Cell-Cell Interactions = | = Cell-Cell Interaction Database = | 
| Line 7: | Line 7: | 
| Using a set of proteins designated as receptors, and ligands defined with a set of GO terms calculate the set of interactions that represent cell-cell interactions (for example Ligand-receptor, receptor-receptor, ...). This analysis is not limited to Cell-Cell interactions. You can define your own protein types, either manually or by choosing different go terms, and create your customized protein-protein interaction network. {{attachment:ligand_receptor_flowchart.png|Network creation flowchart|align="right"}}<<BR>> | This page describes the automated construction of a cell-cell interaction database by filtering existing curated protein-protein interaction (PPI) data. {{attachment:ligand_receptor_flowchart_ppionly.png|Network creation flowchart|width=750}} | 
| Line 11: | Line 12: | 
| === Receptors === '''Receptor genes were defined based on the union of the annotations from''' 1. the set of [[http://www.geneontology.org/|Gene Ontology]] (GO) terms: * GO:0043235 - receptor complex, * GO:0008305 - integrin complex, * GO:0072657 - protein localized to membrane * GO:0043113 - receptor clustering * GO:0004872 - receptor activity, * GO:0009897 - external side of plasma membrane) 1. [[https://www.uniprot.org/|UniProt]] annotations * search term -"Receptor [KW-0675]" go:0005886 organism:human. '''This created a set of 4364 receptor genes (prior to manual curation)''' === Ligands === '''Ligand genes were defined based on the union of the below annotations''' 1. the GO terms: * GO:0005102 - receptor binding 1. the set of proteins labelled as secreted in the Secretome dataset (http://www.proteinatlas.org/humanproteome/secretome) ([[#ref4|4]]). '''This created a set of 3209 Ligand genes (prior to manual curation)''' === Extracellular Matrix === '''Extracellular Matrix (ECM) genes were defined based on the union of the annotations from''' 1. the GO terms: * GO:0031012 - extracellular matrix * GO:0005578 - proteinacious extracellular matrix * GO:0005201 - extracellular matrix structural constituent * GO:1990430 - extracellular matrix protein binding * GO:0035426 - extracellular matrix cell signalling '''This created a set of 433 ECM genes (prior to manual curation)''' === Manual Curation === '''ECM, Receptor and ligand lists were manually curated''' * genes that were neither receptors or ligands were removed * misclassified genes were moved to the correct list (i.e. receptors found on the ligand list or vice versa) After curation, the resulting ligand, receptor and ECM sets consisted of: * Receptors - 1851 genes * Ligands - 1593 genes * ECM - 433 genes In each of the above sets there are genes that are part of other sets (e.g. a gene can be ECM and ligand at the same time) == Interaction Data == The set of protein interactions were downloaded from: 1. [[http://irefindex.org/wiki/index.php?title=iRefIndex| iRefIndex]] ([[http://irefindex.org/download/irefindex/data/archive/release_14.0/|version 14]]) ([[#ref5|5]]). - all biogrid interactions were excluded from the iRefIndex set as we imported the original source. 1. [[http://www.pathwaycommons.org/|Pathway Commons]] ([[http://www.pathwaycommons.org/archives/PC2/v8/|version 8]])([[#ref6|6]]). 1. [[https://thebiogrid.org/|BioGrid]] ([[https://downloads.thebiogrid.org/BioGRID/Release-Archive/BIOGRID-3.4.147/|version 3.4.147]])([[#ref7|7]]). The entire interaction set was filtered to only include interactions that contained receptor-ligand, receptor-receptor, ligand-ligand, receptor-ecm, ligand-ecm or ecm-ecm interactions where the receptor, ligands and ecm were defined by the above lists. The resulting set of interactions from the three datasets our outlined in the below venn diagram. It is not a perfect representation as only interaction that matched exactly i.e. A-B and A-B were considered overlapping (A-B and B-A were not considered overlapping) {{attachment:3_db_overlap.png|Network creation flowchart|width=750}} == Download Data == 1. [[attachment:ligands.txt|Ligands]] - table of ligands. (contains HGNC symbol and classification (Ligand, Ligand/ECM, Ligand/Receptor, Ligand/ECM/Receptor) 1. [[attachment:receptors.txt|Receptors]] - table of receptors. (contains HGNC symbol and classification (Receptor, Receptor/ECM, Ligand/Receptor, Ligand/ECM/Receptor) 1. [[attachment:ecm.txt|ECM]] - table of ECM. (contains HGNC symbol and classification (ECM, ECM/Receptor, ECM/Ligand, Ligand/ECM/Receptor) 1. [[attachment:protein_types.txt|Protein types]] - table of unique set of receptor, ligand and ECM genes (contains HGNC symbol as well as classification (Receptor, Ligand, ECM, ECM/Receptor, ECM/Ligand, Receptor/Ligand, Ligand/ECM/Receptor) 1. [[attachment:receptor_ligand_interactions.txt|Ligand - Receptor interaction set]] == References == 1. <<Anchor(ref1)>> Qiao W, Wang W, Laurenti E, Turinsky AL, Wodak SJ, Bader GD, Dick JE, Zandstra PW '''Intercellular network structure and regulatory motifs in the human hematopoietic system'''<<BR>>[[http://www.ncbi.nlm.nih.gov/pubmed/25028490|Pubmed]] 1. <<Anchor(ref2)>> Ashburner M, Ball CA, Blake JA, Botstein D, Butler H, Cherry JM, Davis AP, Dolinski K, Dwight SS, Eppig JT, Harris MA, Hill DP, Issel-Tarver L, Kasarskis A, Lewis S, Matese JC, Richardson JE, Ringwald M, Rubin GM, Sherlock G. '''Gene ontology: tool for the unification of biology. The Gene Ontology Consortium.''' Nat Genet. 2000 May;25(1):25-9<<BR>>[[http://www.ncbi.nlm.nih.gov/pubmed/10802651|Pubmed]] 1. <<Anchor(ref3)>>The Gene Ontology Consortium. '''Expansion of the Gene Ontology knowledgebase and resources.''' Nucleic Acids Res. 2017 Jan 4;45(D1):D331-D338<<BR>>[[http://www.ncbi.nlm.nih.gov/pubmed/27899567|Pubmed]] 1. <<Anchor(ref4)>>Uhlén M, Fagerberg L, Hallström BM, Lindskog C, Oksvold P, Mardinoglu A, Sivertsson Å, Kampf C, Sjöstedt E, Asplund A, Olsson I, Edlund K, Lundberg E, Navani S, Szigyarto CA, Odeberg J, Djureinovic D, Takanen JO, Hober S, Alm T, Edqvist PH, Berling H, Tegel H, Mulder J, Rockberg J, Nilsson P, Schwenk JM, Hamsten M, von Feilitzen K, Forsberg M, Persson L, Johansson F, Zwahlen M, von Heijne G, Nielsen J, Pontén F. Proteomics. '''Tissue-based map of the human proteome.''' Science. 2015 Jan 23;347(6220)<<BR>>[[http://www.ncbi.nlm.nih.gov/pubmed/25613900|Pubmed]] 1. <<Anchor(ref5)>> Razick S, Magklaras G, Donaldson IM. '''iRefIndex: a consolidated protein interaction database with provenance.''' BMC Bioinformatics. 2008 Sep 30;9:405<<BR>>[[http://www.ncbi.nlm.nih.gov/pubmed/18823568|Pubmed]] 1. <<Anchor(ref6)>>Cerami EG, Gross BE, Demir E, Rodchenkov I, Babur O, Anwar N, Schultz N, Bader GD, Sander C. '''Pathway Commons, a web resource for biological pathway data.''' Nucleic Acids Res. 2011 Jan;39(Database issue):D685-90.2010 Nov 10. <<BR>>[[http://www.ncbi.nlm.nih.gov/pubmed/21071392|Pubmed]] 1. <<Anchor(ref7)>>Stark C, Breitkreutz BJ, Reguly T, Boucher L, Breitkreutz A, Tyers M. '''BioGRID: a general repository for interaction datasets'''. Nucleic Acids Res. 2006 Jan 1;34(Database issue):D535-9.<<BR>>[[http://www.ncbi.nlm.nih.gov/pubmed/16381927|Pubmed]] | 
Cell-Cell Interaction Database
Contents
Overview
This page describes the automated construction of a cell-cell interaction database by filtering existing curated protein-protein interaction (PPI) data.
 
 
Defining Receptor and Ligands
Receptors
Receptor genes were defined based on the union of the annotations from
- the set of Gene Ontology (GO) terms: - GO:0043235 - receptor complex, 
- GO:0008305 - integrin complex, 
- GO:0072657 - protein localized to membrane 
- GO:0043113 - receptor clustering 
- GO:0004872 - receptor activity, 
- GO:0009897 - external side of plasma membrane) 
 
- UniProt annotations - search term -"Receptor [KW-0675]" go:0005886 organism:human.
 
This created a set of 4364 receptor genes (prior to manual curation)
Ligands
Ligand genes were defined based on the union of the below annotations
- the GO terms: - GO:0005102 - receptor binding 
 
- the set of proteins labelled as secreted in the Secretome dataset (http://www.proteinatlas.org/humanproteome/secretome) (4). 
This created a set of 3209 Ligand genes (prior to manual curation)
Extracellular Matrix
Extracellular Matrix (ECM) genes were defined based on the union of the annotations from
- the GO terms: - GO:0031012 - extracellular matrix 
- GO:0005578 - proteinacious extracellular matrix 
- GO:0005201 - extracellular matrix structural constituent 
- GO:1990430 - extracellular matrix protein binding 
- GO:0035426 - extracellular matrix cell signalling 
 
This created a set of 433 ECM genes (prior to manual curation)
Manual Curation
ECM, Receptor and ligand lists were manually curated
- genes that were neither receptors or ligands were removed
- misclassified genes were moved to the correct list (i.e. receptors found on the ligand list or vice versa)
After curation, the resulting ligand, receptor and ECM sets consisted of:
- Receptors - 1851 genes
- Ligands - 1593 genes
- ECM - 433 genes
In each of the above sets there are genes that are part of other sets (e.g. a gene can be ECM and ligand at the same time)
Interaction Data
- The set of protein interactions were downloaded from:
- iRefIndex (version 14) (5). - all biogrid interactions were excluded from the iRefIndex set as we imported the original source. 
- BioGrid (version 3.4.147)(7). 
The entire interaction set was filtered to only include interactions that contained receptor-ligand, receptor-receptor, ligand-ligand, receptor-ecm, ligand-ecm or ecm-ecm interactions where the receptor, ligands and ecm were defined by the above lists.
The resulting set of interactions from the three datasets our outlined in the below venn diagram. It is not a perfect representation as only interaction that matched exactly i.e. A-B and A-B were considered overlapping (A-B and B-A were not considered overlapping)
 
 
Download Data
- Ligands - table of ligands. (contains HGNC symbol and classification (Ligand, Ligand/ECM, Ligand/Receptor, Ligand/ECM/Receptor) 
- Receptors - table of receptors. (contains HGNC symbol and classification (Receptor, Receptor/ECM, Ligand/Receptor, Ligand/ECM/Receptor) 
- ECM - table of ECM. (contains HGNC symbol and classification (ECM, ECM/Receptor, ECM/Ligand, Ligand/ECM/Receptor) 
- Protein types - table of unique set of receptor, ligand and ECM genes (contains HGNC symbol as well as classification (Receptor, Ligand, ECM, ECM/Receptor, ECM/Ligand, Receptor/Ligand, Ligand/ECM/Receptor) 
References
- Qiao W, Wang W, Laurenti E, Turinsky AL, Wodak SJ, Bader GD, Dick JE, Zandstra PW Intercellular network structure and regulatory motifs in the human hematopoietic system 
 Pubmed
- Ashburner M, Ball CA, Blake JA, Botstein D, Butler H, Cherry JM, Davis AP, Dolinski K, Dwight SS, Eppig JT, Harris MA, Hill DP, Issel-Tarver L, Kasarskis A, Lewis S, Matese JC, Richardson JE, Ringwald M, Rubin GM, Sherlock G. Gene ontology: tool for the unification of biology. The Gene Ontology Consortium. Nat Genet. 2000 May;25(1):25-9 
 Pubmed
- The Gene Ontology Consortium. Expansion of the Gene Ontology knowledgebase and resources. Nucleic Acids Res. 2017 Jan 4;45(D1):D331-D338 
 Pubmed
- Uhlén M, Fagerberg L, Hallström BM, Lindskog C, Oksvold P, Mardinoglu A, Sivertsson Å, Kampf C, Sjöstedt E, Asplund A, Olsson I, Edlund K, Lundberg E, Navani S, Szigyarto CA, Odeberg J, Djureinovic D, Takanen JO, Hober S, Alm T, Edqvist PH, Berling H, Tegel H, Mulder J, Rockberg J, Nilsson P, Schwenk JM, Hamsten M, von Feilitzen K, Forsberg M, Persson L, Johansson F, Zwahlen M, von Heijne G, Nielsen J, Pontén F. Proteomics. Tissue-based map of the human proteome. Science. 2015 Jan 23;347(6220) 
 Pubmed
- Razick S, Magklaras G, Donaldson IM. iRefIndex: a consolidated protein interaction database with provenance. BMC Bioinformatics. 2008 Sep 30;9:405 
 Pubmed
- Cerami EG, Gross BE, Demir E, Rodchenkov I, Babur O, Anwar N, Schultz N, Bader GD, Sander C. Pathway Commons, a web resource for biological pathway data. Nucleic Acids Res. 2011 Jan;39(Database issue):D685-90.2010 Nov 10. 
 Pubmed
- Stark C, Breitkreutz BJ, Reguly T, Boucher L, Breitkreutz A, Tyers M. BioGRID: a general repository for interaction datasets. Nucleic Acids Res. 2006 Jan 1;34(Database issue):D535-9. 
 Pubmed
