| Size: 203 Comment:  | Size: 1317 Comment:  | 
| Deletions are marked like this. | Additions are marked like this. | 
| Line 4: | Line 4: | 
| * Predict specificity of peptide recognition domain from the primary amino acid sequence. * Analyze PDZ, WW and then SH3 domains | |
| Line 11: | Line 13: | 
| 1. Learn SVN, Brain code (ResidueResidueCorrelation) 1. Literature review related to domain specificity (background activity) 1. Run ResidueResidue correlation analysis on PDZ domain data: 1-1 version + try others e.g. 1-2 (Requires: PDZ profiles from Gary) 1. Implement new feature: amino acid groups (learn amino acid groups) + run on PDZ data 1. Think about new PDZ domain features that can be used for prediction. == Ideas == * Use of structural data (PDZ domain structures) (may require homology modeling) * Use of machine learning methods (SVM for classification and boosting decision tree for interpretable learning model) * Analysis of correlation within domain and peptide (inter-residue correlation) maybe correspondence analysis | |
| Line 12: | Line 25: | 
| * Shirley Hui * Gary Bader | |
| Line 15: | Line 30: | 
| [wiki:/Log Development Log] | |
| Line 16: | Line 33: | 
| * The Structure and Function of Proline Recognition Domains, Zarrinpar et al., 2003 attachment:Structure_Function_Pro_Recog_Domains_Zarrinpar_et_al_2003.pdf | 
Goals
- Predict specificity of peptide recognition domain from the primary amino acid sequence.
- Analyze PDZ, WW and then SH3 domains
Strategy
Status
Tasks
- Learn SVN, Brain code (ResidueResidueCorrelation) 
- Literature review related to domain specificity (background activity)
- Run ResidueResidue correlation analysis on PDZ domain data: 1-1 version + try others e.g. 1-2 (Requires: PDZ profiles from Gary) 
- Implement new feature: amino acid groups (learn amino acid groups) + run on PDZ data
- Think about new PDZ domain features that can be used for prediction.
Ideas
- Use of structural data (PDZ domain structures) (may require homology modeling)
- Use of machine learning methods (SVM for classification and boosting decision tree for interpretable learning model)
- Analysis of correlation within domain and peptide (inter-residue correlation) maybe correspondence analysis
Team
- Shirley Hui
- Gary Bader
Documents
[wiki:/Log Development Log]
Background Literature
- The Structure and Function of Proline Recognition Domains, Zarrinpar et al., 2003 attachment:Structure_Function_Pro_Recog_Domains_Zarrinpar_et_al_2003.pdf
