Talk:Fabry:Sequence-based mutation analysis
From Bioinformatikpedia
Praise
- Nice tables
- Nice visualizations of the mutated residues. Is this pymol?
- I don't think so. They were generated by HOPE.
- Plenty of physicochemical properties.
Criticism
- Does the secondary structure prediction refer to the sequence before or after the mutation?
--> Since we used the files from Task 3 (see Journal), it is from the native structure
- What are your reasons for taking into account the substitution score of PAM1 and PAM250? Is the evolutionary distance not too short and too far, respectively?
--> We were said to do that. At least that's how I understood it
- I think the substitution scores of the three matrices are not directly comparable. PAM1 should have lower scores than PAM250 relative to the maximum score.
--> We díd compare them relatively not the absolute values
- What do the entries in the PSSM table mean? It is obviously not the substitution score. In case of P40S: is P(P|40) = 0.81 or P(S|40) = 0.81?
- It's the "weighted observed percentages rounded down" value as shown here. As I understood, it means that, e.g. for P40S, in 81% of the sequences, the wild type is conserved whereas in 2% of the sequences, the mutation (P → S) is observed.
- You classify the features either as disease causing or not disease causing. But what if a feature is in between?
--> There are more severe forms of the disease and less severe forms, but either you are ill or not... What else is there in between?
- You are weighting each feature equally. For instance PAM1, PAM25, and BLOSUM62 are treated equally. The three secondary structure prediction are of course not independent.
--> but sometimes different. A weighting function would have actually been a good idea, but since we are not done yet, it would have made no sense to develop one.