Using clustering techniques finds groups of proteins that may be of clinical significance
Breast cancer is the most common form of cancer in women, and most of us probably know somebody who's been affected by it, so when another data scientist suggested I look at the breast cancer proteome on Kaggle, I thought it was a worthwhile thing to do. I'm not a biologist, but I know that cell behaviour involves complex networks of interacting proteins, so I thought that clustering would be a good way of uncovering these networks. I was pleased to discover that the protein clusters discovered seemed to be predictive of clinical outcomes.
This is something I hope might be useful to clinical researchers. If you are interested in this work, please contact me.