Lung cancer dataset and visualization

less than 1 minute read

Published:

======

I found this blog when searching for cancer datasets and she used the same dataset that I used for machine learning project and application of machine learning in cancer and diease prediction

This page is overview of cancer genomics

What I have learned from this blog:

  • impute missing data using mice package

  • PCA analysis

  • using regression, classification to build the model (caret package)

I found the Pan-lung cancer dataset from cbio-portal with very nice data visualization of lung cancer data

Example of the most 10 mutation genes and frequency

lung-cancer