Analysis of data is an integral part of biology, both in academic research and the Industry. With the advent of high-throughput techniques, biological data analysis has crossed the realm of classical statistical techniques and now involves techniques used by the wider data analytic and machine learning community. It is now expected that every biology student is acquainted with the key concepts and tools of data analysis. This course is designed specifically for biology students to learn the key concepts, applications, and limitations of commonly used data analysis techniques. This course emphasizes visualization and analysis of higher-dimensional data, like clustering, classification, and dimensionality reduction.PRE-REQUISITE:NilINDUSTRY SUPPORT:Data analysis is an essential component in any bio-pharma/healthcare industry. Data analysis in biology has already moved out of the domain of conventional statistics, and it is expected that a student of biology is acquainted with basic concepts of modern data analysis tools.INTENDED AUDIENCE:Students of different areas of Biology, Biotechnology, and allied subjects
Week 1:Basic concepts of probability and statisticsWeek 2:Basic concepts of linearalgebraWeek 3:Basics of RWeek 4:Data visualizationWeek 5:Correlation and regressionWeek 6:Clustering and classification, Correlation and regressionWeek 7:Clustering and classificationWeek 8:Analysis of higher-dimensional data