Coursera Genomic Data Science and Clustering (Bioinformatics V)

University of California, San Diego via Coursera

  • Overview
  1. Coursera
    Platform:
    Coursera
    Provider:
    University of California, San Diego
    Effort:
    4-10 hours a week
    Length:
    3 weeks
    Language:
    English
    Credentials:
    Paid Certificate Available
    Part of:
    Bioinformatics Specialization
    Overview
    How do we infer which genes orchestrate various processes in the cell? How did humans migrate out of Africa and spread around the world? In this class, we will see that these two seemingly different questions can be addressed using similar algorithmic and machine learning techniques arising from the general problem of dividing data points into distinct clusters.

    In the first half of the course, we will introduce algorithms for clustering a group of objects into a collection of clusters based on their similarity, a classic problem in data science, and see how these algorithms can be applied to gene expression data.

    In the second half of the course, we will introduce another classic tool in data science called principal components analysis that can be used to preprocess multidimensional data before clustering in an effort to greatly reduce the number dimensions without losing much of the "signal" in the data.

    Finally, you will learn how to apply popular bioinformatics software tools to solve a real problem in clustering.

    Syllabus
    • Week 1: Introduction to Clustering Algorithms
    • Week 2: Advanced Clustering Techniques
    • Week 3: Introductory Algorithms in Population Genetics
    Taught by
    Pavel Pevzner and Phillip Compeau

Share This Page



  1. This site uses cookies to help personalise content, tailor your experience and to keep you logged in if you register.
    By continuing to use this site, you are consenting to our use of cookies.
    Dismiss Notice