1. To support our site, MoocLab may be compensated by some course providers through affiliate links.

FutureLearn Managing Big Data with R and Hadoop

Partnership for Advanced Computing in Europe via FutureLearn

  • Overview
  1. FutureLearn
    Platform:
    FutureLearn
    Provider:
    Partnership for Advanced Computing in Europe
    Length:
    5 weeks
    Effort:
    4 hours/week
    Language:
    English
    Credentials:
    Paid Certificate Available
    Overview
    This course will give you access to a virtual environment with installations of Hadoop, R and Rstudio to get hands-on experience with big data management. Several unique examples from statistical learning and related R code for map-reduce operations will be available for testing and learning.

    Those with basic knowledge in statistical learning and R will better understand the methods behind and how to run them in parallel using map-reduce functions and Hadoop data storage. At the end of the course you will get access to RHadoop on a supercomputer at University of Ljubljana.

    This course is designed for people interested in data science, computational statistics and machine learning and have basic experiences with them. It will be also useful for advanced undergraduate students and first year PhD students in data analysis, statistics or bioinformatics, who wish to understand how to manage big data with Hadoop using R programming language.

    We expect that the learners will also have basic experiences with linux, bash and R and are capable to download and run virtual machine.

    All software needed to actively participate the course is provided within the virtual machine that the followers are supposed to download and run on the local machine. No extra software is needed. You will need a modest local machine with 15GB free disk space and 2GB RAM.

    What topics will you cover?
    • Welcome to BIG DATA
    • Working with Hadoop
    • First steps in R and RHadoop
    • Statistical learning with RHadoop: clustering
    • Statistical learning with RHadoop: regression and classification
    Taught by
    Janez Povh, Biljana Mileva Boshkoska and Leon Kos

Share This Page



  1. This site uses cookies to help personalise content, tailor your experience and to keep you logged in if you register.
    By continuing to use this site, you are consenting to our use of cookies.
    Dismiss Notice