TrueschoTruescho
All Courses
Big Data Science with the BD2K-LINCS Data Coordination and Integration Center
Coursera
Course
Unknown

Big Data Science with the BD2K-LINCS Data Coordination and Integration Center

Icahn School of Medicine at Mount Sinai

Study molecular network responses in human cells to diverse perturbations using integrated big data analysis.

Unknown14 weeksEnglish6,473 enrolled

About this Course

The Library of Integrative Network-based Cellular Signatures (LINCS) was an NIH Common Fund program that lasted for 10 years from 2012-2021. The idea behind the LINCS program was to perturb different types of human cells with many different types of perturbations such as drugs and other small molecules, genetic manipulations such as single gene knockdown, knockout, or overexpression, manipulation of the extracellular microenvironment conditions, for example, growing cells on different surfaces, and more. These perturbations are applied to various types of human cells including cancer cell lines or induced pluripotent stem cells (iPSCs) from patients, differentiated into various lineages such as neurons or cardiomyocytes. Then, to better understand the molecular networks that are affected by these perturbations, changes in levels of many different molecules within the human cells were measured including: mRNAs, proteins, and metabolites, as well as cellular phenotypic changes such as cell morphology. The BD2K-LINCS Data Coordination and Integration Center (DCIC) was commissioned to organize, analyze, visualize, and integrate this data with other publicly available relevant resources. In this course, we introduce the LINCS DCIC and the various Data and Signature Generation Centers (DSGCs) that collected data for LINCS. We then cover the LINCS metadata, and how the metadata is linked to ontologies and dictionaries. We then present the data processing and data normalization methods used to clean and harmonize the LINCS data. This follows by discussions about how the LINCS data is served with RESTful APIs. Most importantly, the course covers computational bioinformatics methods that can be applied to other multi-omics datasets and projects including dimensionality reduction, clustering, gene-set enrichment analysis, interactive data visualization, and supervised learning. Finally, we introduce crowdsourcing/citizen-science projects where students can work together in teams to extract gene expression signatures from public databases, and then query such collections of signatures against the LINCS data for predicting small molecules as potential therapeutics for a collection of complex human diseases

What You'll Learn

  • Understand molecular network alterations due to perturbations
  • Measure molecular level changes in cells
  • Analyze phenotypic cellular changes
  • Coordinate and manage big data
  • Integrate data with additional relevant resources

Prerequisites

  • Basic topic familiarity
  • Readiness for applied exercises

Instructors

A

Avi Ma’ayan, PhD

Director, Mount Sinai Center for Bioinformatics

Topics

Basic Science
Health
Data Analysis
Data Science
Data Integration
Unsupervised Learning
Bioinformatics
Big Data
Metadata Management
Machine Learning

Course Info

PlatformCoursera
LevelUnknown
PacingUnknown
PriceFree

Skills

العلوم الأساسية
الصحة
تحليل البيانات
علوم البيانات
دمج البيانات
التعلم غير المراقب
المعلوماتية الحيوية
البيانات الضخمة
Metadata Management
Machine Learning

Start Learning Now