Stat 240 Lab 07 Dr Lloyd T. Elliott March 16, 2020 Visualiza

Question

Stat 240 Lab 07dr Lloyd T Elliottmarch 16 2020visualisation Of Ge This assignment requires creating a Shiny app that visualizes gene expression data, specifically replicating a live-coded demo but using the dataset GSE21935 from NCBI, which includes gene expression profiles from subjects with and without schizophrenia. The task involves downloading the dataset, extracting relevant data, constructing data structures in R, and developing an interactive visualization with density plots to compare gene expression distributions between schizophrenic and non-schizophrenic groups. An alternative simpler version allows using a different dataset with similar binary grouping and continuous variables.

Dr. Jack HW Helper · Accepted Answer

Understanding the intricate relationship between gene expression and psychiatric conditions such as schizophrenia has been a significant focus in biomedical research. Visualization tools like Shiny applications in R provide powerful means to explore such complex datasets interactively. This paper discusses the process of creating an interactive density plot visualization for gene expression data, exemplifying the use of R and Shiny for bioinformatics data analysis, specifically illustrating the steps to replicate a live coding demonstration with GSE21935 dataset from NCBI. The first critical step involves acquiring and preparing the data. The GSE21935 dataset, available through NCBI's Gene Expression Omnibus (GEO), encompasses microarray gene expression measurements for individuals diagnosed with schizophrenia versus control subjects. Using the GEOquery package in R, researchers can download the dataset directly into R and extract the expression data along with clinical indicators such as schizophrenia diagnosis. The expression data is typically stored in a matrix or data frame, with rows representing samples and columns representing genes. For simplicity and illustration, select the first ten genes listed in the dataset, along with their gene names. Construct a data frame 'x' with each row corresponding to a sample and columns named after the selected genes, containing the expression levels for each sample-gene pair. Concurrently, create a vector 'y' reflecting the diagnosis status: assign '1' for schizophrenic subjects and '0' for controls, ensuring each entry aligns with the corresponding row in data frame 'x'. This alignment is crucial for accurate visualization and analysis. The core component of the project is developing a Shiny app that provides an interactive interface for users to select genes and examine their expression distributions in different groups. The app features a dropdown menu populated with gene names from the data frame columns. Upon selection

Stat 240 Lab 07 Dr Lloyd T. Elliott March 16, 2020 Visualiza

Stat 240 Lab 07dr Lloyd T Elliottmarch 16 2020visualisation Of Ge

Paper For Above instruction

References

Stat 240 Lab 07dr Lloyd T Elliottmarch 16 2020visualisation Of Ge

Paper For Above instruction

References

Related Assignments