Create And Present Data Analysis Situation

Question

Create And Presenting Data Analysissituationanalyze The Data Gathered Create and presenting data analysis Situation: Analyze the data gathered for the Center for Disease Control and Prevention (CDC) social vulnerability data and data dictionary (CDC, 2018a; CDC, 2018b), in use for determining the resiliency of communities within specific states: Alabama, Nebraska, and Georgia. Objective: Explore the dataset, considering the state, counties, and population, and four categories: socioeconomic features, household and composition disability features, and minority status and language limitations, and housing types and transportation. In the interest of clarity, I will specify the variates associated with these categories Socioeconomic o Persons below the poverty estimate o Civilian unemployed estimate o Per capita income estimate o Persons with no high school diploma Household and composition disability features o Ages 65 and older o Ages 17 and under o Persons with a disability, over the age of 5 o Single-parent households Minority status and language limitations o Persons with minority status o Persons with no or minimal use of the English language Housing types and transportation o Multi-unit dwellings (10 or more units) o Mobile homes o Homes with more residents than a home is designed for o Homes with no vehicle o Group quarters or institutionalized quarters Note: Do not use the columns that are follow-on calculations of these columns. These are the columns with the prefix “E_â€. Consider the following research questions: How do these factors relate to the measure of social vulnerability (in the data set at RPL_THEMES) metric analytically? By the CDC standards, the closer the value is to one, the higher the vulnerability (CDC, 2018b). What patterns can be found when looking at different aspects of the data features? • How do different characteristics of the data relate? • How well do these variates represent the vulnerability? • Which characteristics have a more sig

Dr. Jack HW Helper · Accepted Answer

This analysis aims to explore the CDC's social vulnerability data to understand the factors influencing community resilience in Alabama, Nebraska, and Georgia. The social vulnerability index (SVI), a composite measure capturing various social, economic, and housing factors, serves as the primary outcome variable. The goal is to develop a novel analytical approach to assess how different socio-demographic and infrastructural features relate to community vulnerability, moving beyond the CDC's standard calculations. The dataset comprises multiple variables categorized into four groups: socioeconomic features, household and disability features, minority status and language limitations, and housing types and transportation. Key variables include poverty levels, unemployment rates, income, educational attainment, age demographics, disability prevalence, minority status, English proficiency, housing types, vehicle availability, and group quarters, among others. The analysis begins with data collection from the CDC's publicly available datasets, followed by a rigorous data cleaning process focused on preserving data integrity without transforming, removing outliers, or deleting NAs, aligning with the instructions to justify any such actions. The initial phase involves exploratory data analysis (EDA) to identify patterns and relationships among variables. I will perform at least five different types of visualizations and analyses, such as correlation matrices, scatterplots, boxplots, and heatmaps, each interpreted thoroughly to elucidate critical trends and interconnectedness among features. These explorations aim to understand the data's structure, outliers, and potential multicollinearity, providing a foundation for subsequent modeling. Following the EDA, the analysis proceeds with model development. The dataset will be split into training (80%) and testing (20%) sets to validate the models effectively. A random forest model will be developed to identify and quantify the i

Create And Present Data Analysis Situation

Create And Presenting Data Analysissituationanalyze The Data Gathered

Paper For Above instruction

References

Create And Presenting Data Analysissituationanalyze The Data Gathered

Paper For Above instruction

References

Related Assignments