DSBA/MBAD 6211 Assignment 1 Due: 11:59pm On 2/18/2021

Question

DSBA/MBAD 6211 Assignment 1 Due: 11:59pm @ 2/18/2021 In the In the fall of 2019, the administration of a large private university requested that the Office of Enrollment Management and the Office of Institutional Research work together to identify prospective students who would most likely enroll as new freshmen in the Fall 2020 semester. Historically, inquiries numbered about 90,000+ students, and the university enrolled from 2400 to 2800 new freshmen each Fall semester. It was decided that inquiries for Fall 2019 would be used to build the model to help shape the Fall 2020 freshman class. The data set INQ2019 was built over a period of a several months in consultation with Enrollment Management. Please carefully explore all variables and build a predictive model for better enrollment management. Please apply regression and decision tree models to analyze the data. Variable and model naming requirements: Please include your name initials to the data frame names as well as model names in your R coding. Please instance, in my coding, I would name the data frames as dfKZ, dfKZ.train, and dfKZ.valid. I would also name the models as regressionKZ, treeKZ, etc. Please submit a Word document including: 1. A table showing the overall structure of the dataset, including variable names, data types, and whether the variables will be used in your analyses. Also, please answer questions c, d, e. a. The nominal variables ACADEMIC_INTEREST_1, ACADEMIC_INTEREST_2, and IRSCHOOL were rejected because they were replaced by the interval variables INT1RAT, INT2RAT, and HSCRAT, respectively. For example, academic interest codes 1 and 2 were replaced by the percentage of inquirers over the past five years who indicated those interest codes and then enrolled. The variable IRSCHOOL is the high school code of the student, and it was replaced by the percentage of inquirers from that high school over the last five years who enrolled. b. CONTACT_CODE1 and CONTACT_DATE1 are also rejected due to

Dr. Jack HW Helper · Accepted Answer

The growing demand for higher education continues to challenge universities to adopt innovative, data-driven strategies to attract prospective students. In this context, the administration of a large private university sought to collaborate with the Office of Enrollment Management and the Office of Institutional Research to build a predictive model aimed at identifying students likely to enroll as freshmen in the Fall 2020 semester. The data utilized for this model was based on inquiries from Fall 2019, encapsulating over 90,000 student inquiries and feeding into a large enrollment pool, historically admitting between 2400 and 2800 new freshmen each year. Understanding the Dataset The dataset INQ2019 comprised multiple variables relevant to the inquiry of prospective students. It includes a variety of factors such as ACADEMIC_INTEREST, CONTACT_DETAILS, and performance metrics like SATSCORE and HSCRAT, among others. To comply with the assignment's requirements, the initial step involved analyzing the overall structure, including variable names, their respective data types, and identifying those that would be utilized in the predictive analyses. A detailed analysis revealed that certain nominal variables, specifically ACADEMIC_INTEREST_1, ACADEMIC_INTEREST_2, and IRSCHOOL, were replaced by interval variables INT1RAT, INT2RAT, and HSCRAT. This transformation was strategic, enabling a more enriched analysis correlating interests and historical enrollment trends. Furthermore, variables CONTACT_CODE1 and CONTACT_DATE1 were rejected due to their perceived irrelevance, as advised by Enrollment Management. During this analysis, I identified that the target variable is ENROLL, signifying whether a student enrolled (1) or not (0) in Fall 2014. In the next phase, I assessed if any additional variables should be rejected. After careful consideration, I realized that the variable DISTANCE could also be excluded from our analyses, since it may not provide significant insights into

DSBA/MBAD 6211 Assignment 1 Due: 11:59pm On 2/18/2021 ✓ Solved

DSBA/MBAD 6211 Assignment 1 Due: 11:59pm @ 2/18/2021 In the

Paper For Above Instructions

Load necessary libraries

Load dataset

Data pre-processing

Handling missing values

Regression model

Summary of Regression Model

Decision Tree Model

References