INSS 662 Project You Are Required To Use Weka Or Other Open

Question

Inss 662 Project You Are Required To Usewekaor Other Open Sourced Inss 662 Project : You are required to use Weka or other open source data mining software, not Watson. 1. Find an open dataset on the Internet, see below 2. Conduct appropriate data mining activities and report the processes and outcomes . You need to use at least three competitive algorithms from the same or different classes of Data mining or Machine Learning techniques. See the lecture slides for Chapter 5 for algorithms and techniques under course materials. 3. Present your results. Possible data sets to choose from: Data from Hackathon such as the Lord of the Machines - Data Science Hackathon , DataHack Premier League , Mckinsey Analytics Online Hackathon , etc. (Be brave to take challenging problem ...) Possible sources are: a. b. c. (Datasets for Data Mining and Data Science) d. open government dataset @ e. Dataset from URL: OR f. Other datasets after getting approval from the instructors.

Dr. Jack HW Helper · Accepted Answer

The project assigned in INSS 662 requires students to employ open source data mining tools, such as Weka or alternatives, to analyze an openly available dataset. The core objective is to conduct meaningful data mining activities utilizing at least three diverse algorithms from different classes of machine learning or data mining techniques. This comprehensive process involves identifying a suitable dataset, performing data preprocessing, applying multiple algorithms, and analyzing the results to draw insightful conclusions. Selection of Dataset The first step involves choosing an appropriate dataset. Students are encouraged to explore datasets from hackathons, government portals, or other credible sources. For example, datasets from events like the Lord of the Machines - Data Science Hackathon, DataHack Premier League, or McKinsey Analytics Online Hackathon are suitable, especially if they pose challenging problems that can demonstrate the versatility of different algorithms. Additional options include open government datasets, datasets from specific URLs, or other datasets approved by instructors. The key is to select a dataset that is rich enough to allow the application of multiple algorithms and generate meaningful insights. Data Mining Activities and Methodology The core of this project revolves around performing a series of data mining activities that include data cleaning, transformation, and feature selection, followed by the application of multiple algorithms. The student should document each step meticulously, including rationale for preprocessing techniques, parameter settings, and choice of algorithms. The algorithms selected should belong to different classes, such as decision trees, neural networks, clustering algorithms, or ensemble methods, to exhibit a broad coverage of techniques. For example, applying a decision tree classifier like C4.5, a clustering algorithm like K-Means, and a neural network such as Multi-Layer Perceptron demonstrates a divers

INSS 662 Project You Are Required To Use Weka Or Other Open

Inss 662 Project You Are Required To Usewekaor Other Open Sourced

Paper For Above instruction

References

Inss 662 Project You Are Required To Usewekaor Other Open Sourced

Paper For Above instruction

References

Related Assignments