You Have Been Hired By A North American Trucking Firm To Hel

You Have Been Hired By A North American Trucking Firm To Help Reduce T

You have been hired by a North American trucking firm to help reduce the number of deliveries that show up to customers later than their expected due date. The company is expecting you to develop a project that will use the company’s own data to solve this problem. Using the CRISP- DM framework, answer the questions below.

Part 1

You spend several hours in a meeting with the company learning how their business works. You learn that the company receives its orders from a freight brokerage, which tells them where to pick up and where to drop off every load. Drivers are fined every time a load is delivered later than is expected. This is an example of which stage in the CRISP-DM framework? Choose the best answer.

  • Business Understanding
  • Evaluation and/or Deployment
  • Data Understanding
  • Modeling
  • Data Preparation

Part 2

The data that you receive from the company makes no sense! There are many suspicious outliers. According to some of the entries, some loads were delivered 4000 hours early, others were 9200 hours late?! What does that even mean? You decide to go through the data and delete entries that seem suspicious. This is an example of which stage in the CRISP-DM framework? Choose the best answer.

  • Business Understanding
  • Evaluation and/or Deployment
  • Data Understanding
  • Modeling
  • Data Preparation

Part 3

You decide to identify groups of drivers according to the features available to you in the data provided to you. To identify these groups, you decide on an unsupervised learning approach. You decide that a clustering algorithm will work very well here. This is an example of which stage in the CRISP-DM framework? Choose the best answer.

  • Business Understanding
  • Evaluation and/or Deployment
  • Data Understanding
  • Modeling
  • Data Preparation

Part 4

The company invites you to their headquarters to share with them how you think that they can reduce the number of late deliveries. They are excited to learn from you and to change their business practices according to the results of your model and your insights. This is an example of which stage in the CRISP-DM framework? Choose the best answer.

  • Business Understanding
  • Evaluation and/or Deployment
  • Data Understanding
  • Modeling
  • Data Preparation

Part 5

You present to them your analysis. You have discovered that there are 3 distinct clusters of drivers working at the company. In one cluster are 1050 drivers who are old and come from American Midwestern states. In another cluster, there are 630 drivers who are young and come from Western states. In the final cluster are 129 drivers of all ages from Canada. Given the business goals of this project, did your cluster analysis address the company’s concerns? Choose the best answer.

  • No
  • Yes

Practice Assignment 2

Part 1

According to the lecture, a database can be best described as a collection of tables. True or False.

  • True
  • False

Part 2

In crow's foot notation, which relationship is indicated by the notation above? Choose the best answer.

  • One (and only one)
  • Zero or one
  • One or many
  • Zero or many

Part 3

Primary keys are attributes stored in a dependent entity which show how records in the dependent entity are related to the independent identity. True or False.

  • True
  • False

Part 4

In a relational database, rows will always be retrieved from the database and presented in the same chronological order that they were input. True or False.

  • True
  • False

Part 5

An entity is an object, concept or event that is being modelled in the database as a table. True or False.

  • True
  • False

References

  • Chapman, P., & Cooper, J. (2019). Data Analytics for Lean Six Sigma. CRC Press.
  • Fayyad, U., Piatetsky-Shapiro, G., & Smyth, P. (1996). From Data Mining to Knowledge Discovery in Databases. AI Magazine, 17(3), 37–54.
  • Kimball, R., & Ross, M. (2013). The Data Warehouse Toolkit: The Definitive Guide to Dimensional Modeling. John Wiley & Sons.
  • Marchette, S. (2015). Data Science from Scratch: First Principles with Python. O'Reilly Media.
  • Murphy, K. P. (2012). Machine Learning: A Probabilistic Perspective. MIT Press.
  • Rouse, M. (2021). What Is CRISP-DM? [Definition, Phases & Models]. SearchBusinessAnalytics. https://searchbusinessanalytics.techtarget.com/definition/CRISP-DM
  • Shmueli, G., Bruce, P. C., Gedeck, P., & Ritchie, S. (2020). Data Mining for Business Analytics: Concepts, Techniques, and Applications in Python. Wiley.
  • Tan, P.-N., Steinbach, M., & Kumar, V. (2018). Introduction to Data Mining (2nd Edition). Pearson.
  • Wilkinson, L., & Friendly, M. (2009). The history of the table graphical display in statistics. The American Statistician, 63(2), 97-108.
  • Wirth, R., & Hipp, H. (2000)./crisp-DM: Towards a standard process model for data mining. ACM SIGKDD Explorations Newsletter, 2(2), 1-8.