Research And Answer The Questions_Submit Responses In A Sep

Question

Research And Answer The Questions Submit Responses In a Separate Doc Research and answer the questions. Submit responses in a separate document. Be sure to label questions correctly. Choose 4 of the 5 problems. 1. The RIPPER algorithm (by Cohen [1]) is an extension of an earlier algorithm called IREP (by Furnkranz and Widmer). Both algorithms apply the reduced-error pruning method to determine whether a rule needs to be pruned. The reduced error pruning method uses a validation set to estimate the generalization error of a classifier. Consider the following pair of rules: R 1: A → C R 2: A ∧ B → C R 2 is obtained by adding a new conjunct, B , to the left-hand side of R 1. For this question, you will be asked to determine whether R 2 is preferred over R 1 from the perspectives of rule-growing and rule-pruning. To determine whether a rule should be pruned, IREP computes the following measure: v IREP = (p + n + 1) / (P + N + 2), where P is the total number of positive examples in the validation set, N is the total number of negative examples in the validation set, p is the number of positive examples in the validation set covered by the rule, and n is the number of negative examples in the validation set covered by the rule. v IREP is similar to classification accuracy for the validation set; IREP favors rules with higher v IREP. In contrast, RIPPER applies the measure v RIPPER = (p - n) / (p + n). Do a, b, and c below: (a) Suppose R 1 is covered by 350 positive examples and 150 negative examples, while R 2 is covered by 300 positive examples and 50 negative examples. Compute the FOIL’s information gain for the rule R 2 with respect to R 1. (b) Consider a validation set with 500 positive examples and 500 negative examples. For R 1, suppose the number of positive examples covered by the rule is 200, and the number of negative examples is 50. For R 2, suppose positive examples covered are 100, and negative examples are 5. Compute v IREP for both rules and determine which

Dr. Jack HW Helper · Accepted Answer

Research And Answer The Questions Submit Responses In a Separate Doc Research and answer the questions. Submit responses in a separate document. Be sure to label questions correctly. Choose 4 of the 5 problems. 1. The RIPPER algorithm (by Cohen [1]) is an extension of an earlier algorithm called IREP (by Furnkranz and Widmer). Both algorithms apply the reduced-error pruning method to determine whether a rule needs to be pruned. The reduced error pruning method uses a validation set to estimate the generalization error of a classifier. Consider the following pair of rules: R 1: A → C R 2: A ∧ B → C R 2 is obtained by adding a new conjunct, B , to the left-hand side of R 1. For this question, you will be asked to determine whether R 2 is preferred over R 1 from the perspectives of rule-growing and rule-pruning. To determine whether a rule should be pruned, IREP computes the following measure: v IREP = (p + n + 1) / (P + N + 2), where P is the total number of positive examples in the validation set, N is the total number of negative examples in the validation set, p is the number of positive examples in the validation set covered by the rule, and n is the number of negative examples in the validation set covered by the rule. v IREP is similar to classification accuracy for the validation set; IREP favors rules with higher v IREP. In contrast, RIPPER applies the measure v RIPPER = (p - n) / (p + n). Do a, b, and c below: (a) Suppose R 1 is covered by 350 positive examples and 150 negative examples, while R 2 is covered by 300 positive examples and 50 negative examples. Compute the FOIL’s information gain for the rule R 2 with respect to R 1. (b) Consider a validation set with 500 positive examples and 500 negative examples. For R 1, suppose the number of positive examples covered by the rule is 200, and the number of negative examples is 50. For R 2, suppose positive examples covered are 100, and negative examples are 5. Compute v IREP for both rules and determine which

Research And Answer The Questions Submit Responses In a Separate Doc

Related Assignments