CMSC 510 Fall 2020 Homework Assignment 3 Announced Due Tuesd

Question

Cmsc 510 Fall 2020homework Assignment 3announced 106due Tuesday Implement and test a logistic regression model with L1 regularization using proximal gradient descent. Use TensorFlow to develop your solutions. Your implementation should classify two digits from the MNIST dataset. Incorporate soft-thresholding as the proximal operator for L1 regularization. Experiment with various gradient step sizes and regularization parameters. Document your code and results thoroughly, including testing the model with decreasing training set sizes, specifying the class digits, and providing a final full-size training set model solution. Your final report should be in PDF format and include your name at the top of the code file.

Dr. Jack HW Helper · Accepted Answer

In this assignment, the goal is to develop a logistic regression classifier with L1 regularization that leverages proximal gradient descent, implemented using TensorFlow. The overarching objective is to classify two specific digits from the MNIST dataset, such as '3' and '7', to evaluate the effectiveness of L1 regularization on sparse solutions and feature selection. Understanding the Problem and Theoretical Foundations Logistic regression (LR) is a widely used supervised learning algorithm for binary classification. It models the probability that a given input belongs to a class through the logistic (sigmoid) function. Mathematically, LR predicts the probability p of class 1 as: p = sigmoid(w^T x + b) where w is the weight vector, and b is the bias term. The loss function used during training is the logistic loss (also known as cross-entropy loss): L(w, b) = -∑ [ y_i log p_i + (1 - y_i) log (1 - p_i) ] However, to induce sparsity in the model coefficients and perform feature selection, L1 regularization is added: L_total(w, b) = L(w, b) + λ ||w||_1 where λ is a regularization parameter controlling the sparsity level. Notably, L1 norm is not differentiable at zero, making standard gradient descent unsuitable. Proximal Gradient Descent for L1 Regularization Proximal gradient descent extends standard gradient methods to optimize non-smooth functions, such as L1 regularization. Each iteration involves two steps: Gradient step on the smooth part (logistic loss). Proximal (or soft-thresholding) step on the non-smooth regularizer. The proximal operator for the L1 norm is the soft-thresholding function: S_{λ}(z) = sign(z) * max(|z| - λ, 0) which shrinks coefficients towards zero, encouraging sparsity when multiple iterations are performed. Implementation Details The key components of the implementation involve: Using TensorFlow to perform gradient computations. Define placeholders or datasets for features and labels. Calculating the logistic loss and its gradient with res

CMSC 510 Fall 2020 Homework Assignment 3 Announced Due Tuesd ✓ Solved

Paper For Above Instructions

Understanding the Problem and Theoretical Foundations

Proximal Gradient Descent for L1 Regularization

Implementation Details

Using and Modifying Existing Code

Experimentation and Testing

Deliverables

Summary

References

Note

End of Assignment Instructions