Novel gradient-based methods for data distribution and privacy in data science

Kuru, Nurdan (2019) Novel gradient-based methods for data distribution and privacy in data science. [Thesis]

[thumbnail of 10314487_Nurdan_Kuru.pdf] PDF
10314487_Nurdan_Kuru.pdf

Download (4MB)

Abstract

With an increase in the need of storing data at different locations, designing algorithms that can analyze distributed data is becoming more important. In this thesis, we present several gradient-based algorithms, which are customized for data distribution and privacy. First, we propose a provably convergent, second order incremental and inherently parallel algorithm. The proposed algorithm works with distributed data. By using a local quadratic approximation, we achieve to speed-up the convergence with the help of curvature information. We also illustrate that the parallel implementation of our algorithm performs better than a parallel stochastic gradient descent method to solve a large-scale data science problem. This first algorithm solves the problem of using data that resides at different locations. However, this setting is not necessarily enough for data privacy. To guarantee the privacy of the data, we propose differentially private optimization algorithms in the second part of the thesis. The first one among them employs a smoothing approach which is based on using the weighted averages of the history of gradients. This approach helps to decrease the variance of the noise. This reduction in the variance is important for iterative optimization algorithms, since increasing the amount of noise in the algorithm can harm the performance. We also present differentially private version of a recent multistage accelerated algorithm. These extensions use noise related parameter selection and the proposed stepsizes are proportional to the variance of the noisy gradient. The numerical experiments show that our algorithms show a better performance than some well-known differentially private algorithms
Item Type: Thesis
Uncontrolled Keywords: Large-scale optimization. -- Differential privacy. -- Momentum-based algorithms. -- Büyük ölçeki eniyileme. -- Diferansiyel mahremiyet. -- Momentum tabanlı algoritmalar.
Subjects: T Technology > T Technology (General) > T055.4-60.8 Industrial engineering. Management engineering
Divisions: Faculty of Engineering and Natural Sciences > Academic programs > Industrial Engineering
Faculty of Engineering and Natural Sciences
Depositing User: IC-Cataloging
Date Deposited: 14 Feb 2020 09:52
Last Modified: 26 Apr 2022 10:32
URI: https://research.sabanciuniv.edu/id/eprint/39652

Actions (login required)

View Item
View Item