In collaboration with Prof. Van den Broeck (CS, UCLA), we have an exciting new research on using coding methods to make ML algorithms more robust. Initial results from this work were presented at 2016 Allerton Conference.
|Robust Channel Coding Strategies for Machine Learning Data (I)|
|Van den Broeck, Guy||UCLA|
|Dolecek, Lara||Univ. of California, Los Angeles|
Keywords: Coding Theory, Data Analytics, Machine Learning and Learning Theory
Abstract: Two important recent trends are the proliferation of learning algorithms along with the massive increase of data stored on unreliable storage mediums. These trends impact each other; noisy data can have an undesirable effect on the results provided by learning algorithms. Although traditional tools exist to improve the reliability of data storage devices, these tools operate at a different abstraction level and therefore ignore the data application, leading to an inefficient use of resources. In this paper we propose taking the operation of learning algorithms into account when deciding how to best protect data. Specifically, we examine several learning algorithms that operate on data that is stored on noisy mediums and protected by error-correcting codes with a limited budget of redundancy; we develop a principled way to allocate resources so that the harm on the output of the learning algorithm is minimized.