Timezone: »
Tensor decompositions are powerful tools for dimensionality reduction and feature interpretation of multidimensional data such as signals. Existing tensor decomposition objectives (e.g., Frobenius norm) are designed for fitting raw data under statistical assumptions, which may not align with downstream classification tasks. In practice, raw input tensor can contain irrelevant information while data augmentation techniques may be used to smooth out class-irrelevant noise in samples. This paper addresses the above challenges by proposing augmented tensor decomposition (ATD), which effectively incorporates data augmentations and self-supervised learning (SSL) to boost downstream classification. To address the non-convexity of the new augmented objective, we develop an iterative method that enables the optimization to follow an alternating least squares (ALS) fashion. We evaluate our proposed ATD on multiple datasets. It can achieve 0.8%~2.5% accuracy gain over tensor-based baselines. Also, our ATD model shows comparable or better performance (e.g., up to 15% in accuracy) over self-supervised and autoencoder baselines while using less than 5% of learnable parameters of these baseline models.
Author Information
Chaoqi Yang (University of Illinois Urbana Champaign)
Cheng Qian (IQVIA)
Navjot Singh (University of Illinois, Urbana Champaign)
Cao (Danica) Xiao (Relativity)
M Westover (Massachusetts General Hospital, Harvard University)
Edgar Solomonik (University of Illinois, Urbana Champaign)
Jimeng Sun (University of Illinois, Urbana Champaign)
More from the Same Authors
-
2021 : Therapeutics Data Commons: Machine Learning Datasets and Tasks for Drug Discovery and Development »
Kexin Huang · Tianfan Fu · Wenhao Gao · Yue Zhao · Yusuf Roohani · Jure Leskovec · Connor Coley · Cao Xiao · Jimeng Sun · Marinka Zitnik -
2022 : Recommendation for New Drugs with Limited Prescription Data »
Zhenbang Wu · Huaxiu Yao · Zhe Su · David Liebovitz · Lucas Glass · James Zou · Chelsea Finn · Jimeng Sun -
2022 : A source data privacy framework for synthetic clinical trial data »
Afrah Shafquat · Jason Mezey · Mandis Beigi · Jimeng Sun · Jacob Aptekar -
2022 Poster: Reinforced Genetic Algorithm for Structure-based Drug Design »
Tianfan Fu · Wenhao Gao · Connor Coley · Jimeng Sun -
2022 Poster: Cost-efficient Gaussian tensor network embeddings for tensor-structured inputs »
Linjian Ma · Edgar Solomonik -
2022 Poster: TransTab: Learning Transferable Tabular Transformers Across Tables »
Zifeng Wang · Jimeng Sun -
2022 Poster: Sample Efficiency Matters: A Benchmark for Practical Molecular Optimization »
Wenhao Gao · Tianfan Fu · Jimeng Sun · Connor Coley -
2022 Poster: Conformal Prediction with Temporal Quantile Adjustments »
Zhen Lin · Shubhendu Trivedi · Jimeng Sun -
2021 Poster: Fast and accurate randomized algorithms for low-rank tensor decompositions »
Linjian Ma · Edgar Solomonik