Skip to yearly menu bar Skip to main content


Tensor Attention Training: Provably Efficient Learning of Higher-order Transformers

Yingyu Liang ⋅ Zhenmei Shi ⋅ Zhao Song ⋅ Yufa Zhou

Abstract

Chat is not available.