Workshop
First Workshop on Quantum Tensor Networks in Machine Learning
XiaoYang Liu · Qibin Zhao · Jacob Biamonte · Cesar F Caiafa · Paul Pu Liang · Nadav Cohen · Stefan Leichenauer
Quantum tensor networks in machine learning (QTNML) are envisioned to have great potential to advance AI technologies. Quantum machine learning promises quantum advantages (potentially exponential speedups in training, quadratic speedup in convergence, etc.) over classical machine learning, while tensor networks provide powerful simulations of quantum machine learning algorithms on classical computers. As a rapidly growing interdisciplinary area, QTNML may serve as an amplifier for computational intelligence, a transformer for machine learning innovations, and a propeller for AI industrialization.
Tensor networks, a contracted network of factor tensors, have arisen independently in several areas of science and engineering. Such networks appear in the description of physical processes and an accompanying collection of numerical techniques have elevated the use of quantum tensor networks into a variational model of machine learning. Underlying these algorithms is the compression of highdimensional data needed to represent quantum states of matter. These compression techniques have recently proven ripe to apply to many traditional problems faced in deep learning. Quantum tensor networks have shown significant power in compactly representing deep neural networks, and efficient training and theoretical understanding of deep neural networks. More potential QTNML technologies are rapidly emerging, such as approximating probability functions, and probabilistic graphical models. However, the topic of QTNML is relatively young and many open problems are still to be explored.
Quantum algorithms are typically described by quantum circuits (quantum computational networks). These networks are indeed a class of tensor networks, creating an evident interplay between classical tensor network contraction algorithms and executing tensor contractions on quantum processors. The modern field of quantum enhanced machine learning has started to utilize several tools from tensor network theory to create new quantum models of machine learning and to better understand existing ones.
The interplay between tensor networks, machine learning and quantum algorithms is rich. Indeed, this interplay is based not just on numerical methods but on the equivalence of tensor networks to various quantum circuits, rapidly developing algorithms from the mathematics and physics communities for optimizing and transforming tensor networks, and connections to lowrank methods for learning. A merger of tensor network algorithms with stateoftheart approaches in deep learning is now taking place. A new community is forming, which this workshop aims to foster.
Schedule
Fri 6:00 a.m.  6:05 a.m.

Opening Remarks
(
Opening
)
A short introduction 
XiaoYang Liu 🔗 
Fri 6:05 a.m.  6:35 a.m.

Invited Talk 1: Tensor Networks as a Data Structure in Probabilistic Modeling and for Learning Dynamical Laws from Data
(
Talk
)
SlidesLive Video Recent years have enjoyed a significant interest in exploiting tensor networks in the context of machine learning, both as a tool for the formulation of new learning algorithms and for enhancing the mathematical understanding of existing methods. In this talk, we will explore two readings of such a connection. On the one hand, we will consider the task of identifying the underlying nonlinear governing equations, required both for obtaining an understanding and making future predictions. We will see that this problem can be addressed in a scalable way making use of tensor network based parameterizations for the governing equations. On the other hand, we will investigate the expressive power of tensor networks in probabilistic modelling. Inspired by the connection of tensor networks and machine learning, and the natural correspondence between tensor networks and probabilistic graphical models, we will provide a rigorous analysis of the expressive power of various tensornetwork factorizations of discrete multivariate probability distributions. Joint work with A. Goeßmann, M. Götte, I. Roth, R. Sweke, G. Kutyniok, I. Glasser, N. Pancotti, J. I. Cirac. 
Jens Eisert 🔗 
Fri 6:35 a.m.  6:45 a.m.

Invited Talk 1 Q&A by Jens
(
Q&A
)

Jens Eisert 🔗 
Fri 6:45 a.m.  7:17 a.m.

Invited Talk 2: Expressiveness in Deep Learning via Tensor Networks and Quantum Entanglement
(
Talk
)
SlidesLive Video Understanding deep learning calls for addressing three fundamental questions: expressiveness, optimization and generalization. This talk will describe a series of works aimed at unraveling some of the mysteries behind expressiveness. I will begin by showing that state of the art deep learning architectures, such as convolutional networks, can be represented as tensor networks  a prominent computational model for quantum manybody simulations. This connection will inspire the use of quantum entanglement for defining measures of data dependencies modeled by deep networks. Next, I will turn to derive a quantum maxflow / mincut theorem characterizing the entanglement captured by deep networks. The theorem will give rise to new results that shed light on expressiveness in deep learning, and in addition, provide new tools for deep network design. Works covered in the talk were in collaboration with Yoav Levine, Or Sharir, Ronen Tamari, David Yakira and Amnon Shashua. 
Nadav Cohen 🔗 
Fri 7:17 a.m.  7:25 a.m.

Invited Talk 2 Q&A by Cohen
(
Q&A
)

Nadav Cohen 🔗 
Fri 7:25 a.m.  7:55 a.m.

Invited Talk 3: Tensor Networks and Counting Problems on the Lattice
(
Talk
)
SlidesLive Video An overview will be given of counting problems on the lattice, such as the calculation of the hard square constant and of the residual entropy of ice. Unlike Monte Carlo techniques which have difficulty in calculating such quantities, we will demonstrate that tensor networks provide a natural framework for tackling these problems. We will also show that tensor networks reveal nonlocal hidden symmetries in those systems, and that the typical critical behaviour is witnessed by matrix product operators which form representations of tensor fusion categories. 
Frank Verstraete 🔗 
Fri 7:55 a.m.  8:05 a.m.

Invited Talk 3 Q&A by Frank
(
Q&A
)

Frank Verstraete 🔗 
Fri 8:05 a.m.  8:50 a.m.

Invited Talk 4: Quantum in ML and ML in Quantum
(
Talk
)
SlidesLive Video In this talk, I will cover recent results in two areas: 1) Using quantuminspired methods in machine learning, including using lowentanglement states (matrix product states/tensor train decompositions) for different regression and classification tasks. 2) Using machine learning methods for efficient classical simulation of quantum systems. I will cover our results on simulating quantum circuits on parallel computers using graphbased algorithms, and also efficient numerical methods for optimization using tensortrains for the computational of large number (up to B=100) on GPUs. The code is a combination of classical linear algebra algorithms, Riemannian optimization methods and efficient software implementation in TensorFlow.

Ivan Oseledets 🔗 
Fri 8:50 a.m.  9:00 a.m.

Invited Talk 4 Q&A by Ivan
(
Q&A
)

Ivan Oseledets 🔗 
Fri 9:00 a.m.  9:40 a.m.

Invited Talk 5: Live Presentation of TensorLy By Jean Kossaifi
(
Talk
)
Live Presentation 
Animashree Anandkumar · Jean Kossaifi 🔗 
Fri 9:40 a.m.  10:07 a.m.

Invited Talk 6: A Century of the Tensor Network Formulation from the Ising Model
(
Talk
)
SlidesLive Video A hundred years have passed since Ising model was proposed by Lenz in 1920. One finds that the square lattice Ising model is already an example of twodimensional tensor network (TN), which is formed by contracting 4leg tensors. In 1941, Kramers and Wannier assumed a variational state in the form of the matrix product state (MPS), and they optimized it `numerically'. Baxter reached the concept of the cornertransfer matrix (CTM), and performed a variational computation in 1968. Independently from these statistical studies, MPS was introduced by Affleck, Lieb, Kennedy and Tasaki (AKLT) in 1987 for the study of onedimensional quantum spin chain, by Derrida for asymetric exclusion processes, and also (implicitly) by the establishment of the density matrix renormalization group (DMRG) by White in 1992. After a brief (?) introduction of these prehistories, I'll speak about my contribution to this area, the applications of DMRG and CTMRG methods to twodimensional statistical models, including those on hyperbolic lattices, fractal systems, and random spin models. Analysis of the spinglass state, which is related to learning processes, from the view point of the entanglement structure would be a target of future studies in this direction. 
Tomotoshi Nishino 🔗 
Fri 10:07 a.m.  10:15 a.m.

Invited Talk 6 Q&A by Tomotoshi
(
Q&A
)

Tomotoshi Nishino 🔗 
Fri 10:15 a.m.  10:18 a.m.

Poster 1: MultiGraph Tensor Networks by Yao Lei Xu
(
Poster Talk
)

Yao Lei Xu 🔗 
Fri 10:18 a.m.  10:21 a.m.

Poster 2: High Performance SingleSite Finite DMRG on GPUs by Hao Hong
(
Poster Talk
)

Hong Hao 🔗 
Fri 10:21 a.m.  10:24 a.m.

Poster 3: Variational Quantum Circuit Model for Knowledge Graph Embeddings by Yunpu Ma
(
Poster Talk
)

Yunpu Ma 🔗 
Fri 10:24 a.m.  10:27 a.m.

Poster 4: Hybrid quantumclassical classifier based on tensor network and variational quantum circuit by Samuel YenChi Chen
(
Poster Talk
)

YenChi Chen 🔗 
Fri 10:27 a.m.  10:30 a.m.

Poster 5: A Neural Matching Model based on Quantum Interference and Quantum Manybody System
(
Poster Talk
)

Hui Gao 🔗 
Fri 10:30 a.m.  10:40 a.m.

Contributed Talk 1: Paper 3: Tensor network approaches for datadriven identification of nonlinear dynamical laws
(
Talk
)
SlidesLive Video To date, scalable methods for datadriven identification of nonlinear governing equations do not exploit or offer insight into fundamental underlying physical structure. In this work, we show that various physical constraints can be captured via tensor network based parameterizations for the governing equation, which naturally ensures scalability. In addition to providing analytic results motivating the use of such models for realistic physical systems, we demonstrate that efficient rankadaptive optimization algorithms can be used to learn optimal tensor network models without requiring a~priori knowledge of the exact tensor ranks. 
Alex Goeßmann 🔗 
Fri 10:40 a.m.  10:50 a.m.

Contributed Talk 2: Paper 6: Anomaly Detections with Tensor Networks
(
Talk
)
SlidesLive Video Originating from condensed matter physics, tensor networks are compact representations of highdimensional tensors. In this paper, the prowess of tensor networks is demonstrated on the particular task of oneclass anomaly detection. We exploit the memory and computational efficiency of tensor networks to learn a linear transformation over a space with dimension exponential in the number of original features. The linearity of our model enables us to ensure a tight fit around training instances by penalizing the model's global tendency to predict normality via its Frobenius norma task that is infeasible for most deep learning models. Our method outperforms deep and classical algorithms on tabular datasets and produces competitive results on image datasets, despite not exploiting the locality of images. 
Jinhui Wang 🔗 
Fri 10:50 a.m.  11:00 a.m.

Contributed Talk 3: Paper 32: Highorder Learning Model via Fractional Tensor Network Decomposition
(
Talk
)
SlidesLive Video
We consider highorder learning models, of which the weight tensor is represented by (symmetric) tensor network~(TN) decomposition. Although such models have been widely used on various tasks, it is challenging to determine the optimal order in complex systems (e.g., deep neural networks). To tackle this issue, we introduce a new notion of \emph{fractional tensor network~(FrTN)} decomposition, which generalizes the conventional TN models with an integer order by allowing the order to be an arbitrary fraction. Due to the density of fractions in the field of real numbers, the order of the model can be formulated as a learnable parameter and simply optimized by stochastic gradient descent~(SGD) and its variants. Moreover, it is uncovered that FrTN strongly connects to wellknown methods such as $\ell_p$pooling~\cite{gulcehre2014learned} and ``squeezeandexcitation''~\cite{hu2018squeeze} operations in the deep learning studies. On the numerical side, we apply the proposed model to enhancing the classic ResNet26/50~\cite{he2016deep} and MobileNetv2~\cite{sandler2018mobilenetv2} on both CIFAR10 and ILSVRC12 classification tasks, and the results demonstrate the effectiveness brought by the learnable order parameters in FrTN.

Chao Li 🔗 
Fri 11:00 a.m.  11:45 a.m.

Panel Discussion 1: Theoretical, Algorithmic and Physical
(
Discussion Pannel
)
Theoretical, Algorithmic and Physical Discussions of Quantum Tensor Networks in Machine Learning. 
Jacob Biamonte · Ivan Oseledets · Jens Eisert · Nadav Cohen · Guillaume Rabusseau · XiaoYang Liu 🔗 
Fri 11:45 a.m.  12:00 p.m.

Break
(
Break
)

🔗 
Fri 12:00 p.m.  12:45 p.m.

Panel Discussion 2: Software and High Performance Implementation
(
Discussion Pannel
)
Software and High Performance Implementation discussion of Quantum Tensor Networks in Machine Learning. 
Glen Evenbly · Martin Ganahl · Paul Springer · XiaoYang Liu 🔗 
Fri 12:45 p.m.  1:00 p.m.

Break
(
Break
)

🔗 
Fri 1:00 p.m.  1:28 p.m.

Invited Talk 7: cuTensor: HighPerformance CUDA Tensor Primitives
(
Talk
)
SlidesLive Video This talk discusses cuTENSOR, a highperformance CUDA library for tensor operations that efficiently handles the ubiquitous presence of highdimensional arrays (i.e., tensors) in today's HPC and DL workloads. This library supports highly efficient tensor operations such as tensor contractions, elementwise tensor operations such as tensor permutations, and tensor reductions. While providing high performance, cuTENSOR also enables users to express their mathematical equations for tensors in a straightforward way that hides the complexity of dealing with these highdimensional objects behind an easytouse API. 
Paul Springer 🔗 
Fri 1:28 p.m.  1:35 p.m.

Invited Talk 7 Q&A by Paul
(
Q&A
)

Paul Springer 🔗 
Fri 1:35 p.m.  2:05 p.m.

Invited Talk 8: TensorNetwork: A Python Package for Tensor Network Computations
(
Talk
)
SlidesLive Video TensorNetwork is an open source python package for tensor network computations. It has been designed with the goal in mind to help researchers and engineers with rapid development of highly efficient tensor network algorithms for physics and machine learning applications. After a brief introduction to tensor networks, I will discuss some of the main design principles of the TensorNetwork package, and show how one can use it to speed up tensor network algorithms by running them on accelerated hardware, or by exploiting tensor sparsity. 
Martin Ganahl 🔗 
Fri 2:05 p.m.  2:15 p.m.

Invited Talk 8 Q&A by Martin
(
Q&A
)

Martin Ganahl 🔗 
Fri 2:15 p.m.  2:51 p.m.

Invited Talk 9: Tensor Network Models for Structured Data
(
Talk
)
SlidesLive Video In this talk, I will present uniform tensor network models (also known translation invariant tensor networks) which are particularly suited for modelling structured data such as sequences and trees. Uniform tensor networks are tensor networks where the core tensors appearing in the decomposition of a given tensor are all equal, which can be seen as a weight sharing mechanism in tensor networks. In the first part of the talk, I will show how uniform tensor networks are particularly suited to represent functions defined over sets of structured objects such as sequences and trees. I will then present how these models are related to classical computational models such as hidden Markov models, weighted automata, secondorder recurrent neural networks and context free grammars. In the second part of the talk, I will present a classical learning algorithm for weighted automata and show how and it can be interpreted as a mean to convert nonuniform tensor networks to uniform ones. Lastly, I will present ongoing work leveraging the tensor network formalism to design efficient and versatile probabilistic models for sequence data. 
Guillaume Rabusseau 🔗 
Fri 2:51 p.m.  3:00 p.m.

Invited Talk 9 Q&A by Guillaume
(
Q&A
)

Guillaume Rabusseau 🔗 
Fri 3:00 p.m.  3:30 p.m.

Invited Talk 10: Getting Started with Tensor Networks
(
Talk
)
SlidesLive Video I will provide an overview of the tensor network formalism and its applications, and discuss the key operations, such as tensor contractions, required for building tensor network algorithms. I will also demonstrate the TensorTrace graphical interface, a software tool which is designed to allow users to implement and code tensor network routines easily and effectively. Finally, the utility of tensor networks towards tasks in machine learning will be briefly discussed. 
Glen Evenbly 🔗 
Fri 3:30 p.m.  3:40 p.m.

Invited Talk 10 Q&A by Evenbly
(
Q&A
)

Glen Evenbly 🔗 
Fri 3:40 p.m.  3:50 p.m.

Contributed Talk 4: Paper 27: Limitations of gradientbased Born Machine over tensornetworks on learning quantum nonlocality
(
Talk
)
SlidesLive Video Nonlocality is an important constituent of quantum physics which lies at the heart of many striking features of quantum states such as entanglement. An important category of highly entangled quantum states are GreenbergerHorneZeilinger (GHZ) states which play key roles in various quantumbased technologies and are particularly of interest in benchmarking noisy quantum hardwares. A novel quantum inspired generative model known as Born Machine which leverages on probabilistic nature of quantum physics has shown a great success in learning classical and quantum data over tensor network (TN) architecture. To this end, we investigate the task of training the Born Machine for learning the GHZ state over two different architectures of tensor networks. Our result indicates that gradientbased training schemes over TN Born Machine fails to learn the nonlocal information of the coherent superposition (or parity) of the GHZ state. This leads to an important question of what kind of architecture design, initialization and optimization schemes would be more suitable to learn the nonlocal information hidden in the quantum state and whether we can adapt quantuminspired training algorithms to learn such quantum states. 
Khadijeh Najafi 🔗 
Fri 3:50 p.m.  4:00 p.m.

Contributed Talk 5: Paper 19: Deep convolutional tensor network
(
Talk
)
SlidesLive Video Neural networks have achieved state of the art results in many areas, supposedly due to parameter sharing, locality, and depth. Tensor networks (TNs) are linear algebraic representations of quantum manybody states based on their entanglement structure. TNs have found use in machine learning. We devise a novel TN based model called Deep convolutional tensor network (DCTN) for image classification, which has parameter sharing, locality, and depth. It is based on the Entangled plaquette states (EPS) TN. We show how EPS can be implemented as a backpropagatable layer. We test DCTN on MNIST, FashionMNIST, and CIFAR10 datasets. A shallow DCTN performs well on MNIST and FashionMNIST and has a small parameter count. Unfortunately, depth increases overfitting and thus decreases test accuracy. Also, DCTN of any depth performs badly on CIFAR10 due to overfitting. It is to be determined why. We discuss how the hyperparameters of DCTN affect its training and overfitting. 
Philip Blagoveschensky 🔗 
Fri 4:00 p.m.  4:04 p.m.

Poster 6: Paper 16: Quantum Tensor Networks for Variational Reinforcement Learning
(
Poster Talk
)

Yiming Fang 🔗 
Fri 4:04 p.m.  4:07 p.m.

Poster 7: Paper 13: Quantum Tensor Networks, Stochastic Processes, and Weighted Automata
(
Poster Talk
)

Sandesh Adhikary 🔗 
Fri 4:07 p.m.  4:10 p.m.

Poster 8: Paper 24: Modeling Natural Language via Quantum Manybody Wave Function and Tensor Network,
(
Poster Talk
)

YITONG YAO 🔗 
Fri 4:10 p.m.  4:32 p.m.

Invited Talk 11: Tensor Methods for Efficient and Interpretable Spatiotemporal Learning
(
Talk
)
SlidesLive Video Multivariate spatiotemporal data is ubiquitous in science and engineering, from climate science to sports analytics, to neuroscience. Such data contain higherorder correlations and can be represented as a tensor. Tensor latent factor models provide a powerful tool for reducing dimensionality and discovering higherorder structures. However, existing tensor models are often slow or fail to yield interpretable latent factors. In this talk, I will demonstrate advances in tensor methods to generate interpretable latent factors for highdimensional spatiotemporal data. We provide theoretical guarantees and demonstrate their applications to realworld climate, basketball, and neuroscience data. 
Rose Yu 🔗 
Fri 4:32 p.m.  4:40 p.m.

Invited Talk 11 Q&A by Rose
(
Q&A
)

Rose Yu 🔗 
Fri 4:40 p.m.  5:10 p.m.

Invited Talk 12: Learning Quantum Channels with Tensor Networks
(
Talk
)
SlidesLive Video We present a new approach to quantum process tomography, the reconstruction of an unknown quantum channel from measurement data. Specifically, we combine a tensornetwork representation of the Choi matrix (a complete description of a quantum channel), with unsupervised machine learning of singleshot projective measurement data. We show numerical experiments for both unitary and noisy quantum circuits, for a number of qubits well beyond the reach of standard process tomography techniques. 
Giacomo Torlai 🔗 
Fri 5:10 p.m.  5:20 p.m.

Invited Talk 12: Q&A
(
Q&A
)

Giacomo Torlai 🔗 
Fri 5:20 p.m.  5:25 p.m.

Closing Remarks
(
Talk
)
TBD 
XiaoYang Liu 🔗 