Poster
Parallelizing Support Vector Machines on Distributed Computers
Edward Y Chang · Kaihua Zhu · Hao Wang · hongjie Bai · Jian Li · Zhihuan Qiu · Hang Cui
[
Abstract
]
Abstract:
Support Vector Machines (SVMs) suffer from a widely recognized scalability problem in both memory use and computational time. To improve scalability, we have developed a parallel SVM algorithm (PSVM), which reduces memory use through performing a row-based, approximate matrix factorization, and which loads only essential data to each machine to perform parallel computation. Let nn denote the number of training instances, pp the reduced matrix dimension after factorization (pp is significantly smaller than nn), and mm the number of machines. PSVM reduces the memory requirement from \MO\MO(n2n2) to \MO\MO(np/mnp/m), and improves computation time to \MO\MO(np2/mnp2/m). Empirical studies on up to 500500 computers shows PSVM to be effective.
Live content is unavailable. Log in and register to view live content