Timezone: »

 
Poster
Fuzzy Clustering with Similarity Queries
Wasim Huleihel · Arya Mazumdar · Soumyabrata Pal

Fri Dec 10 08:30 AM -- 10:00 AM (PST) @
The fuzzy or soft $k$-means objective is a popular generalization of the well-known $k$-means problem, extending the clustering capability of the $k$-means to datasets that are uncertain, vague and otherwise hard to cluster. In this paper, we propose a semi-supervised active clustering framework, where the learner is allowed to interact with an oracle (domain expert), asking for the similarity between a certain set of chosen items. We study the query and computational complexities of clustering in this framework. We prove that having a few of such similarity queries enables one to get a polynomial-time approximation algorithm to an otherwise conjecturally NP-hard problem. In particular, we provide algorithms for fuzzy clustering in this setting that ask $O(\mathsf{poly}(k)\log n)$ similarity queries and run with polynomial-time-complexity, where $n$ is the number of items. The fuzzy $k$-means objective is nonconvex, with $k$-means as a special case, and is equivalent to some other generic nonconvex problem such as non-negative matrix factorization. The ubiquitous Lloyd-type algorithms (or alternating-minimization algorithms) can get stuck at a local minima. Our results show that by making few similarity queries, the problem becomes easier to solve. Finally, we test our algorithms over real-world datasets, showing their effectiveness in real-world applications.

Author Information

Wasim Huleihel (Tel-Aviv University)
Arya Mazumdar (University of California, San Diego)
Soumyabrata Pal (University of Massachusetts Amherst)

I am a fourth year grad student in the Department of Computer Science at the University of Massachusetts Amherst.

More from the Same Authors

  • 2021 Poster: Support Recovery of Sparse Signals from a Mixture of Linear Measurements »
    Soumyabrata Pal · Arya Mazumdar · Venkata Gandikota
  • 2020 Poster: Recovery of sparse linear classifiers from mixture of responses »
    Venkata Gandikota · Arya Mazumdar · Soumyabrata Pal
  • 2019 : Poster Session »
    Gergely Flamich · Shashanka Ubaru · Charles Zheng · Josip Djolonga · Kristoffer Wickstrøm · Diego Granziol · Konstantinos Pitas · Jun Li · Robert Williamson · Sangwoong Yoon · Kwot Sin Lee · Julian Zilly · Linda Petrini · Ian Fischer · Zhe Dong · Alexander Alemi · Bao-Ngoc Nguyen · Rob Brekelmans · Tailin Wu · Aditya Mahajan · Alexander Li · Kirankumar Shiragur · Yair Carmon · Linara Adilova · SHIYU LIU · Bang An · Sanjeeb Dash · Oktay Gunluk · Arya Mazumdar · Mehul Motani · Julia Rosenzweig · Michael Kamp · Marton Havasi · Leighton P Barnes · Zhengqing Zhou · Yi Hao · Dylan Foster · Yuval Benjamini · Nati Srebro · Michael Tschannen · Paul Rubenstein · Sylvain Gelly · John Duchi · Aaron Sidford · Robin Ru · Stefan Zohren · Murtaza Dalal · Michael A Osborne · Stephen J Roberts · Moses Charikar · Jayakumar Subramanian · Xiaodi Fan · Max Schwarzer · Nicholas Roberts · Simon Lacoste-Julien · Vinay Prabhu · Aram Galstyan · Greg Ver Steeg · Lalitha Sankar · Yung-Kyun Noh · Gautam Dasarathy · Frank Park · Ngai-Man (Man) Cheung · Ngoc-Trung Tran · Linxiao Yang · Ben Poole · Andrea Censi · Tristan Sylvain · R Devon Hjelm · Bangjie Liu · Jose Gallego-Posada · Tyler Sypherd · Kai Yang · Jan Nikolas Morshuis
  • 2019 : Poster Session »
    Jonathan Scarlett · Piotr Indyk · Ali Vakilian · Adrian Weller · Partha P Mitra · Benjamin Aubin · Bruno Loureiro · Florent Krzakala · Lenka Zdeborová · Kristina Monakhova · Joshua Yurtsever · Laura Waller · Hendrik Sommerhoff · Michael Moeller · Rushil Anirudh · Shuang Qiu · Xiaohan Wei · Zhuoran Yang · Jayaraman Thiagarajan · Salman Asif · Michael Gillhofer · Johannes Brandstetter · Sepp Hochreiter · Felix Petersen · Dhruv Patel · Assad Oberai · Akshay Kamath · Sushrut Karmalkar · Eric Price · Ali Ahmed · Zahra Kadkhodaie · Sreyas Mohan · Eero Simoncelli · Carlos Fernandez-Granda · Oscar Leong · Wesam Sakla · Rebecca Willett · Stephan Hoyer · Jascha Sohl-Dickstein · Sam Greydanus · Gauri Jagatap · Chinmay Hegde · Michael Kellman · Jonathan Tamir · Nouamane Laanait · Ousmane Dia · Mirco Ravanelli · Jonathan Binas · Negar Rostamzadeh · Shirin Jalali · Tiantian Fang · Alex Schwing · Sébastien Lachapelle · Philippe Brouillard · Tristan Deleu · Simon Lacoste-Julien · Stella Yu · Arya Mazumdar · Ankit Singh Rawat · Yue Zhao · Jianshu Chen · Xiaoyang Li · Hubert Ramsauer · Gabrio Rizzuti · Nikolaos Mitsakos · Dingzhou Cao · Thomas Strohmer · Yang Li · Pei Peng · Gregory Ongie
  • 2019 Poster: Superset Technique for Approximate Recovery in One-Bit Compressed Sensing »
    Larkin Flodin · Venkata Gandikota · Arya Mazumdar
  • 2019 Poster: Sample Complexity of Learning Mixture of Sparse Linear Regressions »
    Akshay Krishnamurthy · Arya Mazumdar · Andrew McGregor · Soumyabrata Pal
  • 2019 Poster: Same-Cluster Querying for Overlapping Clusters »
    Wasim Huleihel · Arya Mazumdar · Muriel Medard · Soumyabrata Pal
  • 2017 : The Geometric Block Model (poster). »
    Soumyabrata Pal
  • 2017 Poster: Clustering with Noisy Queries »
    Arya Mazumdar · Barna Saha
  • 2017 Poster: Semisupervised Clustering, AND-Queries and Locally Encodable Source Coding »
    Arya Mazumdar · Soumyabrata Pal
  • 2017 Spotlight: Semisupervised Clustering, AND-Queries and Locally Encodable Source Coding »
    Arya Mazumdar · Soumyabrata Pal
  • 2017 Poster: Query Complexity of Clustering with Side Information »
    Arya Mazumdar · Barna Saha
  • 2015 Poster: Associative Memory via a Sparse Recovery Model »
    Arya Mazumdar · Ankit Singh Rawat