CtrlGen: Controllable Generative Modeling in Language and Vision

Workshop

CtrlGen: Controllable Generative Modeling in Language and Vision

Steven Y. Feng · Dor Arad Hudson · Tatsunori Hashimoto · DONGYEOP Kang · Varun Prashant Gangal · Anusha Balakrishnan · Joel Tetreault

Mon 13 Dec, 8 a.m. PST

[ Abstract ] Workshop Website

Over the past few years, there has been an increased interest in the areas of language and image generation within the community. As generated texts by models like GPT-3 start to sound more fluid and natural, and generated images and videos by GAN models appear more realistic, researchers began focusing on qualitative properties of the generated content such as the ability to control its style and structure, or incorporate information from external sources into the output. Such aims are extremely important to make language and image generation useful for human-machine interaction and other real-world applications including machine co-creativity, entertainment, reducing biases or toxicity, and improving conversational agents and personal assistants.

Achieving these ambitious but important goals introduces challenges not only from NLP and Vision perspectives, but also ones that pertain to Machine Learning as a whole, which has witnessed a growing body of research in relevant domains such as interpretability, disentanglement, robustness, and representation learning. We believe that progress towards the realization of human-like language and image generation may benefit greatly from insights and progress in these and other ML areas.

In this workshop, we propose to bring together researchers from the NLP, Vision, and ML communities to discuss the current challenges and explore potential directions for controllable generation and improve its quality, correctness, and diversity. As excitement about language and image generation has significantly increased recently thanks to the advent and improvement of language models, Transformers, and GANs, we feel this is the opportune time to hold a new workshop about this subject. We hope CtrlGen will foster discussion and interaction across communities, and sprout fruitful cross-domain relations that open the door for enhanced controllability in language and image generation.

Chat is not available.

Timezone: America/Los_Angeles

Schedule

Mon 8:00 a.m. - 8:10 a.m.	Opening Remarks ( Short Intro ) > SlidesLive Video	🔗
Mon 8:10 a.m. - 8:30 a.m.	Invited Talk #1 - Control in Dialogue: When does it work? (Jason Weston) ( Invited Talk ) > SlidesLive Video	🔗
Mon 8:30 a.m. - 8:35 a.m.	Invited Talk #1 Q&A ( Short Q&A ) >	🔗
Mon 8:35 a.m. - 8:55 a.m.	Invited Talk #2 - Disentangling Faithfulness and Extractiveness in Abstractive Summarization (He He) ( Invited Talk ) > SlidesLive Video	🔗
Mon 8:55 a.m. - 9:00 a.m.	Invited Talk #2 Q&A ( Short Q&A ) >	🔗
Mon 9:00 a.m. - 9:25 a.m.	Invited Talk #3 - Disentanglement for Controllable Image Generation (Irina Higgins) ( Invited Talk ) > SlidesLive Video	Irina Higgins 🔗
Mon 9:25 a.m. - 9:30 a.m.	Invited Talk #3 Q&A ( Short Q&A ) >	🔗
Mon 9:30 a.m. - 9:50 a.m.	Invited Talk #4 - Neuro-Logic and Differentiable Controls (Yejin Choi) ( Invited Talk ) > SlidesLive Video	🔗
Mon 9:50 a.m. - 9:55 a.m.	Invited Talk #4 Q&A ( Short Q&A ) >	🔗
Mon 9:55 a.m. - 10:10 a.m.	Virtual Coffee/Networking Break link Link	🔗
Mon 10:10 a.m. - 11:30 a.m.	Discussion Panel and QA Session ( Discussion Panel ) > SlidesLive Video	🔗
Mon 11:30 a.m. - 12:30 p.m.	Virtual Poster Session #1 ( Poster Session ) > link Link	🔗
Mon 12:30 p.m. - 1:30 p.m.	Lunch Break link Link	🔗
Mon 1:30 p.m. - 1:50 p.m.	Demonstrations ( Live-Streamed Demos ) > SlidesLive Video	🔗
Mon 1:50 p.m. - 2:10 p.m.	Invited Talk #5 - Off the Beaten Path: Domain-Agnostic ML for Controllable Generation and Beyond (Alex Tamkin) ( Invited Talk ) > SlidesLive Video	🔗
Mon 2:10 p.m. - 2:15 p.m.	Invited Talk #5 Q&A ( Short Q&A ) >	🔗
Mon 2:15 p.m. - 2:35 p.m.	Invited Talk #6 - Generating and Editing Images Using StyleGAN and CLIP (Or Patashnik) ( Invited Talk ) > SlidesLive Video	Or Patashnik 🔗
Mon 2:35 p.m. - 2:40 p.m.	Invited Talk #6 Q&A ( Short Q&A ) >	🔗
Mon 2:40 p.m. - 3:00 p.m.	Virtual Coffee/Networking Break link Link	🔗
Mon 3:00 p.m. - 4:20 p.m.	Virtual Poster Session #2 ( Poster Session ) > link Link	🔗
Mon 4:20 p.m. - 4:40 p.m.	Invited Talk #7 - Controllable Text Generation with Multiple Constraints (Yulia Tsvetkov) ( Invited Talk ) > SlidesLive Video	Yulia Tsvetkov 🔗
Mon 4:40 p.m. - 4:45 p.m.	Invited Talk #7 Q&A ( Short Q&A ) >	🔗
Mon 4:45 p.m. - 5:00 p.m.	Best Paper Awards and Closing Remarks ( Closing Remarks ) > SlidesLive Video	🔗
Mon 5:00 p.m. - 12:00 a.m.	GatherTown Open for Continued Socializing ( Networking and Socializing ) > link Link	🔗
-	Sound-Guided Semantic Image Manipulation ( Poster ) >	SEUNG HYUN LEE · Sang Ho Yoon · Jinkyu Kim · Sangpil Kim 🔗
-	PPL-MCTS: Constrained Textual Generation Through Discriminator-Guided Decoding ( Poster ) >	Antoine Chaffin · Vincent Claveau · Ewa Kijak 🔗
-	Hamiltonian prior to Disentangle Content and Motion in Image Sequences ( Poster ) >	Asif Khan · Amos Storkey 🔗
-	Controllable Paraphrase Generation with Multiple Types of Constraints ( Poster ) >	Gwénolé Lecorvé 🔗
-	Controlled Cue Generation for Play Scripts ( Poster ) >	Alara Dirik · Hilal Dönmez · Pinar Yanardag 🔗
-	Controlling Conditional Language Models with Distributional Policy Gradients ( Poster ) >	Tomasz Korbak · Hady Elsahar · Germán Kruszewski · Marc Dymetman 🔗
-	Towards Unsupervised Content Disentanglement in Sentence Representations via Syntactic Roles ( Poster ) >	Ghazi FELHI · Joseph Roux · Djame Seddah 🔗
-	Diamond in the rough: Improving image realism by traversing the GAN latent space ( Poster ) >	Jeffrey Wen · Fabian Benitez-Quiroz · Qianli Feng · Aleix Martinez 🔗
-	Sampling from Discrete Energy-Based Models with Quality/Efficiency Trade-offs ( Poster ) >	Bryan Eikema · Germán Kruszewski · Hady Elsahar · Marc Dymetman 🔗
-	XCI-Sketch: Extraction of Color Information from Images for Generation of Colored Outlines and Sketches ( Poster ) >	V MANUSHREE · Sameer Saxena · Parna Chowdhury · MANISIMHA VARMA MANTHENA · Harsh Rathod · Ankita Ghosh · Sahil Khose 🔗
-	Continuous Emotion Transfer Using Kernels ( Poster ) >	Alex Lambert · Sanjeel Parekh · Zoltan Szabo · Florence d'Alché-Buc 🔗
-	Self-supervised Enhancement of Latent Discovery in GANs ( Poster ) >	ADARSH KAPPIYATH · Silpa Vadakkeeveetil Sreelatha · 🔗
-	Learning to Compose Visual Relations ( Poster ) >	Nan Liu · Shuang Li · Yilun Du 🔗
-	Learning Representations for Zero-Shot Image Generation without Text ( Poster ) >	Gautam Singh · Fei Deng · Sungjin Ahn 🔗
-	C^3: Contrastive Learning for Cross-domain Correspondence in Few-shot Image Generation ( Poster ) >	Hyukgi Lee · Gi-Cheon Kang · Chang-Hoon Jeong · Hanwool Sul · Byoung-Tak Zhang 🔗
-	MIDI-DDSP: Hierarchical Modeling of Music for Detailed Control ( Poster ) >	Yusong Wu · Ethan Manilow · Kyle Kastner · Tim Cooijmans · Aaron Courville · Cheng-Zhi Anna Huang · Jesse Engel 🔗
-	Neural Abstructions: Abstractions that Support Construction for Grounded Language Learning ( Poster ) >	Kaylee Burns · Christopher D Manning · Li Fei-Fei 🔗
-	Fair Data Generation using Language Models with Hard Constraints ( Poster ) >	SK Mainul Islam · Abhinav Nagpal · Balaji Ganesan · Pranay Lohia 🔗
-	Robust Text Generation using Sequence-to-Sequence Pre-Training ( Poster ) >	Nishtha Madaan · · Srikanta Bedathur 🔗
-	LUMINOUS: Indoor Scene Generation for Embodied AI Challenges ( Poster ) >	Yizhou Zhao · Kaixiang Lin · Zhiwei Jia · Qiaozi Gao · Govindarajan Thattai · Jesse Thomason · Gaurav Sukhatme 🔗