Archive

2023⁴⁰⁴

March¹⁴³

Learning and Verification of Task Structure in Instructional Videos

March 23, 2023 · 812 words · Medhini Narasimhan, Licheng Yu, Sean Bell, Ning Zhang, Trevor Darrell

DreamBooth3D: Subject-Driven Text-to-3D Generation

March 23, 2023 · 881 words · Amit Raj, Srinivas Kaza, Ben Poole, Michael Niemeyer, Nataniel Ruiz and 7 others

The Quantization Model of Neural Scaling

March 23, 2023 · 1469 words · Eric J. Michaud, Ziming Liu, Uzay Girit, Max Tegmark

Paraphrasing evades detectors of AI-generated text, but retrieval is an effective defense

March 23, 2023 · 1378 words · Kalpesh Krishna, Yixiao Song, Marzena Karpinska, John Wieting, Mohit Iyyer

3D-POP – An automated annotation approach to facilitate markerless 2D-3D tracking of freely moving birds with marker-based motion capture

March 23, 2023 · 1337 words · Hemal Naik, Alex Hoi Hang Chan, Junran Yang, Mathilde Delacoux, Iain D. Couzin and 2 others

Reinforcement Learning with Exogenous States and Rewards

March 22, 2023 · 1466 words · George Trimponias, Thomas G. Dietterich

Instruct-NeRF2NeRF: Editing 3D Scenes with Instructions

March 22, 2023 · 921 words · Ayaan Haque, Matthew Tancik, Alexei A. Efros, Aleksander Holynski, Angjoo Kanazawa

FeatureNeRF: Learning Generalizable NeRFs by Distilling Foundation Models

March 22, 2023 · 930 words · Jianglong Ye, Naiyan Wang, Xiaolong Wang

LFM-3D: Learnable Feature Matching Across Wide Baselines Using 3D Signals

March 22, 2023 · 811 words · Arjun Karpur, Guilherme Perrotta, Ricardo Martin-Brualla, Howard Zhou, Andre Araujo

Can we trust the evaluation on ChatGPT?

March 22, 2023 · 389 words · Rachith Aiyappa, Jisun An, Haewoon Kwak, Yong-Yeol Ahn

Adaptive Conformal Prediction by Reweighting Nonconformity Score

March 22, 2023 · 790 words · Salim I. Amoukou, Nicolas J. B Brunel

RepoCoder: Repository-Level Code Completion Through Iterative Retrieval and Generation

March 22, 2023 · 1082 words · Fengji Zhang, Bei Chen, Yue Zhang, Jin Liu, Daoguang Zan and 3 others

RaBit: Parametric Modeling of 3D Biped Cartoon Characters with a Topological-consistent Dataset

March 22, 2023 · 1024 words · Zhongjin Luo, Shengcai Cai, Jinguo Dong, Ruibo Ming, Liangdong Qiu and 2 others

MEGA: Multilingual Evaluation of Generative AI

March 22, 2023 · 1011 words · Kabir Ahuja, Rishav Hada, Millicent Ochieng, Prachi Jain, Harshita Diddee and 6 others

Large Language Models Can Be Used to Estimate the Ideologies of Politicians in a Zero-Shot Learning Setting

March 21, 2023 · 522 words · Patrick Y. Wu, Joshua A. Tucker, Jonathan Nagler, Solomon Messing

Visual Representation Learning from Unlabeled Video using Contrastive Masked Autoencoders

March 21, 2023 · 879 words · Jefferson Hernandez, Ruben Villegas, Vicente Ordonez

Text2Room: Extracting Textured 3D Meshes from 2D Text-to-Image Models

March 21, 2023 · 724 words · Lukas Höllein, Ang Cao, Andrew Owens, Justin Johnson, Matthias Nießner

Probabilistic Domain Adaptation for Biomedical Image Segmentation

March 21, 2023 · 523 words · Anwai Archit, Constantin Pape

ExtremeNeRF: Few-shot Neural Radiance Fields Under Unconstrained Illumination

March 21, 2023 · 830 words · SeokYeong Lee, JunYong Choi, Seungryong Kim, Ig-Jae Kim, Junghyun Cho

Equiangular Basis Vectors

March 21, 2023 · 964 words · Yang Shen, Xuhao Sun, Xiu-Shen Wei

Learning Context-aware Classifier for Semantic Segmentation

March 21, 2023 · 759 words · Zhuotao Tian, Jiequan Cui, Li Jiang, Xiaojuan Qi, Xin Lai and 3 others

Novel Class Discovery for 3D Point Cloud Semantic Segmentation

March 21, 2023 · 918 words · Luigi Riz, Cristiano Saltori, Elisa Ricci, Fabio Poiesi

Inversion by Direct Iteration: An Alternative to Denoising Diffusion for Image Restoration

March 20, 2023 · 942 words · Mauricio Delbracio, Peyman Milanfar

MM-REACT: Prompting ChatGPT for Multimodal Reasoning and Action

March 20, 2023 · 507 words · Zhengyuan Yang, Linjie Li, Jianfeng Wang, Kevin Lin, Ehsan Azarnasab and 5 others

Reflexion: an autonomous agent with dynamic memory and self-reflection

March 20, 2023 · 1182 words · Noah Shinn, Beck Labash, Ashwin Gopinath

Zero-1-to-3: Zero-shot One Image to 3D Object

March 20, 2023 · 1147 words · Ruoshi Liu, Rundi Wu, Basile Van Hoorick, Pavel Tokmakov, Sergey Zakharov and 1 others

Context-faithful Prompting for Large Language Models

March 20, 2023 · 661 words · Wenxuan Zhou, Sheng Zhang, Hoifung Poon, Muhao Chen

SVDiff: Compact Parameter Space for Diffusion Fine-Tuning

March 20, 2023 · 970 words · Ligong Han, Yinxiao Li, Han Zhang, Peyman Milanfar, Dimitris Metaxas and 1 others

VoxelNeXt: Fully Sparse VoxelNet for 3D Object Detection and Tracking

March 20, 2023 · 600 words · Yukang Chen, Jianhui Liu, Xiangyu Zhang, Xiaojuan Qi, Jiaya Jia

A Survey on Oversmoothing in Graph Neural Networks

March 20, 2023 · 670 words · T. Konstantin Rusch, Michael M. Bronstein, Siddhartha Mishra

PanGu-Σ: Towards Trillion Parameter Language Model with Sparse Heterogeneous Computing

March 20, 2023 · 1182 words · Xiaozhe Ren, Pingyi Zhou, Xinfan Meng, Xinjing Huang, Yadao Wang and 12 others

NeRF-LOAM: Neural Implicit Representation for Large-Scale Incremental LiDAR Odometry and Mapping

March 19, 2023 · 1000 words · Junyuan Deng, Xieyuanli Chen, Songpengcheng Xia, Zhen Sun, Guoqing Liu and 2 others

Improving Uncertainty Quantification of Deep Classifiers via Neighborhood Conformal Prediction: Novel Algorithm and Theoretical Analysis

March 19, 2023 · 808 words · Subhankar Ghosh, Taha Belkhouja, Yan Yan, Janardhan Rao Doppa

Two Kinds of Recall

March 19, 2023 · 610 words · Yoav Goldberg

Can AI-Generated Text be Reliably Detected?

March 17, 2023 · 946 words · Vinu Sankar Sadasivan, Aounon Kumar, Sriram Balasubramanian, Wenxiao Wang, Soheil Feizi

On the De-duplication of LAION-2B

March 17, 2023 · 681 words · Ryan Webster, Julien Rabin, Loic Simon, Frederic Jurie

A Recipe for Watermarking Diffusion Models

March 17, 2023 · 922 words · Yunqing Zhao, Tianyu Pang, Chao Du, Xiao Yang, Ngai-Man Cheung and 1 others

GPTs are GPTs: An Early Look at the Labor Market Impact Potential of Large Language Models

March 17, 2023 · 1727 words · Tyna Eloundou, Sam Manning, Pamela Mishkin, Daniel Rock

A Robustness Analysis of Blind Source Separation

March 17, 2023 · 1735 words · Alexander Schell

$α$Surf: Implicit Surface Reconstruction for Semi-Transparent and Thin Objects with Decoupled Geometry and Opacity

March 17, 2023 · 1071 words · Tianhao Wu, Hanxue Liang, Fangcheng Zhong, Gernot Riegler, Shimon Vainer and 1 others

Towards a Foundation Model for Neural Network Wavefunctions

March 17, 2023 · 944 words · Michael Scherbela, Leon Gerard, Philipp Grohs

Adversarial Counterfactual Visual Explanations

March 17, 2023 · 875 words · Guillaume Jeanneret, Loïc Simon, Frédéric Jurie

A Dynamic Multi-Scale Voxel Flow Network for Video Prediction

March 17, 2023 · 731 words · Xiaotao Hu, Zhewei Huang, Ailin Huang, Jun Xu, Shuchang Zhou

Trained on 100 million words and still in shape: BERT meets British National Corpus

March 17, 2023 · 1248 words · David Samuel, Andrey Kutuzov, Lilja Øvrelid, Erik Velldal

CoLT5: Faster Long-Range Transformers with Conditional Computation

March 17, 2023 · 652 words · Joshua Ainslie, Tao Lei, Michiel de Jong, Santiago Ontañón, Siddhartha Brahma and 7 others

Efficient Diffusion Training via Min-SNR Weighting Strategy

March 16, 2023 · 705 words · Tiankai Hang, Shuyang Gu, Chen Li, Jianmin Bao, Dong Chen and 3 others

LERF: Language Embedded Radiance Fields

March 16, 2023 · 1075 words · Justin Kerr, Chung Min Kim, Ken Goldberg, Angjoo Kanazawa, Matthew Tancik

SemDeDup: Data-efficient learning at web-scale through semantic deduplication

March 16, 2023 · 1135 words · Amro Abbas, Kushal Tirumala, Dániel Simig, Surya Ganguli, Ari S. Morcos

FateZero: Fusing Attentions for Zero-shot Text-based Video Editing

March 16, 2023 · 709 words · Chenyang Qi, Xiaodong Cun, Yong Zhang, Chenyang Lei, Xintao Wang and 2 others

$P+$: Extended Textual Conditioning in Text-to-Image Generation

March 16, 2023 · 796 words · Andrey Voynov, Qinghao Chu, Daniel Cohen-Or, Kfir Aberman

Jump to Conclusions: Short-Cutting Transformers With Linear Transformations

March 16, 2023 · 1017 words · Alexander Yom Din, Taelin Karidi, Leshem Choshen, Mor Geva

NeRFMeshing: Distilling Neural Radiance Fields into Geometrically-Accurate 3D Meshes

March 16, 2023 · 888 words · Marie-Julie Rakotosaona, Fabian Manhardt, Diego Martin Arroyo, Michael Niemeyer, Abhijit Kundu and 1 others

Unified Multi-Modal Latent Diffusion for Joint Subject and Text Conditional Image Generation

March 16, 2023 · 829 words · Yiyang Ma, Huan Yang, Wenjing Wang, Jianlong Fu, Jiaying Liu

Taming Diffusion Models for Audio-Driven Co-Speech Gesture Generation

March 16, 2023 · 719 words · Lingting Zhu, Xian Liu, Xuanyu Liu, Rui Qian, Ziwei Liu and 1 others

GLEN: General-Purpose Event Detection for Thousands of Types

March 16, 2023 · 1186 words · Qiusi Zhan, Sha Li, Kathryn Conger, Martha Palmer, Heng Ji and 1 others

Secret-Keeping in Question Answering

March 16, 2023 · 1035 words · Nathaniel W. Rollings, Kent O'Sullivan, Sakshum Kulshrestha

Translating Radiology Reports into Plain Language using ChatGPT and GPT-4 with Prompt Learning: Promising Results, Limitations, and Potential

March 16, 2023 · 860 words · Qing Lyu, Josh Tan, Mike E. Zapadka, Janardhana Ponnatapuram, Chuang Niu and 2 others

ART: Automatic multi-step reasoning and tool-use for large language models

March 16, 2023 · 1092 words · Bhargavi Paranjape, Scott Lundberg, Sameer Singh, Hannaneh Hajishirzi, Luke Zettlemoyer and 1 others

UPRISE: Universal Prompt Retrieval for Improving Zero-Shot Evaluation

March 15, 2023 · 930 words · Daixuan Cheng, Shaohan Huang, Junyu Bi, Yuefeng Zhan, Jianfeng Liu and 5 others

Rotation-Invariant Transformer for Point Cloud Matching

March 14, 2023 · 912 words · Hao Yu, Zheng Qin, Ji Hou, Mahdi Saleh, Dongsheng Li and 2 others

Allegro-Legato: Scalable, Fast, and Robust Neural-Network Quantum Molecular Dynamics via Sharpness-Aware Minimization

March 14, 2023 · 1020 words · Hikaru Ibayashi, Taufeq Mohammed Razakh, Liqiu Yang, Thomas Linker, Marco Olguin and 6 others

Blind Video Deflickering by Neural Filtering with a Flawed Atlas

March 14, 2023 · 687 words · Chenyang Lei, Xuanchi Ren, Zhaoxiang Zhang, Qifeng Chen

Simfluence: Modeling the Influence of Individual Training Examples by Simulating Training Runs

March 14, 2023 · 945 words · Kelvin Guu, Albert Webson, Ellie Pavlick, Lucas Dixon, Ian Tenney and 1 others

A Theory of Emergent In-Context Learning as Implicit Structure Induction

March 14, 2023 · 1364 words · Michael Hahn, Navin Goyal

I$^2$-SDF: Intrinsic Indoor Scene Reconstruction and Editing via Raytracing in Neural SDFs

March 14, 2023 · 907 words · Jingsen Zhu, Yuchi Huo, Qi Ye, Fujun Luan, Jifan Li and 6 others

FreeNeRF: Improving Few-shot Neural Rendering with Free Frequency Regularization

March 13, 2023 · 723 words · Jiawei Yang, Marco Pavone, Yue Wang

Erasing Concepts from Diffusion Models

March 13, 2023 · 989 words · Rohit Gandikota, Joanna Materzynska, Jaden Fiotto-Kaufman, David Bau

Meet in the Middle: A New Pre-training Paradigm

March 13, 2023 · 803 words · Anh Nguyen, Nikos Karampatziakis, Weizhu Chen

Scaling Vision-Language Models with Sparse Mixture of Experts

March 13, 2023 · 766 words · Sheng Shen, Zhewei Yao, Chunyuan Li, Trevor Darrell, Kurt Keutzer and 1 others

High-throughput Generative Inference of Large Language Models with a Single GPU

March 13, 2023 · 1201 words · Ying Sheng, Lianmin Zheng, Binhang Yuan, Zhuohan Li, Max Ryabinin and 9 others

Universal Instance Perception as Object Discovery and Retrieval

March 12, 2023 · 1096 words · Bin Yan, Yi Jiang, Jiannan Wu, Dong Wang, Ping Luo and 2 others

One Transformer Fits All Distributions in Multi-Modal Diffusion at Scale

March 12, 2023 · 774 words · Fan Bao, Shen Nie, Kaiwen Xue, Chongxuan Li, Shi Pu and 5 others

Prefix-tree Decoding for Predicting Mass Spectra from Molecules

March 11, 2023 · 824 words · Samuel Goldman, John Bradshaw, Jiayi Xin, Connor W. Coley

Probing neural representations of scene perception in a hippocampally dependent task using artificial neural networks

March 11, 2023 · 893 words · Markus Frey, Christian F. Doeller, Caswell Barry

Resurrecting Recurrent Neural Networks for Long Sequences

March 11, 2023 · 1266 words · Antonio Orvieto, Samuel L Smith, Albert Gu, Anushan Fernando, Caglar Gulcehre and 2 others

Stabilizing Transformer Training by Preventing Attention Entropy Collapse

March 11, 2023 · 743 words · Shuangfei Zhai, Tatiana Likhomanenko, Etai Littwin, Dan Busbridge, Jason Ramapuram and 3 others

StyleGANEX: StyleGAN-Based Manipulation Beyond Cropped Aligned Faces

March 10, 2023 · 916 words · Shuai Yang, Liming Jiang, Ziwei Liu, Chen Change Loy

Rewarding Chatbots for Real-World Engagement with Millions of Users

March 10, 2023 · 1275 words · Robert Irvine, Douglas Boubert, Vyas Raina, Adian Liusie, Vineet Mudupalli and 8 others

MVImgNet: A Large-scale Dataset of Multi-view Images

March 10, 2023 · 977 words · Xianggang Yu, Mutian Xu, Yidan Zhang, Haolin Liu, Chongjie Ye and 8 others

Understanding and Constructing Latent Modality Structures in Multi-modal Representation Learning

March 10, 2023 · 853 words · Qian Jiang, Changyou Chen, Han Zhao, Liqun Chen, Qing Ping and 4 others

Structural Multiplane Image: Bridging Neural View Synthesis and 3D Reconstruction

March 10, 2023 · 950 words · Mingfang Zhang, Jinglu Wang, Xiao Li, Yifei Huang, Yoichi Sato and 1 others

Product Jacobi-Theta Boltzmann machines with score matching

March 10, 2023 · 358 words · Andrea Pasquale, Daniel Krefl, Stefano Carrazza, Frank Nielsen

Self-NeRF: A Self-Training Pipeline for Few-Shot Neural Radiance Fields

March 10, 2023 · 684 words · Jiayang Bai, Letian Huang, Wen Gong, Jie Guo, Yanwen Guo

An Overview on Language Models: Recent Developments and Outlook

March 10, 2023 · 1963 words · Chengwei Wei, Yun-Cheng Wang, Bin Wang, C. -C. Jay Kuo

Scaling up GANs for Text-to-Image Synthesis

March 9, 2023 · 1104 words · Minguk Kang, Jun-Yan Zhu, Richard Zhang, Jaesik Park, Eli Shechtman and 2 others

Users are the North Star for AI Transparency

March 9, 2023 · 1066 words · Alex Mei, Michael Saxon, Shiyu Chang, Zachary C. Lipton, William Yang Wang

Cal-QL: Calibrated Offline RL Pre-Training for Efficient Online Fine-Tuning

March 9, 2023 · 1473 words · Mitsuhiko Nakamoto, Yuexiang Zhai, Anikait Singh, Max Sobol Mark, Yi Ma and 3 others

Personalisation within bounds: A risk taxonomy and policy framework for the alignment of large language models with personalised feedback

March 9, 2023 · 939 words · Hannah Rose Kirk, Bertie Vidgen, Paul Röttger, Scott A. Hale

Kernel Regression with Infinite-Width Neural Networks on Millions of Examples

March 9, 2023 · 1076 words · Ben Adlam, Jaehoon Lee, Shreyas Padhy, Zachary Nado, Jasper Snoek

Brain-Diffuser: Natural scene reconstruction from fMRI signals using generative latent diffusion

March 9, 2023 · 933 words · Furkan Ozcelik, Rufin VanRullen

Revisiting Rotation Averaging: Uncertainties and Robust Losses

March 9, 2023 · 626 words · Ganlin Zhang, Viktor Larsson, Daniel Barath

DiffusionDepth: Diffusion Denoising Approach for Monocular Depth Estimation

March 9, 2023 · 790 words · Yiqun Duan, Xianda Guo, Zheng Zhu

Bayesian at heart: Towards autonomic outflow estimation via generative state-space modelling of heart rate dynamics

March 8, 2023 · 1011 words · Fernando E. Rosas, Diego Candia-Rivera, Andrea I Luppi, Yike Guo, Pedro A. M. Mediano

X-Avatar: Expressive Human Avatars

March 8, 2023 · 695 words · Kaiyue Shen, Chen Guo, Manuel Kaufmann, Juan Jose Zarate, Julien Valentin and 2 others

Ewald-based Long-Range Message Passing for Molecular Graphs

March 8, 2023 · 723 words · Arthur Kosmala, Johannes Gasteiger, Nicholas Gao, Stephan Günnemann

Video-P2P: Video Editing with Cross-attention Control

March 8, 2023 · 694 words · Shaoteng Liu, Yuechen Zhang, Wenbo Li, Zhe Lin, Jiaya Jia

Cost-Effective Hyperparameter Optimization for Large Language Model Generation Inference

March 8, 2023 · 849 words · Chi Wang, Susan Xueqing Liu, Ahmed H. Awadallah

The Descriptive Complexity of Graph Neural Networks

March 8, 2023 · 1742 words · Martin Grohe

Magnushammer: A Transformer-based Approach to Premise Selection

March 8, 2023 · 642 words · Maciej Mikuła, Szymon Antoniak, Szymon Tworkowski, Albert Qiaochu Jiang, Jin Peng Zhou and 4 others

The Lie-Group Bayesian Learning Rule

March 8, 2023 · 1256 words · Eren Mehmet Kıral, Thomas Möllenhoff, Mohammad Emtiyaz Khan

DroNeRF: Real-time Multi-agent Drone Pose Optimization for Computing Neural Radiance Fields

March 8, 2023 · 609 words · Dipam Patel, Phu Pham, Aniket Bera

TRACT: Denoising Diffusion Models with Transitive Closure Time-Distillation

March 7, 2023 · 737 words · David Berthelot, Arnaud Autef, Jierui Lin, Dian Ang Yap, Shuangfei Zhai and 4 others

How Do Transformers Learn Topic Structure: Towards a Mechanistic Understanding

March 7, 2023 · 1421 words · Yuchen Li, Yuanzhi Li, Andrej Risteski

Computing with Categories in Machine Learning

March 7, 2023 · 545 words · Eli Sennesh, Tom Xu, Yoshihiro Maruyama

OpenOccupancy: A Large Scale Benchmark for Surrounding Semantic Occupancy Perception

March 7, 2023 · 911 words · Xiaofeng Wang, Zheng Zhu, Wenbo Xu, Yunpeng Zhang, Yi Wei and 5 others

The BigScience ROOTS Corpus: A 1.6TB Composite Multilingual Dataset

March 7, 2023 · 968 words · Hugo Laurençon, Lucile Saulnier, Thomas Wang, Christopher Akiki, Albert Villanova del Moral and 49 others

New Perspectives on Regularization and Computation in Optimal Transport-Based Distributionally Robust Optimization

March 7, 2023 · 1535 words · Soroosh Shafieezadeh-Abadeh, Liviu Aolaritei, Florian Dörfler, Daniel Kuhn

Selecting Features for Markov Modeling: A Case Study on HP35

March 7, 2023 · 1219 words · Daniel Nagel, Sofia Sartore, Gerhard Stock

Zeroth-Order Optimization Meets Human Feedback: Provable Learning via Ranking Oracles

March 7, 2023 · 1239 words · Zhiwei Tang, Dmitry Rybin, Tsung-Hui Chang

Run, Don’t Walk: Chasing Higher FLOPS for Faster Neural Networks

March 7, 2023 · 1090 words · Jierun Chen, Shiu-hong Kao, Hao He, Weipeng Zhuo, Song Wen and 2 others

Towards a Complete Analysis of Langevin Monte Carlo: Beyond Poincaré Inequality

March 7, 2023 · 1036 words · Alireza Mousavi-Hosseini, Tyler Farghly, Ye He, Krishnakumar Balasubramanian, Murat A. Erdogdu

Structured Kernel Estimation for Photon-Limited Deconvolution

March 6, 2023 · 692 words · Yash Sanghvi, Zhiyuan Mao, Stanley H. Chan

PaLM-E: An Embodied Multimodal Language Model

March 6, 2023 · 1187 words · Danny Driess, Fei Xia, Mehdi S. M. Sajjadi, Corey Lynch, Aakanksha Chowdhery and 17 others

Enhancing Activity Prediction Models in Drug Discovery with the Ability to Understand Human Language

March 6, 2023 · 691 words · Philipp Seidl, Andreu Vall, Sepp Hochreiter, Günter Klambauer

Faithfulness-Aware Decoding Strategies for Abstractive Summarization

March 6, 2023 · 957 words · David Wan, Mengwen Liu, Kathleen McKeown, Markus Dreyer, Mohit Bansal

Convergence Rates for Non-Log-Concave Sampling and Log-Partition Estimation

March 6, 2023 · 1539 words · David Holzmüller, Francis Bach

OpenICL: An Open-Source Framework for In-context Learning

March 6, 2023 · 521 words · Zhenyu Wu, YaoXiang Wang, Jiacheng Ye, Jiangtao Feng, Jingjing Xu and 2 others

Prismer: A Vision-Language Model with An Ensemble of Experts

March 4, 2023 · 900 words · Shikun Liu, Linxi Fan, Edward Johns, Zhiding Yu, Chaowei Xiao and 1 others

Unleashing Text-to-Image Diffusion Models for Visual Perception

March 3, 2023 · 877 words · Wenliang Zhao, Yongming Rao, Zuyan Liu, Benlin Liu, Jie Zhou and 1 others

Prompt, Generate, then Cache: Cascade of Foundation Models makes Strong Few-shot Learners

March 3, 2023 · 620 words · Renrui Zhang, Xiangfei Hu, Bohao Li, Siyuan Huang, Hanqiu Deng and 3 others

Towards Democratizing Joint-Embedding Self-Supervised Learning

March 3, 2023 · 962 words · Florian Bordes, Randall Balestriero, Pascal Vincent

Mixture of Soft Prompts for Controllable Data Generation

March 2, 2023 · 804 words · Derek Chen, Celine Lee, Yunan Lu, Domenic Rosati, Zhou Yu

Chemically Transferable Generative Backmapping of Coarse-Grained Proteins

March 2, 2023 · 901 words · Soojung Yang, Rafael Gómez-Bombarelli

Dropout Reduces Underfitting

March 2, 2023 · 737 words · Zhuang Liu, Zhiqiu Xu, Joseph Jin, Zhiqiang Shen, Trevor Darrell

Understanding plasticity in neural networks

March 2, 2023 · 1174 words · Clare Lyle, Zeyu Zheng, Evgenii Nikishin, Bernardo Avila Pires, Razvan Pascanu and 1 others

Auxiliary Functions as Koopman Observables: Data-Driven Polynomial Optimization for Dynamical Systems

March 2, 2023 · 1874 words · Jason J. Bramburger, Giovanni Fantuzzi

Consistency Models

March 2, 2023 · 1121 words · Yang Song, Prafulla Dhariwal, Mark Chen, Ilya Sutskever

WiCE: Real-World Entailment for Claims in Wikipedia

March 2, 2023 · 1137 words · Ryo Kamoi, Tanya Goyal, Juan Diego Rodriguez, Greg Durrett

Google USM: Scaling Automatic Speech Recognition Beyond 100 Languages

March 2, 2023 · 1520 words · Yu Zhang, Wei Han, James Qin, Yongqiang Wang, Ankur Bapna and 22 others

X&Fuse: Fusing Visual Information in Text-to-Image Generation

March 2, 2023 · 741 words · Yuval Kirstain, Omer Levy, Adam Polyak

ParaFormer: Parallel Attention Transformer for Efficient Feature Matching

March 2, 2023 · 697 words · Xiaoyong Lu, Yaping Yan, Bin Kang, Songlin Du

Disentangling Linkage and Population Structure in Association Mapping

March 2, 2023 · 566 words · Hanbin Lee, Moo Hyuk Lee

Grounded Decoding: Guiding Text Generation with Grounded Models for Robot Control

March 1, 2023 · 1068 words · Wenlong Huang, Fei Xia, Dhruv Shah, Danny Driess, Andy Zeng and 6 others

UDAPDR: Unsupervised Domain Adaptation via LLM Prompting and Distillation of Rerankers

March 1, 2023 · 844 words · Jon Saad-Falcon, Omar Khattab, Keshav Santhanam, Radu Florian, Martin Franz and 4 others

Improved Segmentation of Deep Sulci in Cortical Gray Matter Using a Deep Learning Framework Incorporating Laplace’s Equation

March 1, 2023 · 787 words · Sadhana Ravikumar, Ranjit Ittyerah, Sydney Lim, Long Xie, Sandhitsu Das and 27 others

StraIT: Non-autoregressive Generation with Stratified Image Transformer

March 1, 2023 · 720 words · Shengju Qian, Huiwen Chang, Yuanzhen Li, Zizhao Zhang, Jiaya Jia and 1 others

R-U-SURE? Uncertainty-Aware Code Suggestions By Maximizing Utility Across Random User Intents

March 1, 2023 · 1620 words · Daniel D. Johnson, Daniel Tarlow, Christian Walder

Finding the right XAI method – A Guide for the Evaluation and Ranking of Explainable AI Methods in Climate Science

March 1, 2023 · 1156 words · Philine Bommer, Marlene Kretschmer, Anna Hedström, Dilyara Bareeva, Marina M. -C. Höhne

An Information-Theoretic Perspective on Variance-Invariance-Covariance Regularization

March 1, 2023 · 1250 words · Ravid Shwartz-Ziv, Randall Balestriero, Kenji Kawaguchi, Tim G. J. Rudner, Yann LeCun

FAIR-Ensemble: When Fairness Naturally Emerges From Deep Ensembling

March 1, 2023 · 609 words · Wei-Yin Ko, Daniel D'souza, Karina Nguyen, Randall Balestriero, Sara Hooker

Unlimited-Size Diffusion Restoration

March 1, 2023 · 769 words · Yinhuai Wang, Jiwen Yu, Runyi Yu, Jian Zhang

Collage Diffusion

March 1, 2023 · 952 words · Vishnu Sarukkai, Linden Li, Arden Ma, Christopher Ré, Kayvon Fatahalian

Almanac: Knowledge-Grounded Language Models for Clinical Medicine

March 1, 2023 · 519 words · Cyril Zakka, Akash Chaurasia, Rohan Shad, William Hiesinger

February¹¹⁶

Methods and measures for investigating microscale motility

February 28, 2023 · 575 words · Karen Grace Bondoc-Naumovitz, Hannah Laeverenz-Schlogelhofer, Rebecca N. Poon, Alexander K. Boggon, Samuel A. Bentley and 2 others

EvoPrompting: Language Models for Code-Level Neural Architecture Search

February 28, 2023 · 767 words · Angelica Chen, David M. Dohan, David R. So

Monocular Depth Estimation using Diffusion Models

February 28, 2023 · 776 words · Saurabh Saxena, Abhishek Kar, Mohammad Norouzi, David J. Fleet

Membership Inference Attack for Beluga Whales Discrimination

February 28, 2023 · 1469 words · Voncarlos Marcelo Araújo, Sébastien Gambs, Clément Chion, Robert Michaud, Léo Schneider and 1 others

Learning Hidden Markov Models Using Conditional Samples

February 28, 2023 · 2094 words · Sham M. Kakade, Akshay Krishnamurthy, Gaurav Mahajan, Cyril Zhang

Is Japanese CCGBank empirically correct? A case study of passive and causative constructions

February 28, 2023 · 423 words · Daisuke Bekki, Hitomi Yanaka

In-Context Instruction Learning

February 28, 2023 · 500 words · Seonghyeon Ye, Hyeonbin Hwang, Sohee Yang, Hyeongu Yun, Yireun Kim and 1 others

H-AES: Towards Automated Essay Scoring for Hindi

February 28, 2023 · 1018 words · Shubhankar Singh, Anirudh Pupneja, Shivaansh Mital, Cheril Shah, Manish Bawkar and 5 others

Large Language Models Are State-of-the-Art Evaluators of Translation Quality

February 28, 2023 · 739 words · Tom Kocmi, Christian Federmann

An Algorithm and Complexity Results for Causal Unit Selection

February 28, 2023 · 863 words · Haiying Huang, Adnan Darwiche

Information-Restricted Neural Language Models Reveal Different Brain Regions’ Sensitivity to Semantics, Syntax and Context

February 28, 2023 · 1117 words · Alexandre Pasquiou, Yair Lakretz, Bertrand Thirion, Christophe Pallier

Im2Hands: Learning Attentive Implicit Representation of Interacting Two-Hand Shapes

February 28, 2023 · 881 words · Jihyun Lee, Minhyuk Sung, Honggyu Choi, Tae-Kyun Kim

HelixSurf: A Robust and Efficient Neural Implicit Surface Learning of Indoor Scenes with Iterative Intertwined Regularization

February 28, 2023 · 1043 words · Zhihao Liang, Zhangjin Huang, Changxing Ding, Kui Jia

Goal Driven Discovery of Distributional Differences via Language Descriptions

February 28, 2023 · 1020 words · Ruiqi Zhong, Peter Zhang, Steve Li, Jinwoo Ahn, Dan Klein and 1 others

Vid2Seq: Large-Scale Pretraining of a Visual Language Model for Dense Video Captioning

February 27, 2023 · 800 words · Antoine Yang, Arsha Nagrani, Paul Hongsuck Seo, Antoine Miech, Jordi Pont-Tuset and 3 others

Language Is Not All You Need: Aligning Perception with Language Models

February 27, 2023 · 470 words · Shaohan Huang, Li Dong, Wenhui Wang, Yaru Hao, Saksham Singhal and 12 others

The ROOTS Search Tool: Data Transparency for LLMs

February 27, 2023 · 595 words · Aleksandra Piktus, Christopher Akiki, Paulo Villegas, Hugo Laurençon, Gérard Dupont and 3 others

Causal isotonic calibration for heterogeneous treatment effects

February 27, 2023 · 1018 words · Lars van der Laan, Ernesto Ulloa-Pérez, Marco Carone, Alex Luedtke

Optimistic Planning by Regularized Dynamic Programming

February 27, 2023 · 925 words · Antoine Moulin, Gergely Neu

LLaMA: Open and Efficient Foundation Language Models

February 27, 2023 · 1252 words · Hugo Touvron, Thibaut Lavril, Gautier Izacard, Xavier Martinet, Marie-Anne Lachaux and 9 others

Inseq: An Interpretability Toolkit for Sequence Generation Models

February 27, 2023 · 751 words · Gabriele Sarti, Nils Feldhus, Ludwig Sickert, Oskar van der Wal

ELITE: Encoding Visual Concepts into Textual Embeddings for Customized Text-to-Image Generation

February 27, 2023 · 765 words · Yuxiang Wei, Yabo Zhang, Zhilong Ji, Jinfeng Bai, Lei Zhang and 1 others

The Role of Pre-training Data in Transfer Learning

February 27, 2023 · 1286 words · Rahim Entezari, Mitchell Wortsman, Olga Saukh, M. Moein Shariatnia, Hanie Sedghi and 1 others

OccDepth: A Depth-Aware Method for 3D Semantic Scene Completion

February 27, 2023 · 694 words · Ruihang Miao, Weizhou Liu, Mingrui Chen, Zheng Gong, Weixin Xu and 2 others

Navigating the Grey Area: Expressions of Overconfidence and Uncertainty in Language Models

February 26, 2023 · 1068 words · Kaitlyn Zhou, Dan Jurafsky, Tatsunori Hashimoto

Jointly Optimizing Translations and Speech Timing to Improve Isochrony in Automatic Dubbing

February 25, 2023 · 956 words · Alexandra Chronopoulou, Brian Thompson, Prashant Mathur, Yogesh Virkar, Surafel M. Lakew and 1 others

SplineCam: Exact Visualization and Characterization of Deep Network Geometry and Decision Boundaries

February 24, 2023 · 939 words · Ahmed Imtiaz Humayun, Randall Balestriero, Guha Balakrishnan, Richard Baraniuk

Fairness in Language Models Beyond English: Gaps and Challenges

February 24, 2023 · 1011 words · Krithika Ramesh, Sunayana Sitaram, Monojit Choudhury

Model-Based Uncertainty in Value Functions

February 24, 2023 · 1301 words · Carlos E. Luis, Alessandro G. Bottero, Julia Vinogradska, Felix Berkenkamp, Jan Peters

MUX-PLMs: Pre-training Language Models with Data Multiplexing

February 24, 2023 · 788 words · Vishvak Murahari, Ameet Deshpande, Carlos E. Jimenez, Izhak Shafran, Mingqiu Wang and 2 others

ProofNet: Autoformalizing and Formally Proving Undergraduate-Level Mathematics

February 24, 2023 · 622 words · Zhangir Azerbayev, Bartosz Piotrowski, Hailey Schoelkopf, Edward W. Ayers, Dragomir Radev and 1 others

Flexible Phase Dynamics for Bio-Plausible Contrastive Learning

February 24, 2023 · 1112 words · Ezekiel Williams, Colin Bredenberg, Guillaume Lajoie

In What Languages are Generative Language Models the Most Formal? Analyzing Formality Distribution across Languages

February 23, 2023 · 1019 words · Asım Ersoy, Gerson Vizcarra, Tasmiah Tahsin Mayeesha, Benjamin Muller

DisCO: Portrait Distortion Correction with Perspective-Aware 3D GANs

February 23, 2023 · 892 words · Zhixiang Wang, Yu-Lun Liu, Jia-Bin Huang, Shin'ichi Satoh, Sizhuo Ma and 2 others

Active Prompting with Chain-of-Thought for Large Language Models

February 23, 2023 · 966 words · Shizhe Diao, Pengcheng Wang, Yong Lin, Tong Zhang

Improving Adaptive Conformal Prediction Using Self-Supervised Learning

February 23, 2023 · 890 words · Nabeel Seedat, Alan Jeffares, Fergus Imrie, Mihaela van der Schaar

DiffusioNeRF: Regularizing Neural Radiance Fields with Denoising Diffusion Models

February 23, 2023 · 922 words · Jamie Wynn, Daniyar Turmukhambetov

Aligning Text-to-Image Models using Human Feedback

February 23, 2023 · 654 words · Kimin Lee, Hao Liu, Moonkyung Ryu, Olivia Watkins, Yuqing Du and 4 others

More than you’ve asked for: A Comprehensive Analysis of Novel Prompt Injection Threats to Application-Integrated Large Language Models

February 23, 2023 · 795 words · Kai Greshake, Sahar Abdelnabi, Shailesh Mishra, Christoph Endres, Thorsten Holz and 1 others

ProsAudit, a prosodic benchmark for self-supervised speech models

February 23, 2023 · 683 words · Maureen de Seyssel, Marvin Lavechin, Hadrien Titeux, Arthur Thomas, Gwendal Virlet and 4 others

Liquidity Providers Greeks and Impermanent Gain

February 23, 2023 · 778 words · Niccolò Bardoscia, Alessandro Nodari

Controlled and Conditional Text to Image Generation with Diffusion Prior

February 23, 2023 · 1276 words · Pranav Aggarwal, Hareesh Ravi, Naveen Marri, Sachin Kelkar, Fengbin Chen and 10 others

Bayes meets Bernstein at the Meta Level: an Analysis of Fast Rates in Meta-Learning with PAC-Bayes

February 23, 2023 · 1242 words · Charles Riou, Pierre Alquier, Badr-Eddine Chérief-Abdellatif

AlpaServe: Statistical Multiplexing with Model Parallelism for Deep Learning Serving

February 22, 2023 · 1623 words · Zhuohan Li, Lianmin Zheng, Yinmin Zhong, Vincent Liu, Ying Sheng and 6 others

Modular Deep Learning

February 22, 2023 · 1698 words · Jonas Pfeiffer, Sebastian Ruder, Ivan Vulić, Edoardo Maria Ponti

How Does In-Context Learning Help Prompt Tuning?

February 22, 2023 · 524 words · Simeng Sun, Yang Liu, Dan Iter, Chenguang Zhu, Mohit Iyyer

Guiding Large Language Models via Directional Stimulus Prompting

February 22, 2023 · 888 words · Zekun Li, Baolin Peng, Pengcheng He, Michel Galley, Jianfeng Gao and 1 others

Regularised neural networks mimic human insight

February 22, 2023 · 1510 words · Anika T. Löwe, Léo Touzo, Paul S. Muhle-Karbe, Andrew M. Saxe, Christopher Summerfield and 1 others

On the Robustness of ChatGPT: An Adversarial and Out-of-distribution Perspective

February 22, 2023 · 821 words · Jindong Wang, Xixu Hu, Wenxin Hou, Hao Chen, Runkai Zheng and 8 others

$PC^2$: Projection-Conditioned Point Cloud Diffusion for Single-Image 3D Reconstruction

February 21, 2023 · 728 words · Luke Melas-Kyriazi, Christian Rupprecht, Andrea Vedaldi

RealFusion: 360° Reconstruction of Any Object from a Single Image

February 21, 2023 · 1006 words · Luke Melas-Kyriazi, Christian Rupprecht, Iro Laina, Andrea Vedaldi

Optical Transformers

February 20, 2023 · 1228 words · Maxwell G. Anderson, Shi-Yuan Ma, Tianyu Wang, Logan G. Wright, Peter L. McMahon

Meta-World Conditional Neural Processes

February 20, 2023 · 740 words · Suzan Ece Ada, Emre Ugur

Over-Parameterization Exponentially Slows Down Gradient Descent for Learning a Single Neuron

February 20, 2023 · 1506 words · Weihang Xu, Simon S. Du

A Large Scale Homography Benchmark

February 20, 2023 · 640 words · Daniel Barath, Dmytro Mishkin, Michal Polic, Wolfgang Förstner, Jiri Matas

TBPos: Dataset for Large-Scale Precision Visual Localization

February 20, 2023 · 714 words · Masud Fahim, Ilona Söchting, Luca Ferranti, Juho Kannala, Jani Boutellier

Composer: Creative and Controllable Image Synthesis with Composable Conditions

February 20, 2023 · 937 words · Lianghua Huang, Di Chen, Yu Liu, Yujun Shen, Deli Zhao and 1 others

A Comprehensive Survey on Pretrained Foundation Models: A History from BERT to ChatGPT

February 18, 2023 · 1855 words · Ce Zhou, Qian Li, Chen Li, Jun Yu, Yixin Liu and 14 others

Machine Love

February 18, 2023 · 1423 words · Joel Lehman

Scalable Prompt Generation for Semi-supervised Learning with Language Models

February 18, 2023 · 935 words · Yuhang Zhou, Suraj Maharjan, Beiye Liu

On Equivalent Optimization of Machine Learning Methods

February 17, 2023 · 1012 words · William T. Redman, Juan M. Bello-Rivas, Maria Fonoberova, Ryan Mohr, Ioannis G. Kevrekidis and 1 others

JANA: Jointly Amortized Neural Approximation of Complex Bayesian Models

February 17, 2023 · 1156 words · Stefan T. Radev, Marvin Schmitt, Valentin Pratz, Umberto Picchini, Ullrich Köthe and 1 others

Text-driven Visual Synthesis with Latent Diffusion Prior

February 16, 2023 · 580 words · Ting-Hsuan Liao, Songwei Ge, Yiran Xu, Yao-Chih Lee, Badour AlBahar and 1 others

Auditing large language models: a three-layered approach

February 16, 2023 · 1545 words · Jakob Mökander, Jonas Schuett, Hannah Rose Kirk, Luciano Floridi

Large Language Models Fail on Trivial Alterations to Theory-of-Mind Tasks

February 16, 2023 · 498 words · Tomer Ullman

Tuning computer vision models with task rewards

February 16, 2023 · 799 words · André Susano Pinto, Alexander Kolesnikov, Yuge Shi, Lucas Beyer, Xiaohua Zhai

A Survey of Geometric Optimization for Deep Learning: From Euclidean Space to Riemannian Manifold

February 16, 2023 · 1635 words · Yanhong Fei, Xian Wei, Yingjie Liu, Zhengyu Li, Mingsong Chen

Empirical Investigation of Neural Symbolic Reasoning Strategies

February 16, 2023 · 527 words · Yoichi Aoki, Keito Kudo, Tatsuki Kuribayashi, Ana Brassard, Masashi Yoshikawa and 2 others

À-la-carte Prompt Tuning (APT): Combining Distinct Data Via Composable Prompting

February 15, 2023 · 673 words · Benjamin Bowman, Alessandro Achille, Luca Zancato, Matthew Trager, Pramuditha Perera and 2 others

Topological Neural Discrete Representation Learning à la Kohonen

February 15, 2023 · 629 words · Kazuki Irie, Róbert Csordás, Jürgen Schmidhuber

The Expressive Power of Tuning Only the Norm Layers

February 15, 2023 · 1196 words · Angeliki Giannou, Shashank Rajput, Dimitris Papailiopoulos

Denoising Diffusion Probabilistic Models for Robust Image Super-Resolution in the Wild

February 15, 2023 · 774 words · Hshmat Sahak, Daniel Watson, Chitwan Saharia, David Fleet

Augmented Language Models: a Survey

February 15, 2023 · 1298 words · Grégoire Mialon, Roberto Dessì, Maria Lomeli, Christoforos Nalmpantis, Ram Pasunuru and 8 others

Tri-Perspective View for Vision-Based 3D Semantic Occupancy Prediction

February 15, 2023 · 979 words · Yuanhui Huang, Wenzhao Zheng, Yunpeng Zhang, Jie Zhou, Jiwen Lu

The Capacity for Moral Self-Correction in Large Language Models

February 15, 2023 · 921 words · Deep Ganguli, Amanda Askell, Nicholas Schiefer, Thomas Liao, Kamilė Lukošiūtė and 43 others

How to Train Your DRAGON: Diverse Augmentation Towards Generalizable Dense Retrieval

February 15, 2023 · 1078 words · Sheng-Chieh Lin, Akari Asai, Minghan Li, Barlas Oguz, Jimmy Lin and 3 others

Energy Transformer

February 14, 2023 · 1282 words · Benjamin Hoover, Yuchen Liang, Bao Pham, Rameswar Panda, Hendrik Strobelt and 3 others

Silences, Spikes and Bursts: Three-Part Knot of the Neural Code

February 14, 2023 · 1661 words · Richard Naud, Zachary Friedenberger, Katalin Toth

Universal Guidance for Diffusion Models

February 14, 2023 · 828 words · Arpit Bansal, Hong-Min Chu, Avi Schwarzschild, Soumyadip Sengupta, Micah Goldblum and 2 others

Statistically Optimal Force Aggregation for Coarse-Graining Molecular Dynamics

February 14, 2023 · 882 words · Andreas Krämer, Aleksander P. Durumeric, Nicholas E. Charron, Yaoyi Chen, Cecilia Clementi and 1 others

Do Deep Learning Methods Really Perform Better in Molecular Conformation Generation?

February 14, 2023 · 406 words · Gengmo Zhou, Zhifeng Gao, Zhewei Wei, Hang Zheng, Guolin Ke

AdapterSoup: Weight Averaging to Improve Generalization of Pretrained Language Models

February 14, 2023 · 739 words · Alexandra Chronopoulou, Matthew E. Peters, Alexander Fraser, Jesse Dodge

A modern look at the relationship between sharpness and generalization

February 14, 2023 · 807 words · Maksym Andriushchenko, Francesco Croce, Maximilian Müller, Matthias Hein, Nicolas Flammarion

A Review of the Role of Causality in Developing Trustworthy AI Systems

February 14, 2023 · 701 words · Niloy Ganguly, Dren Fazlija, Maryam Badar, Marco Fisichella, Sandipan Sikdar and 7 others

Concentration Bounds for Discrete Distribution Estimation in KL Divergence

February 14, 2023 · 594 words · Clément L. Canonne, Ziteng Sun, Ananda Theertha Suresh

SwitchPrompt: Learning Domain-Specific Gated Soft Prompts for Classification in Low-Resource Domains

February 14, 2023 · 592 words · Koustava Goswami, Lukas Lange, Jun Araki, Heike Adel

EspalomaCharge: Machine learning-enabled ultra-fast partial charge assignment

February 14, 2023 · 551 words · Yuanqing Wang, Iván Pulido, Kenichiro Takaba, Benjamin Kaminow, Jenke Scheen and 2 others

Guiding Pretraining in Reinforcement Learning with Large Language Models

February 13, 2023 · 1039 words · Yuqing Du, Olivia Watkins, Zihan Wang, Cédric Colas, Trevor Darrell and 3 others

Symbolic Discovery of Optimization Algorithms

February 13, 2023 · 992 words · Xiangning Chen, Chen Liang, Da Huang, Esteban Real, Kaiyuan Wang and 7 others

Event-based Backpropagation for Analog Neuromorphic Hardware

February 13, 2023 · 763 words · Christian Pehle, Luca Blessing, Elias Arnold, Eric Müller, Johannes Schemmel

Stitchable Neural Networks

February 13, 2023 · 669 words · Zizheng Pan, Jianfei Cai, Bohan Zhuang

Sources of Richness and Ineffability for Phenomenally Conscious States

February 13, 2023 · 1488 words · Xu Ji, Eric Elmoznino, George Deane, Axel Constant, Guillaume Dumas and 3 others

Stabilized In-Context Learning with Pre-trained Language Models for Few Shot Dialogue State Tracking

February 12, 2023 · 1349 words · Derek Chen, Kun Qian, Zhou Yu

From high-dimensional & mean-field dynamics to dimensionless ODEs: A unifying approach to SGD in two-layers networks

February 12, 2023 · 751 words · Luca Arnaboldi, Ludovic Stephan, Florent Krzakala, Bruno Loureiro

Compositional Exemplars for In-context Learning

February 11, 2023 · 946 words · Jiacheng Ye, Zhiyong Wu, Jiangtao Feng, Tao Yu, Lingpeng Kong

Evaluating the Robustness of Discrete Prompts

February 11, 2023 · 765 words · Yoichi Ishibashi, Danushka Bollegala, Katsuhito Sudoh, Satoshi Nakamura

Zero-Knowledge Mechanisms

February 11, 2023 · 645 words · Ran Canetti, Amos Fiat, Yannai A. Gonczarowski

Adding Conditional Control to Text-to-Image Diffusion Models

February 10, 2023 · 1116 words · Lvmin Zhang, Maneesh Agrawala

Thermodynamic AI and the fluctuation frontier

February 9, 2023 · 1810 words · Patrick J. Coles

Languages are Rewards: Hindsight Finetuning using Human Feedback

February 6, 2023 · 774 words · Hao Liu, Carmelo Sferrazza, Pieter Abbeel

A Modified CTGAN-Plus-Features Based Method for Optimal Asset Allocation

February 5, 2023 · 862 words · José-Manuel Peña, Fernando Suárez, Omar Larré, Domingo Ramírez, Arturo Cifuentes

The unreasonable effectiveness of few-shot learning for machine translation

February 2, 2023 · 810 words · Xavier Garcia, Yamini Bansal, Colin Cherry, George Foster, Maxim Krikun and 3 others

Effective Robustness against Natural Distribution Shifts for Models with Different Training Data

February 2, 2023 · 867 words · Zhouxing Shi, Nicholas Carlini, Ananth Balashankar, Ludwig Schmidt, Cho-Jui Hsieh and 2 others

Dreamix: Video Diffusion Models are General Video Editors

February 2, 2023 · 666 words · Eyal Molad, Eliahu Horwitz, Dani Valevski, Alex Rav Acha, Yossi Matias and 3 others

Accelerating Large Language Model Decoding with Speculative Sampling

February 2, 2023 · 614 words · Charlie Chen, Sebastian Borgeaud, Geoffrey Irving, Jean-Baptiste Lespiau, Laurent Sifre and 1 others

Double Permutation Equivariance for Knowledge Graph Completion

February 2, 2023 · 1013 words · Jianfei Gao, Yangze Zhou, Bruno Ribeiro

Explaining wall-bounded turbulence through deep learning

February 2, 2023 · 796 words · Andres Cremades, Sergio Hoyas, Pedro Quintero, Martin Lellep, Moritz Linkmann and 1 others

Causal Lifting and Link Prediction

February 2, 2023 · 1832 words · Leonardo Cotta, Beatrice Bevilacqua, Nesreen Ahmed, Bruno Ribeiro

Multimodal Chain-of-Thought Reasoning in Language Models

February 2, 2023 · 1070 words · Zhuosheng Zhang, Aston Zhang, Mu Li, Hai Zhao, George Karypis and 1 others

Collaborating with language models for embodied reasoning

February 1, 2023 · 673 words · Ishita Dasgupta, Christine Kaeser-Chen, Kenneth Marino, Arun Ahuja, Sheila Babayan and 2 others

Improving Few-Shot Generalization by Exploring and Exploiting Auxiliary Data

February 1, 2023 · 952 words · Alon Albalak, Colin Raffel, William Yang Wang

Continuous U-Net: Faster, Greater and Noiseless

February 1, 2023 · 924 words · Chun-Wun Cheng, Christina Runkel, Lihao Liu, Raymond H Chan, Carola-Bibiane Schönlieb and 1 others

Synthetic Prompting: Generating Chain-of-Thought Demonstrations for Large Language Models

February 1, 2023 · 625 words · Zhihong Shao, Yeyun Gong, Yelong Shen, Minlie Huang, Nan Duan and 1 others

Zero Shot Transfer of Legal Judgement Prediction as Article-aware Entailment for the European Court of Human Rights

February 1, 2023 · 1154 words · Santosh T. Y. S. S, Oana Ichim, Matthias Grabmair

Automatically Marginalized MCMC in Probabilistic Programming

February 1, 2023 · 1313 words · Jinlin Lai, Javier Burroni, Hui Guan, Daniel Sheldon

Width and Depth Limits Commute in Residual Networks

February 1, 2023 · 1141 words · Soufiane Hayou, Greg Yang

January¹⁴⁵

Emerging Trends in Droplet Microfluidic Screens for Biotechnology

January 31, 2023 · 511 words · Carlos Vidal-Céspedes, Tobias Wenzel

Learning Universal Policies via Text-Guided Video Generation

January 31, 2023 · 927 words · Yilun Du, Mengjiao Yang, Bo Dai, Hanjun Dai, Ofir Nachum and 3 others

In-Context Retrieval-Augmented Language Models

January 31, 2023 · 876 words · Ori Ram, Yoav Levine, Itay Dalmedigos, Dor Muhlgay, Amnon Shashua and 2 others

Transformers Meet Directed Graphs

January 31, 2023 · 980 words · Simon Geisler, Yujia Li, Daniel Mankowitz, Ali Taylan Cemgil, Stephan Günnemann and 1 others

Mathematical Capabilities of ChatGPT

January 31, 2023 · 824 words · Simon Frieder, Luca Pinchetti, Ryan-Rhys Griffiths, Tommaso Salvatori, Thomas Lukasiewicz and 3 others

Benchmarking Large Language Models for News Summarization

January 31, 2023 · 800 words · Tianyi Zhang, Faisal Ladhak, Esin Durmus, Percy Liang, Kathleen McKeown and 1 others

Grounding Language Models to Images for Multimodal Generation

January 31, 2023 · 813 words · Jing Yu Koh, Ruslan Salakhutdinov, Daniel Fried

Attend-and-Excite: Attention-Based Semantic Guidance for Text-to-Image Diffusion Models

January 31, 2023 · 865 words · Hila Chefer, Yuval Alaluf, Yael Vinker, Lior Wolf, Daniel Cohen-Or

Patch Gradient Descent: Training Neural Networks on Very Large Images

January 31, 2023 · 807 words · Deepak K. Gupta, Gowreesh Mago, Arnav Chavan, Dilip K. Prasad

Differentially Private Distributed Bayesian Linear Regression with MCMC

January 31, 2023 · 1021 words · Barış Alparslan, Sinan Yıldırım, Ş. İlker Birbil

The passive symmetries of machine learning

January 31, 2023 · 988 words · Soledad Villar, David W. Hogg, Weichi Yao, George A. Kevrekidis, Bernhard Schölkopf

The Flan Collection: Designing Data and Methods for Effective Instruction Tuning

January 31, 2023 · 403 words · Shayne Longpre, Le Hou, Tu Vu, Albert Webson, Hyung Won Chung and 6 others

Learning Data Representations with Joint Diffusion Models

January 31, 2023 · 1023 words · Kamil Deja, Tomasz Trzcinski, Jakub M. Tomczak

Robust Linear Regression: Gradient-descent, Early-stopping, and Beyond

January 31, 2023 · 1378 words · Meyer Scetbon, Elvis Dohmatob

Scaling laws for single-agent reinforcement learning

January 31, 2023 · 1417 words · Jacob Hilton, Jie Tang, John Schulman

Faithful Chain-of-Thought Reasoning

January 31, 2023 · 987 words · Qing Lyu, Shreya Havaldar, Adam Stein, Li Zhang, Delip Rao and 3 others

A Bias-Variance-Privacy Trilemma for Statistical Estimation

January 30, 2023 · 1577 words · Gautam Kamath, Argyris Mouzakis, Matthew Regehr, Vikrant Singhal, Thomas Steinke and 1 others

LongEval: Guidelines for Human Evaluation of Faithfulness in Long-form Summarization

January 30, 2023 · 491 words · Kalpesh Krishna, Erin Bransom, Bailey Kuehl, Mohit Iyyer, Pradeep Dasigi and 2 others

Looped Transformers as Programmable Computers

January 30, 2023 · 1024 words · Angeliki Giannou, Shashank Rajput, Jy-yong Sohn, Kangwook Lee, Jason D. Lee and 1 others

Quantifying Context Mixing in Transformers

January 30, 2023 · 971 words · Hosein Mohebbi, Willem Zuidema, Grzegorz Chrupała, Afra Alishahi

Guiding Online Reinforcement Learning with Action-Free Offline Pretraining

January 30, 2023 · 950 words · Deyao Zhu, Yuhui Wang, Jürgen Schmidhuber, Mohamed Elhoseiny

Crawling the Internal Knowledge-Base of Language Models

January 30, 2023 · 642 words · Roi Cohen, Mor Geva, Jonathan Berant, Amir Globerson

Equivariant Architectures for Learning in Deep Weight Spaces

January 30, 2023 · 1481 words · Aviv Navon, Aviv Shamsian, Idan Achituve, Ethan Fetaya, Gal Chechik and 1 others

SingSong: Generating musical accompaniments from singing

January 30, 2023 · 1299 words · Chris Donahue, Antoine Caillon, Adam Roberts, Ethan Manilow, Philippe Esling and 6 others

REPLUG: Retrieval-Augmented Black-Box Language Models

January 30, 2023 · 930 words · Weijia Shi, Sewon Min, Michihiro Yasunaga, Minjoon Seo, Rich James and 3 others

BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language Models

January 30, 2023 · 906 words · Junnan Li, Dongxu Li, Silvio Savarese, Steven Hoi

Sample Efficient Deep Reinforcement Learning via Local Planning

January 29, 2023 · 1233 words · Dong Yin, Sridhar Thiagarajan, Nevena Lazic, Nived Rajaraman, Botao Hao and 1 others

A Discerning Several Thousand Judgments: GPT-3 Rates the Article + Adjective + Numeral + Noun Construction

January 29, 2023 · 650 words · Kyle Mahowald

Generating Novel, Designable, and Diverse Protein Structures by Equivariantly Diffusing Oriented Residue Clouds

January 29, 2023 · 867 words · Yeqing Lin, Mohammed AlQuraishi

SWARM Parallelism: Training Large Models Can Be Surprisingly Communication-Efficient

January 27, 2023 · 1188 words · Max Ryabinin, Tim Dettmers, Michael Diskin, Alexander Borzunov

Call for Papers – The BabyLM Challenge: Sample-efficient pretraining on a developmentally plausible corpus

January 27, 2023 · 332 words · Alex Warstadt, Leshem Choshen, Aaron Mueller, Adina Williams, Ethan Wilcox and 1 others

Moûsai: Text-to-Music Generation with Long-Context Latent Diffusion

January 27, 2023 · 1249 words · Flavio Schneider, Zhijing Jin, Bernhard Schölkopf

Unsupervised Volumetric Animation

January 26, 2023 · 940 words · Aliaksandr Siarohin, Willi Menapace, Ivan Skorokhodov, Kyle Olszewski, Jian Ren and 3 others

MusicLM: Generating Music From Text

January 26, 2023 · 1253 words · Andrea Agostinelli, Timo I. Denk, Zalán Borsos, Jesse Engel, Mauro Verzetti and 8 others

Cut and Learn for Unsupervised Object Detection and Instance Segmentation

January 26, 2023 · 907 words · Xudong Wang, Rohit Girdhar, Stella X. Yu, Ishan Misra

Open Problems in Applied Deep Learning

January 26, 2023 · 655 words · Maziar Raissi

DetectGPT: Zero-Shot Machine-Generated Text Detection using Probability Curvature

January 26, 2023 · 674 words · Eric Mitchell, Yoonho Lee, Alexander Khazatsky, Christopher D. Manning, Chelsea Finn

Text-To-4D Dynamic Scene Generation

January 26, 2023 · 847 words · Uriel Singer, Shelly Sheynin, Adam Polyak, Oron Ashual, Iurii Makarov and 6 others

Deep Laplacian-based Options for Temporally-Extended Exploration

January 26, 2023 · 994 words · Martin Klissarov, Marlos C. Machado

Finding Regions of Counterfactual Explanations via Robust Optimization

January 26, 2023 · 1061 words · Donato Maragno, Jannis Kurtz, Tabea E. Röber, Rob Goedhart, Ş. Ilker Birbil and 1 others

simple diffusion: End-to-end diffusion for high resolution images

January 26, 2023 · 1016 words · Emiel Hoogeboom, Jonathan Heek, Tim Salimans

On the Importance of Noise Scheduling for Diffusion Models

January 26, 2023 · 626 words · Ting Chen

Break It Down: Evidence for Structural Compositionality in Neural Networks

January 26, 2023 · 1144 words · Michael A. Lepori, Thomas Serre, Ellie Pavlick

Distilling Text into Circuits

January 25, 2023 · 1209 words · Vincent Wang-Mascianica, Jonathon Liu, Bob Coecke

E(n)-equivariant Graph Neural Cellular Automata

January 25, 2023 · 905 words · Gennaro Gala, Daniele Grattarola, Erik Quaeghebeur

Pre-computed memory or on-the-fly encoding? A hybrid approach to retrieval augmentation makes the most of your compute

January 25, 2023 · 791 words · Michiel de Jong, Yury Zemlyanskiy, Nicholas FitzGerald, Joshua Ainslie, Sumit Sanghai and 2 others

Using novel data and ensemble models to improve automated labeling of Sustainable Development Goals

January 25, 2023 · 680 words · Dirk U. Wulff, Dominik S. Meier, Rui Mata

Editing Language Model-based Knowledge Graph Embeddings

January 25, 2023 · 722 words · Siyuan Cheng, Ningyu Zhang, Bozhong Tian, Zelin Dai, Feiyu Xiong and 2 others

Data Consistent Deep Rigid MRI Motion Correction

January 25, 2023 · 410 words · Nalini M. Singh, Neel Dey, Malte Hoffmann, Bruce Fischl, Elfar Adalsteinsson and 3 others

ClimaX: A foundation model for weather and climate

January 24, 2023 · 1508 words · Tung Nguyen, Johannes Brandstetter, Ashish Kapoor, Jayesh K. Gupta, Aditya Grover

K-Planes: Explicit Radiance Fields in Space, Time, and Appearance

January 24, 2023 · 792 words · Sara Fridovich-Keil, Giacomo Meanti, Frederik Warburg, Benjamin Recht, Angjoo Kanazawa

A Watermark for Large Language Models

January 24, 2023 · 1234 words · John Kirchenbauer, Jonas Geiping, Yuxin Wen, Jonathan Katz, Ian Miers and 1 others

From Inclusive Language to Gender-Neutral Machine Translation

January 24, 2023 · 777 words · Andrea Piergentili, Dennis Fucci, Beatrice Savoldi, Luisa Bentivogli, Matteo Negri

PrimeQA: The Prime Repository for State-of-the-Art Multilingual Question Answering Research and Development

January 23, 2023 · 331 words · Avirup Sil, Jaydeep Sen, Bhavani Iyer, Martin Franz, Kshitij Fadnis and 10 others

Noisy Parallel Data Alignment

January 23, 2023 · 1089 words · Ruoyu Xie, Antonios Anastasopoulos

InfiniCity: Infinite-Scale City Synthesis

January 23, 2023 · 707 words · Chieh Hubert Lin, Hsin-Ying Lee, Willi Menapace, Menglei Chai, Aliaksandr Siarohin and 2 others

Prediction-Powered Inference

January 23, 2023 · 1287 words · Anastasios N. Angelopoulos, Stephen Bates, Clara Fannjiang, Michael I. Jordan, Tijana Zrnic

Fully transformer-based biomarker prediction from colorectal cancer histology: a large-scale multicentric study

January 23, 2023 · 1256 words · Sophia J. Wagner, Daniel Reisenbüchler, Nicholas P. West, Jan Moritz Niehues, Gregory Patrick Veldhuizen and 26 others

Zorro: the masked multimodal transformer

January 23, 2023 · 824 words · Adrià Recasens, Jason Lin, Joāo Carreira, Drew Jaegle, Luyu Wang and 6 others

StyleGAN-T: Unlocking the Power of GANs for Fast Large-Scale Text-to-Image Synthesis

January 23, 2023 · 872 words · Axel Sauer, Tero Karras, Samuli Laine, Andreas Geiger, Timo Aila

DiffSDS: A language diffusion model for protein backbone inpainting under geometric conditions and constraints

January 22, 2023 · 874 words · Zhangyang Gao, Cheng Tan, Stan Z. Li

Poor Man’s Quality Estimation: Predicting Reference-Based MT Metrics Without the Reference

January 21, 2023 · 1192 words · Vilém Zouhar, Shehzaad Dhuliawala, Wangchunshu Zhou, Nico Daheim, Tom Kocmi and 2 others

The Pipeline for the Continuous Development of Artificial Intelligence Models – Current State of Research and Practice

January 21, 2023 · 1615 words · Monika Steidl, Michael Felderer, Rudolf Ramler

SuperScaler: Supporting Flexible DNN Parallelization via a Unified Abstraction

January 21, 2023 · 1109 words · Zhiqi Lin, Youshan Miao, Guodong Liu, Xiaoxiang Shi, Quanlu Zhang and 8 others

Regeneration Learning: A Learning Paradigm for Data Generation

January 21, 2023 · 517 words · Xu Tan, Tao Qin, Jiang Bian, Tie-Yan Liu, Yoshua Bengio

Explainable Multilayer Graph Neural Network for Cancer Gene Prediction

January 20, 2023 · 1005 words · Michail Chatzianastasis, Michalis Vazirgiannis, Zijun Zhang

Is ChatGPT A Good Translator? A Preliminary Study

January 20, 2023 · 366 words · Wenxiang Jiao, Wenxuan Wang, Jen-tse Huang, Xing Wang, Zhaopeng Tu

Multiview Compressive Coding for 3D Reconstruction

January 19, 2023 · 1089 words · Chao-Yuan Wu, Justin Johnson, Jitendra Malik, Christoph Feichtenhofer, Georgia Gkioxari

Everything is Connected: Graph Neural Networks

January 19, 2023 · 327 words · Petar Veličković

AtMan: Understanding Transformer Predictions Through Memory Efficient Attention Manipulation

January 19, 2023 · 708 words · Mayukh Deb, Björn Deiseroth, Samuel Weinbach, Patrick Schramowski, Kristian Kersting

Batch Prompting: Efficient Inference with Large Language Model APIs

January 19, 2023 · 869 words · Zhoujun Cheng, Jungo Kasai, Tao Yu

Self Supervision Does Not Help Natural Language Supervision at Scale

January 19, 2023 · 1278 words · Floris Weers, Vaishaal Shankar, Angelos Katharopoulos, Yinfei Yang, Tom Gunter

Understanding and Detecting Hallucinations in Neural Machine Translation via Model Introspection

January 18, 2023 · 952 words · Weijia Xu, Sweta Agrawal, Eleftheria Briakou, Marianna J. Martindale, Marine Carpuat

Learning-Rate-Free Learning by D-Adaptation

January 18, 2023 · 1259 words · Aaron Defazio, Konstantin Mishchenko

Discrete Latent Structure in Neural Networks

January 18, 2023 · 2375 words · Vlad Niculae, Caio F. Corro, Nikita Nangia, Tsvetomila Mihaylova, André F. T. Martins

Towards Models that Can See and Read

January 18, 2023 · 747 words · Roy Ganz, Oren Nuriel, Aviad Aberdam, Yair Kittenplon, Shai Mazor and 1 others

Hierarchical Bayesian inference for community detection and connectivity of functional brain networks

January 18, 2023 · 1042 words · Lingbin Bian, Nizhuan Wang, Leonardo Novelli, Jonathan Keith, Adeel Razi

Data thinning for convolution-closed distributions

January 18, 2023 · 1193 words · Anna Neufeld, Ameer Dharamshi, Lucy L. Gao, Daniela Witten

EPiC-GAN: Equivariant Point Cloud Generation for Particle Jets

January 17, 2023 · 1107 words · Erik Buhmann, Gregor Kasieczka, Jesse Thaler

GLIGEN: Open-Set Grounded Text-to-Image Generation

January 17, 2023 · 1021 words · Yuheng Li, Haotian Liu, Qingyang Wu, Fangzhou Mu, Jianwei Yang and 3 others

Transformers as Algorithms: Generalization and Implicit Model Selection in In-context Learning

January 17, 2023 · 721 words · Yingcong Li, M. Emrullah Ildiz, Dimitris Papailiopoulos, Samet Oymak

Dissociating language and thought in large language models: a cognitive perspective

January 16, 2023 · 1326 words · Kyle Mahowald, Anna A. Ivanova, Idan A. Blank, Nancy Kanwisher, Joshua B. Tenenbaum and 1 others

Msanii: High Fidelity Music Synthesis on a Shoestring Budget

January 16, 2023 · 1296 words · Kinyugo Maina

Drug Synergistic Combinations Predictions via Large-Scale Pre-Training and Graph Structure Learning

January 14, 2023 · 743 words · Zhihang Hu, Qinze Yu, Yucheng Guo, Taifeng Wang, Irwin King and 3 others

Recent advances in artificial intelligence for retrosynthesis

January 14, 2023 · 1097 words · Zipeng Zhong, Jie Song, Zunlei Feng, Tiantao Liu, Lingxiang Jia and 3 others

YOLOv6 v3.0: A Full-Scale Reloading

January 13, 2023 · 395 words · Chuyi Li, Lulu Li, Yifei Geng, Hongliang Jiang, Meng Cheng and 4 others

Designing losses for data-free training of normalizing flows on Boltzmann distributions

January 13, 2023 · 1138 words · Loris Felardos, Jérôme Hénin, Guillaume Charpiat

Guiding Text-to-Image Diffusion Model Towards Grounded Generation

January 12, 2023 · 1042 words · Ziyi Li, Qinye Zhou, Xiaoyun Zhang, Ya Zhang, Yanfeng Wang and 1 others

Tracr: Compiled Transformers as a Laboratory for Interpretability

January 12, 2023 · 1434 words · David Lindner, János Kramár, Matthew Rahtz, Thomas McGrath, Vladimir Mikulik

Taking Search to Task

January 12, 2023 · 1648 words · Chirag Shah, Ryen W. White, Paul Thomas, Bhaskar Mitra, Shawon Sarkar and 1 others

Everyone’s Voice Matters: Quantifying Annotation Disagreement Using Demographic Information

January 12, 2023 · 650 words · Ruyuan Wan, Jaehyung Kim, Dongyeop Kang

We are Going to the Space – Part 1: Which device to deploy in a satellite?

January 12, 2023 · 1020 words · Robert Bayer, Julian Priest, Pınar Tözün

SlideVQA: A Dataset for Document Visual Question Answering on Multiple Images

January 12, 2023 · 1037 words · Ryota Tanaka, Kyosuke Nishida, Kosuke Nishida, Taku Hasegawa, Itsumi Saito and 1 others

Causal Abstraction for Faithful Model Interpretation

January 11, 2023 · 2528 words · Atticus Geiger, Chris Potts, Thomas Icard

Does progress on ImageNet transfer to real-world datasets?

January 11, 2023 · 1311 words · Alex Fang, Simon Kornblith, Ludwig Schmidt

Dynamics of a data-driven low-dimensional model of turbulent minimal Couette flow

January 11, 2023 · 1069 words · Alec J. Linot, Michael D. Graham

Continual Few-Shot Learning Using HyperTransformers

January 11, 2023 · 1142 words · Max Vladymyrov, Andrey Zhmoginov, Mark Sandler

Quantifying the Technological Foundations of Economic Complexity

January 11, 2023 · 1008 words · Hardik Rajpal, Omar A Guerrero

Learning to Exploit Temporal Structure for Biomedical Vision-Language Processing

January 11, 2023 · 826 words · Shruthi Bannur, Stephanie Hyland, Qianchu Liu, Fernando Perez-Garcia, Maximilian Ilse and 11 others

ChatGPT is not all you need. A State of the Art Review of large Generative AI models

January 11, 2023 · 823 words · Roberto Gozalo-Brizuela, Eduardo C. Garrido-Merchan

An Analysis of Quantile Temporal-Difference Learning

January 11, 2023 · 1705 words · Mark Rowland, Rémi Munos, Mohammad Gheshlaghi Azar, Yunhao Tang, Georg Ostrovski and 4 others

GPT as Knowledge Worker: A Zero-Shot Evaluation of (AI)CPA Capabilities

January 11, 2023 · 868 words · Jillian Bommarito, Michael Bommarito, Daniel Martin Katz, Jessica Katz

Data Distillation: A Survey

January 11, 2023 · 833 words · Noveen Sachdeva, Julian McAuley

Does Localization Inform Editing? Surprising Differences in Causality-Based Localization vs. Knowledge Editing in Language Models

January 10, 2023 · 1093 words · Peter Hase, Mohit Bansal, Been Kim, Asma Ghandeharioun

Mastering Diverse Domains through World Models

January 10, 2023 · 890 words · Danijar Hafner, Jurgis Pasukonis, Jimmy Ba, Timothy Lillicrap

Neural Radiance Field Codebooks

January 10, 2023 · 1129 words · Matthew Wallingford, Aditya Kusupati, Alex Fang, Vivek Ramanujan, Aniruddha Kembhavi and 2 others

On the Robustness of AlphaFold: A COVID-19 Case Study

January 10, 2023 · 1001 words · Ismail Alkhouri, Sumit Jha, Andre Beckus, George Atia, Alvaro Velasquez and 3 others

RedMule: A Mixed-Precision Matrix-Matrix Operation Engine for Flexible and Energy-Efficient On-Chip Linear Algebra and TinyML Training Acceleration

January 10, 2023 · 1197 words · Yvan Tortorella, Luca Bertaccini, Luca Benini, Davide Rossi, Francesco Conti

How Data Scientists Review the Scholarly Literature

January 10, 2023 · 1920 words · Sheshera Mysore, Mahmood Jasim, Haoru Song, Sarah Akbar, Andre Kenneth Chase Randall and 1 others

Designing BERT for Convolutional Networks: Sparse and Hierarchical Masked Modeling

January 9, 2023 · 1006 words · Keyu Tian, Yi Jiang, Qishuai Diao, Chen Lin, Liwei Wang and 1 others

Doc2Query–: When Less is More

January 9, 2023 · 419 words · Mitko Gospodinov, Sean MacAvaney, Craig Macdonald

A Survey on Transformers in Reinforcement Learning

January 8, 2023 · 1010 words · Wenzhe Li, Hao Luo, Zichuan Lin, Chongjie Zhang, Zongqing Lu and 1 others

Generative Time Series Forecasting with Diffusion, Denoise, and Disentanglement

January 8, 2023 · 886 words · Yan Li, Xinjiang Lu, Yaqing Wang, Dejing Dou

DeepMatcher: A Deep Transformer-based Network for Robust and Accurate Local Feature Matching

January 8, 2023 · 1154 words · Tao Xie, Kun Dai, Ke Wang, Ruifeng Li, Lijun Zhao

Perceptual-Neural-Physical Sound Matching

January 7, 2023 · 913 words · Han Han, Vincent Lostanlen, Mathieu Lagrange

Why do Nearest Neighbor Language Models Work?

January 7, 2023 · 846 words · Frank F. Xu, Uri Alon, Graham Neubig

Modeling Scattering Coefficients using Self-Attentive Complex Polynomials with Image-based Representation

January 6, 2023 · 1175 words · Andrew Cohen, Weiping Dou, Jiang Zhu, Slawomir Koziel, Peter Renner and 5 others

‘No, to the Right’ – Online Language Corrections for Robotic Manipulation via Shared Autonomy

January 6, 2023 · 873 words · Yuchen Cui, Siddharth Karamcheti, Raj Palleti, Nidhya Shivakumar, Percy Liang and 1 others

Automatic segmentation of clear cell renal cell tumors, kidney, and cysts in patients with von Hippel-Lindau syndrome using U-net architecture on magnetic resonance images

January 6, 2023 · 800 words · Pouria Yazdian Anari, Nathan Lay, Aditi Chaurasia, Nikhil Gopal, Safa Samimi and 11 others

Better Differentially Private Approximate Histograms and Heavy Hitters using the Misra-Gries Sketch

January 6, 2023 · 888 words · Christian Janos Lebeda, Jakub Tětek

Myths and Legends in High-Performance Computing

January 6, 2023 · 227 words · Satoshi Matsuoka, Jens Domke, Mohamed Wahib, Aleksandr Drozd, Torsten Hoefler

TrojanPuzzle: Covertly Poisoning Code-Suggestion Models

January 6, 2023 · 1410 words · Hojjat Aghakhani, Wei Dai, Andre Manoel, Xavier Fernandes, Anant Kharkar and 5 others

Training trajectories, mini-batch losses and the curious role of the learning rate

January 5, 2023 · 828 words · Mark Sandler, Andrey Zhmoginov, Max Vladymyrov, Nolan Miller

HyperReel: High-Fidelity 6-DoF Video with Ray-Conditioned Sampling

January 5, 2023 · 969 words · Benjamin Attal, Jia-Bin Huang, Christian Richardt, Michael Zollhoefer, Johannes Kopf and 2 others

Teaching Computer Vision for Ecology

January 5, 2023 · 479 words · Elijah Cole, Suzanne Stathatos, Björn Lütjens, Tarun Sharma, Justin Kay and 3 others

Reprogramming Pretrained Language Models for Protein Sequence Representation Learning

January 5, 2023 · 1425 words · Ria Vinod, Pin-Yu Chen, Payel Das

Neural Codec Language Models are Zero-Shot Text to Speech Synthesizers

January 5, 2023 · 910 words · Chengyi Wang, Sanyuan Chen, Yu Wu, Ziqiang Zhang, Long Zhou and 8 others

DepthP+P: Metric Accurate Monocular Depth Estimation using Planar and Parallax

January 5, 2023 · 824 words · Sadra Safadoust, Fatma Güney

Semantic match: Debugging feature attribution methods in XAI for healthcare

January 5, 2023 · 509 words · Giovanni Cinà, Tabea E. Röber, Rob Goedhart, Ş. İlker Birbil

Explain to Me: Towards Understanding Privacy Decisions

January 5, 2023 · 1020 words · Gonul Ayci, Pınar Yolum, Arzucan Özgür, Murat Şensoy

InPars-v2: Large Language Models as Efficient Dataset Generators for Information Retrieval

January 4, 2023 · 340 words · Vitor Jeronymo, Luiz Bonifacio, Hugo Abonizio, Marzieh Fadaee, Roberto Lotufo and 2 others

Unsupervised Manifold Linearizing and Clustering

January 4, 2023 · 420 words · Tianjiao Ding, Shengbang Tong, Kwan Ho Ryan Chan, Xili Dai, Yi Ma and 1 others

PACO: Parts and Attributes of Common Objects

January 4, 2023 · 1105 words · Vignesh Ramanathan, Anmol Kalia, Vladan Petrovic, Yi Wen, Baixue Zheng and 9 others

Iterated Decomposition: Improving Science Q&A by Supervising Reasoning Processes

January 4, 2023 · 891 words · Justin Reppert, Ben Rachbach, Charlie George, Luke Stebbing, Jungwon Byun and 2 others

A compositional account of motifs, mechanisms, and dynamics in biochemical regulatory networks

January 4, 2023 · 1202 words · Rebekah Aduddell, James Fairbanks, Amit Kumar, Pablo S. Ocal, Evan Patterson and 1 others

A Succinct Summary of Reinforcement Learning

January 3, 2023 · 1171 words · Sanjeevan Ahilan

Identifying Exoplanets with Deep Learning. V. Improved Light Curve Classification for TESS Full Frame Image Observations

January 3, 2023 · 1751 words · Evan Tey, Dan Moldovan, Michelle Kunimoto, Chelsea X. Huang, Avi Shporer and 6 others

Large Language Models as Corporate Lobbyists

January 3, 2023 · 377 words · John J. Nay

Language Models are Drummers: Drum Composition with Natural Language Pre-Training

January 3, 2023 · 866 words · Li Zhang, Chris Callison-Burch

Deep Learning and Computational Physics (Lecture Notes)

January 3, 2023 · 2811 words · Deep Ray, Orazio Pinti, Assad A. Oberai

Causal Inference in Recommender Systems: A Survey of Strategies for Bias Mitigation, Explanation, and Generalization

January 3, 2023 · 1808 words · Yaochen Zhu, Jing Ma, Jundong Li

Understanding Political Polarisation using Language Models: A dataset and method

January 2, 2023 · 616 words · Samiran Gode, Supreeth Bare, Bhiksha Raj, Hyungon Yoo

ConvNeXt V2: Co-designing and Scaling ConvNets with Masked Autoencoders

January 2, 2023 · 784 words · Sanghyun Woo, Shoubhik Debnath, Ronghang Hu, Xinlei Chen, Zhuang Liu and 2 others

Massive Language Models Can Be Accurately Pruned in One-Shot

January 2, 2023 · 913 words · Elias Frantar, Dan Alistarh

Muse: Text-To-Image Generation via Masked Generative Transformers

January 2, 2023 · 1265 words · Huiwen Chang, Han Zhang, Jarred Barber, AJ Maschinot, Jose Lezama and 7 others

2022²³⁵

December¹⁷⁴

Rethinking with Retrieval: Faithful Large Language Model Inference

December 31, 2022 · 834 words · Hangfeng He, Hongming Zhang, Dan Roth

DensePose From WiFi

December 31, 2022 · 1224 words · Jiaqi Geng, Dong Huang, Fernando De la Torre

Nowcasting Stock Implied Volatility with Twitter

December 31, 2022 · 1201 words · Thomas Dierckx, Jesse Davis, Wim Schoutens

Design on Matroids: Diversity vs. Meritocracy

December 31, 2022 · 818 words · Isa E. Hafalir, Fuhito Kojima, M. Bumin Yenmez, Koji Yokote

A Survey for In-context Learning

December 31, 2022 · 1051 words · Qingxiu Dong, Lei Li, Damai Dai, Ce Zheng, Zhiyong Wu and 5 others

Efficient Market Design with Distributional Objectives

December 31, 2022 · 927 words · Isa E. Hafalir, Fuhito Kojima, M. Bumin Yenmez

Effective Brain Connectome: the whole-brain effective connectivity from neural perturbational inference

December 31, 2022 · 1890 words · Zixiang Luo, Zhichao Liang, Chenyu Xu, Changsong Zhou, Quanying Liu

Integrated information theory (IIT) 4.0: Formulating the properties of phenomenal existence in physical terms

December 30, 2022 · 1188 words · Larissa Albantakis, Leonardo Barbosa, Graham Findlay, Matteo Grasso, Andrew M Haun and 11 others

MAUVE Scores for Generative Models: Theory and Practice

December 30, 2022 · 1786 words · Krishna Pillutla, Lang Liu, John Thickstun, Sean Welleck, Swabha Swayamdipta and 4 others

Learning 3D Human Pose Estimation from Dozens of Datasets using a Geometry-Aware Autoencoder to Bridge Between Skeleton Formats

December 29, 2022 · 999 words · István Sárándi, Alexander Hermans, Bastian Leibe

GPT Takes the Bar Exam

December 29, 2022 · 737 words · Michael Bommarito II, Daniel Martin Katz

Learning One Abstract Bit at a Time Through Self-Invented Experiments Encoded as Neural Networks

December 29, 2022 · 699 words · Vincent Herrmann, Louis Kirsch, Jürgen Schmidhuber

‘Real Attackers Don’t Compute Gradients’: Bridging the Gap Between Adversarial ML Research and Practice

December 29, 2022 · 1955 words · Giovanni Apruzzese, Hyrum S. Anderson, Savino Dambra, David Freeman, Fabio Pierazzi and 1 others

What Estimators Are Unbiased For Linear Models?

December 29, 2022 · 1734 words · Lihua Lei, Jeffrey Wooldridge

Cramming: Training a Language Model on a Single GPU in One Day

December 28, 2022 · 1055 words · Jonas Geiping, Tom Goldstein

Demonstrate-Search-Predict: Composing retrieval and language models for knowledge-intensive NLP

December 28, 2022 · 912 words · Omar Khattab, Keshav Santhanam, Xiang Lisa Li, David Hall, Percy Liang and 2 others

A System-Level View on Out-of-Distribution Data in Robotics

December 28, 2022 · 849 words · Rohan Sinha, Apoorva Sharma, Somrita Banerjee, Thomas Lew, Rachel Luo and 4 others

Feature learning in neural networks and kernel machines that recursively learn features

December 28, 2022 · 1479 words · Adityanarayanan Radhakrishnan, Daniel Beaglehole, Parthe Pandit, Mikhail Belkin

Sparse Coding in a Dual Memory System for Lifelong Learning

December 28, 2022 · 1021 words · Fahad Sarfraz, Elahe Arani, Bahram Zonooz

NeRN – Learning Neural Representations for Neural Networks

December 27, 2022 · 1042 words · Maor Ashkenazi, Zohar Rimon, Ron Vainshtein, Shir Levi, Elad Richardson and 2 others

Building a Culture of Reproducibility in Academic Research

December 27, 2022 · 766 words · Jimmy Lin

A Generalization of ViT/MLP-Mixer to Graphs

December 27, 2022 · 907 words · Xiaoxin He, Bryan Hooi, Thomas Laurent, Adam Perold, Yann LeCun and 1 others

The Forward-Forward Algorithm: Some Preliminary Investigations

December 27, 2022 · 1284 words · Geoffrey Hinton

Structure-based drug discovery with deep learning

December 26, 2022 · 765 words · Rıza Özçelik, Derek van Tilborg, José Jiménez-Luna, Francesca Grisoni

Fully Differentiable RANSAC

December 26, 2022 · 820 words · Tong Wei, Yash Patel, Jiri Matas, Daniel Barath

Large Language Models Encode Clinical Knowledge

December 26, 2022 · 733 words · Karan Singhal, Shekoofeh Azizi, Tao Tu, S. Sara Mahdavi, Jason Wei and 25 others

TextBox 2.0: A Text Generation Library with Pre-trained Language Models

December 26, 2022 · 553 words · Tianyi Tang, Junyi Li, Zhipeng Chen, Yiwen Hu, Zhuohao Yu and 7 others

Sitting Posture Recognition Using a Spiking Neural Network

December 25, 2022 · 988 words · Jianquan Wang, Basim Hafidh, Haiwei Dong, Abdulmotaleb El Saddik

Closed-form control with spike coding networks

December 25, 2022 · 1106 words · Filip S. Slijkhuis, Sander W. Keemink, Pablo Lanillos

GraphCast: Learning skillful medium-range global weather forecasting

December 24, 2022 · 918 words · Remi Lam, Alvaro Sanchez-Gonzalez, Matthew Willson, Peter Wirnsberger, Meire Fortunato and 13 others

Detecting Objects with Graph Priors and Graph Refinement

December 23, 2022 · 727 words · Aritra Bhowmik, Martin R. Oswald, Yu Wang, Nora Baka, Cees G. M. Snoek

SuperGF: Unifying Local and Global Features for Visual Localization

December 23, 2022 · 1070 words · Wenzheng Song, Ran Yan, Boshu Lei, Takayuki Okatani

Stop using the elbow criterion for k-means and how to choose the number of clusters instead

December 23, 2022 · 715 words · Erich Schubert

The Onset of Variance-Limited Behavior for Networks in the Lazy and Rich Regimes

December 23, 2022 · 1174 words · Alexander Atanasov, Blake Bordelon, Sabarish Sainathan, Cengiz Pehlevan

Dubbing in Practice: A Large Scale Study of Human Localization With Insights for Automatic Dubbing

December 23, 2022 · 1333 words · William Brannon, Yogesh Virkar, Brian Thompson

Why Does Surprisal From Larger Transformer-Based Language Models Provide a Poorer Fit to Human Reading Times?

December 23, 2022 · 810 words · Byung-Doh Oh, William Schuler

How different are self and nonself?

December 22, 2022 · 93 words · Andreas Mayer, Christopher J. Russo, Quentin Marcou, William Bialek, Benjamin D. Greenbaum

Deep learning for size-agnostic inverse design of random-network 3D printed mechanical metamaterials

December 22, 2022 · 1165 words · Helda Pahlavani, Kostas Tsifoutis-Kazolis, Prerak Mody, Jie Zhou, Mohammad J. Mirzaali and 1 others

Scalable Adaptive Computation for Iterative Generation

December 22, 2022 · 746 words · Allan Jabri, David Fleet, Ting Chen

Shakes on a Plane: Unsupervised Depth Estimation from Unstabilized Photography

December 22, 2022 · 943 words · Ilya Chugunov, Yuxuan Zhang, Felix Heide

Beyond SOT: It’s Time to Track Multiple Generic Objects at Once

December 22, 2022 · 1283 words · Christoph Mayer, Martin Danelljan, Ming-Hsuan Yang, Vittorio Ferrari, Luc Van Gool and 1 others

Impossibility Theorems for Feature Attribution

December 22, 2022 · 1610 words · Blair Bilodeau, Natasha Jaques, Pang Wei Koh, Been Kim

GOOD: Exploring Geometric Cues for Detecting Objects in an Open World

December 22, 2022 · 577 words · Haiwen Huang, Andreas Geiger, Dan Zhang

Tune-A-Video: One-Shot Tuning of Image Diffusion Models for Text-to-Video Generation

December 22, 2022 · 704 words · Jay Zhangjie Wu, Yixiao Ge, Xintao Wang, Weixian Lei, Yuchao Gu and 4 others

Local Policy Improvement for Recommender Systems

December 22, 2022 · 787 words · Dawen Liang, Nikos Vlassis

ReVISE: Self-Supervised Speech Resynthesis with Visual Input for Universal and Generalized Speech Enhancement

December 21, 2022 · 1041 words · Wei-Ning Hsu, Tal Remez, Bowen Shi, Jacob Donley, Yossi Adi

Contrastive Distillation Is a Sample-Efficient Self-Supervised Loss Policy for Transfer Learning

December 21, 2022 · 1458 words · Chris Lengerich, Gabriel Synnaeve, Amy Zhang, Hugh Leather, Kurt Shuster and 2 others

Generalized Decoding for Pixel, Image, and Language

December 21, 2022 · 1077 words · Xueyan Zou, Zi-Yi Dou, Jianwei Yang, Zhe Gan, Linjie Li and 9 others

Training language models for deeper understanding improves brain alignment

December 21, 2022 · 867 words · Khai Loong Aw, Mariya Toneva

From Images to Textual Prompts: Zero-shot VQA with Frozen Large Language Models

December 21, 2022 · 914 words · Jiaxian Guo, Junnan Li, Dongxu Li, Anthony Meng Huat Tiong, Boyang Li and 2 others

Multi-modal Molecule Structure-text Model for Text-based Retrieval and Editing

December 21, 2022 · 588 words · Shengchao Liu, Weili Nie, Chengpeng Wang, Jiarui Lu, Zhuoran Qiao and 4 others

Hierarchically branched diffusion models for efficient and interpretable multi-class conditional generation

December 21, 2022 · 990 words · Alex M. Tseng, Tommaso Biancalani, Max Shen, Gabriele Scalia

MultiInstruct: Improving Multi-Modal Zero-Shot Learning via Instruction Tuning

December 21, 2022 · 923 words · Zhiyang Xu, Ying Shen, Lifu Huang

Beyond Contrastive Learning: A Variational Generative Model for Multilingual Retrieval

December 21, 2022 · 1232 words · John Wieting, Jonathan H. Clark, William W. Cohen, Graham Neubig, Taylor Berg-Kirkpatrick

Hidden Poison: Machine Unlearning Enables Camouflaged Poisoning Attacks

December 21, 2022 · 990 words · Jimmy Z. Di, Jack Douglas, Jayadev Acharya, Gautam Kamath, Ayush Sekhari

There’s Plenty of Room Right Here: Biological Systems as Evolved, Overloaded, Multi-scale Machines

December 20, 2022 · 1525 words · Joshua Bongard, Michael Levin

Does unsupervised grammar induction need pixels?

December 20, 2022 · 601 words · Boyi Li, Rodolfo Corona, Karttikeya Mangalam, Catherine Chen, Daniel Flaherty and 5 others

Debiasing NLP Models Without Demographic Information

December 20, 2022 · 980 words · Hadas Orgad, Yonatan Belinkov

Character-Aware Models Improve Visual Text Rendering

December 20, 2022 · 1045 words · Rosanne Liu, Dan Garrette, Chitwan Saharia, William Chan, Adam Roberts and 5 others

Parsel: A Unified Natural Language Framework for Algorithmic Reasoning

December 20, 2022 · 1610 words · Eric Zelikman, Qian Huang, Gabriel Poesia, Noah D. Goodman, Nick Haber

Self-Instruct: Aligning Language Model with Self Generated Instructions

December 20, 2022 · 771 words · Yizhong Wang, Yeganeh Kordi, Swaroop Mishra, Alisa Liu, Noah A. Smith and 2 others

Why Can GPT Learn In-Context? Language Models Secretly Perform Gradient Descent as Meta-Optimizers

December 20, 2022 · 637 words · Damai Dai, Yutao Sun, Li Dong, Yaru Hao, Zhifang Sui and 1 others

DialGuide: Aligning Dialogue Model Behavior with Developer Guidelines

December 20, 2022 · 884 words · Prakhar Gupta, Yang Liu, Di Jin, Behnam Hedayatnia, Spandana Gella and 4 others

PairReranker: Pairwise Reranking for Natural Language Generation

December 20, 2022 · 707 words · Dongfu Jiang, Bill Yuchen Lin, Xiang Ren

A Length-Extrapolatable Transformer

December 20, 2022 · 644 words · Yutao Sun, Li Dong, Barun Patra, Shuming Ma, Shaohan Huang and 4 others

RangeAugment: Efficient Online Augmentation with Range Learning

December 20, 2022 · 1157 words · Sachin Mehta, Saeid Naderiparizi, Fartash Faghri, Maxwell Horton, Lailin Chen and 3 others

Detoxifying Text with MaRCo: Controllable Revision with Experts and Anti-Experts

December 20, 2022 · 610 words · Skyler Hallinan, Alisa Liu, Yejin Choi, Maarten Sap

A Survey of Deep Learning for Mathematical Reasoning

December 20, 2022 · 1382 words · Pan Lu, Liang Qiu, Wenhao Yu, Sean Welleck, Kai-Wei Chang

Trustworthy Social Bias Measurement

December 20, 2022 · 982 words · Rishi Bommasani, Percy Liang

Is GPT-3 a Psychopath? Evaluating Large Language Models from a Psychological Perspective

December 20, 2022 · 951 words · Xingxuan Li, Yutong Li, Linlin Liu, Lidong Bing, Shafiq Joty

Interleaving Retrieval with Chain-of-Thought Reasoning for Knowledge-Intensive Multi-Step Questions

December 20, 2022 · 790 words · Harsh Trivedi, Niranjan Balasubramanian, Tushar Khot, Ashish Sabharwal

Mini-Model Adaptation: Efficiently Extending Pretrained Models to New Languages via Aligned Shallow Training

December 20, 2022 · 826 words · Kelly Marchisio, Patrick Lewis, Yihong Chen, Mikel Artetxe

A Measure-Theoretic Characterization of Tight Language Models

December 20, 2022 · 1204 words · Li Du, Lucas Torroba Hennigen, Tiago Pimentel, Clara Meister, Jason Eisner and 1 others

Precise Zero-Shot Dense Retrieval without Relevance Labels

December 20, 2022 · 785 words · Luyu Gao, Xueguang Ma, Jimmy Lin, Jamie Callan

LAMBADA: Backward Chaining for Automated Reasoning in Natural Language

December 20, 2022 · 892 words · Seyed Mehran Kazemi, Najoung Kim, Deepti Bhatia, Xin Xu, Deepak Ramachandran

Controllable Text Generation with Language Constraints

December 20, 2022 · 860 words · Howard Chen, Huihan Li, Danqi Chen, Karthik Narasimhan

SODA: Million-scale Dialogue Distillation with Social Commonsense Contextualization

December 20, 2022 · 804 words · Hyunwoo Kim, Jack Hessel, Liwei Jiang, Ximing Lu, Youngjae Yu and 6 others

MULTI3NLU++: A Multilingual, Multi-Intent, Multi-Domain Dataset for Natural Language Understanding in Task-Oriented Dialogue

December 20, 2022 · 888 words · Nikita Moghe, Evgeniia Razumovskaia, Liane Guillou, Ivan Vulić, Anna Korhonen and 1 others

Recycling diverse models for out-of-distribution generalization

December 20, 2022 · 913 words · Alexandre Ramé, Kartik Ahuja, Jianyu Zhang, Matthieu Cord, Léon Bottou and 1 others

HouseCat6D – A Large-Scale Multi-Modal Category Level 6D Object Pose Dataset with Household Objects in Realistic Scenarios

December 20, 2022 · 1381 words · HyunJun Jung, Shun-Cheng Wu, Patrick Ruhkamp, Hannah Schieber, Pengyuan Wang and 6 others

Settling the Reward Hypothesis

December 20, 2022 · 1049 words · Michael Bowling, John D. Martin, David Abel, Will Dabney

Quantifying Local Extrinsic Curvature in Neural Manifolds

December 20, 2022 · 844 words · Francisco E. Acosta, Sophia Sanborn, Khanh Dao Duc, Manu Madhav, Nina Miolane

Reinforced Clarification Question Generation with Defeasibility Rewards for Disambiguating Social and Moral Situations

December 20, 2022 · 914 words · Valentina Pyatkin, Jena D. Hwang, Vivek Srikumar, Ximing Lu, Liwei Jiang and 2 others

Towards Reasoning in Large Language Models: A Survey

December 20, 2022 · 790 words · Jie Huang, Kevin Chen-Chuan Chang

Extrinsic Evaluation of Machine Translation Metrics

December 20, 2022 · 1106 words · Nikita Moghe, Tom Sherborne, Mark Steedman, Alexandra Birch

High-resolution canopy height map in the Landes forest (France) based on GEDI, Sentinel-1, and Sentinel-2 data with a deep learning approach

December 20, 2022 · 2157 words · Martin Schwartz, Philippe Ciais, Catherine Ottlé, Aurelien De Truchis, Cedric Vega and 9 others

ReCode: Robustness Evaluation of Code Generation Models

December 20, 2022 · 914 words · Shiqi Wang, Zheng Li, Haifeng Qian, Chenghao Yang, Zijian Wang and 9 others

On the Role of Parallel Data in Cross-lingual Transfer Learning

December 20, 2022 · 635 words · Machel Reid, Mikel Artetxe

Multi-asset market making under the quadratic rough Heston

December 20, 2022 · 1218 words · Mathieu Rosenbaum, Jianfei Zhang

Goal-oriented Autonomous Driving

December 20, 2022 · 766 words · Yihan Hu, Jiazhi Yang, Li Chen, Keyu Li, Chonghao Sima and 11 others

Large Language Models Are Reasoning Teachers

December 20, 2022 · 833 words · Namgyu Ho, Laura Schmid, Se-Young Yun

Language Modeling with Latent Situations

December 20, 2022 · 968 words · Belinda Z. Li, Maxwell Nye, Jacob Andreas

CoCoMIC: Code Completion By Jointly Modeling In-file and Cross-file Context

December 20, 2022 · 870 words · Yangruibo Ding, Zijian Wang, Wasi Uddin Ahmad, Murali Krishna Ramanathan, Ramesh Nallapati and 3 others

(QA)$^2$: Question Answering with Questionable Assumptions

December 20, 2022 · 885 words · Najoung Kim, Phu Mon Htut, Samuel R. Bowman, Jackson Petty

Defending Against Poisoning Attacks in Open-Domain Question Answering

December 20, 2022 · 676 words · Orion Weller, Aleem Khan, Nathaniel Weir, Dawn Lawrie, Benjamin Van Durme

Towards Understanding Chain-of-Thought Prompting: An Empirical Study of What Matters

December 20, 2022 · 760 words · Boshi Wang, Sewon Min, Xiang Deng, Jiaming Shen, You Wu and 2 others

Tokenization Consistency Matters for Generative Models on Extractive NLP Tasks

December 19, 2022 · 527 words · Kaiser Sun, Peng Qi, Yuhao Zhang, Lan Liu, William Yang Wang and 1 others

Dexterous Manipulation from Images: Autonomous Real-World RL via Substep Guidance

December 19, 2022 · 1093 words · Kelvin Xu, Zheyuan Hu, Ria Doshi, Aaron Rovinsky, Vikash Kumar and 2 others

Policy learning ‘without’’ overlap: Pessimism and generalized empirical Bernstein’s inequality

December 19, 2022 · 2355 words · Ying Jin, Zhimei Ren, Zhuoran Yang, Zhaoran Wang

Inducing Character-level Structure in Subword-based Language Models with Type-level Interchange Intervention Training

December 19, 2022 · 1489 words · Jing Huang, Zhengxuan Wu, Kyle Mahowald, Christopher Potts

Training Trajectories of Language Models Across Scales

December 19, 2022 · 679 words · Mengzhou Xia, Mikel Artetxe, Chunting Zhou, Xi Victoria Lin, Ramakanth Pasunuru and 3 others

Scalable Diffusion Models with Transformers

December 19, 2022 · 797 words · William Peebles, Saining Xie

Evaluating Human-Language Model Interaction

December 19, 2022 · 1257 words · Mina Lee, Megha Srivastava, Amelia Hardy, John Thickstun, Esin Durmus and 13 others

DSI++: Updating Transformer Memory with New Documents

December 19, 2022 · 1325 words · Sanket Vaibhav Mehta, Jai Gupta, Yi Tay, Mostafa Dehghani, Vinh Q. Tran and 4 others

One Embedder, Any Task: Instruction-Finetuned Text Embeddings

December 19, 2022 · 1312 words · Hongjin Su, Weijia Shi, Jungo Kasai, Yizhong Wang, Yushi Hu and 5 others

Speaking Style Conversion With Discrete Self-Supervised Units

December 19, 2022 · 730 words · Gallil Maimon, Yossi Adi

KNIFE: Knowledge Distillation with Free-Text Rationales

December 19, 2022 · 1053 words · Aaron Chan, Zhiyuan Zeng, Wyatt Lake, Brihi Joshi, Hanjie Chen and 1 others

The case for 4-bit precision: k-bit Inference Scaling Laws

December 19, 2022 · 1046 words · Tim Dettmers, Luke Zettlemoyer

Continual Learning for Instruction Following from Realtime Feedback

December 19, 2022 · 992 words · Alane Suhr, Yoav Artzi

Unnatural Instructions: Tuning Language Models with (Almost) No Human Labor

December 19, 2022 · 1265 words · Or Honovich, Thomas Scialom, Omer Levy, Timo Schick

A Natural Bias for Language Generation Models

December 19, 2022 · 726 words · Clara Meister, Wojciech Stokowiec, Tiago Pimentel, Lei Yu, Laura Rimell and 1 others

Multilingual Sequence-to-Sequence Models for Hebrew NLP

December 19, 2022 · 513 words · Matan Eyal, Hila Noga, Roee Aharoni, Idan Szpektor, Reut Tsarfaty

MatCha: Enhancing Visual Language Pretraining with Math Reasoning and Chart Derendering

December 19, 2022 · 755 words · Fangyu Liu, Francesco Piccinno, Syrine Krichene, Chenxi Pang, Kenton Lee and 4 others

Visconde: Multi-document QA with GPT-3 and Neural Reranking

December 19, 2022 · 567 words · Jayr Pereira, Robson Fidalgo, Roberto Lotufo, Rodrigo Nogueira

NusaCrowd: Open Source Initiative for Indonesian NLP Resources

December 19, 2022 · 1090 words · Samuel Cahyawijaya, Holy Lovenia, Alham Fikri Aji, Genta Indra Winata, Bryan Wilie and 42 others

Optimal Transport for Unsupervised Hallucination Detection in Neural Machine Translation

December 19, 2022 · 1091 words · Nuno M. Guerreiro, Pierre Colombo, Pablo Piantanida, André F. T. Martins

Mu$^{2}$SLAM: Multitask, Multilingual Speech and Language Models

December 19, 2022 · 1056 words · Yong Cheng, Yu Zhang, Melvin Johnson, Wolfgang Macherey, Ankur Bapna

BLOOM+1: Adding Language Support to BLOOM for Zero-Shot Prompting

December 19, 2022 · 1128 words · Zheng-Xin Yong, Hailey Schoelkopf, Niklas Muennighoff, Alham Fikri Aji, David Ifeoluwa Adelani and 9 others

Multi-View Knowledge Distillation from Crowd Annotations for Out-of-Domain Generalization

December 19, 2022 · 773 words · Dustin Wright, Isabelle Augenstein

StyleTRF: Stylizing Tensorial Radiance Fields

December 19, 2022 · 1142 words · Rahul Goel, Sirikonda Dhawal, Saurabh Saini, P. J. Narayanan

Transferring General Multimodal Pretrained Models to Text Recognition

December 19, 2022 · 477 words · Junyang Lin, Xuancheng Ren, Yichang Zhang, Gao Liu, Peng Wang and 2 others

APOLLO: A Simple Approach for Adaptive Pretraining of Language Models for Logical Reasoning

December 19, 2022 · 855 words · Soumya Sanyal, Yichong Xu, Shuohang Wang, Ziyi Yang, Reid Pryzant and 3 others

Multi hash embeddings in spaCy

December 19, 2022 · 1029 words · Lester James Miranda, Ákos Kádár, Adriane Boyd, Sofie Van Landeghem, Anders Søgaard and 1 others

Discovering Language Model Behaviors with Model-Written Evaluations

December 19, 2022 · 1169 words · Ethan Perez, Sam Ringer, Kamilė Lukošiūtė, Karina Nguyen, Edwin Chen and 58 others

Natural Language to Code Generation in Interactive Data Science Notebooks

December 19, 2022 · 1069 words · Pengcheng Yin, Wen-Ding Li, Kefan Xiao, Abhishek Rao, Yeming Wen and 7 others

Emergent Analogical Reasoning in Large Language Models

December 19, 2022 · 1428 words · Taylor Webb, Keith J. Holyoak, Hongjing Lu

Rethinking the Role of Scale for In-Context Learning: An Interpretability-based Case Study at 66 Billion Scale

December 18, 2022 · 1133 words · Hritik Bansal, Karthik Gopalakrishnan, Saket Dingliwal, Sravan Bodapati, Katrin Kirchhoff and 1 others

Language model acceptability judgements are not always robust to context

December 18, 2022 · 706 words · Koustuv Sinha, Jon Gauthier, Aaron Mueller, Kanishka Misra, Keren Fuentes and 2 others

Beyond the C: Retargetable Decompilation using Neural Machine Translation

December 17, 2022 · 1533 words · Iman Hosseini, Brendan Dolan-Gavitt

Are We Ready for Vision-Centric Driving Streaming Perception? The ASAP Benchmark

December 17, 2022 · 975 words · Xiaofeng Wang, Zheng Zhu, Yunpeng Zhang, Guan Huang, Yun Ye and 3 others

Improving Unsupervised Video Object Segmentation with Motion-Appearance Synergy

December 17, 2022 · 943 words · Long Lian, Zhirong Wu, Stella X. Yu

Improving Cross-task Generalization of Unified Table-to-text Models with Compositional Task Configurations

December 17, 2022 · 678 words · Jifan Chen, Yuhao Zhang, Lan Liu, Rui Dong, Xinchi Chen and 3 others

Point-E: A System for Generating 3D Point Clouds from Complex Prompts

December 16, 2022 · 1192 words · Alex Nichol, Heewoo Jun, Prafulla Dhariwal, Pamela Mishkin, Mark Chen

Neural Story Planning

December 16, 2022 · 1050 words · Anbang Ye, Christopher Cui, Taiwei Shi, Mark O. Riedl

‘Rarely’ a problem? Language models exhibit inverse scaling in their predictions following ‘few’-type quantifiers

December 16, 2022 · 471 words · James A. Michaelov, Benjamin K. Bergen

Uncovering the Disentanglement Capability in Text-to-Image Diffusion Models

December 16, 2022 · 969 words · Qiucheng Wu, Yujian Liu, Handong Zhao, Ajinkya Kale, Trung Bui and 4 others

Attentive Mask CLIP

December 16, 2022 · 964 words · Yifan Yang, Weiquan Huang, Yixuan Wei, Houwen Peng, Xinyang Jiang and 6 others

Connecting Permutation Equivariant Neural Networks and Partition Diagrams

December 16, 2022 · 2488 words · Edward Pearce-Crump

Efficient Conditionally Invariant Representation Learning

December 16, 2022 · 985 words · Roman Pogodin, Namrata Deka, Yazhe Li, Danica J. Sutherland, Victor Veitch and 1 others

Brauer’s Group Equivariant Neural Networks

December 16, 2022 · 962 words · Edward Pearce-Crump

MURMUR: Modular Multi-Step Reasoning for Semi-Structured Data-to-Text Generation

December 16, 2022 · 1185 words · Swarnadeep Saha, Xinyan Velocity Yu, Mohit Bansal, Ramakanth Pasunuru, Asli Celikyilmaz

Detecting and Mitigating Hallucinations in Machine Translation: Model Internal Workings Alone Do Well, Sentence Similarity Even Better

December 16, 2022 · 885 words · David Dale, Elena Voita, Loïc Barrault, Marta R. Costa-jussà

Biomedical image analysis competitions: The state of current participation practice

December 16, 2022 · 988 words · Matthias Eisenmann, Annika Reinke, Vivienn Weru, Minu Dietlinde Tizabi, Fabian Isensee and 350 others

Fake it till you make it: Learning(s) from a synthetic ImageNet clone

December 16, 2022 · 1136 words · Mert Bulent Sariyildiz, Karteek Alahari, Diane Larlus, Yannis Kalantidis

Teaching Small Language Models to Reason

December 16, 2022 · 864 words · Lucie Charlotte Magister, Jonathan Mallinson, Jakub Adamek, Eric Malmi, Aliaksei Severyn

How to disagree well: Investigating the dispute tactics used on Wikipedia

December 16, 2022 · 985 words · Christine de Kock, Tom Stafford, Andreas Vlachos

ALERT: Adapting Language Models to Reasoning Tasks

December 16, 2022 · 780 words · Ping Yu, Tianlu Wang, Olga Golovneva, Badr Alkhamissy, Gargi Ghosh and 2 others

SADM: Sequence-Aware Diffusion Model for Longitudinal Medical Image Generation

December 16, 2022 · 608 words · Jee Seok Yoon, Chenghao Zhang, Heung-Il Suk, Jia Guo, Xiaoxiao Li

Economic impacts of AI-augmented R&D

December 15, 2022 · 1977 words · Tamay Besiroglu, Nicholas Emery-Xu, Neil Thompson

Improving Chess Commentaries by Combining Language Models with Symbolic Reasoning Engines

December 15, 2022 · 1083 words · Andrew Lee, David Wu, Emily Dinan, Mike Lewis

FiDO: Fusion-in-Decoder optimized for stronger performance and faster inference

December 15, 2022 · 864 words · Michiel de Jong, Yury Zemlyanskiy, Joshua Ainslie, Nicholas FitzGerald, Sumit Sanghai and 2 others

Efficient Long Sequence Modeling via State Space Augmented Transformer

December 15, 2022 · 967 words · Simiao Zuo, Xiaodong Liu, Jian Jiao, Denis Charles, Eren Manavoglu and 2 others

On Second Thought, Let’s Not Think Step by Step! Bias and Toxicity in Zero-Shot Reasoning

December 15, 2022 · 959 words · Omar Shaikh, Hongxin Zhang, William Held, Michael Bernstein, Diyi Yang

UnitY: Two-pass Direct Speech-to-speech Translation with Discrete Units

December 15, 2022 · 1632 words · Hirofumi Inaguma, Sravya Popuri, Ilia Kulikov, Peng-Jen Chen, Changhan Wang and 5 others

DAMP: Doubly Aligned Multilingual Parser for Task-Oriented Dialogue

December 15, 2022 · 954 words · William Held, Christopher Hidey, Fei Liu, Eric Zhu, Rahul Goel and 2 others

Objaverse: A Universe of Annotated 3D Objects

December 15, 2022 · 901 words · Matt Deitke, Dustin Schwenk, Jordi Salvador, Luca Weihs, Oscar Michel and 5 others

Image-and-Language Understanding from Pixels Only

December 15, 2022 · 1127 words · Michael Tschannen, Basil Mustafa, Neil Houlsby

Attributed Question Answering: Evaluation and Modeling for Attributed Large Language Models

December 15, 2022 · 971 words · Bernd Bohnet, Vinh Q. Tran, Pat Verga, Roee Aharoni, Daniel Andor and 15 others

FlexiViT: One Model for All Patch Sizes

December 15, 2022 · 1204 words · Lucas Beyer, Pavel Izmailov, Alexander Kolesnikov, Mathilde Caron, Simon Kornblith and 5 others

Revisiting the Gold Standard: Grounding Summarization Evaluation with Robust Human Evaluation

December 15, 2022 · 1307 words · Yixin Liu, Alexander R. Fabbri, Pengfei Liu, Yilun Zhao, Linyong Nan and 6 others

ROSCOE: A Suite of Metrics for Scoring Step-by-Step Reasoning

December 15, 2022 · 553 words · Olga Golovneva, Moya Chen, Spencer Poff, Martin Corredor, Luke Zettlemoyer and 2 others

Audio-based AI classifiers show no evidence of improved COVID-19 screening over simple symptoms checkers

December 15, 2022 · 777 words · Harry Coppock, George Nicholson, Ivan Kiskin, Vasiliki Koutra, Kieran Baker and 20 others

Multimodal Teacher Forcing for Reconstructing Nonlinear Dynamical Systems

December 15, 2022 · 609 words · Manuel Brenner, Georgia Koppe, Daniel Durstewitz

Manifestations of Xenophobia in AI Systems

December 15, 2022 · 1606 words · Nenad Tomasev, Jonathan Leader Maynard, Iason Gabriel

Protein Structure Prediction until CASP15

December 15, 2022 · 467 words · Arne Elofsson

Transformers learn in-context by gradient descent

December 15, 2022 · 952 words · Johannes von Oswald, Eyvind Niklasson, Ettore Randazzo, João Sacramento, Alexander Mordvintsev and 2 others

RT-1: Robotics Transformer for Real-World Control at Scale

December 13, 2022 · 1320 words · Anthony Brohan, Noah Brown, Justice Carbajal, Yevgen Chebotar, Joseph Dabis and 46 others

Multi-Concept Customization of Text-to-Image Diffusion

December 8, 2022 · 745 words · Nupur Kumari, Bingliang Zhang, Richard Zhang, Eli Shechtman, Jun-Yan Zhu

ViTPose+: Vision Transformer Foundation Model for Generic Body Pose Estimation

December 7, 2022 · 1561 words · Yufei Xu, Jing Zhang, Qiming Zhang, Dacheng Tao

Robust Speech Recognition via Large-Scale Weak Supervision

December 6, 2022 · 1590 words · Alec Radford, Jong Wook Kim, Tao Xu, Greg Brockman, Christine McLeavey and 1 others

InternVideo: General Video Foundation Models via Generative and Discriminative Learning

December 6, 2022 · 898 words · Yi Wang, Kunchang Li, Yizhuo Li, Yinan He, Bingkun Huang and 12 others

Box2Mask: Box-supervised Instance Segmentation via Level-set Evolution

December 3, 2022 · 1494 words · Wentong Li, Wenyu Liu, Jianke Zhu, Miaomiao Cui, Risheng Yu and 2 others

Scaling Language-Image Pre-training via Masking

December 1, 2022 · 449 words · Yanghao Li, Haoqi Fan, Ronghang Hu, Christoph Feichtenhofer, Kaiming He

Score Jacobian Chaining: Lifting Pretrained 2D Diffusion Models for 3D Generation

December 1, 2022 · 853 words · Haochen Wang, Xiaodan Du, Jiahao Li, Raymond A. Yeh, Greg Shakhnarovich

November⁸

Paint by Example: Exemplar-based Image Editing with Diffusion Models

November 23, 2022 · 620 words · Binxin Yang, Shuyang Gu, Bo Zhang, Ting Zhang, Xuejin Chen and 3 others

DAMO-YOLO : A Report on Real-Time Object Detection Design

November 23, 2022 · 509 words · Xianzhe Xu, Yiqi Jiang, Weihua Chen, Yilun Huang, Yuan Zhang and 1 others

DiffusionDet: Diffusion Model for Object Detection

November 17, 2022 · 1045 words · Shoufa Chen, Peize Sun, Yibing Song, Ping Luo

VeLO: Training Versatile Learned Optimizers by Scaling Up

November 17, 2022 · 2452 words · Luke Metz, James Harrison, C. Daniel Freeman, Amil Merchant, Lucas Beyer and 6 others

DeepPrivacy2: Towards Realistic Full-Body Anonymization

November 17, 2022 · 1063 words · Håkon Hukkelås, Frank Lindseth

Versatile Diffusion: Text, Images and Variations All in One Diffusion Model

November 15, 2022 · 999 words · Xingqian Xu, Zhangyang Wang, Eric Zhang, Kai Wang, Humphrey Shi

EVA: Exploring the Limits of Masked Visual Representation Learning at Scale

November 14, 2022 · 968 words · Yuxin Fang, Wen Wang, Binhui Xie, Quan Sun, Ledell Wu and 4 others

MedSegDiff: Medical Image Segmentation with Diffusion Probabilistic Model

November 1, 2022 · 610 words · Junde Wu, Huihui Fang, Yu Zhang, Yehui Yang, Yanwu Xu

October¹⁴

NeRF-SLAM: Real-Time Dense Monocular SLAM with Neural Radiance Fields

October 24, 2022 · 1081 words · Antoni Rosinol, John J. Leonard, Luca Carlone

MetaFormer Baselines for Vision

October 24, 2022 · 614 words · Weihao Yu, Chenyang Si, Pan Zhou, Mi Luo, Yichen Zhou and 3 others

High Fidelity Neural Audio Compression

October 24, 2022 · 1126 words · Alexandre Défossez, Jade Copet, Gabriel Synnaeve, Yossi Adi

FaceDancer: Pose- and Occlusion-Aware High Fidelity Face Swapping

October 19, 2022 · 1084 words · Felix Rosberg, Eren Erdal Aksoy, Fernando Alonso-Fernandez, Cristofer Englund

Zero-Shot Learners for Natural Language Understanding via a Unified Multiple Choice Perspective

October 16, 2022 · 540 words · Ping Yang, Junjie Wang, Ruyi Gan, Xinyu Zhu, Lin Zhang and 4 others

Deep Differentiable Logic Gate Networks

October 15, 2022 · 969 words · Felix Petersen, Christian Borgelt, Hilde Kuehne, Oliver Deussen

Utilizing supervised models to infer consensus labels and their quality from data with multiple annotators

October 13, 2022 · 697 words · Hui Wen Goh, Ulyana Tkachenko, Jonas Mueller

Point Transformer V2: Grouped Vector Attention and Partition-based Pooling

October 11, 2022 · 682 words · Xiaoyang Wu, Yixing Lao, Li Jiang, Xihui Liu, Hengshuang Zhao

Named Entity Recognition in Twitter: A Dataset and Analysis on Short-Term Temporal Shifts

October 7, 2022 · 853 words · Asahi Ushio, Leonardo Neves, Vitor Silva, Francesco Barbieri, Jose Camacho-Collados

GNM: A General Navigation Model to Drive Any Robot

October 7, 2022 · 850 words · Dhruv Shah, Ajay Sridhar, Arjun Bhorkar, Noriaki Hirose, Sergey Levine

Binding Language Models in Symbolic Languages

October 6, 2022 · 945 words · Zhoujun Cheng, Tianbao Xie, Peng Shi, Chengzu Li, Rahul Nadkarni and 7 others

GLM-130B: An Open Bilingual Pre-trained Model

October 5, 2022 · 2026 words · Aohan Zeng, Xiao Liu, Zhengxiao Du, Zihan Wang, Hanyu Lai and 13 others

SHINE-Mapping: Large-Scale 3D Mapping Using Sparse Hierarchical Implicit Neural Representations

October 5, 2022 · 776 words · Xingguang Zhong, Yue Pan, Jens Behley, Cyrill Stachniss

Is Reinforcement Learning (Not) for Natural Language Processing?: Benchmarks, Baselines, and Building Blocks for Natural Language Policy Optimization

October 3, 2022 · 981 words · Rajkumar Ramamurthy, Prithviraj Ammanabrolu, Kianté Brantley, Jack Hessel, Rafet Sifa and 3 others

September⁷

DreamFusion: Text-to-3D using 2D Diffusion

September 29, 2022 · 1290 words · Ben Poole, Ajay Jain, Jonathan T. Barron, Ben Mildenhall

Personalizing Text-to-Image Generation via Aesthetic Gradients

September 25, 2022 · 164 words · Victor Gallego

A Closer Look at Learned Optimization: Stability, Robustness, and Inductive Biases

September 22, 2022 · 1097 words · James Harrison, Luke Metz, Jascha Sohl-Dickstein

GET3D: A Generative Model of High Quality 3D Textured Shapes Learned from Images

September 22, 2022 · 766 words · Jun Gao, Tianchang Shen, Zian Wang, Wenzheng Chen, Kangxue Yin and 4 others

Efficient Few-Shot Learning Without Prompts

September 22, 2022 · 745 words · Lewis Tunstall, Nils Reimers, Unso Eun Seo Jo, Luke Bates, Daniel Korat and 2 others

Delving into the Devils of Bird’s-eye-view Perception: A Review, Evaluation and Recipe

September 12, 2022 · 2173 words · Hongyang Li, Chonghao Sima, Jifeng Dai, Wenhai Wang, Lewei Lu and 20 others

A Survey on Generative Diffusion Model

September 6, 2022 · 2992 words · Hanqun Cao, Cheng Tan, Zhangyang Gao, Guangyong Chen, Pheng-Ann Heng and 1 others

August⁷

DreamBooth: Fine Tuning Text-to-Image Diffusion Models for Subject-Driven Generation

August 25, 2022 · 902 words · Nataniel Ruiz, Yuanzhen Li, Varun Jampani, Yael Pritch, Michael Rubinstein and 1 others

Red Teaming Language Models to Reduce Harms: Methods, Scaling Behaviors, and Lessons Learned

August 23, 2022 · 1111 words · Deep Ganguli, Liane Lovitt, Jackson Kernion, Amanda Askell, Yuntao Bai and 31 others

Cold Diffusion: Inverting Arbitrary Image Transforms Without Noise

August 19, 2022 · 1243 words · Arpit Bansal, Eitan Borgnia, Hong-Min Chu, Jie S. Li, Hamid Kazemi and 4 others

BoW3D: Bag of Words for Real-Time Loop Closing in 3D LiDAR SLAM

August 15, 2022 · 1001 words · Yunge Cui, Xieyuanli Chen, Yinlong Zhang, Jiahua Dong, Qingxiao Wu and 1 others

LLM.int8(): 8-bit Matrix Multiplication for Transformers at Scale

August 15, 2022 · 1219 words · Tim Dettmers, Mike Lewis, Younes Belkada, Luke Zettlemoyer

TotalSegmentator: robust segmentation of 104 anatomical structures in CT images

August 11, 2022 · 691 words · Jakob Wasserthal, Manfred Meyer, Hanns-Christian Breit, Joshy Cyriac, Shan Yang and 1 others

An Image is Worth One Word: Personalizing Text-to-Image Generation using Textual Inversion

August 2, 2022 · 1167 words · Rinon Gal, Yuval Alaluf, Yuval Atzmon, Or Patashnik, Amit H. Bermano and 2 others

July⁴

Neural Density-Distance Fields

July 29, 2022 · 897 words · Itsuki Ueda, Yoshihiro Fukuhara, Hirokatsu Kataoka, Hiroaki Aizawa, Hidehiko Shishido and 1 others

HorNet: Efficient High-Order Spatial Interactions with Recursive Gated Convolutions

July 28, 2022 · 865 words · Yongming Rao, Wenliang Zhao, Yansong Tang, Jie Zhou, Ser-Nam Lim and 1 others

Patchwork++: Fast and Robust Ground Segmentation Solving Partial Under-Segmentation Using 3D Point Cloud

July 25, 2022 · 1192 words · Seungjae Lee, Hyungtae Lim, Hyun Myung

YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors

July 6, 2022 · 1009 words · Chien-Yao Wang, Alexey Bochkovskiy, Hong-Yuan Mark Liao

June⁶

Feature Refinement to Improve High Resolution Image Inpainting

June 27, 2022 · 334 words · Prakhar Kulshreshtha, Brian Pugh, Salma Jiddi

Towards Robust Blind Face Restoration with Codebook Lookup Transformer

June 22, 2022 · 844 words · Shangchen Zhou, Kelvin C. K. Chan, Chongyi Li, Chen Change Loy

EnvPool: A Highly Parallel Reinforcement Learning Environment Execution Engine

June 21, 2022 · 918 words · Jiayi Weng, Min Lin, Shengyi Huang, Bo Liu, Denys Makoviichuk and 7 others

Global Context Vision Transformers

June 20, 2022 · 768 words · Ali Hatamizadeh, Hongxu Yin, Jan Kautz, Pavlo Molchanov

MineDojo: Building Open-Ended Embodied Agents with Internet-Scale Knowledge

June 17, 2022 · 971 words · Linxi Fan, Guanzhi Wang, Yunfan Jiang, Ajay Mandlekar, Yuncong Yang and 5 others

VPIT: Real-time Embedded Single Object 3D Tracking Using Voxel Pseudo Images

June 6, 2022 · 1096 words · Illia Oleksiienko, Paraskevi Nousi, Nikolaos Passalis, Anastasios Tefas, Alexandros Iosifidis

May⁴

FlashAttention: Fast and Memory-Efficient Exact Attention with IO-Awareness

May 27, 2022 · 816 words · Tri Dao, Daniel Y. Fu, Stefano Ermon, Atri Rudra, Christopher Ré

GIT: A Generative Image-to-text Transformer for Vision and Language

May 27, 2022 · 866 words · Jianfeng Wang, Zhengyuan Yang, Xiaowei Hu, Linjie Li, Kevin Lin and 4 others

KERPLE: Kernelized Relative Positional Embedding for Length Extrapolation

May 20, 2022 · 874 words · Ta-Chung Chi, Ting-Han Fan, Peter J. Ramadge, Alexander I. Rudnicky

Vectorized and performance-portable Quicksort

May 12, 2022 · 1132 words · Mark Blacher, Joachim Giesen, Peter Sanders, Jan Wassenberg

April²

Fast Sampling of Diffusion Models with Exponential Integrator

April 29, 2022 · 1042 words · Qinsheng Zhang, Yongxin Chen

Super-NaturalInstructions: Generalization via Declarative Instructions on 1600+ NLP Tasks

April 16, 2022 · 740 words · Yizhong Wang, Swaroop Mishra, Pegah Alipoormolabashi, Yeganeh Kordi, Amirreza Mirzaei and 35 others

March³

A Lightweight Instrument-Agnostic Model for Polyphonic Note Transcription and Multipitch Estimation

March 18, 2022 · 683 words · Rachel M. Bittner, Juan José Bosch, David Rubinstein, Gabriel Meseguer-Brocal, Sebastian Ewert

BEAT: A Large-Scale Semantic and Emotional Multi-Modal Dataset for Conversational Gestures Synthesis

March 10, 2022 · 1019 words · Haiyang Liu, Zihao Zhu, Naoya Iwamoto, Yichen Peng, Zhengqing Li and 3 others

Training language models to follow instructions with human feedback

March 4, 2022 · 973 words · Long Ouyang, Jeff Wu, Xu Jiang, Diogo Almeida, Carroll L. Wainwright and 15 others

February¹

Pseudo Numerical Methods for Diffusion Models on Manifolds

February 20, 2022 · 860 words · Luping Liu, Yi Ren, Zhijie Lin, Zhou Zhao

January⁵

BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation

January 28, 2022 · 1228 words · Junnan Li, Dongxu Li, Caiming Xiong, Steven Hoi

Alpa: Automating Inter- and Intra-Operator Parallelism for Distributed Deep Learning

January 28, 2022 · 1637 words · Lianmin Zheng, Zhuohan Li, Hao Zhang, Yonghao Zhuang, Zhifeng Chen and 7 others

Instant Neural Graphics Primitives with a Multiresolution Hash Encoding

January 16, 2022 · 1215 words · Thomas Müller, Alex Evans, Christoph Schied, Alexander Keller

PromptBERT: Improving BERT Sentence Embeddings with Prompts

January 12, 2022 · 1070 words · Ting Jiang, Jian Jiao, Shaohan Huang, Zihan Zhang, Deqing Wang and 5 others

DM-VIO: Delayed Marginalization Visual-Inertial Odometry

January 11, 2022 · 1006 words · Lukas von Stumberg, Daniel Cremers

2021¹²

December⁴

High-Resolution Image Synthesis with Latent Diffusion Models

December 20, 2021 · 1054 words · Robin Rombach, Andreas Blattmann, Dominik Lorenz, Patrick Esser, Björn Ommer

Image Segmentation Using Text and Image Prompts

December 18, 2021 · 768 words · Timo Lüddecke, Alexander S. Ecker

ALIKE: Accurate and Lightweight Keypoint Detection and Descriptor Extraction

December 6, 2021 · 997 words · Xiaoming Zhao, Xingming Wu, Jinyu Miao, Weihai Chen, Peter C. Y. Chen and 1 others

YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for everyone

December 4, 2021 · 966 words · Edresson Casanova, Julian Weber, Christopher Shulby, Arnaldo Candido Junior, Eren Gölge and 1 others

November¹

LiT: Zero-Shot Transfer with Locked-image text Tuning

November 15, 2021 · 1140 words · Xiaohua Zhai, Xiao Wang, Basil Mustafa, Andreas Steiner, Daniel Keysers and 2 others

October¹

Colossal-AI: A Unified Deep Learning System For Large-Scale Parallel Training

October 28, 2021 · 1134 words · Shenggui Li, Jiarui Fang, Zhengda Bian, Hongxin Liu, Yuliang Liu and 3 others

May¹

Diffusion Models Beat GANs on Image Synthesis

May 11, 2021 · 1142 words · Prafulla Dhariwal, Alex Nichol

April¹

RoFormer: Enhanced Transformer with Rotary Position Embedding

April 20, 2021 · 981 words · Jianlin Su, Yu Lu, Shengfeng Pan, Ahmed Murtadha, Bo Wen and 1 others

March¹

GLM: General Language Model Pretraining with Autoregressive Blank Infilling

March 18, 2021 · 860 words · Zhengxiao Du, Yujie Qian, Xiao Liu, Ming Ding, Jiezhong Qiu and 2 others

February³

Improved Denoising Diffusion Probabilistic Models

February 18, 2021 · 1164 words · Alex Nichol, Prafulla Dhariwal

FedBN: Federated Learning on Non-IID Features via Local Batch Normalization

February 15, 2021 · 1169 words · Xiaoxiao Li, Meirui Jiang, Xiaofei Zhang, Michael Kamp, Qi Dou

Ivy: Templated Deep Learning for Inter-Framework Portability

February 4, 2021 · 1217 words · Daniel Lenton, Fabio Pardo, Fabian Falck, Stephen James, Ronald Clark

2020⁵

October¹

An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale

October 22, 2020 · 927 words · Alexey Dosovitskiy, Lucas Beyer, Alexander Kolesnikov, Dirk Weissenborn, Xiaohua Zhai and 7 others

July¹

Gender Classification and Bias Mitigation in Facial Images

July 13, 2020 · 1138 words · Wenying Wu, Pavlos Protopapas, Zheng Yang, Panagiotis Michalatos

May¹

Smooth Exploration for Robotic Reinforcement Learning

May 12, 2020 · 774 words · Antonin Raffin, Jens Kober, Freek Stulp

March¹

Efficient Content-Based Sparse Attention with Routing Transformers

March 12, 2020 · 1464 words · Aurko Roy, Mohammad Saffar, Ashish Vaswani, David Grangier

February¹

GLU Variants Improve Transformer

February 12, 2020 · 414 words · Noam Shazeer

2019³

December²

nnAudio: An on-the-fly GPU Audio to Spectrogram Conversion Toolbox Using 1D Convolution Neural Networks

December 27, 2019 · 1807 words · Kin Wai Cheuk, Hans Anderson, Kat Agres, Dorien Herremans

Libri-Light: A Benchmark for ASR with Limited or No Supervision

December 17, 2019 · 754 words · Jacob Kahn, Morgane Rivière, Weiyi Zheng, Evgeny Kharitonov, Qiantong Xu and 10 others

September¹

Fine-Tuning Language Models from Human Preferences

September 18, 2019 · 1109 words · Daniel M. Ziegler, Nisan Stiennon, Jeffrey Wu, Tom B. Brown, Alec Radford and 3 others

2018²

April¹

SpatioTemporal Feature Integration and Model Fusion for Full Reference Video Quality Assessment

April 13, 2018 · 1138 words · Christos G. Bampis, Zhi Li, Alan C. Bovik

March¹

Path Aggregation Network for Instance Segmentation

March 5, 2018 · 885 words · Shu Liu, Lu Qi, Haifang Qin, Jianping Shi, Jiaya Jia

2017²

June²

Multi-scale Multi-band DenseNets for Audio Source Separation

June 29, 2017 · 722 words · Naoya Takahashi, Yuki Mitsufuji

Attention Is All You Need

June 12, 2017 · 1044 words · Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones and 3 others

2016¹

September¹

Image-to-Markup Generation with Coarse-to-Fine Attention

September 16, 2016 · 770 words · Yuntian Deng, Anssi Kanervisto, Jeffrey Ling, Alexander M. Rush

2015¹

March¹

FaceNet: A Unified Embedding for Face Recognition and Clustering

March 12, 2015 · 1097 words · Florian Schroff, Dmitry Kalenichenko, James Philbin

2013¹

June¹

Sinkhorn Distances: Lightspeed Computation of Optimal Transportation Distances

June 4, 2013 · 636 words · Marco Cuturi

2023 404

March 143

Learning and Verification of Task Structure in Instructional Videos

DreamBooth3D: Subject-Driven Text-to-3D Generation

The Quantization Model of Neural Scaling

Paraphrasing evades detectors of AI-generated text, but retrieval is an effective defense

3D-POP – An automated annotation approach to facilitate markerless 2D-3D tracking of freely moving birds with marker-based motion capture

Reinforcement Learning with Exogenous States and Rewards

Instruct-NeRF2NeRF: Editing 3D Scenes with Instructions

FeatureNeRF: Learning Generalizable NeRFs by Distilling Foundation Models

LFM-3D: Learnable Feature Matching Across Wide Baselines Using 3D Signals

Can we trust the evaluation on ChatGPT?

Adaptive Conformal Prediction by Reweighting Nonconformity Score

RepoCoder: Repository-Level Code Completion Through Iterative Retrieval and Generation

RaBit: Parametric Modeling of 3D Biped Cartoon Characters with a Topological-consistent Dataset

MEGA: Multilingual Evaluation of Generative AI

Large Language Models Can Be Used to Estimate the Ideologies of Politicians in a Zero-Shot Learning Setting

Visual Representation Learning from Unlabeled Video using Contrastive Masked Autoencoders

Text2Room: Extracting Textured 3D Meshes from 2D Text-to-Image Models

Probabilistic Domain Adaptation for Biomedical Image Segmentation

ExtremeNeRF: Few-shot Neural Radiance Fields Under Unconstrained Illumination

Equiangular Basis Vectors

Learning Context-aware Classifier for Semantic Segmentation

Novel Class Discovery for 3D Point Cloud Semantic Segmentation

Inversion by Direct Iteration: An Alternative to Denoising Diffusion for Image Restoration

MM-REACT: Prompting ChatGPT for Multimodal Reasoning and Action

Reflexion: an autonomous agent with dynamic memory and self-reflection

Zero-1-to-3: Zero-shot One Image to 3D Object

Context-faithful Prompting for Large Language Models

SVDiff: Compact Parameter Space for Diffusion Fine-Tuning

VoxelNeXt: Fully Sparse VoxelNet for 3D Object Detection and Tracking

A Survey on Oversmoothing in Graph Neural Networks

PanGu-Σ: Towards Trillion Parameter Language Model with Sparse Heterogeneous Computing

NeRF-LOAM: Neural Implicit Representation for Large-Scale Incremental LiDAR Odometry and Mapping

Improving Uncertainty Quantification of Deep Classifiers via Neighborhood Conformal Prediction: Novel Algorithm and Theoretical Analysis

Two Kinds of Recall

Can AI-Generated Text be Reliably Detected?

On the De-duplication of LAION-2B

A Recipe for Watermarking Diffusion Models

GPTs are GPTs: An Early Look at the Labor Market Impact Potential of Large Language Models

A Robustness Analysis of Blind Source Separation

$α$Surf: Implicit Surface Reconstruction for Semi-Transparent and Thin Objects with Decoupled Geometry and Opacity

Towards a Foundation Model for Neural Network Wavefunctions

Adversarial Counterfactual Visual Explanations

A Dynamic Multi-Scale Voxel Flow Network for Video Prediction

Trained on 100 million words and still in shape: BERT meets British National Corpus

CoLT5: Faster Long-Range Transformers with Conditional Computation

Efficient Diffusion Training via Min-SNR Weighting Strategy

LERF: Language Embedded Radiance Fields

SemDeDup: Data-efficient learning at web-scale through semantic deduplication

FateZero: Fusing Attentions for Zero-shot Text-based Video Editing

$P+$: Extended Textual Conditioning in Text-to-Image Generation

Jump to Conclusions: Short-Cutting Transformers With Linear Transformations

NeRFMeshing: Distilling Neural Radiance Fields into Geometrically-Accurate 3D Meshes

Unified Multi-Modal Latent Diffusion for Joint Subject and Text Conditional Image Generation

Taming Diffusion Models for Audio-Driven Co-Speech Gesture Generation

GLEN: General-Purpose Event Detection for Thousands of Types

Secret-Keeping in Question Answering

Translating Radiology Reports into Plain Language using ChatGPT and GPT-4 with Prompt Learning: Promising Results, Limitations, and Potential

ART: Automatic multi-step reasoning and tool-use for large language models

UPRISE: Universal Prompt Retrieval for Improving Zero-Shot Evaluation

Rotation-Invariant Transformer for Point Cloud Matching

Allegro-Legato: Scalable, Fast, and Robust Neural-Network Quantum Molecular Dynamics via Sharpness-Aware Minimization

Blind Video Deflickering by Neural Filtering with a Flawed Atlas

Simfluence: Modeling the Influence of Individual Training Examples by Simulating Training Runs

A Theory of Emergent In-Context Learning as Implicit Structure Induction

I$^2$-SDF: Intrinsic Indoor Scene Reconstruction and Editing via Raytracing in Neural SDFs

FreeNeRF: Improving Few-shot Neural Rendering with Free Frequency Regularization

Erasing Concepts from Diffusion Models

Meet in the Middle: A New Pre-training Paradigm

Scaling Vision-Language Models with Sparse Mixture of Experts

High-throughput Generative Inference of Large Language Models with a Single GPU

Universal Instance Perception as Object Discovery and Retrieval

One Transformer Fits All Distributions in Multi-Modal Diffusion at Scale

Prefix-tree Decoding for Predicting Mass Spectra from Molecules

Probing neural representations of scene perception in a hippocampally dependent task using artificial neural networks

Resurrecting Recurrent Neural Networks for Long Sequences

Stabilizing Transformer Training by Preventing Attention Entropy Collapse

StyleGANEX: StyleGAN-Based Manipulation Beyond Cropped Aligned Faces

Rewarding Chatbots for Real-World Engagement with Millions of Users

2023⁴⁰⁴

March¹⁴³

February¹¹⁶