Timezone: »

 
HumBugDB: A Large-scale Acoustic Mosquito Dataset
Ivan Kiskin · Marianne Sinka · Adam Cobb · Waqas Rafique · Lawrence Wang · Davide Zilli · Benjamin Gutteridge · Rinita Dam · Theodoros Marinos · Yunpeng Li · Dickson Msaky · Emmanuel Kaindoa · Gerard Killeen · Eva Herreros-Moya · Kathy Willis · Stephen J Roberts

Fri Dec 10 12:30 AM -- 12:40 AM (PST) @

This paper presents the first large-scale multi-species dataset of acoustic recordings of mosquitoes tracked continuously in free flight. We present 20 hours of audio recordings that we have expertly labelled and tagged precisely in time. Significantly, 18 hours of recordings contain annotations from 36 different species. Mosquitoes are well-known carriers of diseases such as malaria, dengue and yellow fever. Collecting this dataset is motivated by the need to assist applications which utilise mosquito acoustics to conduct surveys to help predict outbreaks and inform intervention policy. The task of detecting mosquitoes from the sound of their wingbeats is challenging due to the difficulty in collecting recordings from realistic scenarios. To address this, as part of the HumBug project, we conducted global experiments to record mosquitoes ranging from those bred in culture cages to mosquitoes captured in the wild. Consequently, the audio recordings vary in signal-to-noise ratio and contain a broad range of indoor and outdoor background environments from Tanzania, Thailand, Kenya, the USA and the UK. In this paper we describe in detail how we collected, labelled and curated the data. The data is provided from a PostgreSQL database, which captures important metadata such as the capture method, age, feeding status and gender of the mosquitoes. Additionally, we provide code to extract features and train Bayesian convolutional neural networks for two key tasks: the identification of mosquitoes from their corresponding background environments, and the classification of detected mosquitoes into species. Our extensive dataset is both challenging to machine learning researchers focusing on acoustic identification, and critical to entomologists, geo-spatial modellers and other domain experts to understand mosquito behaviour, model their distribution, and manage the threat they pose to humans.

Author Information

Ivan Kiskin (University of Oxford)
Marianne Sinka
Adam Cobb (US Army Research Lab)
Waqas Rafique (University College London)
Lawrence Wang
Davide Zilli (Mind Foundry Ltd.)
Benjamin Gutteridge (Oxford University)
Rinita Dam
Theodoros Marinos
Yunpeng Li (University of Oxford)
Dickson Msaky
Emmanuel Kaindoa (Ifakara Health Institute)
Gerard Killeen
Eva Herreros-Moya
Kathy Willis
Stephen J Roberts (University of Oxford)

More from the Same Authors

  • 2021 : HumBugDB: A Large-scale Acoustic Mosquito Dataset »
    Ivan Kiskin · Marianne Sinka · Adam Cobb · Waqas Rafique · Lawrence Wang · Davide Zilli · Benjamin Gutteridge · Rinita Dam · Theodoros Marinos · Yunpeng Li · Dickson Msaky · Emmanuel Kaindoa · Gerard Killeen · Eva Herreros-Moya · Kathy Willis · Stephen J Roberts
  • 2021 : Relaxed-Responsibility Hierarchical Discrete VAEs »
    Matthew Willetts · Xenia Miscouridou · Stephen J Roberts · Chris C Holmes
  • 2021 : On-the-fly Strategy Adaptation for ad-hoc Agent Coordination »
    Jaleh Zand · Jack Parker-Holder · Stephen J Roberts
  • 2022 : Panel on Open Problems in Machine Learning Systems »
    Ivana Dusparic · Stephen J Roberts · Morine Amutorine · Jerome White · Murtuza Shergadwala
  • 2021 Poster: Tuning Mixed Input Hyperparameters on the Fly for Efficient Population Based AutoRL »
    Jack Parker-Holder · Vu Nguyen · Shaan Desai · Stephen J Roberts
  • 2020 Poster: Effective Diversity in Population Based Reinforcement Learning »
    Jack Parker-Holder · Aldo Pacchiano · Krzysztof M Choromanski · Stephen J Roberts
  • 2020 Spotlight: Effective Diversity in Population Based Reinforcement Learning »
    Jack Parker-Holder · Aldo Pacchiano · Krzysztof M Choromanski · Stephen J Roberts
  • 2020 Poster: Explicit Regularisation in Gaussian Noise Injections »
    Alexander Camuto · Matthew Willetts · Umut Simsekli · Stephen J Roberts · Chris C Holmes
  • 2020 Poster: Provably Efficient Online Hyperparameter Optimization with Population-Based Bandits »
    Jack Parker-Holder · Vu Nguyen · Stephen J Roberts
  • 2019 : Poster Session »
    Gergely Flamich · Shashanka Ubaru · Charles Zheng · Josip Djolonga · Kristoffer Wickstrøm · Diego Granziol · Konstantinos Pitas · Jun Li · Robert Williamson · Sangwoong Yoon · Kwot Sin Lee · Julian Zilly · Linda Petrini · Ian Fischer · Zhe Dong · Alexander Alemi · Bao-Ngoc Nguyen · Rob Brekelmans · Tailin Wu · Aditya Mahajan · Alexander Li · Kirankumar Shiragur · Yair Carmon · Linara Adilova · SHIYU LIU · Bang An · Sanjeeb Dash · Oktay Gunluk · Arya Mazumdar · Mehul Motani · Julia Rosenzweig · Michael Kamp · Marton Havasi · Leighton P Barnes · Zhengqing Zhou · Yi Hao · Dylan Foster · Yuval Benjamini · Nati Srebro · Michael Tschannen · Paul Rubenstein · Sylvain Gelly · John Duchi · Aaron Sidford · Robin Ru · Stefan Zohren · Murtaza Dalal · Michael A Osborne · Stephen J Roberts · Moses Charikar · Jayakumar Subramanian · Xiaodi Fan · Max Schwarzer · Nicholas Roberts · Simon Lacoste-Julien · Vinay Prabhu · Aram Galstyan · Greg Ver Steeg · Lalitha Sankar · Yung-Kyun Noh · Gautam Dasarathy · Frank Park · Ngai-Man (Man) Cheung · Ngoc-Trung Tran · Linxiao Yang · Ben Poole · Andrea Censi · Tristan Sylvain · R Devon Hjelm · Bangjie Liu · Jose Gallego-Posada · Tyler Sypherd · Kai Yang · Jan Nikolas Morshuis
  • 2019 : HumBug Zooniverse: a crowdsourced acoustic mosquito dataset »
    Ivan Kiskin
  • 2019 : Poster session »
    Michael Melese Woldeyohannis · Bernardt Duvenhage · Nyamos Waigama · Asaye Bir Senay · Claire Babirye · Tensaye Ayalew · Kelechi Ogueji · Vinay Prabhu · Prabu Ravindran · Fadilulah Wahab · ChukwuNonso H Nwokoye · Paul Duckworth · Hafte Abera · Abebe Mideksa · Loubna Benabbou · Anugraha Sinha · Ivan Kiskin · Robert Soden · Tupokigwe Isagah · Rehema Mwawado · Yimer Mohammed · Bryan Wilder · Daniel Omeiza · Sunayana Rane · Richard Mgaya · Samsun Knight · Jessenia Gonzalez Villarreal · Eyob Beyene · Monika Obrocka Tulinska · Luis Fernando Cantu Diaz de Leon · Joseph Aro · Michael T Smith · Michael Famoroti · Praneeth Vepakomma · Ramesh Raskar · Debjani Bhowmick · Chukwunonso H Nwokoye · Alejandro Noriega Campero · Hope Mbelwa · Anusua Trivedi
  • 2019 : Poster Session »
    Eduard Gorbunov · Alexandre d'Aspremont · Lingxiao Wang · Liwei Wang · Boris Ginsburg · Alessio Quaglino · Camille Castera · Saurabh Adya · Diego Granziol · Rudrajit Das · Raghu Bollapragada · Fabian Pedregosa · Martin Takac · Majid Jahani · Sai Praneeth Karimireddy · Hilal Asi · Balint Daroczy · Leonard Adolphs · Aditya Rawal · Nicolas Brandt · Minhan Li · Giuseppe Ughi · Orlando Romero · Ivan Skorokhodov · Damien Scieur · Kiwook Bae · Konstantin Mishchenko · Rohan Anil · Vatsal Sharan · Aditya Balu · Chao Chen · Zhewei Yao · Tolga Ergen · Paul Grigas · Chris Junchi Li · Jimmy Ba · Stephen J Roberts · Sharan Vaswani · Armin Eftekhari · Chhavi Sharma
  • 2017 : Cost-sensitive detection with variational autoencoders for environmental acoustic sensing »
    Yunpeng Li · Stephen J Roberts
  • 2017 : Posters »
    Biswarup Bhattacharya · Darius Lam · Sandeep Vidyapu · Shreya Shankar · Therese Anders · Bryan Wilder · Muhammad R Khan · Yunpeng Li · Nazmus Saquib · Varun Kshirsagar · Anthony Perez · Pengfei Zhang · Shahrzad Gholami · Rediet Abebe
  • 2017 : Contributed talk: Safe Policy Search with Gaussian Process Models »
    Kyriakos Polymenakos · Stephen J Roberts
  • 2014 Poster: Sampling for Inference in Probabilistic Models with Fast Bayesian Quadrature »
    Tom Gunter · Michael A Osborne · Roman Garnett · Philipp Hennig · Stephen J Roberts
  • 2012 Poster: Active Learning of Model Evidence Using Bayesian Quadrature »
    Michael A Osborne · David Duvenaud · Roman Garnett · Carl Edward Rasmussen · Stephen J Roberts · Zoubin Ghahramani
  • 2006 Poster: Bayesian Image Super-resolution, Continued »
    Lyndsey C Pickup · David Capel · Stephen J Roberts · Andrew Zisserman
  • 2006 Spotlight: Bayesian Image Super-resolution, Continued »
    Lyndsey C Pickup · David Capel · Stephen J Roberts · Andrew Zisserman