Sparse Training Theory for Scalable and Efficient Agents: Blue Sky Ideas Track

Decebal Constantin Mocanu, Elena Mocanu, Tiago Pinto, Selima Curci, Phuong H. Nguyen, Madeleine Gibescu, Damien Ernst, Zita A. Vale

Research output: Working paperPreprintAcademic

17 Downloads (Pure)

Abstract

A fundamental task for artificial intelligence is learning. Deep Neural Networks have proven to cope perfectly with all learning paradigms, i.e. supervised, unsupervised, and reinforcement learning. Nevertheless, traditional deep learning approaches make use of cloud computing facilities and do not scale well to autonomous agents with low computational resources. Even in the cloud, they suffer from computational and memory limitations, and they cannot be used to model adequately large physical worlds for agents which assume networks with billions of neurons. These issues are addressed in the last few years by the emerging topic of sparse training, which trains sparse networks from scratch. This paper discusses sparse training state-of-the-art, its challenges and limitations while introducing a couple of new theoretical research directions which has the potential of alleviating sparse training limitations to push deep learning scalability well beyond its current boundaries. Nevertheless, the theoretical advancements impact in complex multi-agents settings is discussed from a real-world perspective, using the smart grid case study.
Original languageEnglish
PublisherArXiv.org
Number of pages6
DOIs
Publication statusPublished - 2 Mar 2021

Keywords

  • cs.AI
  • cs.LG
  • cs.MA
  • cs.NE

Fingerprint

Dive into the research topics of 'Sparse Training Theory for Scalable and Efficient Agents: Blue Sky Ideas Track'. Together they form a unique fingerprint.
  • Sparse Training Theory for Scalable and Efficient Agents

    Mocanu, D. C., Mocanu, E., Pinto, T., Curci, S., Nguyen, P. H., Gibescu, M., Ernst, D. & Vale, Z., 3 May 2021, AAMAS '21: Proceedings of the 20th International Conference on Autonomous Agents and MultiAgent Systems. Dignum, F., Lomuscio, A., Endriss, U. & Nowé, A. (eds.). ACM Publishing, p. 34-38

    Research output: Chapter in Book/Report/Conference proceedingChapterAcademicpeer-review

    Open Access
    File

Cite this