Peer-to-peer deep learning with non-IID data

Šajina, Robert; Tanković, Nikola; Ipšić, Ivo

doi:10.1016/j.eswa.2022.119159

Pregled bibliografske jedinice broj: 1230791

Peer-to-peer deep learning with non-IID data

Šajina, Robert; Tanković, Nikola; Ipšić, Ivo

Peer-to-peer deep learning with non-IID data // Expert Systems with Applications, 214 (2023), 119159, 12 doi:10.1016/j.eswa.2022.119159 (međunarodna recenzija, članak, znanstveni)

CROSBI ID: 1230791 Za ispravke kontaktirajte CROSBI podršku putem web obrasca

Naslov
Peer-to-peer deep learning with non-IID data

Autori
Šajina, Robert ; Tanković, Nikola ; Ipšić, Ivo

Izvornik
Expert Systems with Applications (0957-4174) 214 (2023); 119159, 12

Vrsta, podvrsta i kategorija rada
Radovi u časopisima, članak, znanstveni

Ključne riječi
Peer-to-peer ; Gossip averaging ; Decentralized learning ; Batch normalization ; Neural network ; Machine learning

Sažetak
Collaborative training of deep neural networks using edge devices has attracted substantial research interest recently. The two main architecture approaches for the training process are centrally orchestrated Federated Learning and fully decentralized peer-to-peer learning. In decentralized systems, edge devices, known as agents, collaborate in a peer-to-peer architecture, avoiding the need for a central system to orchestrate the process. Decentralized peer-to-peer (P2P) learning techniques are well researched under the assumption of independent and identically distributed (IID) data across the agents. IID data is seldom observed in real-world distributed systems, and the training performance varies significantly with non-IID data. This paper proposes a decentralized learning variant of the P2P gossip averaging method with Batch Normalization (BN) adaptation for P2P architectures. It is well-known that BN layers accelerate the convergence of the non-distributed deep learning models. Recent research confirms that Federated Learning methods benefit from using the BN method with some aggregation alterations. Our work demonstrated BN effectiveness in P2P architectures by mitigating the non-IID data characteristics across decentralized agents. We also introduce a variant of the early stopping technique that, combined with BN layers, acts as a fine-tuning technique for agent models. We validated our approach by conducting numerous simulations of different model-topology- communication combinations and comparing them to other decentralized baseline approaches. The evaluations were conducted on the next word prediction task using user comments from the Reddit and StackOverflow datasets representing comments from two different domains. Simulations showed that our approach, on average, achieves a mean relative top accuracy increase of 16.9% in ring (19.9% for Reddit, 13.9% for StackOverflow) and 29.8% in sparse (32.9% for Reddit, 26.6% for StackOverflow) communication topologies compared to the best baseline approach. Our code is available at https://github.com/fipu-lab/p2p_bn.

Izvorni jezik
Engleski

Znanstvena područja
Računarstvo, Informacijske i komunikacijske znanosti

POVEZANOST RADA

Ustanove:
Sveučilište Jurja Dobrile u Puli,
Fakultet informatike i digitalnih tehnologija, Rijeka

Profili:

Robert Šajina (autor)

Ivo Ipšić (autor)

Nikola Tanković (autor)

Poveznice na cjeloviti tekst rada:

doi www.sciencedirect.com www.sciencedirect.com

Poveznice na istraživačke podatke:

codeocean.com

Citiraj ovu publikaciju:

Časopis indeksira:

Current Contents Connect (CCC)
Web of Science Core Collection (WoSCC)

Science Citation Index Expanded (SCI-EXP)
SCI-EXP, SSCI i/ili A&HCI

Scopus

Uključenost u ostale bibliografske baze podataka::

INSPEC

CROSBI Hrvatska znanstvena bibliografija

Pregled bibliografske jedinice broj: 1230791

Peer-to-peer deep learning with non-IID data

Poveznice na cjeloviti tekst rada:

Poveznice na istraživačke podatke:

Citiraj ovu publikaciju:

Časopis indeksira:

Uključenost u ostale bibliografske baze podataka::

Citati:

Altmetrijski pokazatelji:

Pregled bibliografske jedinice broj: 1230791

Peer-to-peer deep learning with non-IID data

Poveznice na cjeloviti tekst rada:

Poveznice na istraživačke podatke:

Citiraj ovu publikaciju:

Časopis indeksira:

Uključenost u ostale bibliografske baze podataka::

Citati:

Altmetrijski pokazatelji:

Podijeli: