Utilize este identificador para referenciar este registo: http://hdl.handle.net/10071/32038
Registo completo
Campo DCValorIdioma
dc.contributor.authorSusskind, Z.-
dc.contributor.authorArora, A.-
dc.contributor.authorMiranda, I. D. S.-
dc.contributor.authorVillon, L. A. Q.-
dc.contributor.authorKatopodis, R. F.-
dc.contributor.authorAraújo, L. S.-
dc.contributor.authorDutra, D. L. C.-
dc.contributor.authorLima, P. M. V.-
dc.contributor.authorFrança, F. M. G.-
dc.contributor.authorBreternitz Jr., M.-
dc.contributor.authorJohn, L. K.-
dc.contributor.editorAndreas Kloeckner-
dc.contributor.editorJosé Moreira-
dc.date.accessioned2024-07-12T10:01:51Z-
dc.date.available2024-07-12T10:01:51Z-
dc.date.issued2023-
dc.identifier.citationSusskind, Z., Arora, A., Miranda, I. D. S., Villon, L. A. Q., Katopodis, R. F., Araújo, L. S., Dutra, D. L. C., Lima, P. M. V., França, F. M. G., Breternitz Jr., M., & John, L. K. (2023). Weightless neural networks for efficient edge inference. In A. Kloeckner, & J. Moreira (Eds.). PACT '22: Proceedings of the International Conference on Parallel Architectures and Compilation Techniques (pp. 279 – 290). ACM - Association for Computing Machinery. https://doi.org/10.1145/3559009.3569680-
dc.identifier.isbn97978-1-4503-9868-8-
dc.identifier.urihttp://hdl.handle.net/10071/32038-
dc.description.abstractWeightless neural networks (WNNs) are a class of machine learning model which use table lookups to perform inference, rather than the multiply-accumulate operations typical of deep neural networks (DNNs). Individual weightless neurons are capable of learning non-linear functions of their inputs, a theoretical advantage over the linear neurons in DNNs, yet state-of-the-art WNN architectures still lag behind DNNs in accuracy on common classification tasks. Additionally, many existing WNN architectures suffer from high memory requirements, hindering implementation. In this paper, we propose a novel WNN architecture, BTHOWeN, with key algorithmic and architectural improvements over prior work, namely counting Bloom filters, hardware-friendly hashing, and Gaussian-based nonlinear thermometer encodings. These enhancements improve model accuracy while reducing size and energy per inference. BTHOWeN targets the large and growing edge computing sector by providing superior latency and energy efficiency to both prior WNNs and comparable quantized DNNs. Compared to state-of-the-art WNNs across nine classification datasets, BTHOWeN on average reduces error by more than 40% and model size by more than 50%. We demonstrate the viability of a hardware implementation of BTHOWeN by presenting an FPGA-based inference accelerator, and compare its latency and resource usage against similarly accurate quantized DNN inference accelerators, including multi-layer perceptron (MLP) and convolutional models. The proposed BTHOWeN models consume almost 80% less energy than the MLP models, with nearly 85% reduction in latency. In our quest for efficient ML on the edge, WNNs are clearly deserving of additional attention.eng
dc.language.isoeng-
dc.publisherACM - Association for Computing Machinery-
dc.relation3015.001/3016.001-
dc.relation1763848-
dc.relationinfo:eu-repo/grantAgreement/FCT/6817 - DCRRNI ID/UIDB%2F04466%2F2020/PT-
dc.relationinfo:eu-repo/grantAgreement/FCT/6817 - DCRRNI ID/UIDP%2F04466%2F2020/PT-
dc.relation310676/2019-3-
dc.relationPOCI-01-0247-FEDER-045912-
dc.relation.ispartofPACT '22: Proceedings of the International Conference on Parallel Architectures and Compilation Techniques-
dc.rightsopenAccess-
dc.subjectWeightless Neural Networkseng
dc.subjectWNNeng
dc.subjectWiSARDeng
dc.subjectRedes neuronais -- Neural networkseng
dc.subjectHardware accelerationeng
dc.subjectInferência -- Inferenceeng
dc.subjectEdge computingeng
dc.titleWeightless neural networks for efficient edge inferenceeng
dc.typeconferenceObject-
dc.event.titleInternational Conference on Parallel Architectures and Compilation Techniques-
dc.event.typeConferênciapt
dc.event.locationChicago, Illinoiseng
dc.event.date2022-
dc.pagination279 - 290-
dc.peerreviewedyes-
dc.date.updated2024-07-12T10:39:27Z-
dc.description.versioninfo:eu-repo/semantics/acceptedVersion-
dc.identifier.doi10.1145/3559009.3569680-
iscte.identifier.cienciahttps://ciencia.iscte-iul.pt/id/ci-pub-92348-
iscte.alternateIdentifiers.wosWOS:001071492700021-
iscte.alternateIdentifiers.scopus2-s2.0-85147333502-
Aparece nas coleções:ISTAR-CRI - Comunicações a conferências internacionais

Ficheiros deste registo:
Ficheiro TamanhoFormato 
conferenceObject_92348.pdf1,08 MBAdobe PDFVer/Abrir


FacebookTwitterDeliciousLinkedInDiggGoogle BookmarksMySpaceOrkut
Formato BibTex mendeley Endnote Logotipo do DeGóis Logotipo do Orcid 

Todos os registos no repositório estão protegidos por leis de copyright, com todos os direitos reservados.