Hierarchical Normalized Completely Random Measures to Cluster Grouped Data

dc.contributor.authorArgiento, Raffaele
dc.contributor.authorCremaschi, Andrea
dc.contributor.authorVannucci, Marina
dc.contributor.rorhttps://ror.org/02jjdwm75
dc.date.accessioned2026-05-27T10:28:46Z
dc.date.issued2019-05-17
dc.description.abstractIn this article, we propose a Bayesian nonparametric model for clustering grouped data. We adopt a hierarchical approach: at the highest level, each group of data is modeled according to a mixture, where the mixing distributions are conditionally independent normalized completely random measures (NormCRMs) centered on the same base measure, which is itself a NormCRM. The discreteness of the shared base measure implies that the processes at the data level share the same atoms. This desired feature allows to cluster together observations of different groups. We obtain a representation of the hierarchical clustering model by marginalizing with respect to the infinite dimensional NormCRMs. We investigate the properties of the clustering structure induced by the proposed model and provide theoretical results concerning the distribution of the number of clusters, within and between groups. Furthermore, we offer an interpretation in terms of generalized Chinese restaurant franchise process, which allows for posterior inference under both conjugate and nonconjugate models. We develop algorithms for fully Bayesian inference and assess performances by means of a simulation study and a real-data illustration. Supplementary materials for this article are available online.
dc.description.peerreviewedYes
dc.description.sponsorshipRaffaele Argiento gratefully acknowledges Collegio Carlo Alberto for partially funding this work.
dc.description.statusPublished
dc.formatapplication/pdf
dc.identifier.citationArgiento, R., Cremaschi, A., & Vannucci, M. (2020). Hierarchical normalized completely random measures to cluster grouped data. Journal of the American Statistical Association. https://doi.org/10.1080/01621459.2019.1594833
dc.identifier.doihttps://doi.org/10.1080/01621459.2019.1594833
dc.identifier.issn1537-274X
dc.identifier.officialurlhttps://www.tandfonline.com/doi/full/10.1080/01621459.2019.1594833
dc.identifier.urihttps://hdl.handle.net/20.500.14417/4373
dc.journal.titleJournal of the American Statistical Association
dc.language.isoeng
dc.page.total44
dc.publisherTaylor & Francis
dc.relation.entityIE University
dc.relation.schoolIE School of Science & Technology
dc.rightsAttribution-NonCommercial-NoDerivatives 4.0 International
dc.rights.accessRightsinfo:eu-repo/semantics/openAccess
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/4.0/
dc.subject.keywordsBayesian Nonparametrics
dc.subject.keywordsClustering
dc.subject.keywordsMixture Models
dc.subject.keywordsHierarchical Models
dc.subject.odsODS 3 - Salud y bienestar
dc.subject.unesco12 Matemáticas
dc.titleHierarchical Normalized Completely Random Measures to Cluster Grouped Data
dc.typeinfo:eu-repo/semantics/article
dc.version.typeinfo:eu-repo/semantics/acceptedVersion
dspace.entity.typePublication
relation.isAuthorOfPublication976c8dd3-a3ba-4b1a-9273-72c7ee16c39e
relation.isAuthorOfPublication.latestForDiscovery976c8dd3-a3ba-4b1a-9273-72c7ee16c39e

Bloque original

Mostrando 1 - 1 de 1
Cargando...
Miniatura
Nombre:
10.1080@01621459.2019.1594833.pdf
Tamaño:
1.08 MB
Formato:
Adobe Portable Document Format

Bloque de licencias

Mostrando 1 - 1 de 1
Cargando...
Miniatura
Nombre:
license.txt
Tamaño:
1.71 KB
Formato:
Item-specific license agreed to upon submission
Descripción: