Bayesian analysis for mixtures of discrete distributions with a non-parametric component

IRIS

Bayesian finite mixture modelling is a flexible parametric modelling approach for classification and density fitting. Many areas of application require distinguishing a signal from a noise component. In practice, it is often difficult to justify a specific distribution for the signal component; therefore, the signal distribution is usually further modelled via a mixture of distributions. However, modelling the signal as a mixture of distributions is computationally non-trivial due to the difficulties in justifying the exact number of components to be used and due to the label switching problem. This paper proposes the use of a non-parametric distribution to model the signal component. We consider the case of discrete data and show how this new methodology leads to more accurate parameter estimation and smaller false non-discovery rate. Moreover, it does not incur the label switching problem. We show an application of the method to data generated by ChIP-sequencing experiments.

Bayesian analysis for mixtures of discrete distributions with a non-parametric component / Alhaji, B. B.; Dai, H.; Hayashi, Y.; Vinciotti, V.; Harrison, A.; Lausen, B.. - In: JOURNAL OF APPLIED STATISTICS. - ISSN 0266-4763. - 43:8(2016), pp. 1369-1385. [10.1080/02664763.2015.1100594]

Bayesian analysis for mixtures of discrete distributions with a non-parametric component

Alhaji B. B.;Dai H.;Hayashi Y.;Vinciotti V.;Harrison A.;Lausen B.

2016-01-01

Abstract

Bayesian finite mixture modelling is a flexible parametric modelling approach for classification and density fitting. Many areas of application require distinguishing a signal from a noise component. In practice, it is often difficult to justify a specific distribution for the signal component; therefore, the signal distribution is usually further modelled via a mixture of distributions. However, modelling the signal as a mixture of distributions is computationally non-trivial due to the difficulties in justifying the exact number of components to be used and due to the label switching problem. This paper proposes the use of a non-parametric distribution to model the signal component. We consider the case of discrete data and show how this new methodology leads to more accurate parameter estimation and smaller false non-discovery rate. Moreover, it does not incur the label switching problem. We show an application of the method to data generated by ChIP-sequencing experiments.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno di pubblicazione (Date of publication)
	
				2016
			
	Titolo del periodico (Journal title)
	
				JOURNAL OF APPLIED STATISTICS
			
	Numero e parte del fascicolo (Issue number and part)
	
				8
			
	DOI
	
				https://dx.doi.org/10.1080/02664763.2015.1100594
			
	Codice Scopus (Scopus identifier)
	
				2-s2.0-84945335919
			
	Codice WOS (WOS identifier)
	
				WOS:000373938600001
			
	Tutti gli autori
	
						Alhaji, B. B.; Dai, H.; Hayashi, Y.; Vinciotti, V.; Harrison, A.; Lausen, B.
					
	Citazione
	
				Bayesian analysis for mixtures of discrete distributions with a non-parametric component / Alhaji, B. B.; Dai, H.; Hayashi, Y.; Vinciotti, V.; Harrison, A.; Lausen, B.. - In: JOURNAL OF APPLIED STATISTICS. - ISSN 0266-4763. - 43:8(2016), pp. 1369-1385. [10.1080/02664763.2015.1100594]
			
	Appare nelle tipologie:
	
				03.1 Articolo su rivista (Journal article)

File in questo prodotto:

Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11572/276056

Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni

ND

0

1

ND

social impact