• xmlui.mirage2.page-structure.header.title
    • français
    • English
  • Help
  • Login
  • Language 
    • Français
    • English
View Item 
  •   BIRD Home
  • CEREMADE (UMR CNRS 7534)
  • CEREMADE : Publications
  • View Item
  •   BIRD Home
  • CEREMADE (UMR CNRS 7534)
  • CEREMADE : Publications
  • View Item
JavaScript is disabled for your browser. Some features of this site may not work without it.

Browse

BIRDResearch centres & CollectionsBy Issue DateAuthorsTitlesTypeThis CollectionBy Issue DateAuthorsTitlesType

My Account

LoginRegister

Statistics

Most Popular ItemsStatistics by CountryMost Popular Authors
Thumbnail

Missing data in a stochastic Dollo model for binary trait data, and its application to the dating of Proto-Indo-European

Nicholls, Geoff K; Ryder, Robin J. (2011), Missing data in a stochastic Dollo model for binary trait data, and its application to the dating of Proto-Indo-European, Journal of the Royal Statistical Society. Series C, Applied Statistics, 60, 1, p. 71-92. http://dx.doi.org/10.1111/j.1467-9876.2010.00743.x

View/Open
supplement.pdf (727.0Kb)
Type
Article accepté pour publication ou publié
Date
2011
Journal name
Journal of the Royal Statistical Society. Series C, Applied Statistics
Volume
60
Number
1
Publisher
Wiley
Pages
71-92
Publication identifier
http://dx.doi.org/10.1111/j.1467-9876.2010.00743.x
Metadata
Show full item record
Author(s)
Nicholls, Geoff K

Ryder, Robin J.
Abstract (EN)
Nicholls and Gray have described a phylogenetic model for trait data. They used their model to estimate branching times on Indo-European language trees from lexical data. Alekseyenko and co-workers extended the model and gave applications in genetics. We extend the inference to handle data missing at random. When trait data are gathered, traits are thinned in a way that depends on both the trait and the missing data content. Nicholls and Gray treated missing records as absent traits. Hittite has 12% missing trait records. Its age is poorly predicted in their cross-validation. Our prediction is consistent with the historical record. Nicholls and Gray dropped seven languages with too much missing data. We fit all 24 languages in the lexical data of Ringe and co-workers. To model spatiotemporal rate heterogeneity we add a catastrophe process to the model. When a language passes through a catastrophe, many traits change at the same time. We fit the full model in a Bayesian setting, via Markov chain Monte Carlo sampling. We validate our fit by using Bayes factors to test known age constraints. We reject three of 30 historically attested constraints. Our main result is a unimodal posterior distribution for the age of Proto-Indo-European centred at 8400 years before Present with 95% highest posterior density interval equal to 7100–9800 years before Present.
Subjects / Keywords
Bayesian inference; Dating methods; Markov chain Monte Carlo methods; Missing data; Phylogenetics; Proto-Indo-European; Rate heterogeneity

Related items

Showing items related by title and author.

  • Thumbnail
    TraitLab: A MatLab package for fitting and simulating binary tree-like data 
    Nicholls, Geoff K; Ryder, Robin J.; Welch, David (2011) Document de travail / Working paper
  • Thumbnail
    Phylogenetic models for Semitic core vocabularies 
    Ryder, Robin J.; Nicholls, Geoff K (2011) Communication / Conférence
  • Thumbnail
    Framing the issue of Multilevel Analysis of Networks vs. Multilevel Network Analysis Issue : how multilevel networks may be made to address missing data, the boundary specification issue and heterogeneity 
    Wang, Peng; Snijders, Tom; Robins, Garry; Lomi, Alessandro; Koskinen, Johan; Lazega, Emmanuel (2012-06) Communication / Conférence
  • Thumbnail
    Calibration procedures for approximate Bayesian credible sets 
    Lee, Jeong Eun; Nicholls, Geoff K; Ryder, Robin J. (2019) Article accepté pour publication ou publié
  • Thumbnail
    A Phylogenetic Model of the Evolution of Discrete Matrices for the Joint Inference of Lexical and Phonological Language Histories 
    Clarté, Grégoire; Ryder, Robin J. (2022) Document de travail / Working paper
Dauphine PSL Bibliothèque logo
Place du Maréchal de Lattre de Tassigny 75775 Paris Cedex 16
Phone: 01 44 05 40 94
Contact
Dauphine PSL logoEQUIS logoCreative Commons logo