ijsmr

ijsmr logo-pdf 1349088093

Comparison of Post Hoc Multiple Pairwise Testing Procedures as Applied to Small k-Group Logrank Tests
Pages 104-116
Moonseong Heo and Andrew C. Leon
DOI:
http://dx.doi.org/10.6000/1929-6029.2013.02.02.04
Published: 30 April 2013


Abstract: The logrank test is widely used to compare groups on distribution of survival time in the presence of censoring. There is no convention for post hoc pairwise comparisons after a significant omnibus k-group logrank test. This simulation study compares four post hoc pairwise testing procedures: Bonferroni, Dunn-Šidák,Hochberg, and unadjusted post hoc logrank test procedure. Evaluation criteria include, familywise type I error rate, correct decision rate, number of correctly rejected pairs, and false discovery rate. We demonstrated that when conditioned upon rejection of the omnibus test, multiplicity adjustments may be unnecessary and can be overly conservative when k is at most 4, or number of comparisons is no greater than 6. This is supported by the results that the performance of the unadjusted post hoclogrank test procedure is preferred over the others on all criteria except for the false discovery rate. The Hochberg procedure appears to be superior among the adjustments examined. Data from a clinical trial for suicide prevention illustrate these approaches where number of comparison groups is often limited.

Keywords: Logrank test, multiplicity adjustment, post hoc tests, survival analysis.
Download Full Article

ijsmr logo-pdf 1349088093

Observation-Driven Model for Zero-Inflated Daily Counts of Emergency Room Visit Data
Pages 220-228
Gary Sneddon, Wasimul Bari and M. Tariqul Hasan
DOI:
http://dx.doi.org/10.6000/1929-6029.2013.02.03.7
Published: 31 July 2013


Abstract: Time series data with excessive zeros frequently occur in medical and health studies. To analyze time series count data without excessive zeros, observation-driven Poisson regression models are commonly used in the literature. As handling excessive zeros in count data is not straightforward, observation-driven models are rarely used to analyze time series count data with excessive zeros. In this paper an observation-driven zero-inflated Poisson (ZIP) model for time series count data is proposed. This approach can accommodate an autoregressive serial dependence structure which commonly appears in time series. The estimation of the model parameters by using the quasi-likelihood estimating equation approach is discussed. To estimate the correlation parameters of the dependence structure, a moment approach is used. The proposed methodology is illustrated by applying it to a data set of daily emergency room visits due to bronchitis.

Keywords: Autocorrelation structure, non-stationary, observation-driven model, quasi-likelihood, zero-inflated Poisson.
Download Full Article

ijsmr logo-pdf 1349088093

The Methodology of Human Diseases Risk Prediction Tools
Pages 239-248
H. Mannan, R. Ahmed, M. Sanagou, S. Ivory and R. Wolfe
DOI:
http://dx.doi.org/10.6000/1929-6029.2013.02.03.9
Published: 31 July 2013


Abstract: Disease risk prediction tools are used for population screening and to guide clinical care. They identify which individuals have particularly elevated risk of disease. The development of a new risk prediction tool involves several methodological components including: selection of a general modelling framework and specific functional form for the new tool, making decisions about the inclusion of risk factors, dealing with missing data in those risk factors, and performing validation checks of a new tool’s performance. There have been many methodological developments of relevance to these issues in recent years. Developments of importance for disease detection in humans were reviewed and their uptake in risk prediction tool development illustrated. This review leads to guidance on appropriate methodology for future risk prediction development activities.

Keywords: Disease risk prediction, missing data, model validation, model updating, model utility.
Download Full Article

ijsmr logo-pdf 1349088093

Enriched-Data Problems and Essential Non-Identifiability
Pages 16-44
Geert Molenberghs, Edmund Njeru Njagi, Michael G. Kenward and Geert Verbeke
DOI:
http://dx.doi.org/10.6000/1929-6029.2012.01.01.02
Published: 24 September 2012


Abstract: There are two principal ways in which statistical models extend beyond the data available. First, the data may be coarsened, that is, what is actually observed is less detailed than what is planned, owing to, for example, attrition, censoring, grouping, or a combination of these. Second, the data may be augmented, that is, the observed data are hypothetically but conveniently supplemented with structures such as random effects, latent variables, latent classes, or component membership in mixture distributions. These two settings together will be referred to as enriched data. Reasons for modelling enriched data include the incorporation of substantive information, such as the need for predictions, advantages in interpretation, and mathematical and computational convenience. The fitting of models for enriched data combine evidence arising from empirical data with non-verifiable model components, i.e., that are purely assumption driven. This has important implications for the interpretation of statistical analyses in such settings. While widely known, the exploration and discussion of these issues is somewhat scattered. The user should be fully aware of the potential dangers and pitfalls that follows from this. Therefore, we provide a unified framework for enriched data and show in general that to any given model an entire class of models can be assigned, with all of its members producing the same fit to the observed data but arbitrary regarding the unobservable parts of the enriched data. The implications of this are explored for several specific settings, namely that of latent classes, finite mixtures, factor analysis, random-effects models, and incomplete data. The results are applied to a range of relevant examples.

Keywords: Compound-symmetry, empirical bayes, enriched data, exponential random effects, gamma random effects, linear mixed model, missing at random, missing completely at random, non-future dependence, pattern-mixture model, selection model, shared-parameter model..
Download Full Article