• About
  • Documentation

  • More Universes
  • Recent Updates
  • Leader board

  • All repositories
  • All packages
  • All articles
  • All datasets
  • All system Libraries
atanubhattacharjee
  • Builds
  • Packages
  • Articles
  • Datasets
  • Contribution
  • Badges
  • API
  • Feed

Links toatanubhattacharjee

highMLR - Machine Learning Feature Selection for High Dimensional Survival Data

A unified, flexible framework for high dimensional feature selection in the presence of a survival outcome. Provides multiple machine learning approaches (Cox elastic net, random survival forest, accelerated oblique random survival forest, gradient-boosted Cox, stability selection, classical univariate Cox screening, pseudo- observation bridging to arbitrary regression learners, and Fine-Gray competing risks selection) under a single interface. Adds causal survival forest estimation of heterogeneous treatment effects on survival (experimental), conformal survival prediction with finite- sample coverage guarantees, and time-dependent 'SHAP' explanations via 'SurvSHAP(t)'. Methodology is based on regularised Cox regression (2011) <doi:10.18637/jss.v039.i05>, random survival forests (2008) <doi:10.1214/08-AOAS169>, oblique random survival forests (2024) <doi:10.1080/10618600.2023.2231048>, stability selection (2010) <doi:10.1111/j.1467-9868.2010.00740.x>, causal survival forests (2023) <doi:10.1111/rssb.12538>, time-dependent survival explanations (2023) <doi:10.1016/j.knosys.2022.110234>, conformal survival prediction (2023) <doi:10.1093/biomet/asad043>, the Fine-Gray model for competing risks (1999) <doi:10.1080/01621459.1999.10474144>, and pseudo-observation regression (2010) <doi:10.1177/0962280209105020>.

Last updated

2.70 score 5 scripts 255 downloads

MIIPW - IPW and Mean Score Methods for Time-Course Missing Data

Contains functions for data analysis of Repeated measurement using GEE. Data may contain missing value in response and covariates. For parameter estimation through Fisher Scoring algorithm, Mean Score and Inverse Probability Weighted method combining with Multiple Imputation are used when there is missing value in covariates/response. Reference for mean score method, inverse probability weighted method is Wang et al(2007)<doi:10.1093/biostatistics/kxl024>.

Last updated

2.00 score 5 scripts 305 downloads

dscoreMSM - Survival Proximity Score Matching in Multi-State Survival Model

Implements survival proximity score matching in multi-state survival models. Includes tools for simulating survival data and estimating transition-specific coxph models with frailty terms. The primary methodological work on multistate censored data modeling using propensity score matching has been published by Bhattacharjee et al.(2024) <doi:10.1038/s41598-024-54149-y>.

Last updated

jagscpp

2.00 score 208 downloads

jmSurface - Semi-Parametric Association Surfaces for Joint Longitudinal-Survival Models

Implements interpretable multi-biomarker fusion in joint longitudinal-survival models via semi-parametric association surfaces. Provides a two-stage estimation framework where Stage 1 fits mixed-effects longitudinal models and extracts Best Linear Unbiased Predictors ('BLUP's), and Stage 2 fits transition-specific penalized Cox models with tensor-product spline surfaces linking latent biomarker summaries to transition hazards. Supports multi-state disease processes with transition-specific surfaces, Restricted Maximum Likelihood ('REML') smoothing parameter selection, effective degrees of freedom ('EDF') diagnostics, dynamic prediction of transition probabilities, and three interpretability visualizations (surface plots, contour heatmaps, marginal effect slices). Methods are described in Bhattacharjee (2025, under review).

Last updated

1.70 score 152 downloads

jmBIG - Joint Longitudinal and Survival Model for Big Data

Provides analysis tools for big data where the sample size is very large. It offers a suite of functions for fitting and predicting joint models, which allow for the simultaneous analysis of longitudinal and time-to-event data. This statistical methodology is particularly useful in medical research where there is often interest in understanding the relationship between a longitudinal biomarker and a clinical outcome, such as survival or disease progression. This can be particularly useful in a clinical setting where it is important to be able to predict how a patient's health status may change over time. Overall, this package provides a comprehensive set of tools for joint modeling of BIG data obtained as survival and longitudinal outcomes with both Bayesian and non-Bayesian approaches. Its versatility and flexibility make it a valuable resource for researchers in many different fields, particularly in the medical and health sciences.

Last updated

1.48 score 1 dependents 1 scripts 213 downloads

afthd - Accelerated Failure Time for High Dimensional Data with MCMC

Functions for Posterior estimates of Accelerated Failure Time(AFT) model with MCMC and Maximum likelihood estimates of AFT model without MCMC for univariate and multivariate analysis in high dimensional gene expression data are available in this 'afthd' package. AFT model with Bayesian framework for multivariate in high dimensional data has been proposed by Prabhash et al.(2016) <doi:10.21307/stattrans-2016-046>.

Last updated

jagscpp

1.48 score 1 dependents 7 scripts 304 downloads

JMbdirect - Joint Model for Longitudinal and Multiple Time to Events Data

Provides model fitting, prediction, and plotting for joint models of longitudinal and multiple time-to-event data, including methods from Rizopoulos (2012) <doi:10.1201/b12208>. Useful for handling complex survival and longitudinal data in clinical research.

Last updated

1.00 score 586 downloads

flassomsm - Penalized Estimation for Multi-State Models with Lasso and Fused Penalties

Provides a suite of methods for detecting influential subjects in longitudinal datasets, particularly when observations occur at irregular time points. The methods identify individuals whose response trajectories deviate significantly from the population pattern, enabling detection of anomalies or subjects exerting undue influence on model outcomes.

Last updated

1.00 score 144 downloads

ILRCM - Convert Irregular Longitudinal Data to Regular Intervals and Perform Clustering

Convert irregularly spaced longitudinal data into regular intervals for further analysis, and perform clustering using advanced machine learning techniques. The package is designed for handling complex longitudinal datasets, optimizing them for research in healthcare, demography, and other fields requiring temporal data modeling.

Last updated

1.00 score 194 downloads

MIGEE - Impute Missing Values and Fitting Linear Mixed Effect Model

Implements methods for estimating generalized estimating equations (GEE) with advanced options for flexible modeling and handling missing data. This package provides tools to fit and analyze GEE models for longitudinal data, allowing users to address missingness using a variety of imputation techniques. It supports both univariate and multivariate modeling, visualization of missing data patterns, and facilitates the transformation of data for efficient statistical analysis. Designed for researchers working with complex datasets, it ensures robust estimation and inference in longitudinal and clustered data settings.

Last updated

1.00 score 166 downloads

SurviMChd - High Dimensional Survival Data Analysis with Markov Chain Monte Carlo

High dimensional survival data analysis with Markov Chain Monte Carlo(MCMC). Currently supports frailty data analysis. Allows for Weibull and Exponential distribution. Includes function for interval censored data.

Last updated

jagscpp

1.00 score 194 downloads

QoLMiss - Scales Score Calculation from Quality of Life Data

There are three functions: qol, miss_qol and miss_patient takes input of the data set containing the answers of QOL questionnaire. It will compute the three types of domain based scale scores: Global, Functional, and Symptoms. In case of missing data, the miss_qol and miss_patient functions will make the required changes and then calculate the domain-wise scale scores. Finally, provide an output replacing the question columns with the domain-based scale scores in the original data set.

Last updated

1.00 score 184 downloads

designsize - Sample Size Calculation of Various Study Designs

Different sample size calculations with different study designs. These techniques are explained by Chow (2007) <doi:10.1201/9781584889830>.

Last updated

1.00 score 236 downloads

SurvHiDim - High Dimensional Survival Data Analysis

High dimensional time to events data analysis with variable selection technique. Currently support LASSO, clustering and Bonferroni's correction.

Last updated

1.00 score 3 scripts 195 downloads

longit - High Dimensional Longitudinal Data Analysis Using MCMC

High dimensional longitudinal data analysis with Markov Chain Monte Carlo(MCMC). Currently support mixed effect regression with or without missing observations by considering covariance structures. It provides estimates by missing at random and missing not at random assumptions. In this R package, we present Bayesian approaches that statisticians and clinical researchers can easily use. The functions' methodology is based on the book "Bayesian Approaches in Oncology Using R and OpenBUGS" by Bhattacharjee A (2020) <doi:10.1201/9780429329449-14>.

Last updated

jagscpp

1.00 score 193 downloads