项目作者: ngreifer

项目描述 :
Covariate Balance Tables and Plots - An R package for assessing covariate balance
高级语言: R
项目地址: git://github.com/ngreifer/cobalt.git
创建时间: 2016-07-14T21:07:03Z
项目社区:https://github.com/ngreifer/cobalt

开源协议:

下载


" class="reference-link">cobalt: Covariate Balance Tables and Plots

CRAN_Status_Badge CRAN_Downloads_Badge

Overview

Welcome to cobalt, which stands for Covariate Balance
Tables (and Plots). cobalt allows users to assess balance on
covariate distributions in preprocessed groups generated through
weighting, matching, or subclassification, such as by using the
propensity score. cobalt’s primary function is bal.tab(), which
stands for “balance table”, and is meant to replace (or supplement) the
balance assessment tools found in other R packages. To examine how
bal.tab() integrates with these packages and others, see the help file
for bal.tab() with ?bal.tab, which links to the methods used for
each package. Each page has examples of how bal.tab() is used with the
package. There are also five vignettes detailing the use of cobalt,
which can be accessed at vignette("cobalt"): one for basic uses of
cobalt, one for the use of cobalt with additional packages, one for
the use of cobalt with multiply imputed and/or clustered data, one for
the use of cobalt with longitudinal treatments, and one for the use of
cobalt to generate publication-ready plots. Currently, cobalt is
compatible with output from MatchIt, twang, Matching, optmatch,
CBPS, ebal, WeightIt, designmatch, sbw, MatchThem, and cem
as well as data not processed through these packages.

For more information, check out the cobalt
website!

Why cobalt?

Most of the major conditioning packages contain functions to assess
balance; so why use cobalt at all? cobalt arose out of several
desiderata when using these packages: to have standardized measures that
were consistent across all conditioning packages, to allow for
flexibility in the calculation and display of balance measures, and to
incorporate recent methodological recommendations in the assessment of
balance. In addition, cobalt has unique plotting capabilities that
make use of ggplot2 in R for balance assessment and reporting.

Because conditioning methods are spread across several packages which
each have their idiosyncrasies in how they report balance (if at all),
comparing the resulting balance from various conditioning methods can be
a challenge. cobalt unites these packages by providing a single,
flexible tool that intelligently processes output from any of the
conditioning packages and provides the user with both useful defaults
and customizable options for display and calculation. cobalt also
allows for balance assessment on data not generated through any of the
conditioning packages. In addition, cobalt has tools for assessing and
reporting balance for clustered data sets, data sets generated through
multiple imputation, and data sets with a continuous treatment variable,
all features that exist in very limited capacities or not at all in
other packages.

A large focus in developing cobalt was to streamline output so that
only the most useful, non-redundant, and complete information is
displayed, all at the user’s choice. Balance statistics are intuitive,
methodologically informed, and simple to interpret. Visual displays of
balance reflect the goals of balance assessment rather than being steps
removed. While other packages have focused their efforts on processing
data, cobalt only assesses balance, and does so particularly well.

New features are being added all the time, following the cutting edge of
methodological work on balance assessment. As new packages and methods
are developed, cobalt will be ready to integrate them to further our
goal of simple, unified balance assessment.

Examples

Below are examples of cobalt’s primary functions:

  1. library("cobalt")
  2. data("lalonde", package = "cobalt")
  3. #Nearest neighbor matching with MatchIt
  4. m.out <- MatchIt::matchit(treat ~ age + educ + race + married +
  5. nodegree + re74 + re75,
  6. data = lalonde)
  7. #Checking balance before and after matching:
  8. bal.tab(m.out, thresholds = c(m = .1), un = TRUE)
  1. #> Balance Measures
  2. #> Type Diff.Un Diff.Adj M.Threshold
  3. #> distance Distance 1.7941 0.9739
  4. #> age Contin. -0.3094 0.0718 Balanced, <0.1
  5. #> educ Contin. 0.0550 -0.1290 Not Balanced, >0.1
  6. #> race_black Binary 0.6404 0.3730 Not Balanced, >0.1
  7. #> race_hispan Binary -0.0827 -0.1568 Not Balanced, >0.1
  8. #> race_white Binary -0.5577 -0.2162 Not Balanced, >0.1
  9. #> married Binary -0.3236 -0.0216 Balanced, <0.1
  10. #> nodegree Binary 0.1114 0.0703 Balanced, <0.1
  11. #> re74 Contin. -0.7211 -0.0505 Balanced, <0.1
  12. #> re75 Contin. -0.2903 -0.0257 Balanced, <0.1
  13. #>
  14. #> Balance tally for mean differences
  15. #> count
  16. #> Balanced, <0.1 5
  17. #> Not Balanced, >0.1 4
  18. #>
  19. #> Variable with the greatest mean difference
  20. #> Variable Diff.Adj M.Threshold
  21. #> race_black 0.373 Not Balanced, >0.1
  22. #>
  23. #> Sample sizes
  24. #> Control Treated
  25. #> All 429 185
  26. #> Matched 185 185
  27. #> Unmatched 244 0
  1. #Examining distributional balance with plots:
  2. bal.plot(m.out, var.name = "educ")
  3. bal.plot(m.out, var.name = "distance",
  4. mirror = TRUE, type = "histogram")


  1. #Generating a Love plot to report balance:
  2. love.plot(m.out, stats = c("mean.diffs", "variance.ratios"),
  3. thresholds = c(m = .1, v = 2), abs = TRUE,
  4. binary = "std",
  5. var.order = "unadjusted")

Please remember to cite this package when using it to analyze data. For
example, in a manuscript, you could write: “Matching was performed using
the Matching package (Sekhon, 2011), and covariate balance was
assessed using cobalt (Greifer, 2024), both in R (R Core Team, 2023).”
Use citation("cobalt") to generate a bibliographic reference for the
cobalt package.

Bugs appear in cobalt occasionally, often found by users. Please
report any bugs at https://github.com/ngreifer/cobalt/issues. To
install the latest development version of cobalt, which may have
removed a bug you’re experiencing, use the following code:

  1. devtools::install_github("ngreifer/cobalt")