R>> vctrs>> 返回
项目作者: r-lib

项目描述 :
Generic programming with typed R vectors
高级语言: R
项目地址: git://github.com/r-lib/vctrs.git
创建时间: 2016-09-06T21:32:53Z
项目社区:https://github.com/r-lib/vctrs

开源协议:Other

下载


vctrs

Codecov test
coverage
Lifecycle:
maturing
R-CMD-check

There are three main goals to the vctrs package, each described in a
vignette:

  • To propose vec_size() and vec_ptype() as alternatives to
    length() and class(); vignette("type-size"). These definitions
    are paired with a framework for size-recycling and type-coercion.
    ptype should evoke the notion of a prototype, i.e. the original or
    typical form of something.

  • To define size- and type-stability as desirable function properties,
    use them to analyse existing base functions, and to propose better
    alternatives; vignette("stability"). This work has been
    particularly motivated by thinking about the ideal properties of
    c(), ifelse(), and rbind().

  • To provide a new vctr base class that makes it easy to create new
    S3 vectors; vignette("s3-vector"). vctrs provides methods for many
    base generics in terms of a few new vctrs generics, making
    implementation considerably simpler and more robust.

vctrs is a developer-focussed package. Understanding and extending vctrs
requires some effort from developers, but should be invisible to most
users. It’s our hope that having an underlying theory will mean that
users can build up an accurate mental model without explicitly learning
the theory. vctrs will typically be used by other packages, making it
easy for them to provide new classes of S3 vectors that are supported
throughout the tidyverse (and beyond). For that reason, vctrs has few
dependencies.

Installation

Install vctrs from CRAN with:

  1. install.packages("vctrs")

Alternatively, if you need the development version, install it with:

  1. # install.packages("pak")
  2. pak::pak("r-lib/vctrs")

Usage

  1. library(vctrs)
  2. # Sizes
  3. str(vec_size_common(1, 1:10))
  4. #> int 10
  5. str(vec_recycle_common(1, 1:10))
  6. #> List of 2
  7. #> $ : num [1:10] 1 1 1 1 1 1 1 1 1 1
  8. #> $ : int [1:10] 1 2 3 4 5 6 7 8 9 10
  9. # Prototypes
  10. str(vec_ptype_common(FALSE, 1L, 2.5))
  11. #> num(0)
  12. str(vec_cast_common(FALSE, 1L, 2.5))
  13. #> List of 3
  14. #> $ : num 0
  15. #> $ : num 1
  16. #> $ : num 2.5

Motivation

The original motivation for vctrs comes from two separate but related
problems. The first problem is that base::c() has rather undesirable
behaviour when you mix different S3 vectors:

  1. # combining factors makes integers
  2. c(factor("a"), factor("b"))
  3. #> [1] 1 1
  4. # combining dates and date-times gives incorrect values; also, order matters
  5. dt <- as.Date("2020-01-01")
  6. dttm <- as.POSIXct(dt)
  7. c(dt, dttm)
  8. #> [1] "2020-01-01" "4321940-06-07"
  9. c(dttm, dt)
  10. #> [1] "2019-12-31 19:00:00 EST" "1970-01-01 00:04:22 EST"

This behaviour arises because c() has dual purposes: as well as its
primary duty of combining vectors, it has a secondary duty of stripping
attributes. For example, ?POSIXct suggests that you should use c()
if you want to reset the timezone.

The second problem is that dplyr::bind_rows() is not extensible by
others. Currently, it handles arbitrary S3 classes using heuristics, but
these often fail, and it feels like we really need to think through the
problem in order to build a principled solution. This intersects with
the need to cleanly support more types of data frame columns, including
lists of data frames, data frames, and matrices.