为另一个变量的第一个非NA创建一个0的变量，然后从0开始向上/向下计数其他值按第三个变量分组

作者: 淋了一整夜的雨
发布时间: 2025-07-09 09:48:05 (3天前)
转自：

3 条回复

0#
回复此人
句号了哦哦 | 2019-08-31 10-32

<div class =“post-text”itemprop =“text”> <P> 我们可以找出第一个非NA的行索引 <code> score </code> 出现，然后创建一个序列 <code> 1 - index </code> 至 <code> n() - index </code> 对于每个群体。 </p> <pre> <code> library(dplyr) df %>% group_by(country) %>% mutate(index = which.max(!is.na(score)), years_from_implementation = (1 - index[1]):(n() - index[1])) %>% select(-index) # country year score years_from_implementation # <chr> <dbl> <dbl> <int> # 1 US 1999 NA -4 # 2 US 2000 NA -3 # 3 US 2001 NA -2 # 4 US 2002 NA -1 # 5 US 2003 426 0 # 6 US 2004 NA 1 # 7 US 2005 NA 2 # 8 US 2006 430 3 # 9 US 2007 NA 4 #10 Mex 2000 450 0 #11 Mex 2001 NA 1 </code> </pre> </DIV>

编辑
1#
回复此人
My☀ | 2019-08-31 10-32

<div class =“post-text”itemprop =“text”> <P> 这里有一个 <code> dplyr </code> 选项 </p> <pre> <code> library(dplyr) df %>% group_by(country) %>% mutate(years_from_implementation = 1:n() - which(score == first(score[!is.na(score)]))) %>% ungroup() ## A tibble: 11 x 4 # country year score years_from_implementation # <chr> <dbl> <dbl> <int> # 1 US 1999 NA -4 # 2 US 2000 NA -3 # 3 US 2001 NA -2 # 4 US 2002 NA -1 # 5 US 2003 426 0 # 6 US 2004 NA 1 # 7 US 2005 NA 2 # 8 US 2006 430 3 # 9 US 2007 NA 4 #10 Mex 2000 450 0 #11 Mex 2001 NA 1 </code> </pre> </DIV>

编辑

登录后才能参与评论