如何完成代码以用R中的中位数替换NA

作者: 陆离
发布时间: 2024-06-08 12:35:48 (1月前)
转自：

5 条回复

0#
回复此人
不见你 | 2019-08-31 10-32

<div class =“post-text”itemprop =“text”> <P> <code> library(data.table) </code> </p> <pre> <code> dt <- data.table(title = c("Mr", "Mrs", "Miss", "Mrs", "Mr", "Mr", "Mr", "Master", "Mrs"), age = c(22, 38, 26, 35, 35, NA, 54, 2, 27)) dt[,avg_age:=median(age,na.rm=T),by="title"] dt[is.na(age),age:=avg_age] dt[,avg_age:=NULL] </code> </pre> </DIV>

编辑
1#
回复此人
一生浮华 | 2019-08-31 10-32

<div class =“post-text”itemprop =“text”> <p>这可能不是最优雅的方式，但它有效：</p> <pre> <code> title <- c("Mr", "Mrs", "Miss", "Mrs", "Mr", "Mr", "Mr", "Master", "Mrs") age <- c(22, 38, 26, 35, 35, NA, 54, 2, 27) df = data.frame(title, age) # get the medians by groups medians = aggregate(df$age, list(df$title), median, na.rm = TRUE) # match the missing ages with the medians thanks to the groups df$age[is.na(df$age)] <- medians[array(medians$Group.1) == df$title[is.na(df$age)], "x"] </code> </pre> </DIV>

编辑
2#
回复此人
NetworkAttachedStorage | 2019-08-31 10-32

<div class =“post-text”itemprop =“text”> <P> 或许这个 <code> tidyverse </code> 一个班轮 </p> <pre> <code> agedata %>% group_by(title) %>% mutate(age=ifelse(is.na(age), median(age, na.rm=TRUE), age)) </code> </pre> </DIV>

编辑
3#
回复此人
蜡笔小辛 | 2019-08-31 10-32

<div class =“post-text”itemprop =“text”> <pre> <code> zz <- "group traits BSPy01-10 NA BSPy01-10 7.3 BSPy01-10 7.3 BSPy01-11 5.3 BSPy01-11 5.4 BSPy01-11 5.6 BSPy01-11 NA BSPy01-11 NA BSPy01-11 4.8 BSPy01-12 8.1 BSPy01-12 6.0 BSPy01-12 6.0 BSPy01-13 6.1" Data <- read.table(text=zz, header = TRUE) impute <- function(x, fun) { missing <- is.na(x) replace(x, missing, fun(x[!missing])) } ddply(Data, ~ group, transform, traits = impute(traits, median)) </code> </pre> </DIV>

编辑

登录后才能参与评论