Programming Language/R

R ๊ธฐ์ดˆ ๋ช…๋ น์–ด(c(), factor(), class(), levels(), as.numeric(), is.numeric())

chaerlo127 2022. 4. 7. 23:49
728x90

ํ•™๊ต์—์„œ ๋ฐ์ดํ„ฐ ๋งˆ์ด๋‹์„ ๋ฐฐ์šฐ๋ฉด์„œ, R ์–ธ์–ด์—๋„ ๋ฐฐ์šฐ๊ณ  ์žˆ๋‹ค. ์ƒˆ๋กญ๊ฒŒ ์–ธ์–ด๋ฅผ ๋ฐฐ์šฐ๋‹ค ๋ณด๋‹ˆ ์–ด๋ ค์›€์„ ๋А๋ผ๊ณ  ์žˆ์–ด์„œ ๋ธ”๋กœ๊ทธ์— ์ž‘์„ฑํ•˜๋ฉด์„œ ๋ณต์Šตํ•˜๋Š” ์‹œ๊ฐ„์„ ๊ฐ€์ ธ๋ณด๊ณ ์ž ํ•œ๋‹ค.

 

โœจ R ์ด๋ž€

R์€ ํ†ต๊ณ„ ๊ณ„์‚ฐ๊ณผ ๊ทธ๋ž˜ํ”ฝ์„ ์œ„ํ•œ ํ”„๋กœ๊ทธ๋ž˜๋ฐ ์–ธ์–ด์ด์ž ์†Œํ”„ํŠธ์›จ์–ด ํ™˜๊ฒฝ

์˜คํ”ˆ์†Œ์Šค๋กœ ๋ฌด๋ฃŒ

๋ฐ์ดํ„ฐ ๋ถ„์„œ๊ธฐ์šฉ์œผ๋กœ, ๋ฐ์ดํ„ฐ ์ฒ˜๋ฆฌ, ํ†ต๊ณ„ ๋ถ„์„์—์„œ ์‚ฌ์šฉ

๋Œ€์†Œ๋ฌธ์ž ๊ตฌ๋ถ„

 

R์€ ๊ทธํŒจํ”ฝ ๊ธฐ๋Šฅ์œผ๋กœ ์ˆ˜ํ•™ ๊ธฐํ˜ธ๋ฅผ ํฌํ•จํ•  ์ˆ˜ ์žˆ๋Š” ์ถœํŒ๋ฌผ ์ˆ˜์ค€์˜ ๊ทธ๋ž˜ํ”„๋ฅผ ์ œ๊ณตํ•˜์—ฌ ๋„ํ‘œ๋ฅผ ๊ทธ๋ฆฌ๋Š”๋ฐ ์œ ์šฉํ•˜๋‹ค.

 

โœจ ๋ณ€์ˆ˜

- ์—ฐ์† ๋ณ€์ˆ˜ (Continuous variable)

 ์—ฐ์†์ ์ด๋ฉฐ, ํฌ๊ธฐ๋ฅผ ๋‚˜ํƒ€๋‚ธ๋‹ค.

= Numberic variable, quantitative variable (์–‘์  ๋ณ€์ˆ˜)

 

- ๋ฒ”์ฃผ ๋ณ€์ˆ˜ (Categorical variable)

 ๋Œ€์ƒ ๋ถ„๋ฅ˜ (์—ฌ์„ฑ/๋‚จ์„ฑ)

 ์ˆซ์ž ํ˜•ํƒœ์—ฌ๋„, ์‚ฐ์ˆ  ํ˜•ํƒœ๋กœ ๊ณ„์‚ฐ ์˜๋ฏธ๊ฐ€ ์—†์Œ

 R์—์„œ factor๋กœ ๋‚˜ํƒ€๋ƒ„

 = Nomical variable

 


โœจ R ๋ช…๋ น์–ด

 

1. ๋ณ€์ˆ˜ ์„ ์–ธ

# ๋ณ€์ˆ˜ ์„ ์–ธ 
# a values์— number 2 ์‚ฝ์ž…
a <- 2
b <- 2
a + b # 2 + 2 = 4

 

2. ๋ฒกํ„ฐ ๋ณ€์ˆ˜

a <- c(1, 2, 3)
b <- c(1, 3, 5)
a + b # ๊ฐ€๋Šฅ

b <- 2
a + b # ๊ฐ€๋Šฅ


# 1์—์„œ ๋ถ€ํ„ฐ ๊ฐ™์€ ์ฐจ์ด๋กœ 6๊นŒ์ง€ ๋“ค์–ด๊ฐ„๋‹ค. (1, 2, 3, 4, 5, 6)
d <- c(1:6)
a <- seq(1,6)
b <- seql(1, 6, by =2) # ๋‘ ์นธ ์”ฉ ๋„์›Œ์ง. 1, 3, 5

# numeric variable
a <- c(1, 2, 3)
# factor variable (categorical variable)
b <- factor(c(1, 2, 3, 4, 5))

c <- c("hi", "nice", "to", "meet", "you")
paste(c, collapse = " ,") # collapse
  • paste(c, collapse = " ,") # collapse

 

3. ๋ณ€์ˆ˜ ํƒ€์ž…/factor ๊ตฌ์„ฑ ๋ฒ”์ฃผ ํ™•์ธ

# ๋ณ€์ˆ˜ ํƒ€์ž… ํ™•์ธ
a <- factor(c(1, 2, 3, 4, 5))
class(a) # factor
a <- 1
class(a) # numeric

# factor ๋ณ€์ˆ˜ ๊ตฌ์„ฑ ๋ฒ”์ฃผ ํ™•์ธ
levels(a) # [1] "1" "2" "3" "4" "5"

 

4. ๋ณ€์ˆ˜ ํƒ€์ž… ๋ณ€ํ™˜

  • as.numeric(x) : numeric ์œผ๋กœ
  • as.factor(x) : factor๋กœ
  • as.character(x) : charactor๋กœ
  • as.Date(x) : date๋กœ
  • as.data.frame(x) : data frame์œผ๋กœ
  • as.array(x) : ๋ฐฐ์—ด๋กœ
  • as.matrix: ํ–‰๋ ฌ๋กœ
a <- factor(c(1, 2, 3, 4, 5))
class(a) # factor

a <- as.numeric(a) # factor -> numeric
class(a) # numeric

๋ณ€์ˆ˜์˜ ํƒ€์ž…์ด ๋ณ€ํ™”๋œ ๊ฒƒ์„ ํ™•์ธํ•  ์ˆ˜ ์žˆ๋‹ค.

 

ํ˜น์€ ๋ฐ์ดํ„ฐ ํƒ€์ž… ํ™•์ธ ํ•จ์ˆ˜๋ฅผ ํ†ตํ•ด ํ™•์ธํ•  ์ˆ˜๋„ ์žˆ๋‹ค.

 

  • is.numeric(x)
  • is.integer(x)
  • is.double(x)
  • is.character(x)
  • is.logical(x)
  • is.complex(x)
  • is.null(x)
  • is.na(x)
  • is.infinite(x)
  • is.finite(x)

์š”์ƒˆ ์‹œํ—˜๊ธฐ๊ฐ„์ด๋ผ๋˜๋ฐ,,,, ๋‚˜๋„ ์ด์ œ ์Šฌ์Šฌ ์‹œ์ž‘ํ•ด์•ผ์ง€

 

728x90