03. Filtering Vectors all(), any(), which(), subset()

When working with vectors, you'll often need to filter out values that meet a criteria. In R, this process is known as filtering, and the platform provides several functions to help you extract subsets of data.

The `any()` Function

The `any()` function returns `TRUE` or `FALSE`, depending on whether all arguments match that criteria. The `TRUE` and `FALSE` are of type `logical`.

``````> x <- 1:100
> any(x > 101)
FALSE
> any(x == 2)
TRUE
> any(x <= 50)
TRUE``````

The `all()` Function

On the flip side, we can use the `all()` function to test if all values meet a certain criteria.

``````> x < 1:100
> all(x > 40)
FALSE
> all(x > 0)
TRUE``````

Comparison Operations

With vectors, we may run comparison operations to return vector containing logical values. For example:

``````> x <- c(1,2,3,4,5,6)
> x > 3
[1] FALSE FALSE FALSE  TRUE  TRUE  TRUE``````

As you can see, we are returned a logical vector containing TRUE and FALSE values, depending on how that positional element was evaluated.

How is this useful? We can use these resulting logical vectors to pull out subvectors. Let's say we only want to pull out odd values - we can write:

``````> x <- c(12,423,52,21,324)
> x[x %% 2 == 1]
[1] 423  21``````

The `x %% 2 == 1` returns a logical vectors. All positions where `TRUE` is held are then printed.

We can further use this feature to replace values that meet a certain criteria:

``````> x <- c(1,2,3,4,5,6)
> x[x*x>20] = 1337
> x
[1]   1    2    3    4 1337 1337``````

Pulling out subvectors with `subset()`

In the methods mentioned above, `NA` values are included in the subvector, no matter the condition.

``````> x <- c(1,2,3,NA,5,6)
> x[x>2]
[1] 3 NA 5 6 ``````

In the case when you need to exclude the `NA`, you may use the `subset()` function.

``````> subset(x, x>2)
[1] 3 5 6``````

Pulling out indicies with `which()`

If you need to pull out not the actual values but just the indicies in which the values of a certain condition reside, then use the `which()` function. This will return all the indicies that match a certain criteria.

``````> z <- c(1,2,3,4,5,6)
> which(z > 3)
[1] 4 5 6 ``````

Learn to be a Pythonista!

Learn Python in One Day

Have you always wanted to learn computer programming but are afraid it'll be too difficult for you? Or you're familiar with some programming but are interested in learning Python fast? Then this book is for you. You no longer have to waste your time and money learning Python from lengthy books, expensive online courses or complicated Python tutorials.

\$ Check price
(185+ reviews)

Take your Linux skills to the next level!

Linux for Beginners

Linux for Beginners doesn't make any assumptions about your background or knowledge of Linux. You need no prior knowledge to benefit from this book. You will be guided step by step using a logical and systematic approach. As new concepts, commands, or jargon are encountered they are explained in plain language, making it easy for anyone to understand.

\$ Check price
(101+ reviews)