DATA MANIPULATION IN R
![Select columns in R with dplyr](/images/featured/select-dplyr-r.png)
Select columns with dplyr
dplyr
![Filter rows in R with dplyr](/images/featured/filter-dplyr-r.png)
Filter rows with dplyr
dplyr
![Order rows in R with dplyr](/images/featured/arrange-dplyr-r.png)
Order rows with the arrange() function from dplyr
dplyr
![Rename columns in R with dplyr](/images/featured/rename-dplyr-r.png)
Rename columns with the rename() function from dplyr
dplyr
![Create and modify columns in R with dplyr](/images/featured/mutate-dplyr-r.png)
Create and modify columns with the mutate() function from dplyr
dplyr
![The summarise() function from dplyr](/images/featured/summarise-dplyr-r.png)
Create statistical summaries with the summarise() function from dplyr
dplyr
![The table() and prop.table() functions in R](/images/featured/contingency-table-r.png)
Tables with table() and prop.table()
Data transformation
![Remove leading and trailing whitespaces in R with trimws()](/images/featured/trimws-r.png)
Remove leading and trailing whitespaces with trimws()
String manipulation
![Lowercase and uppercase in R with tolower(), toupper() and chartr()](/images/featured/tolower-toupper-r.png)
Lowercase and uppercase with tolower() and toupper()
String manipulation
![The substring() and substr() functions in R](/images/featured/substring-r.png)
Extract and replace substrings with substring() and substr()
String manipulation
![rbind() and cbind() functions in R](/images/featured/rbind-cbind-r.png)
rbind() and cbind() functions
Data transformation
![The strsplit() function in R](/images/featured/strsplit-r.png)
Split strings with strsplit()
String manipulation
¿What is DATA MANIPULATION?
Data manipulation, also known as data wrangling, refers to the process of transforming and cleaning raw data into a structured format suitable for analysis. This process involves various operations such as filtering, sorting, aggregating, merging, reshaping, and transforming data to make it more organized, understandable, and ready for analysis. R provides several functions to perform these tasks, but dplyr
is one of the most popular and widely used R packages for data manipulation.
-
Base R
Data manipulation in base R involves using the core functions and methods provided by R's base package for handling, transforming, and manipulating data structures such as vectors, matrices, arrays, data frames, and lists. -
dplyr
dplyr
is an R package designed for efficient and user-friendly data manipulation. It provides a set of functions that streamline data wrangling tasks by offering a consistent grammar for manipulating data frames and data tables.