DATA MANIPULATION IN R
![Web scraping in R](/images/featured/web-scraping-r.png)
Web scraping with rvest
Import and export data
![Pattern matching in R](/images/featured/grepl-grep-r.png)
Pattern matching with grepl() and grep()
String manipulation
![Pattern matching and replacement in R](/images/featured/gsub-sub-r.png)
Pattern matching and replacement with gsub() and sub()
String manipulation
![colSums, rowSums, colMeans and rowMeans functions in R](/images/featured/row-column-sums-means-r.png)
Row and column sums and means
Data transformation
![Row and column names in R](/images/featured/rownames-colnames-r.png)
Row and column names
Data transformation
![Concatenate strings in R with paste and paste0 functions](/images/featured/paste-r.png)
Concatenate strings with paste and paste0
String manipulation
![The nchar function in R](/images/featured/nchar-r.png)
Count the number of characters with nchar
String manipulation
![Read SQL databases in R](/images/featured/sql-r.png)
Read SQL databases
Import and export data
![Learn how to use the lapply function in R programming language](/images/featured/lapply-r.png)
lapply function
apply family
![Read Excel file sheets into R](/images/featured/read-excel-r.png)
Read Excel files
Import and export data
![Join data frames in R with the merge function](/images/featured/merge-r.png)
Merge data frames
Data transformation
![Learn how to use the aggregate function in R](/images/featured/aggregate-r.png)
Aggregate
Data transformation
¿What is DATA MANIPULATION?
Data manipulation, also known as data wrangling, refers to the process of transforming and cleaning raw data into a structured format suitable for analysis. This process involves various operations such as filtering, sorting, aggregating, merging, reshaping, and transforming data to make it more organized, understandable, and ready for analysis. R provides several functions to perform these tasks, but dplyr
is one of the most popular and widely used R packages for data manipulation.
-
Base R
Data manipulation in base R involves using the core functions and methods provided by R's base package for handling, transforming, and manipulating data structures such as vectors, matrices, arrays, data frames, and lists. -
dplyr
dplyr
is an R package designed for efficient and user-friendly data manipulation. It provides a set of functions that streamline data wrangling tasks by offering a consistent grammar for manipulating data frames and data tables.