* Cantinho Satkeys

Refresh History
  • FELISCUNHA: Votos de um santo domingo para todo o auditório  4tj97u<z
    03 de Novembro de 2024, 10:49
  • j.s.: bom fim de semana  43e5r6 49E09B4F
    02 de Novembro de 2024, 08:37
  • j.s.: ghyt74 a todos  4tj97u<z
    02 de Novembro de 2024, 08:36
  • FELISCUNHA: ghyt74   49E09B4F  e bom feriado   4tj97u<z
    01 de Novembro de 2024, 10:39
  • JPratas: try65hytr Pessoal  h7ft6l k7y8j0
    01 de Novembro de 2024, 03:51
  • j.s.: try65hytr a todos  4tj97u<z
    30 de Outubro de 2024, 21:00
  • JPratas: dgtgtr Pessoal  4tj97u<z k7y8j0
    28 de Outubro de 2024, 17:35
  • FELISCUNHA: Votos de um santo domingo para todo o auditório  k8h9m
    27 de Outubro de 2024, 11:21
  • j.s.: bom fim de semana   49E09B4F 49E09B4F
    26 de Outubro de 2024, 17:06
  • j.s.: dgtgtr a todos  4tj97u<z
    26 de Outubro de 2024, 17:06
  • FELISCUNHA: ghyt74   49E09B4F  e bom fim de semana
    26 de Outubro de 2024, 11:49
  • JPratas: try65hytr Pessoal  101yd91 k7y8j0
    25 de Outubro de 2024, 03:53
  • JPratas: dgtgtr A Todos  4tj97u<z 2dgh8i k7y8j0
    23 de Outubro de 2024, 16:31
  • FELISCUNHA: ghyt74  pessoal   49E09B4F
    23 de Outubro de 2024, 10:59
  • j.s.: dgtgtr a todos  4tj97u<z
    22 de Outubro de 2024, 18:16
  • j.s.: dgtgtr a todos  4tj97u<z
    20 de Outubro de 2024, 15:04
  • FELISCUNHA: Votos de um santo domingo para todo o auditório  101041
    20 de Outubro de 2024, 11:37
  • axlpoa: hi
    19 de Outubro de 2024, 22:24
  • FELISCUNHA: ghyt74   49E09B4F  e bom fim de semana  4tj97u<z
    19 de Outubro de 2024, 11:31
  • j.s.: ghyt74 a todos  4tj97u<z
    18 de Outubro de 2024, 09:33

Autor Tópico: Data Manipulation With Dplyr in R  (Lida 88 vezes)

0 Membros e 1 Visitante estão a ver este tópico.

Online mitsumi

  • Moderador Global
  • ***
  • Mensagens: 115627
  • Karma: +0/-0
Data Manipulation With Dplyr in R
« em: 06 de Dezembro de 2020, 16:29 »

Data Manipulation With Dplyr in R
Duration: 3h2m | .MP4 1280x720, 30 fps(r) | AAC, 44100 Hz, 2ch | 1.48 GB
Genre: eLearning | Language: English
A straightforward tutorial in data wrangling with one of the most powerful R packages - dplyr.

What you'll learn
Filter data frames using various conditions
Select and remove data frame columns (variables)
Sort data frames by column values
Create new variables from the existing ones
Compute summary statistics for our data frame
Other useful operations (count data fame rows, select top rows, select rows at random etc.)
Chaining dplyr commands to write powerful data manipulation code
Joining data frames (five joining types)
Combining dplyr with ggDescription2 to create meningful charts

Requirements
Basic R programming knowledge

Description
Data manipulation is a vital data analysis skill - actually, it is the foundation of data analysis. This course is about the most effective data manipulation tool in R - dplyr!

As a data analyst, you will spend a vast amount of your time preparing or processing your data. The goal of data preparation is to convert your raw data into a high quality data source, suitable for analysis. More often than not, this process involves a lot of work. The dplyr package contains the tools that can make this work much easier.

dplyr has a few important advantages over other data data manipulation tools or functions:

it's much faster (25-30 times faster)

its code is easier to write and understand

it can use chaining to build sequences of commands, thus making the code even cleaner and faster to execute

For these reasons, dplyr quickly began the most popular data manipulation tool among R data scientists. When you finish this course, you will be able to

It is a short course, but it is focused on the most essential commands and functions of the dplyr package, those commands that you will likely use most often.

So let's see what you are going to learn in this course.

The first section covers the five core dplyr commands. These commands are: filter, select, mutate, arrange and summarise. You will need this commands practically every time when you work with dplyr. They are used to subset data frames, compute new variables, sort data frames, compute statistical indicators and so on. Here's a few real life scenarios of their utilization:

you need to extract from your respondents data set the male subjects with an income greater than $30,000

you need to compute each respondent's income per family member, knowing the total income and the number of family members

you have a data set with 27 variables, but you only need 6 for your analysis (so you want to remove the extra variables)

you have to sort your employees data set by salary

you need to compute the average satisfaction towards a product, knowing each individual customer satisfaction etc.

The second section approaches other important dplyr commands and functions. In this section you'll learn:

how to count the observation in a certain group

how to extract a random sample from your data frame

how to extract the top entries from your data frame, based on a given variable

how to visualize the structure of your data set

how to use the set operations in dplyr (if you have used these operations in base R, you'll see that dplyr takes them to a whole new level).

In the third section you'll start to take advantage of the true power of dplyr. Here we'll talk about chaining - creating sequences of dplyr commands that accomplish multiple tasks with one click only.

The fourth section is about joining data frames with dplyr. This is a very important topic, because many times your data will be found in several data frames. So you will need to join these data frames into only one, suitable for your analyses. We are going to look at five join types available in dplyr: inner_join, semi_join, left_join, anti_join and full_join. We are going to examine the output of each join type using a simple example.

In the fifth section we'll learn how to combine the dplyr and ggDescription2 (using chaining) commands to build expressive charts and graphs. For example, if you want to represent the income distribution for the subjects with a higher education only, or the relationship between income and education level for the female subjects only, in this section you will                                                                                                                                                                                                         learn exactly how to do it.

Every command is illustrated with video, both the syntax and the output being explained in detail. At the end of the course, a big number of practical exercises are proposed. By doing these exercises you'll actually apply in practice what you have learned.

Join this course right now and acquire a critical data analysis ability - data manipulation!

Who this course is for:
People who want to become R analysts
Students and statisticians who want to learn R
People who want to learn the fundamentals of data manipulation using R

Download link:
Só visivel para registados e com resposta ao tópico.

Only visible to registered and with a reply to the topic.

Links are Interchangeable - No Password - Single Extraction