Data visualization: Whatsapp group on the pandemic of Covid-19

visualization of data of a Whatsapp’s group during the Covid-19 pandemic

Fagner Morais Dias
3 min readNov 6, 2020
Photo by Christian Wiediger on Unsplash

Introduction

Communication is a strong key to maintain health and good relation with people. in the 21 century, there are many ways to say anything to someone even in long distances. Skype, Whatsapp, Telegram, or a simple fone call can turn many miles away to a few inches apart. The Whatsapp, for example, the user can create a group to talk to more than one person at a time or make fone call or even a video chat to talk to with a friend.Whatsapp is used in the whole world who more than 1.5 bi users every day the app to communicate with other people. In this article, we going to analyze a group made on Whatsapp and take some information like the most words used and time of more use.

Database

The source of the data used to analyze was simply export the conversation of a group on Whatsapp. The group is made of students from the program of MSc and Ph.D. at the university.

Pre-processing data

To the preparation of the data came, to identify some patterns like date and time we use regex to identify these patterns in the text.

anonymization of data

To maintain the identity of who sends the message, we made a function that creates a dictionary that maintains the author’s name as key and the string author_x, x is an indice to differ from the other authors, as value to the key.

Getting data

Some statistics

The graphic below represents the number of messages sends on each day of the week and weekend.

The graphic below represents the number of messages in the years 2019 and 2020, and it is possible to see the difference between them. The pandemic of Covid-19 put the number of messages down. The suspension of classes may be the main factor in these quantities.

The graphic below represents the time of the users most send messages to the group.

Conclusion

It is possible to see that the year 2019 is very different from 2020. In 2019, without the pandemic, the quantities of messages occur like expected. Instead, in 2020 the number of messages is lower mainly because de the pandemic and the suspension of classes.

--

--