Joint Research Programme, BTO 2022.037 | June 2022

BTO-VO project - Data visualization (2022)

This research is part of the Joint Research Programme of KWR, the water utilities and Vewin. All rights reserved by KWR. No part of this publication may be reproduced, stored in an automatic database, or transmitted in any form or by any means, be it electronic, mechanical, by photocopying, recording, or otherwise, without the prior written permission of KWR. This report is distributed to BTO-participants, and it is public after 1 year. July 2022.

KWR_new - white.png

PO Box 1072

3430 BB Nieuwegein

The Netherlands

T    +31 (0)30 60 69 511

F    +31 (0)30 60 61 165

E    [email protected]

I    www.kwrwater.nl


Management summary

Good visualization can considerably improve the interpretability of data and the efficiency of communication. With the latest developments in graphic techniques, we are now able to visualize and dashboard many kinds of seemingly messy (high-dimensional) data in 2D/3D plots, networks, etc. More practically, many visualizations can be presented in an interactive way, in which graphs and data points can be manipulated and customized by users, giving great ease to check data, identify patterns, and present findings.

Importance:

Data visualization and dashboarding are the graphical display of abstract information for two purposes: sense-making (e.g., data analysis) and communication [1-3]. In other words, data visualization is the representation of data or information in a graph, chart, or other visual formats. It communicates the relationship of the data with images. This is important because it allows trends and patterns to be more easily seen [1]. With the rise of (more available) data upon us, we need to be able to interpret increasingly larger and more heterogeneous batches of data. Multiple tools make it easier to visualize and present data. Data visualization is not only important for data scientists and data analysts, but also for many roles in different fields, e.g., marketing, communication, tech, design, etc. Besides, interactive visualization (hereafter referred to as visualization) enables the exploration of data via the manipulation of data embedded in the graph, with the color, brightness, size, shape, and position of visual objects representing aspects of the dataset being analyzed. The use of interactive visualizations is becoming increasingly popular in scientific research and business intelligence, and is now a common part of most analytics suites (e.g., Plotly, Tableau, PowerBI), thanks to its ease of use and added value, which allows a more effective exploration of the data.

Method:

In this report, we first discuss why good data visualization is important for KWR and Dutch water utilities. Then we summarize different types of graphs suitable for data science projects. Next, we list potential tools and software to make visualization and provide a list of top 5 of recommended tools. Last but not least, we also provide a user guide for quickly visualizing data based on Python.

Results:

The following concrete results have been produced in this project:

Application and future directions

In this report, we have addressed data visualization and its use in our research activities. We expect readers can follow the guideline and use interactive data visualization to boost their research efficiency and improve internal/external communications. Following this project, we suggest the following direction and activities:

This report:

This research is published in the report: Data visualization (BTO-402045-304). It becomes public after one year.



Table of Contents

1. Why informative visualization is needed