.. only:: html .. note:: :class: sphx-glr-download-link-note Click :ref:`here <sphx_glr_download_auto_examples_univariate_summaries_plot_univariate_url_summary.py>` to download the full example code
.. rst-class:: sphx-glr-example-title
Example of univariate eda summary for an url variable
The URL summary computes the following:
- Countplot for the invididual unique urls
- Countplot for the domains of the urls
- Countplot for the domain suffixes of the urls
- Countplot for the file types of the urls
import pandas as pd
import plotly
import intedact
Here we take a look at the source URL's for countries GDPR violations recordings.
data = pd.read_csv(
"https://raw.githubusercontent.com/rfordatascience/tidytuesday/master/data/2020/2020-04-21/gdpr_violations.tsv",
sep="\t",
)
fig = intedact.url_summary(data, "source", fig_width=700)
plotly.io.show(fig)
.. rst-class:: sphx-glr-timing **Total running time of the script:** ( 0 minutes 3.239 seconds)
.. only :: html .. container:: sphx-glr-footer :class: sphx-glr-footer-example .. container:: sphx-glr-download sphx-glr-download-python :download:`Download Python source code: plot_univariate_url_summary.py <plot_univariate_url_summary.py>` .. container:: sphx-glr-download sphx-glr-download-jupyter :download:`Download Jupyter notebook: plot_univariate_url_summary.ipynb <plot_univariate_url_summary.ipynb>`
.. only:: html .. rst-class:: sphx-glr-signature `Gallery generated by Sphinx-Gallery <https://sphinx-gallery.github.io>`_