Skip to content

Latest commit

 

History

History
106 lines (50 loc) · 2.28 KB

plot_univariate_url_summary.rst

File metadata and controls

106 lines (50 loc) · 2.28 KB
.. only:: html

    .. note::
        :class: sphx-glr-download-link-note

        Click :ref:`here <sphx_glr_download_auto_examples_univariate_summaries_plot_univariate_url_summary.py>`
        to download the full example code

.. rst-class:: sphx-glr-example-title

Univariate URL Summary

Example of univariate eda summary for an url variable

The URL summary computes the following:

  • Countplot for the invididual unique urls
  • Countplot for the domains of the urls
  • Countplot for the domain suffixes of the urls
  • Countplot for the file types of the urls
import pandas as pd
import plotly

import intedact

Here we take a look at the source URL's for countries GDPR violations recordings.

data = pd.read_csv(
    "https://raw.githubusercontent.com/rfordatascience/tidytuesday/master/data/2020/2020-04-21/gdpr_violations.tsv",
    sep="\t",
)

fig = intedact.url_summary(data, "source", fig_width=700)
plotly.io.show(fig)
.. rst-class:: sphx-glr-timing

   **Total running time of the script:** ( 0 minutes  3.239 seconds)


.. only :: html

 .. container:: sphx-glr-footer
    :class: sphx-glr-footer-example



  .. container:: sphx-glr-download sphx-glr-download-python

     :download:`Download Python source code: plot_univariate_url_summary.py <plot_univariate_url_summary.py>`



  .. container:: sphx-glr-download sphx-glr-download-jupyter

     :download:`Download Jupyter notebook: plot_univariate_url_summary.ipynb <plot_univariate_url_summary.ipynb>`


.. only:: html

 .. rst-class:: sphx-glr-signature

    `Gallery generated by Sphinx-Gallery <https://sphinx-gallery.github.io>`_