## Empirical Cummulative Distributive Frequency Name

 In statistics, an **empirical cummulative distribution function (ECDF)** is the distribution function associated with the empirical measure of a sample. This cumulative distribution function is a step function that jumps up by 1/n at each of the n data points. Its value at any specified value of the measured variable is the fraction of observations of the measured variable that are less than or equal to the specified value.

The **empirical cummulative distribution function is an estimate (ECDF)** of the cumulative distribution function that generated the points in the sample. It converges with probability 1 to that underlying distribution, according to the Glivenkoâ€“Cantelli theorem. A number of results exist to quantify the rate of convergence of the empirical distribution function to the underlying cumulative distribution function.

In [1]:
def ecdf(data):
    """Compute ECDF for a one-dimensional array of measurements."""
    # Number of data points: n
    n = len(data)

    # x-data for the ECDF: x
    x = np.sort(data)

    # y-data for the ECDF: y
    y = np.arange(1, n+1) / n

    return x, y

In [None]:
# Compute ECDF for dataframe: x_component, y_component
x_component, y_component = ecdf('name of dataframe')

# Generate plot
_ = plt.plot(x_component, y_component, marker='.', linestyle='none')

# Label the axes
_ = plt.xlabel('x_label_name')
_ = plt.ylabel('ECDF')

# Display the plot
plt.show()


In [None]:
# Plot the ECDF
_ = plt.plot(x_component, y_component, '.')
_ = plt.xlabel('x_label_name')
_ = plt.ylabel('ECDF')

# Overlay percentiles as red x's
_ = plt.plot(ptiles_vers, percentiles/100, marker='D', color='red',
         linestyle='none')

# Show the plot
plt.show()
