<a href="https://colab.research.google.com/github/DaviAlbini/data-science-projects/blob/main/03-avancados/Clustering_on_birds.ipynb" target="_parent"><img src="https://colab.research.google.com/assets/colab-badge.svg" alt="Open In Colab"/></a>

# Clusterização de Aves: Análise Exploratória com Dataset Birds (OpenML ID 41464)
A análise de Clusters caracteriza-se por ser uma técnica exploratória, de modo que não tem caráter preditivo para observações de fora da amostra. Se  novas observações forem adicionadas à amostra, novos agrupamentos devem ser realizados, pois a inclusão de novas observações pode alterar a composição dos grupos. Se forem alteradas variáveis da análise, novos agrupamentos devem ser realizados, pois a inclusão/retirada de uma variável pode alterar os grupos.

Analisaremos dois métodos para a obtenção de Clusters:

 • **Método Hierárquico Aglomerativo**:
A quantidade de clusters é definida ao longo da análise (passo a passo)

 • **Método Não Hierárquico *K-means*** :
Define-se a priori quantos cluster serão formados

### **Método Hierárquico Aglomerativo**

A análise de cluster hierárquica depende de escolhas:

1.   Medida de dissimilaridade (*distância*)

> Refere-se à distância entre as observações, com base nas variáveis escolhidas. Portanto, indica o quanto as observações são diferentes entre si.

2.   Método de encadeamento das observações

> Refere-se à especificação da medida de distância quando houver cluster formados.

---

**Método de encadeamento**:

Indica qual distância utilizar quando já existem clusters formados durante os estágios aglomerativos.

*   Nearest neighbor (*single linkage*)

> Privilegia menores distâncias, recomendável em casos de observações distintas.

*   Furthest neigbor (*complete linkage*)

> Privilegia maiores distâncias, recomendável em casos de observações parecidas.

*   Between groups (*average linkage*)

> Junção de grupos pela distância média entre todos os pares de observações do grupo em análise.

### **Método Não Hierárquico K-means**

A quantidade `K` de clusters é **escolhida a priori** e é usada como base para a identificação dos centros de aglomeração, de modo que as observações são **arbitrariamente** alocadas aos `K` clusters para o cálculo dos centroides iniciais.

Nas etapas seguintes, as observações vão sendo comparadas pela proximidade aos centroides dos outros clusters. Se houver realocação a outro cluster por estar mais próxima, os centroides são recalculados (em ambos os clusters). Trata-se de um **processo iterativo**.

Em resumo:
> Faz-se diversas simulações e realocações, até chegar numa solução ótima. Se entendermos que a solução ótima são 3 clusters, então o modelo buscará realizar a melhor alocação para estes 3 clusters.

O procedimento K-means encerra-se quando não for possível realocar qualquer observação por estar mais próxima do centroide de outro cluster:

O que indica que:

> A soma dos quadrados de cada observação até o centro do cluster alocada foi minimizada.

---

**Técnicas para a identificação da quantidade de clusters no K-means**

1.   Método de Elbow

> Calcula-se a soma total dos quadrados dentro dos clusters (WCSS) para várias opções de K (quantidade de clusters). No gráfico, busca-se a dobra (“cotovelo”), ou seja, o ponto a partir do qual a diminuição na WCSS não é mais tão expressiva, mesmo aumentando a quantidade de clusters

2.   Método da Silhueta

> Para cada observação, calcula-se: (a) sua distância média dentro do cluster onde está alocada; (b) sua distância média para o cluster mais próximo onde não esteja alocada. Em seguida, calcula-se o coeficiente de silhueta médio para todas as observações. O procedimento é realizado para várias opções de K.

### **Considerações Importantes sobre Clusterização**

*   A análise de cluster é bastante **sensível** à presença de `outliers`.
*   Quando **há variáveis categóricas**, pode-se aplicar a `Análise de Correspondência` --> **Evitando ponderações arbitrárias**.
*   O `output` do **método hierárquico** pode ser utilizado como `input` no **método não hierárquico** para a identificação inicial da quantidade de clusters.
*   O **método não hierárquico k-means** pode ser aplicado em amostras maiores.

 ## Visão Geral do Dataset
- **Fonte**: OpenML (ID 41464)  
- **Objeto**: 327 gravações de áudio, 12 espécies de aves  
- **Características**: possíveis múltiplas espécies por gravação => cenário multilabel :contentReference[oaicite:18]{index=18}

## Objetivo do Projeto
Alcançar uma clusterização eficaz das instâncias de áudio para:
- Identificar padrões acústicos compartilhados entre espécies
- Mapear agrupamentos a características biológicas
- Facilitar visualizações interpretativas

## Clustering

In [33]:
# !pip install openml



In [1]:
# Importação e Manipulação de Dados
import os
import numpy as np
import pandas as pd
import openml

# Visualização
import plotly.express as px
import plotly.graph_objects as go

# Pré-processamento
from scipy.stats import zscore
from sklearn.preprocessing import OneHotEncoder
from sklearn.compose import ColumnTransformer
from sklearn.pipeline import Pipeline

# Utilidade
from sklearn.utils import check_random_state

RANDOM_STATE = 42
rs = check_random_state(RANDOM_STATE)

# Exibir mais colunas no console
pd.set_option("display.max_columns", 300)

In [2]:
oml_ds = openml.datasets.get_dataset(41464)
X, y, categorical_indicator, attribute_names = oml_ds.get_data(dataset_format="dataframe", target=None)
df = X.copy()

df.describe()

Unnamed: 0,audio.ssd1,audio.ssd2,audio.ssd3,audio.ssd4,audio.ssd5,audio.ssd6,audio.ssd7,audio.ssd8,audio.ssd9,audio.ssd10,audio.ssd11,audio.ssd12,audio.ssd13,audio.ssd14,audio.ssd15,audio.ssd16,audio.ssd17,audio.ssd18,audio.ssd19,audio.ssd20,audio.ssd21,audio.ssd22,audio.ssd25,audio.ssd26,audio.ssd27,audio.ssd28,audio.ssd29,audio.ssd30,audio.ssd31,audio.ssd32,audio.ssd33,audio.ssd34,audio.ssd35,audio.ssd36,audio.ssd37,audio.ssd38,audio.ssd39,audio.ssd40,audio.ssd41,audio.ssd42,audio.ssd43,audio.ssd44,audio.ssd45,audio.ssd46,audio.ssd49,audio.ssd50,audio.ssd51,audio.ssd52,audio.ssd53,audio.ssd54,audio.ssd55,audio.ssd56,audio.ssd57,audio.ssd58,audio.ssd59,audio.ssd60,audio.ssd61,audio.ssd62,audio.ssd63,audio.ssd64,audio.ssd65,audio.ssd66,audio.ssd67,audio.ssd68,audio.ssd69,audio.ssd70,audio.ssd73,audio.ssd74,audio.ssd75,audio.ssd76,audio.ssd77,audio.ssd78,audio.ssd79,audio.ssd80,audio.ssd81,audio.ssd82,audio.ssd83,audio.ssd84,audio.ssd85,audio.ssd86,audio.ssd87,audio.ssd88,audio.ssd89,audio.ssd90,audio.ssd91,audio.ssd92,audio.ssd93,audio.ssd94,audio.ssd97,audio.ssd98,audio.ssd99,audio.ssd100,audio.ssd101,audio.ssd102,audio.ssd103,audio.ssd104,audio.ssd105,audio.ssd106,audio.ssd107,audio.ssd108,audio.ssd109,audio.ssd110,audio.ssd111,audio.ssd112,audio.ssd113,audio.ssd114,audio.ssd115,audio.ssd116,audio.ssd117,audio.ssd118,audio.ssd121,audio.ssd122,audio.ssd123,audio.ssd124,audio.ssd125,audio.ssd126,audio.ssd127,audio.ssd128,audio.ssd129,audio.ssd130,audio.ssd131,audio.ssd132,audio.ssd133,audio.ssd134,audio.ssd135,audio.ssd136,audio.ssd137,audio.ssd138,audio.ssd139,audio.ssd140,audio.ssd141,audio.ssd145,audio.ssd146,audio.ssd147,audio.ssd148,audio.ssd149,audio.ssd150,audio.ssd151,audio.ssd152,audio.ssd153,audio.ssd154,audio.ssd155,audio.ssd156,audio.ssd157,audio.ssd158,audio.ssd159,audio.ssd160,audio.ssd161,audio.ssd162,audio.ssd163,audio.ssd164,audio.ssd165,audio.ssd166,cluster1,cluster2,cluster3,cluster4,cluster5,cluster6,cluster7,cluster8,cluster9,cluster10,cluster11,cluster12,cluster13,cluster14,cluster15,cluster16,cluster17,cluster18,cluster19,cluster20,cluster21,cluster22,cluster23,cluster24,cluster25,cluster26,cluster27,cluster28,cluster29,cluster30,cluster31,cluster32,cluster33,cluster34,cluster35,cluster36,cluster37,cluster38,cluster39,cluster40,cluster41,cluster42,cluster43,cluster44,cluster45,cluster46,cluster47,cluster48,cluster49,cluster50,cluster51,cluster52,cluster53,cluster54,cluster55,cluster56,cluster57,cluster59,cluster60,cluster61,cluster62,cluster63,cluster64,cluster65,cluster66,cluster67,cluster68,cluster69,cluster70,cluster71,cluster72,cluster73,cluster74,cluster75,cluster76,cluster78,cluster79,cluster80,cluster81,cluster82,cluster83,cluster84,cluster85,cluster86,cluster87,cluster88,cluster89,cluster90,cluster91,cluster92,cluster93,cluster94,cluster95,cluster96,cluster97,cluster98,cluster99,cluster100,segments,mean_rect_width,std_rect_width,mean_rect_height,std_rect_height,mean_rect_volume,std_rect_volume
count,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0,645.0
mean,0.069505,0.066703,0.101182,0.116583,0.146894,0.137654,0.127785,0.128071,0.110028,0.109471,0.115032,0.124897,0.128643,0.166325,0.192398,0.241229,0.252173,0.210303,0.13956,0.099319,0.063479,0.00298,0.008022,0.004106,0.004948,0.004621,0.004235,0.003825,0.003539,0.002839,0.002293,0.001933,0.001799,0.001863,0.002111,0.003239,0.004972,0.011285,0.011891,0.008508,0.006143,0.002787,0.001206,8.5e-05,2.481686,1.694949,1.206429,0.945732,0.615668,0.650598,0.695584,0.645904,0.689184,0.656969,0.641358,0.662676,0.768669,0.929552,1.023504,1.076343,1.179811,1.437652,1.641202,1.573013,1.699791,10.812497,13.856688,8.112186,5.725459,4.778493,4.12091,4.340454,4.735921,5.023133,5.062888,5.228757,5.405003,5.705724,6.398067,8.163169,8.411986,8.401185,8.976028,10.576132,13.1988,12.910464,13.755529,161.50527,0.053773,0.058858,0.093574,0.110348,0.142754,0.133569,0.123751,0.124911,0.106832,0.106932,0.112701,0.122567,0.125554,0.161843,0.185152,0.227469,0.237199,0.19723,0.128152,0.09228,0.058588,0.001922,0.001982,0.004461,0.005927,0.009207,0.02219,0.020992,0.018439,0.024557,0.020181,0.02412,0.031112,0.037739,0.03996,0.065391,0.078646,0.109379,0.125717,0.104262,0.066966,0.047094,0.027129,0.364249,0.252299,0.31854,0.332242,0.357392,0.340233,0.325682,0.307493,0.269699,0.260601,0.264731,0.278747,0.303455,0.395551,0.479478,0.604683,0.63027,0.575919,0.433615,0.30697,0.218622,0.047495,0.004267,0.005067,0.007299,0.009723,0.002468,0.002962,0.005875,0.005972,0.004609,0.008379,0.006379,0.001746,0.003769,0.006567,0.003592,0.001506,0.003991,0.004033,0.00214,0.003913,0.007841,0.004464,0.003791,0.006511,0.004828,0.008871,0.000361,0.003426,0.004077,0.004486,0.008613,0.001882,0.006809,0.005706,0.004065,0.004226,0.002479,0.005263,0.004714,0.001658,0.008382,0.006096,0.003607,0.002874,0.016064,0.002494,0.008565,0.003641,0.004217,0.009179,0.005323,0.002881,0.006453,0.001758,0.004748,0.004466,0.003114,0.006018,0.007571,0.008383,0.006285,0.003483,0.008338,0.001287,0.002281,0.003636,0.003053,0.004589,0.000978,0.010664,0.003765,0.007364,0.005599,0.005945,0.006483,0.001343,0.001289,0.004453,0.004449,0.00638,0.005959,0.006232,0.007479,0.00383,0.001466,0.001962,0.00223,0.00177,0.004415,0.001464,0.004798,0.004506,0.001982,0.00979,0.000633,0.006628,0.004127,0.010017,3.35969,21.143283,33.109169,17.947731,21.430477,1088.481438,3260.646997
std,0.108367,0.12278,0.120896,0.122229,0.133079,0.126236,0.117149,0.115889,0.105418,0.104175,0.106435,0.113951,0.111596,0.129193,0.1279,0.137825,0.132716,0.121892,0.088102,0.062019,0.041778,0.005321,0.020179,0.012178,0.009205,0.006501,0.005583,0.004913,0.004909,0.003803,0.003257,0.002921,0.002709,0.002723,0.003245,0.007836,0.013171,0.044962,0.042464,0.023727,0.022137,0.008669,0.002741,0.000295,1.549656,1.028227,0.852163,0.741946,0.701394,0.727848,0.858939,0.94194,0.945445,1.015327,1.037672,1.072599,1.138531,1.478959,1.526977,1.496852,1.610024,1.719327,2.085051,2.026469,2.072607,5.326613,17.913886,9.46899,6.103819,5.832839,8.192654,7.428984,10.454235,13.106505,12.772178,15.375872,16.56887,14.909437,12.78201,18.94366,17.760132,16.628455,17.854527,17.161927,24.371751,23.459764,23.375351,131.758459,0.100247,0.119479,0.119764,0.122624,0.133775,0.126543,0.117104,0.115904,0.10511,0.104233,0.106219,0.113855,0.111182,0.128505,0.126262,0.131039,0.126568,0.116889,0.080065,0.055497,0.03849,0.003716,0.007242,0.031183,0.016717,0.021634,0.044332,0.041913,0.034977,0.044073,0.036961,0.040335,0.048773,0.054713,0.052251,0.072913,0.074229,0.084034,0.08806,0.076674,0.049,0.034661,0.022608,0.443722,0.332476,0.292448,0.263905,0.294574,0.265952,0.271854,0.252059,0.218409,0.225968,0.234643,0.231937,0.255017,0.351361,0.394787,0.508534,0.540728,0.504017,0.430825,0.317252,0.224128,0.08951,0.039661,0.032341,0.055772,0.066918,0.031785,0.025669,0.046958,0.04412,0.03163,0.051498,0.03927,0.014524,0.048602,0.049419,0.027256,0.016797,0.029982,0.049194,0.023892,0.027042,0.068464,0.047124,0.026459,0.043856,0.034946,0.071712,0.005676,0.030214,0.035494,0.031108,0.069841,0.023081,0.05345,0.050836,0.028479,0.029969,0.02343,0.049995,0.045848,0.022589,0.059931,0.031014,0.028362,0.02866,0.097144,0.026619,0.061906,0.031097,0.044236,0.066906,0.046209,0.028149,0.049195,0.014014,0.033227,0.038208,0.019844,0.039513,0.048185,0.056536,0.058777,0.04549,0.058654,0.014337,0.017931,0.029489,0.023044,0.034923,0.008417,0.05094,0.030467,0.049002,0.052182,0.040139,0.052171,0.013659,0.012387,0.033402,0.049957,0.051982,0.039499,0.049756,0.071296,0.030062,0.02214,0.01649,0.016019,0.015348,0.030091,0.022428,0.032749,0.056804,0.0399,0.062702,0.008743,0.047186,0.033719,0.064506,5.282479,30.7451,69.215009,24.1954,42.871603,2279.150816,9981.115731
min,0.001333,0.002663,0.004912,0.007605,0.015742,0.01749,0.018587,0.021672,0.018647,0.021898,0.025766,0.029112,0.034355,0.056682,0.078951,0.114065,0.127964,0.104238,0.070256,0.052076,0.032272,0.001066,2e-06,1.6e-05,5.6e-05,8.7e-05,0.000187,0.000205,0.000208,0.000202,0.000162,0.000167,0.000178,0.000186,0.000205,0.000307,0.000411,0.0006,0.000593,0.000361,0.000158,0.000101,5.4e-05,0.0,-0.640381,0.017073,-0.064888,-0.094397,-0.249983,-0.18515,-0.107631,-0.159261,-0.0585,-0.174324,-0.127165,-0.133824,-0.179716,-0.300027,-0.274589,-0.182511,-0.164429,-0.358369,-0.15683,-0.113682,-0.138356,0.76558,2.024711,2.415155,2.383785,2.3768,2.313059,2.376767,2.429804,2.349139,2.38721,2.452752,2.39036,2.501573,2.362951,2.412029,2.486594,2.436761,2.534111,2.525251,2.487457,2.279941,2.224151,1.0,0.001066,0.001066,0.001066,0.002899,0.012284,0.014102,0.015506,0.018929,0.016221,0.019756,0.024329,0.027716,0.032054,0.054576,0.078014,0.112931,0.127597,0.102609,0.070032,0.051517,0.031722,0.001066,0.001066,0.001066,0.001066,0.001066,0.001066,0.001066,0.001066,0.001066,0.001066,0.001066,0.001066,0.001066,0.001066,0.002512,0.012533,0.024454,0.041286,0.031005,0.025088,0.014308,0.0081,0.016404,0.029325,0.0507,0.053382,0.079978,0.068167,0.075154,0.071576,0.066427,0.069071,0.070174,0.07138,0.075725,0.113669,0.140804,0.187205,0.20404,0.15852,0.106175,0.083365,0.055137,0.001066,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0
25%,0.008115,0.010873,0.021457,0.030335,0.050271,0.046586,0.043559,0.044909,0.038148,0.040101,0.043021,0.047776,0.052754,0.075556,0.0995,0.134864,0.152782,0.12532,0.083015,0.060971,0.038629,0.001079,0.000192,0.000181,0.000504,0.000723,0.001035,0.000842,0.000727,0.000634,0.000466,0.000392,0.000363,0.000353,0.000379,0.000499,0.000688,0.000938,0.000916,0.000577,0.000236,0.000139,7.5e-05,0.0,1.324302,1.017003,0.52796,0.399703,0.224258,0.231156,0.278307,0.263875,0.321818,0.304052,0.288006,0.263518,0.269048,0.225042,0.191631,0.148131,0.120156,0.180281,0.190821,0.191204,0.275492,6.574179,4.933579,3.915233,3.019468,2.888752,2.805059,2.834673,2.850186,2.845568,2.882997,2.873168,2.886413,2.864404,2.897101,2.894185,2.926755,2.973334,2.941273,3.019114,2.989326,2.982854,3.004211,55.026163,0.001497,0.005446,0.014118,0.023886,0.044799,0.042483,0.039392,0.042459,0.035803,0.037643,0.041447,0.046049,0.051264,0.073592,0.098623,0.133733,0.150182,0.120696,0.080808,0.060063,0.037829,0.001066,0.001066,0.001066,0.001066,0.001066,0.001066,0.001066,0.001066,0.001066,0.001066,0.001164,0.002817,0.004471,0.006991,0.020066,0.03267,0.055578,0.070734,0.058215,0.039647,0.029152,0.015942,0.099086,0.0838,0.131947,0.157826,0.180006,0.157963,0.147645,0.143126,0.118437,0.11489,0.11396,0.116543,0.123055,0.154744,0.194487,0.251394,0.26387,0.217356,0.137703,0.101694,0.069778,0.003279,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0
50%,0.024863,0.021824,0.048259,0.066101,0.099202,0.090245,0.082214,0.08346,0.06989,0.06696,0.071288,0.077309,0.084653,0.117883,0.14846,0.19917,0.207671,0.164748,0.102401,0.073209,0.046036,0.001131,0.001055,0.000539,0.001589,0.002016,0.002502,0.002149,0.001847,0.001442,0.001067,0.000855,0.000792,0.00083,0.000837,0.00134,0.001841,0.002113,0.001889,0.001558,0.000528,0.000232,0.000124,0.0,2.140626,1.578748,1.102934,0.83084,0.525882,0.540367,0.5546,0.482597,0.523873,0.484406,0.451735,0.402069,0.404154,0.350297,0.304098,0.335819,0.326426,0.537757,0.552085,0.414591,0.6343,10.337257,8.60828,5.815074,4.130804,3.449494,3.091071,3.098347,3.12371,3.103391,3.109132,3.102663,3.114556,3.085868,3.131871,3.151033,3.207651,3.376993,3.362933,3.876168,3.978494,3.642651,3.823747,124.87068,0.011243,0.014869,0.038328,0.05711,0.092875,0.084035,0.078694,0.080924,0.067374,0.064141,0.069915,0.074028,0.081088,0.112024,0.137407,0.182419,0.188181,0.147642,0.096929,0.070295,0.043977,0.001066,0.001066,0.001066,0.001066,0.001066,0.002101,0.002026,0.001738,0.004637,0.003429,0.005328,0.009567,0.013488,0.017334,0.033901,0.049598,0.076683,0.090704,0.074752,0.048001,0.034899,0.019391,0.211866,0.137413,0.217162,0.243847,0.277745,0.262613,0.242328,0.230475,0.192154,0.180781,0.190265,0.204571,0.222233,0.300069,0.361271,0.42732,0.444129,0.390506,0.246247,0.15775,0.109615,0.006306,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0
75%,0.0902,0.062301,0.13751,0.165304,0.192939,0.181517,0.173548,0.174532,0.144143,0.141049,0.15658,0.16169,0.166213,0.212774,0.226455,0.289375,0.298901,0.253674,0.15799,0.11147,0.069782,0.001849,0.005858,0.002167,0.005289,0.006202,0.005192,0.004528,0.004154,0.003372,0.002696,0.002156,0.00203,0.002068,0.002451,0.002948,0.004335,0.006432,0.006027,0.005376,0.002821,0.001376,0.000882,1.2e-05,3.156459,2.159223,1.622889,1.329832,0.834588,0.85852,0.846304,0.71885,0.765851,0.665911,0.614955,0.571155,0.606286,0.87974,1.524942,1.731928,1.959488,2.45095,2.673518,2.522951,2.63835,14.339357,15.886984,8.773662,6.285933,4.975915,3.602162,3.651798,3.671865,3.534329,3.578847,3.464912,3.522792,3.527113,3.830756,5.205407,7.036598,7.593037,8.458435,11.627016,13.799152,13.252876,13.925949,234.218752,0.068355,0.053354,0.130104,0.161868,0.190887,0.18028,0.170846,0.171373,0.138867,0.13817,0.155089,0.158569,0.160463,0.208332,0.220381,0.265663,0.27197,0.227458,0.141529,0.102527,0.062234,0.001066,0.001066,0.001066,0.001066,0.002366,0.021162,0.021387,0.019753,0.028691,0.022076,0.028769,0.040317,0.048477,0.054275,0.081897,0.09806,0.129987,0.144993,0.122752,0.072326,0.049165,0.026902,0.449071,0.291457,0.407857,0.44992,0.453703,0.415347,0.409683,0.380064,0.330207,0.32246,0.328913,0.362893,0.384591,0.495999,0.632655,0.761354,0.78274,0.763453,0.597647,0.398916,0.297778,0.042811,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,5.0,35.7,40.305087,31.1,24.413111,1221.0,1747.619769
max,0.850176,1.31813,0.916178,0.58258,0.638798,0.632144,0.605935,0.623863,0.571527,0.559225,0.54723,0.573955,0.560946,0.635504,0.695642,0.764285,0.75277,0.661413,0.524977,0.395318,0.290303,0.051685,0.212005,0.129201,0.095815,0.063333,0.058634,0.042378,0.042529,0.037668,0.032571,0.031652,0.030512,0.030676,0.030884,0.124198,0.164183,0.823855,0.762962,0.272247,0.308453,0.108189,0.027849,0.003047,13.97391,7.58327,6.616355,7.025934,10.402664,9.012228,9.885484,13.384801,13.183875,15.354201,17.166357,14.290042,11.11217,16.542148,14.537073,11.181005,12.649268,10.308189,14.706293,15.074796,12.44755,22.538899,253.910804,99.700562,92.494341,104.176651,169.703827,131.20129,160.130606,250.527176,237.370084,303.931159,353.028025,277.380876,197.122905,336.227388,282.679468,198.813514,233.881224,159.307312,285.428711,294.970121,182.38957,509.001961,0.817301,1.271474,0.90024,0.575741,0.645724,0.632129,0.606792,0.622007,0.57401,0.562715,0.545762,0.56928,0.559283,0.636216,0.696282,0.766066,0.740567,0.662218,0.509995,0.383508,0.280644,0.042047,0.108625,0.669469,0.20491,0.140552,0.289288,0.288992,0.259206,0.305248,0.237749,0.243144,0.295869,0.297283,0.292528,0.37525,0.442918,0.491473,0.522825,0.467469,0.330796,0.245118,0.146619,6.599882,3.767024,2.498426,2.1892,4.450945,2.939461,3.608109,3.096063,1.894935,2.110909,2.651515,2.379555,2.92575,3.273974,3.167144,3.593892,3.9291,4.778648,3.431706,2.083267,1.446398,0.610884,0.588235,0.5,1.0,1.0,0.5,0.5,1.0,0.5,0.5,0.6,0.5,0.166667,1.0,0.666667,0.5,0.333333,0.5,1.0,0.5,0.333333,1.0,1.0,0.4,0.5,0.666667,1.0,0.117647,0.352941,0.5,0.4,1.0,0.5,1.0,1.0,0.5,0.5,0.5,1.0,1.0,0.4,1.0,0.263158,0.4,0.4,1.0,0.4,1.0,0.5,1.0,1.0,1.0,0.5,1.0,0.166667,0.5,0.5,0.25,0.5,0.5,1.0,0.75,1.0,1.0,0.25,0.333333,0.5,0.333333,0.6,0.111111,0.5,0.5,0.75,1.0,0.5,1.0,0.2,0.166667,0.5,1.0,1.0,0.5,1.0,1.0,0.5,0.5,0.25,0.166667,0.2,0.5,0.5,0.5,1.0,1.0,1.0,0.166667,1.0,0.4,1.0,36.0,203.0,596.180941,187.5,251.984335,18638.333333,103512.089769


In [3]:
# Colunas Numéricas
num_cols = []
num_cols += [c for c in df.columns if c.startswith("audio.ssd")]
num_cols += [c for c in df.columns if c.startswith("cluster")]
num_cols += [c for c in df.columns if c.startswith("mean_")]
num_cols += [c for c in df.columns if c.startswith("std_")]

# Colunas Categóricas
categ_cols = ["segments", "location"]
df[categ_cols] = df[categ_cols].astype(str)

**Tratamento inicial dos dados:**

Antes de iniciar os procedimentos, é importante realizar uma análise das
unidades de medidas das variáveis. Se estiverem em unidades de medidas distintas, é importante realizar a padronização das variáveis antes de iniciar a análise de cluster.

> Dentre as técnicas de padronização temos `ZScore`/`Scaler` (ambos tornam as variáveis com média = 0 e desvio padrão = 1)



In [4]:
# Aplicando o procedimento de ZScore
df[num_cols] = df[num_cols].apply(zscore, ddof=1)

# Transformando categóricas em float com `get dummies`
df_encoded = pd.get_dummies(df, columns=categ_cols, drop_first=False)

In [93]:
df_encoded

Unnamed: 0,audio.ssd1,audio.ssd2,audio.ssd3,audio.ssd4,audio.ssd5,audio.ssd6,audio.ssd7,audio.ssd8,audio.ssd9,audio.ssd10,audio.ssd11,audio.ssd12,audio.ssd13,audio.ssd14,audio.ssd15,audio.ssd16,audio.ssd17,audio.ssd18,audio.ssd19,audio.ssd20,audio.ssd21,audio.ssd22,audio.ssd25,audio.ssd26,audio.ssd27,audio.ssd28,audio.ssd29,audio.ssd30,audio.ssd31,audio.ssd32,audio.ssd33,audio.ssd34,audio.ssd35,audio.ssd36,audio.ssd37,audio.ssd38,audio.ssd39,audio.ssd40,audio.ssd41,audio.ssd42,audio.ssd43,audio.ssd44,audio.ssd45,audio.ssd46,audio.ssd49,audio.ssd50,audio.ssd51,audio.ssd52,audio.ssd53,audio.ssd54,audio.ssd55,audio.ssd56,audio.ssd57,audio.ssd58,audio.ssd59,audio.ssd60,audio.ssd61,audio.ssd62,audio.ssd63,audio.ssd64,audio.ssd65,audio.ssd66,audio.ssd67,audio.ssd68,audio.ssd69,audio.ssd70,audio.ssd73,audio.ssd74,audio.ssd75,audio.ssd76,audio.ssd77,audio.ssd78,audio.ssd79,audio.ssd80,audio.ssd81,audio.ssd82,audio.ssd83,audio.ssd84,audio.ssd85,audio.ssd86,audio.ssd87,audio.ssd88,audio.ssd89,audio.ssd90,audio.ssd91,audio.ssd92,audio.ssd93,audio.ssd94,audio.ssd97,audio.ssd98,audio.ssd99,audio.ssd100,audio.ssd101,audio.ssd102,audio.ssd103,audio.ssd104,audio.ssd105,audio.ssd106,audio.ssd107,audio.ssd108,audio.ssd109,audio.ssd110,audio.ssd111,audio.ssd112,audio.ssd113,audio.ssd114,audio.ssd115,audio.ssd116,audio.ssd117,audio.ssd118,audio.ssd121,audio.ssd122,audio.ssd123,audio.ssd124,audio.ssd125,audio.ssd126,audio.ssd127,audio.ssd128,audio.ssd129,audio.ssd130,audio.ssd131,audio.ssd132,audio.ssd133,audio.ssd134,audio.ssd135,audio.ssd136,audio.ssd137,audio.ssd138,audio.ssd139,audio.ssd140,audio.ssd141,audio.ssd145,audio.ssd146,audio.ssd147,audio.ssd148,audio.ssd149,audio.ssd150,audio.ssd151,audio.ssd152,audio.ssd153,audio.ssd154,audio.ssd155,audio.ssd156,audio.ssd157,audio.ssd158,audio.ssd159,audio.ssd160,audio.ssd161,audio.ssd162,audio.ssd163,...,cluster12,cluster13,cluster14,cluster15,cluster16,cluster17,cluster18,cluster19,cluster20,cluster21,cluster22,cluster23,cluster24,cluster25,cluster26,cluster27,cluster28,cluster29,cluster30,cluster31,cluster32,cluster33,cluster34,cluster35,cluster36,cluster37,cluster38,cluster39,cluster40,cluster41,cluster42,cluster43,cluster44,cluster45,cluster46,cluster47,cluster48,cluster49,cluster50,cluster51,cluster52,cluster53,cluster54,cluster55,cluster56,cluster57,cluster59,cluster60,cluster61,cluster62,cluster63,cluster64,cluster65,cluster66,cluster67,cluster68,cluster69,cluster70,cluster71,cluster72,cluster73,cluster74,cluster75,cluster76,cluster78,cluster79,cluster80,cluster81,cluster82,cluster83,cluster84,cluster85,cluster86,cluster87,cluster88,cluster89,cluster90,cluster91,cluster92,cluster93,cluster94,cluster95,cluster96,cluster97,cluster98,cluster99,cluster100,mean_rect_width,std_rect_width,mean_rect_height,std_rect_height,mean_rect_volume,std_rect_volume,hasSegments,Brown.Creeper,Pacific.Wren,Pacific.slope.Flycatcher,Red.breasted.Nuthatch,Dark.eyed.Junco,Olive.sided.Flycatcher,Hermit.Thrush,Chestnut.backed.Chickadee,Varied.Thrush,Hermit.Warbler,Swainson.s.Thrush,Hammond.s.Flycatcher,Western.Tanager,Black.headed.Grosbeak,Golden.Crowned.Kinglet,Warbling.Vireo,MacGillivray.s.Warbler,Stellar.s.Jay,Common.Nighthawk,segments_0,segments_1,segments_10,segments_11,segments_12,segments_13,segments_14,segments_15,segments_16,segments_17,segments_18,segments_19,segments_2,segments_20,segments_21,segments_23,segments_24,segments_3,segments_36,segments_4,segments_5,segments_6,segments_7,segments_8,segments_9,location_1,location_10,location_11,location_13,location_15,location_16,location_17,location_2,location_4,location_5,location_7,location_8
0,-0.488936,-0.218091,-0.095540,0.143470,0.177156,0.310669,0.373549,0.468729,0.497583,0.476236,0.465066,0.337596,0.385230,0.413819,0.613032,0.710882,0.564552,0.445294,0.383684,0.517700,0.111989,-0.331619,-0.373093,-0.245675,-0.166137,-0.021787,-0.000693,0.038219,0.044428,0.124804,0.292491,0.128366,0.121869,-0.031718,0.069924,-0.154259,-0.093461,-0.121783,-0.118655,-0.164950,-0.057003,0.495600,-0.118059,-0.287158,-0.111597,-0.661833,-0.541884,-0.723564,-0.737741,-0.422742,-0.662208,-0.443700,-0.434404,-0.363630,-0.407698,-0.453658,-0.381992,-0.525665,-0.225699,0.037483,0.449017,1.098585,1.123206,1.173671,1.078810,-0.963147,-0.231278,-0.441642,-0.405905,-0.332804,-0.192671,-0.150645,-0.197905,-0.186471,-0.183849,-0.154236,-0.155374,-0.208777,-0.290822,-0.281632,-0.195177,-0.168440,-0.044833,0.712280,0.486596,0.374576,0.427814,-0.897911,-0.462265,-0.223575,-0.102120,0.161502,0.184553,0.299890,0.400048,0.485329,0.487874,0.485743,0.471895,0.334440,0.376985,0.442266,0.662585,0.793755,0.579788,0.496999,0.376480,0.304851,0.096662,-0.230453,-0.126429,-0.108881,-0.290768,-0.376295,-0.157487,-0.104290,-0.081709,0.544964,0.169576,0.485992,0.027961,0.206561,0.476551,0.358645,0.533067,0.491255,0.650187,0.530253,0.515248,0.376780,0.188650,-0.480967,-0.203513,-0.004776,0.116989,-0.013898,0.248158,0.098039,0.133398,0.205566,0.141393,0.127650,0.041912,0.043860,-0.101168,0.251990,0.167955,0.193747,0.549537,0.864644,...,-0.120213,-0.07754,-0.132885,-0.131791,-0.089665,-0.133099,-0.081973,-0.089585,2.699825,-0.114531,-0.094726,-0.143284,-0.148466,-0.138155,0.948953,-0.063519,2.432506,-0.114872,2.328550,-0.123327,-0.081518,-0.127397,-0.112248,-0.142726,-0.141006,-0.105813,-0.105268,-0.102824,-0.07341,-0.139868,-0.196545,-0.127181,-0.100272,-0.165365,-0.093673,-0.13835,-0.117098,-0.095335,-0.137191,-0.115187,-0.102354,-0.131174,-0.125424,-0.14291,-0.116874,-0.156912,3.741219,-0.157122,-0.148273,-0.106927,-0.076574,-0.142151,-0.089784,-0.127231,-0.123312,3.205614,6.476481,-0.116221,-0.209349,-0.123562,-0.150287,-0.107288,-0.148099,-0.124262,-0.098347,-0.104055,-0.133316,-0.089047,2.836897,-0.150877,-0.125249,0.974024,-0.127421,-0.06621,-0.118993,-0.139217,-0.115295,-0.146709,-0.065294,-0.1465,-0.079323,-0.049675,-0.156136,-0.072413,-0.140476,-0.122407,-0.155292,-0.154778,-0.180478,1.191197,1.176365,-0.131434,-0.150168,1,False,False,False,False,False,False,False,False,False,False,False,True,True,False,False,False,False,False,False,False,False,False,False,False,True,False,False,False,False,False,False,False,False,False,False,False,False,False,False,False,False,False,False,False,True,False,False,False,False,False,False,False,False,False,False,False
1,-0.580486,-0.250197,-0.092860,0.054252,0.190705,0.312221,0.321999,0.445112,0.490582,0.520345,0.437584,0.303871,0.379934,0.393841,0.488904,0.478144,0.312088,0.267669,0.139127,0.104778,-0.021862,-0.326169,-0.391677,-0.257335,-0.207635,-0.084703,0.013637,0.178860,0.087206,0.264970,0.198547,0.141375,0.085325,0.031825,-0.014823,-0.131673,-0.193756,-0.194911,-0.235859,-0.298385,-0.249263,-0.266657,-0.376342,-0.287158,0.160381,-0.480922,-0.647627,-0.664848,-0.666892,-0.757898,-0.599224,-0.528812,-0.434447,-0.469212,-0.386488,-0.263857,-0.471708,-0.456536,-0.598190,-0.598417,-0.682010,-0.734036,-0.522345,0.606501,-0.429699,-1.073408,-0.157413,-0.374504,-0.386824,-0.341622,-0.167773,-0.238108,-0.191168,-0.167383,-0.173393,-0.161252,-0.158150,-0.183634,-0.264018,-0.284005,-0.298160,-0.333086,-0.350460,-0.448500,-0.380493,0.577722,-0.376696,-0.964161,-0.524421,-0.242281,-0.059519,0.067295,0.235885,0.331887,0.341134,0.477513,0.495684,0.517000,0.421292,0.286370,0.388237,0.439768,0.550642,0.597722,0.442841,0.395783,0.267093,0.228162,0.072812,-0.230453,-0.126429,-0.108881,-0.290768,-0.376295,-0.091801,-0.227258,-0.122735,-0.259141,0.164652,0.648530,0.345184,0.381820,0.093518,0.527230,0.424269,0.594523,0.490216,0.510912,0.452147,0.188153,0.271984,-0.672841,-0.237477,-0.099432,-0.059059,0.057873,0.202759,0.071800,0.276615,0.464287,0.235393,0.125152,0.134661,0.106514,-0.075115,-0.057701,-0.288847,-0.344241,-0.422780,-0.437393,...,-0.120213,-0.07754,-0.132885,-0.131791,-0.089665,-0.133099,-0.081973,-0.089585,-0.144702,-0.114531,-0.094726,-0.143284,-0.148466,-0.138155,-0.123708,-0.063519,-0.113399,-0.114872,-0.144217,-0.123327,-0.081518,-0.127397,-0.112248,-0.142726,-0.141006,-0.105813,-0.105268,-0.102824,-0.07341,-0.139868,-0.196545,-0.127181,-0.100272,-0.165365,-0.093673,-0.13835,-0.117098,-0.095335,-0.137191,-0.115187,-0.102354,-0.131174,-0.125424,-0.14291,-0.116874,-0.156912,-0.152316,-0.157122,-0.148273,-0.106927,-0.076574,-0.142151,-0.089784,-0.127231,-0.123312,-0.132475,-0.131417,-0.116221,-0.209349,-0.123562,-0.150287,-0.107288,-0.148099,-0.124262,-0.098347,-0.104055,-0.133316,-0.089047,-0.122727,-0.150877,-0.125249,-0.104898,-0.127421,-0.06621,-0.118993,-0.139217,-0.115295,-0.146709,-0.065294,-0.1465,-0.079323,-0.049675,-0.156136,-0.072413,-0.140476,-0.122407,-0.155292,-0.687696,-0.478352,-0.741783,-0.499876,-0.477582,-0.326682,0,False,False,False,False,False,False,False,False,False,False,False,False,False,False,False,False,False,False,False,True,False,False,False,False,False,False,False,False,False,False,False,False,False,False,False,False,False,False,False,False,False,False,False,False,True,False,False,False,False,False,False,False,False,False,False,False
2,-0.577773,-0.398934,-0.439389,-0.415455,-0.326508,-0.366434,-0.365811,-0.340025,-0.325864,-0.336580,-0.379096,-0.365899,-0.356846,-0.318513,-0.209680,-0.211340,-0.264203,-0.462831,-0.452323,-0.393569,-0.420113,-0.354361,-0.391330,-0.306932,-0.381881,-0.382517,-0.349092,-0.398157,-0.432243,-0.373269,-0.382618,-0.369037,-0.381996,-0.366695,-0.392640,-0.229804,-0.167866,-0.133081,-0.151600,-0.324600,-0.264080,-0.300802,-0.405891,-0.287158,0.207620,0.239037,-0.313866,-0.079493,-0.267734,-0.148366,-0.154543,-0.451879,-0.333718,-0.158040,-0.216655,-0.366572,-0.007530,0.250152,0.432037,1.599476,2.040963,-0.473454,-0.767502,-0.586042,-0.576882,-0.248531,-0.095668,-0.006223,-0.354701,-0.240925,-0.166404,-0.152929,-0.150945,-0.164758,-0.168177,-0.123832,-0.153926,-0.203013,-0.034435,-0.043049,-0.045966,0.795715,1.388839,-0.374029,-0.429979,-0.385099,-0.445650,-0.448203,-0.525768,-0.396819,-0.451590,-0.427317,-0.317012,-0.380638,-0.371738,-0.318567,-0.326008,-0.330563,-0.381824,-0.367877,-0.348049,-0.307891,-0.220321,-0.214279,-0.229992,-0.390726,-0.362236,-0.321560,-0.351008,-0.230453,-0.126429,-0.108881,-0.290768,-0.376295,-0.466539,-0.356931,-0.311028,-0.438504,-0.500689,-0.427577,-0.502535,-0.541774,-0.461416,-0.302735,-0.288471,-0.286255,-0.673131,-0.292356,-0.368724,-0.272708,-0.308304,-0.659883,-0.366442,-0.399150,-0.447813,-0.374140,-0.338113,-0.372003,-0.377867,-0.420927,-0.355280,-0.436935,-0.463436,-0.137658,-0.130796,-0.154895,0.371055,0.435770,-0.563299,-0.653234,...,-0.120213,-0.07754,-0.132885,-0.131791,-0.089665,-0.133099,-0.081973,-0.089585,-0.144702,-0.114531,-0.094726,-0.143284,-0.148466,-0.138155,-0.123708,-0.063519,-0.113399,-0.114872,-0.144217,-0.123327,-0.081518,-0.127397,-0.112248,-0.142726,-0.141006,-0.105813,-0.105268,-0.102824,-0.07341,-0.139868,-0.196545,-0.127181,-0.100272,-0.165365,-0.093673,-0.13835,-0.117098,-0.095335,-0.137191,-0.115187,-0.102354,-0.131174,-0.125424,-0.14291,-0.116874,-0.156912,-0.152316,-0.157122,-0.148273,-0.106927,-0.076574,-0.142151,-0.089784,-0.127231,-0.123312,-0.132475,-0.131417,-0.116221,-0.209349,-0.123562,-0.150287,-0.107288,-0.148099,-0.124262,-0.098347,-0.104055,-0.133316,-0.089047,-0.122727,12.507669,-0.125249,-0.104898,-0.127421,-0.06621,-0.118993,-0.139217,-0.115295,-0.146709,-0.065294,-0.1465,-0.079323,-0.049675,-0.156136,-0.072413,-0.140476,-0.122407,-0.155292,0.092916,-0.437488,0.415462,-0.466889,-0.181858,-0.315346,1,False,False,False,False,False,False,False,False,False,False,True,False,False,False,False,False,False,False,False,False,False,False,False,False,False,False,False,False,False,False,False,True,False,False,False,False,False,False,False,False,False,False,False,False,True,False,False,False,False,False,False,False,False,False,False,False
3,-0.354900,0.496591,0.994622,1.277306,1.591906,1.677287,1.606881,1.748087,1.904081,1.889294,1.738364,1.576752,1.494487,1.558938,1.609276,1.568311,1.332629,0.984026,0.839597,0.514959,0.284594,-0.276739,-0.353221,0.096080,0.387019,0.865040,1.099135,1.070133,0.991659,1.061519,1.295177,1.332678,1.134766,0.961462,0.617235,0.146131,-0.041681,-0.152053,-0.211816,-0.265933,-0.202779,-0.257198,-0.335849,-0.249837,-0.811386,-1.033782,-1.158545,-1.074494,-0.869768,-0.689148,-0.389217,-0.681278,-0.549500,-0.566580,-0.594445,-0.455367,-0.482656,-0.591050,-0.612270,-0.412722,-0.685715,-0.597597,1.285456,1.336471,0.942987,0.669163,-0.551454,-0.523958,-0.475380,-0.325021,-0.150773,-0.196234,-0.147606,-0.173316,-0.169204,-0.150923,-0.149274,-0.210754,-0.265004,-0.285458,-0.315477,-0.182659,-0.312163,-0.365989,1.735814,1.761590,1.022719,0.551870,-0.319528,0.504134,1.072480,1.313263,1.621388,1.638100,1.552275,1.759010,1.895877,1.885100,1.782633,1.591710,1.517966,1.596252,1.682848,1.752887,1.516678,1.115844,1.045402,0.673050,0.378082,-0.230453,-0.126429,-0.045867,0.391885,0.134281,0.574191,1.838653,1.235768,1.337906,1.350778,1.883902,1.456819,1.768663,1.721056,1.411835,1.968623,1.742622,1.752928,1.244756,1.371568,0.966604,0.563212,-0.481379,0.460230,0.701335,0.878010,1.083677,1.121158,1.475416,1.010403,1.225629,1.342747,1.155213,1.000334,0.755585,0.437711,0.278367,0.457324,0.022750,0.012643,0.702330,...,-0.120213,-0.07754,-0.132885,-0.131791,-0.089665,-0.133099,-0.081973,-0.089585,-0.144702,-0.114531,-0.094726,-0.143284,-0.148466,-0.138155,-0.123708,-0.063519,-0.113399,-0.114872,-0.144217,-0.123327,-0.081518,-0.127397,-0.112248,-0.142726,-0.141006,-0.105813,-0.105268,-0.102824,-0.07341,-0.139868,-0.196545,-0.127181,-0.100272,-0.165365,-0.093673,-0.13835,-0.117098,-0.095335,-0.137191,-0.115187,-0.102354,-0.131174,-0.125424,-0.14291,-0.116874,-0.156912,-0.152316,-0.157122,-0.148273,-0.106927,-0.076574,-0.142151,-0.089784,-0.127231,-0.123312,-0.132475,-0.131417,-0.116221,-0.209349,-0.123562,-0.150287,-0.107288,-0.148099,-0.124262,-0.098347,-0.104055,-0.133316,-0.089047,-0.122727,-0.150877,-0.125249,-0.104898,-0.127421,-0.06621,-0.118993,-0.139217,-0.115295,-0.146709,-0.065294,-0.1465,-0.079323,-0.049675,-0.156136,-0.072413,-0.140476,-0.122407,-0.155292,-0.687696,-0.478352,-0.741783,-0.499876,-0.477582,-0.326682,0,False,False,False,False,False,False,False,False,False,False,False,False,False,False,False,False,False,False,False,True,False,False,False,False,False,False,False,False,False,False,False,False,False,False,False,False,False,False,False,False,False,False,False,False,True,False,False,False,False,False,False,False,False,False,False,False
4,-0.044150,1.302656,1.681609,1.293301,1.503515,1.555880,1.531711,1.696400,1.774292,1.754617,1.624989,1.444818,1.411034,1.456385,1.543499,1.476869,1.173620,0.841383,0.661463,0.366569,0.147870,-0.318275,-0.268628,0.975349,0.992102,0.652140,0.936668,0.990958,0.837046,1.351054,1.361184,1.057103,0.951308,0.875881,0.628637,0.178288,-0.054891,-0.161194,-0.214594,-0.275163,-0.232684,-0.282230,-0.371600,-0.283765,-0.859971,-0.865651,-1.106104,-1.222970,-0.950158,-0.696162,-0.592749,-0.635082,-0.730028,-0.496734,-0.563842,-0.500924,-0.632015,-0.562170,-0.565509,-0.657754,-0.808185,-0.663710,-0.712224,-0.713290,-0.356576,-0.609978,-0.512657,-0.487373,-0.468686,-0.381148,-0.147973,-0.170318,-0.189413,-0.177084,-0.193547,-0.147727,-0.167480,-0.186594,-0.265284,-0.275440,-0.308071,-0.312379,-0.337707,-0.412067,-0.435609,-0.435172,-0.322597,-0.680037,0.009907,1.212459,1.712981,1.363425,1.519702,1.594352,1.550021,1.688788,1.797408,1.765368,1.632095,1.451443,1.458100,1.462623,1.595791,1.668241,1.362484,0.985207,0.850110,0.524808,0.267379,-0.230453,-0.126429,0.069804,0.808344,1.284303,0.446496,1.066939,2.165051,1.338995,1.934034,1.453162,1.245371,1.454444,1.646723,0.433021,1.803068,1.389740,1.313169,1.102740,0.823858,0.466188,0.314138,-0.127840,1.401650,1.200889,0.667047,1.019992,1.089156,0.945283,1.060582,1.152156,1.250189,0.938621,0.917165,0.827906,0.446483,0.328903,0.100657,-0.146326,-0.198349,-0.346857,...,-0.120213,-0.07754,-0.132885,-0.131791,-0.089665,-0.133099,-0.081973,-0.089585,-0.144702,-0.114531,-0.094726,-0.143284,-0.148466,-0.138155,-0.123708,-0.063519,-0.113399,-0.114872,-0.144217,-0.123327,-0.081518,-0.127397,-0.112248,-0.142726,-0.141006,-0.105813,-0.105268,-0.102824,-0.07341,-0.139868,-0.196545,-0.127181,-0.100272,-0.165365,-0.093673,-0.13835,-0.117098,-0.095335,-0.137191,-0.115187,-0.102354,-0.131174,-0.125424,-0.14291,-0.116874,-0.156912,-0.152316,-0.157122,-0.148273,-0.106927,-0.076574,-0.142151,-0.089784,-0.127231,-0.123312,-0.132475,-0.131417,-0.116221,-0.209349,-0.123562,-0.150287,-0.107288,-0.148099,-0.124262,-0.098347,-0.104055,-0.133316,-0.089047,-0.122727,-0.150877,-0.125249,-0.104898,-0.127421,-0.06621,-0.118993,-0.139217,-0.115295,-0.146709,-0.065294,-0.1465,-0.079323,-0.049675,-0.156136,-0.072413,-0.140476,-0.122407,-0.155292,-0.687696,-0.478352,-0.741783,-0.499876,-0.477582,-0.326682,0,False,False,False,False,False,False,False,False,False,False,False,False,False,False,False,False,False,False,False,True,False,False,False,False,False,False,False,False,False,False,False,False,False,False,False,False,False,False,False,False,False,False,False,False,True,False,False,False,False,False,False,False,False,False,False,False
...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...
640,-0.032643,-0.496856,-0.755802,-0.838038,-0.893556,-0.872175,-0.847191,-0.829975,-0.784401,-0.756106,-0.748928,-0.736102,-0.742441,-0.757594,-0.765996,-0.807212,-0.803874,-0.738035,-0.653668,-0.601344,-0.587045,-0.358120,-0.065498,-0.331483,-0.520822,-0.670793,-0.683877,-0.701014,-0.644912,-0.648340,-0.631601,-0.569299,-0.549581,-0.574220,-0.552580,-0.357540,-0.330875,-0.233478,-0.261716,-0.337118,-0.268010,-0.305416,-0.412458,-0.287158,-0.449648,1.457508,0.991808,1.316449,0.413457,0.511917,0.372923,0.325226,-0.136646,-0.083789,-0.013346,-0.050565,-0.253004,-0.409326,-0.460376,-0.496668,-0.669381,-0.730215,-0.666493,-0.755160,-0.662256,0.350352,-0.417514,0.975747,0.288662,0.518689,-0.095805,0.029309,0.002271,-0.038498,-0.164694,-0.149436,-0.134374,-0.169486,-0.234973,-0.273838,-0.304745,-0.314716,-0.346388,-0.433639,-0.414406,-0.432152,-0.469739,0.199191,-0.177061,-0.478507,-0.741426,-0.827134,-0.885772,-0.864553,-0.831613,-0.829428,-0.774520,-0.748300,-0.750667,-0.740765,-0.728100,-0.739205,-0.726255,-0.750187,-0.727710,-0.663156,-0.585106,-0.547788,-0.519829,-0.230453,-0.126429,-0.108881,-0.290768,-0.376295,-0.476486,-0.475414,-0.496690,-0.533007,-0.517166,-0.571570,-0.579175,-0.595673,-0.734959,-0.718807,-0.676203,-0.634365,-0.670712,-0.695255,-0.598824,-0.445034,-0.374874,0.209322,-0.542963,-0.837781,-0.854516,-0.872762,-0.785708,-0.699281,-0.687401,-0.890774,-0.772553,-0.717399,-0.786571,-0.720302,-0.708126,-0.765589,-0.733655,-0.739742,-0.746233,-0.701158,...,-0.120213,-0.07754,-0.132885,-0.131791,-0.089665,-0.133099,-0.081973,-0.089585,-0.144702,-0.114531,-0.094726,-0.143284,-0.148466,-0.138155,-0.123708,-0.063519,-0.113399,-0.114872,-0.144217,-0.123327,-0.081518,-0.127397,-0.112248,-0.142726,-0.141006,-0.105813,-0.105268,-0.102824,-0.07341,-0.139868,-0.196545,-0.127181,-0.100272,-0.165365,-0.093673,-0.13835,-0.117098,-0.095335,-0.137191,-0.115187,-0.102354,-0.131174,-0.125424,-0.14291,-0.116874,-0.156912,-0.152316,-0.157122,-0.148273,-0.106927,-0.076574,-0.142151,-0.089784,-0.127231,-0.123312,-0.132475,-0.131417,-0.116221,-0.209349,-0.123562,-0.150287,-0.107288,-0.148099,-0.124262,-0.098347,-0.104055,-0.133316,-0.089047,-0.122727,-0.150877,-0.125249,-0.104898,-0.127421,-0.06621,-0.118993,-0.139217,-0.115295,-0.146709,-0.065294,-0.1465,-0.079323,-0.049675,-0.156136,-0.072413,-0.140476,-0.122407,-0.155292,-0.687696,-0.478352,-0.741783,-0.499876,-0.477582,-0.326682,0,False,False,False,False,False,False,False,False,False,False,False,False,False,False,False,False,False,False,False,True,False,False,False,False,False,False,False,False,False,False,False,False,False,False,False,False,False,False,False,False,False,False,False,False,False,False,False,False,False,False,True,False,False,False,False,False
641,-0.295971,-0.458243,-0.663160,-0.749123,-0.434091,-0.794147,-0.774753,-0.701887,-0.726469,-0.700027,-0.698926,-0.713048,-0.713462,-0.717832,-0.754284,-0.774402,-0.748899,-0.739659,-0.665280,-0.596684,-0.574311,-0.359248,-0.279134,-0.323190,-0.490731,-0.626798,-0.332075,-0.658476,-0.603560,-0.575760,-0.592611,-0.541571,-0.534078,-0.555855,-0.549806,-0.350394,-0.333228,-0.232032,-0.262470,-0.337961,-0.266655,-0.305877,-0.415376,-0.287158,0.069027,0.040510,0.340230,0.484406,-0.005042,0.202508,0.120854,0.025558,-0.036150,0.034680,0.015896,0.203818,-0.280653,-0.422380,-0.506270,-0.687345,-0.637503,-0.726639,-0.757281,-0.672471,-0.670086,0.516821,-0.081649,-0.276823,-0.060729,-0.032994,-0.129840,-0.148085,-0.122755,-0.123502,-0.141752,-0.111785,-0.108373,-0.102005,-0.258972,-0.285016,-0.308342,-0.339276,-0.331042,-0.434024,-0.412119,-0.425383,-0.463683,0.239137,-0.333334,-0.453440,-0.656376,-0.752474,-0.442335,-0.800298,-0.775833,-0.703626,-0.721271,-0.697366,-0.695244,-0.715522,-0.698113,-0.690335,-0.714763,-0.710489,-0.670579,-0.661787,-0.594749,-0.551770,-0.505046,-0.230453,-0.126429,-0.108881,-0.290768,-0.376295,-0.476486,-0.475414,-0.496690,-0.524453,-0.517166,-0.571570,-0.589509,-0.596971,-0.720280,-0.653085,-0.569978,-0.676777,-0.588166,-0.622662,-0.781639,-0.430839,-0.364170,0.049347,-0.553713,-0.719668,-0.786583,-0.328203,-0.800410,-0.701415,-0.632014,-0.741765,-0.669237,-0.644884,-0.679447,-0.772678,-0.720774,-0.735765,-0.772913,-0.726642,-0.729577,-0.714008,...,-0.120213,-0.07754,-0.132885,-0.131791,-0.089665,-0.133099,-0.081973,-0.089585,-0.144702,-0.114531,-0.094726,-0.143284,-0.148466,-0.138155,-0.123708,-0.063519,-0.113399,-0.114872,-0.144217,-0.123327,-0.081518,-0.127397,-0.112248,-0.142726,-0.141006,-0.105813,-0.105268,-0.102824,-0.07341,-0.139868,-0.196545,-0.127181,-0.100272,-0.165365,-0.093673,-0.13835,-0.117098,-0.095335,-0.137191,-0.115187,-0.102354,-0.131174,-0.125424,-0.14291,-0.116874,-0.156912,-0.152316,-0.157122,-0.148273,-0.106927,-0.076574,-0.142151,-0.089784,-0.127231,-0.123312,-0.132475,-0.131417,-0.116221,-0.209349,-0.123562,-0.150287,-0.107288,-0.148099,-0.124262,-0.098347,-0.104055,-0.133316,-0.089047,-0.122727,-0.150877,-0.125249,-0.104898,-0.127421,-0.06621,-0.118993,-0.139217,-0.115295,-0.146709,-0.065294,-0.1465,-0.079323,-0.049675,-0.156136,-0.072413,-0.140476,-0.122407,-0.155292,-0.687696,-0.478352,-0.741783,-0.499876,-0.477582,-0.326682,0,False,False,False,False,False,False,False,False,False,False,False,False,False,False,False,False,False,False,False,True,False,False,False,False,False,False,False,False,False,False,False,False,False,False,False,False,False,False,False,False,False,False,False,False,False,False,False,False,False,False,True,False,False,False,False,False
642,1.204729,-0.097053,0.296666,0.375182,0.345996,0.318274,0.433815,0.400906,0.248470,0.312972,0.485040,0.453453,0.227429,0.042745,-0.144598,-0.223065,-0.351863,-0.449959,-0.439009,-0.430139,-0.424015,-0.352106,1.252152,-0.190249,0.003221,0.144195,0.119500,0.077908,0.168077,0.166880,0.079121,0.123574,0.181668,0.066719,-0.112205,-0.219085,-0.261556,-0.210191,-0.251896,-0.330417,-0.263718,-0.303570,-0.408445,-0.287158,-0.576243,-0.528224,-0.937113,-0.806367,-0.515699,-0.700092,-0.578825,-0.263657,-0.248403,-0.334370,-0.279442,-0.304881,-0.396158,-0.405264,-0.540731,-0.645300,-0.557081,-0.827594,-0.614951,-0.795008,-0.716873,-0.745263,-0.417116,-0.385553,-0.442887,-0.355915,-0.137340,-0.216969,-0.159751,-0.161662,-0.137678,-0.145805,-0.137680,-0.187300,-0.270897,-0.283354,-0.307578,-0.329583,-0.317644,-0.427492,-0.410158,-0.428444,-0.461749,-0.817222,0.993542,-0.107103,0.337494,0.407801,0.344306,0.346309,0.463137,0.402727,0.255507,0.315413,0.488455,0.435771,0.218543,0.049176,-0.093434,-0.142896,-0.254216,-0.360227,-0.348435,-0.340876,-0.345370,-0.230453,-0.126429,-0.108881,-0.290768,-0.116015,-0.019935,0.013262,-0.199899,0.321447,-0.062882,0.004626,0.425310,0.414590,0.089805,0.246456,-0.169892,-0.189924,-0.101496,-0.518507,-0.311439,-0.232260,-0.245007,1.771633,-0.096459,0.210093,0.173904,0.197862,0.137300,0.473427,0.235791,0.232914,0.151425,0.317129,0.145974,-0.044072,-0.287985,-0.394234,-0.497108,-0.551890,-0.692755,-0.637406,...,-0.120213,-0.07754,-0.132885,-0.131791,-0.089665,-0.133099,-0.081973,-0.089585,-0.144702,-0.114531,-0.094726,-0.143284,-0.148466,-0.138155,-0.123708,-0.063519,-0.113399,-0.114872,-0.144217,-0.123327,-0.081518,-0.127397,-0.112248,-0.142726,-0.141006,-0.105813,-0.105268,-0.102824,-0.07341,-0.139868,-0.196545,-0.127181,-0.100272,-0.165365,-0.093673,-0.13835,-0.117098,-0.095335,-0.137191,-0.115187,-0.102354,-0.131174,-0.125424,-0.14291,-0.116874,-0.156912,-0.152316,-0.157122,-0.148273,-0.106927,-0.076574,-0.142151,-0.089784,-0.127231,-0.123312,-0.132475,-0.131417,-0.116221,-0.209349,-0.123562,-0.150287,-0.107288,-0.148099,-0.124262,-0.098347,-0.104055,-0.133316,-0.089047,-0.122727,-0.150877,-0.125249,-0.104898,-0.127421,-0.06621,-0.118993,-0.139217,-0.115295,-0.146709,-0.065294,-0.1465,-0.079323,-0.049675,-0.156136,-0.072413,-0.140476,-0.122407,-0.155292,-0.687696,-0.478352,-0.741783,-0.499876,-0.477582,-0.326682,0,False,False,False,False,False,False,False,False,False,False,False,False,False,False,False,False,False,False,False,True,False,False,False,False,False,False,False,False,False,False,False,False,False,False,False,False,False,False,False,False,False,False,False,False,False,False,False,False,False,False,True,False,False,False,False,False
643,-0.047749,-0.443411,-0.651249,-0.736785,-0.772124,-0.765684,-0.753541,-0.717600,-0.663871,-0.648786,-0.661325,-0.673759,-0.680029,-0.649120,-0.668256,-0.690426,-0.699192,-0.673879,-0.601263,-0.552020,-0.547765,-0.353985,-0.106629,-0.316867,-0.469548,-0.627721,-0.609540,-0.653591,-0.596226,-0.601794,-0.552086,-0.522400,-0.536662,-0.566139,-0.537788,-0.345800,-0.323206,-0.231165,-0.258655,-0.335811,-0.266700,-0.303455,-0.411364,-0.287158,-0.490268,0.317015,0.523858,0.119737,0.170138,0.329695,-0.026011,0.056974,-0.081340,-0.106391,-0.311208,-0.206780,-0.411217,-0.444994,-0.451655,-0.659020,-0.679315,-0.892427,-0.671012,-0.700478,-0.758795,0.646726,-0.438916,-0.071292,0.011428,-0.189308,-0.115756,-0.050284,-0.152359,-0.125629,-0.147992,-0.129174,-0.163394,-0.191809,-0.275560,-0.266301,-0.309565,-0.333129,-0.339812,-0.456935,-0.425062,-0.428675,-0.453649,0.579628,-0.195705,-0.444668,-0.668241,-0.728955,-0.768284,-0.755357,-0.750309,-0.723245,-0.656367,-0.637808,-0.651635,-0.667101,-0.671814,-0.626991,-0.633440,-0.620844,-0.624137,-0.582207,-0.528402,-0.510038,-0.478130,-0.230453,-0.126429,-0.108881,-0.290768,-0.376295,-0.476486,-0.475414,-0.496690,-0.533007,-0.517166,-0.564107,-0.549425,-0.505183,-0.645105,-0.577831,-0.441470,-0.686499,-0.604893,-0.535384,-0.471376,-0.492465,-0.529600,0.073101,-0.483260,-0.581814,-0.806840,-0.694831,-0.729288,-0.694293,-0.706472,-0.647154,-0.613468,-0.687783,-0.751811,-0.744629,-0.666701,-0.716512,-0.710097,-0.698071,-0.774151,-0.690043,...,-0.120213,-0.07754,-0.132885,-0.131791,-0.089665,-0.133099,-0.081973,-0.089585,-0.144702,-0.114531,-0.094726,-0.143284,-0.148466,-0.138155,-0.123708,-0.063519,-0.113399,-0.114872,-0.144217,-0.123327,-0.081518,-0.127397,-0.112248,-0.142726,-0.141006,-0.105813,-0.105268,-0.102824,-0.07341,-0.139868,-0.196545,-0.127181,-0.100272,-0.165365,-0.093673,-0.13835,-0.117098,-0.095335,-0.137191,-0.115187,-0.102354,-0.131174,-0.125424,-0.14291,-0.116874,-0.156912,-0.152316,-0.157122,-0.148273,-0.106927,-0.076574,-0.142151,-0.089784,-0.127231,-0.123312,-0.132475,-0.131417,-0.116221,-0.209349,-0.123562,-0.150287,-0.107288,-0.148099,-0.124262,-0.098347,-0.104055,-0.133316,-0.089047,-0.122727,-0.150877,-0.125249,-0.104898,-0.127421,-0.06621,-0.118993,-0.139217,-0.115295,-0.146709,-0.065294,-0.1465,-0.079323,-0.049675,-0.156136,-0.072413,-0.140476,-0.122407,-0.155292,-0.687696,-0.478352,-0.741783,-0.499876,-0.477582,-0.326682,0,False,False,False,False,False,False,False,False,False,False,False,False,False,False,False,False,False,False,False,True,False,False,False,False,False,False,False,False,False,False,False,False,False,False,False,False,False,False,False,False,False,False,False,False,False,False,False,False,False,False,True,False,False,False,False,False


In [5]:
bool_cols = []
bool_cols += [c for c in df_encoded.columns if c.startswith("segments_")]
bool_cols += [c for c in df_encoded.columns if c.startswith("hasSegments")]
bool_cols += [c for c in df_encoded.columns if c.startswith("location_")]
df_encoded[bool_cols] = df_encoded[bool_cols].astype(float)

print(df_encoded.dtypes)

audio.ssd1    float64
audio.ssd2    float64
audio.ssd3    float64
audio.ssd4    float64
audio.ssd5    float64
               ...   
location_2    float64
location_4    float64
location_5    float64
location_7    float64
location_8    float64
Length: 314, dtype: object


In [6]:
df_birds = df_encoded.drop(columns=[
    "Brown.Creeper",
    "Pacific.Wren",
    "Pacific.slope.Flycatcher",
    "Red.breasted.Nuthatch",
    "Dark.eyed.Junco",
    "Olive.sided.Flycatcher",
    "Hermit.Thrush",
    "Chestnut.backed.Chickadee",
    "Varied.Thrush",
    "Hermit.Warbler",
    "Swainson.s.Thrush",
    "Hammond.s.Flycatcher",
    "Western.Tanager",
    "Black.headed.Grosbeak",
    "Golden.Crowned.Kinglet",
    "Warbling.Vireo",
    "MacGillivray.s.Warbler",
    "Stellar.s.Jay",
    "Common.Nighthawk"])

# df_birds = df_encoded.copy()
df_birds

Unnamed: 0,audio.ssd1,audio.ssd2,audio.ssd3,audio.ssd4,audio.ssd5,audio.ssd6,audio.ssd7,audio.ssd8,audio.ssd9,audio.ssd10,audio.ssd11,audio.ssd12,audio.ssd13,audio.ssd14,audio.ssd15,audio.ssd16,audio.ssd17,audio.ssd18,audio.ssd19,audio.ssd20,audio.ssd21,audio.ssd22,audio.ssd25,audio.ssd26,audio.ssd27,audio.ssd28,audio.ssd29,audio.ssd30,audio.ssd31,audio.ssd32,audio.ssd33,audio.ssd34,audio.ssd35,audio.ssd36,audio.ssd37,audio.ssd38,audio.ssd39,audio.ssd40,audio.ssd41,audio.ssd42,audio.ssd43,audio.ssd44,audio.ssd45,audio.ssd46,audio.ssd49,audio.ssd50,audio.ssd51,audio.ssd52,audio.ssd53,audio.ssd54,audio.ssd55,audio.ssd56,audio.ssd57,audio.ssd58,audio.ssd59,audio.ssd60,audio.ssd61,audio.ssd62,audio.ssd63,audio.ssd64,audio.ssd65,audio.ssd66,audio.ssd67,audio.ssd68,audio.ssd69,audio.ssd70,audio.ssd73,audio.ssd74,audio.ssd75,audio.ssd76,audio.ssd77,audio.ssd78,audio.ssd79,audio.ssd80,audio.ssd81,audio.ssd82,audio.ssd83,audio.ssd84,audio.ssd85,audio.ssd86,audio.ssd87,audio.ssd88,audio.ssd89,audio.ssd90,audio.ssd91,audio.ssd92,audio.ssd93,audio.ssd94,audio.ssd97,audio.ssd98,audio.ssd99,audio.ssd100,audio.ssd101,audio.ssd102,audio.ssd103,audio.ssd104,audio.ssd105,audio.ssd106,audio.ssd107,audio.ssd108,audio.ssd109,audio.ssd110,audio.ssd111,audio.ssd112,audio.ssd113,audio.ssd114,audio.ssd115,audio.ssd116,audio.ssd117,audio.ssd118,audio.ssd121,audio.ssd122,audio.ssd123,audio.ssd124,audio.ssd125,audio.ssd126,audio.ssd127,audio.ssd128,audio.ssd129,audio.ssd130,audio.ssd131,audio.ssd132,audio.ssd133,audio.ssd134,audio.ssd135,audio.ssd136,audio.ssd137,audio.ssd138,audio.ssd139,audio.ssd140,audio.ssd141,audio.ssd145,audio.ssd146,audio.ssd147,audio.ssd148,audio.ssd149,audio.ssd150,audio.ssd151,audio.ssd152,audio.ssd153,audio.ssd154,audio.ssd155,audio.ssd156,audio.ssd157,audio.ssd158,audio.ssd159,audio.ssd160,audio.ssd161,audio.ssd162,audio.ssd163,audio.ssd164,audio.ssd165,audio.ssd166,cluster1,cluster2,cluster3,cluster4,cluster5,cluster6,cluster7,cluster8,cluster9,cluster10,cluster11,cluster12,cluster13,cluster14,cluster15,cluster16,cluster17,cluster18,cluster19,cluster20,cluster21,cluster22,cluster23,cluster24,cluster25,cluster26,cluster27,cluster28,cluster29,cluster30,cluster31,cluster32,cluster33,cluster34,cluster35,cluster36,cluster37,cluster38,cluster39,cluster40,cluster41,cluster42,cluster43,cluster44,cluster45,cluster46,cluster47,cluster48,cluster49,cluster50,cluster51,cluster52,cluster53,cluster54,cluster55,cluster56,cluster57,cluster59,cluster60,cluster61,cluster62,cluster63,cluster64,cluster65,cluster66,cluster67,cluster68,cluster69,cluster70,cluster71,cluster72,cluster73,cluster74,cluster75,cluster76,cluster78,cluster79,cluster80,cluster81,cluster82,cluster83,cluster84,cluster85,cluster86,cluster87,cluster88,cluster89,cluster90,cluster91,cluster92,cluster93,cluster94,cluster95,cluster96,cluster97,cluster98,cluster99,cluster100,mean_rect_width,std_rect_width,mean_rect_height,std_rect_height,mean_rect_volume,std_rect_volume,hasSegments,segments_0,segments_1,segments_10,segments_11,segments_12,segments_13,segments_14,segments_15,segments_16,segments_17,segments_18,segments_19,segments_2,segments_20,segments_21,segments_23,segments_24,segments_3,segments_36,segments_4,segments_5,segments_6,segments_7,segments_8,segments_9,location_1,location_10,location_11,location_13,location_15,location_16,location_17,location_2,location_4,location_5,location_7,location_8
0,-0.488936,-0.218091,-0.095540,0.143470,0.177156,0.310669,0.373549,0.468729,0.497583,0.476236,0.465066,0.337596,0.385230,0.413819,0.613032,0.710882,0.564552,0.445294,0.383684,0.517700,0.111989,-0.331619,-0.373093,-0.245675,-0.166137,-0.021787,-0.000693,0.038219,0.044428,0.124804,0.292491,0.128366,0.121869,-0.031718,0.069924,-0.154259,-0.093461,-0.121783,-0.118655,-0.164950,-0.057003,0.495600,-0.118059,-0.287158,-0.111597,-0.661833,-0.541884,-0.723564,-0.737741,-0.422742,-0.662208,-0.443700,-0.434404,-0.363630,-0.407698,-0.453658,-0.381992,-0.525665,-0.225699,0.037483,0.449017,1.098585,1.123206,1.173671,1.078810,-0.963147,-0.231278,-0.441642,-0.405905,-0.332804,-0.192671,-0.150645,-0.197905,-0.186471,-0.183849,-0.154236,-0.155374,-0.208777,-0.290822,-0.281632,-0.195177,-0.168440,-0.044833,0.712280,0.486596,0.374576,0.427814,-0.897911,-0.462265,-0.223575,-0.102120,0.161502,0.184553,0.299890,0.400048,0.485329,0.487874,0.485743,0.471895,0.334440,0.376985,0.442266,0.662585,0.793755,0.579788,0.496999,0.376480,0.304851,0.096662,-0.230453,-0.126429,-0.108881,-0.290768,-0.376295,-0.157487,-0.104290,-0.081709,0.544964,0.169576,0.485992,0.027961,0.206561,0.476551,0.358645,0.533067,0.491255,0.650187,0.530253,0.515248,0.376780,0.188650,-0.480967,-0.203513,-0.004776,0.116989,-0.013898,0.248158,0.098039,0.133398,0.205566,0.141393,0.127650,0.041912,0.043860,-0.101168,0.251990,0.167955,0.193747,0.549537,0.864644,1.442230,0.400148,-0.447470,-0.107577,-0.156668,-0.13088,-0.145301,-0.07766,-0.115397,-0.125113,-0.135357,-0.145712,-0.162701,-0.162448,-0.120213,-0.07754,-0.132885,-0.131791,-0.089665,-0.133099,-0.081973,-0.089585,2.699825,-0.114531,-0.094726,-0.143284,-0.148466,-0.138155,0.948953,-0.063519,2.432506,-0.114872,2.328550,-0.123327,-0.081518,-0.127397,-0.112248,-0.142726,-0.141006,-0.105813,-0.105268,-0.102824,-0.07341,-0.139868,-0.196545,-0.127181,-0.100272,-0.165365,-0.093673,-0.13835,-0.117098,-0.095335,-0.137191,-0.115187,-0.102354,-0.131174,-0.125424,-0.14291,-0.116874,-0.156912,3.741219,-0.157122,-0.148273,-0.106927,-0.076574,-0.142151,-0.089784,-0.127231,-0.123312,3.205614,6.476481,-0.116221,-0.209349,-0.123562,-0.150287,-0.107288,-0.148099,-0.124262,-0.098347,-0.104055,-0.133316,-0.089047,2.836897,-0.150877,-0.125249,0.974024,-0.127421,-0.06621,-0.118993,-0.139217,-0.115295,-0.146709,-0.065294,-0.1465,-0.079323,-0.049675,-0.156136,-0.072413,-0.140476,-0.122407,-0.155292,-0.154778,-0.180478,1.191197,1.176365,-0.131434,-0.150168,1.0,0.0,0.0,0.0,0.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0
1,-0.580486,-0.250197,-0.092860,0.054252,0.190705,0.312221,0.321999,0.445112,0.490582,0.520345,0.437584,0.303871,0.379934,0.393841,0.488904,0.478144,0.312088,0.267669,0.139127,0.104778,-0.021862,-0.326169,-0.391677,-0.257335,-0.207635,-0.084703,0.013637,0.178860,0.087206,0.264970,0.198547,0.141375,0.085325,0.031825,-0.014823,-0.131673,-0.193756,-0.194911,-0.235859,-0.298385,-0.249263,-0.266657,-0.376342,-0.287158,0.160381,-0.480922,-0.647627,-0.664848,-0.666892,-0.757898,-0.599224,-0.528812,-0.434447,-0.469212,-0.386488,-0.263857,-0.471708,-0.456536,-0.598190,-0.598417,-0.682010,-0.734036,-0.522345,0.606501,-0.429699,-1.073408,-0.157413,-0.374504,-0.386824,-0.341622,-0.167773,-0.238108,-0.191168,-0.167383,-0.173393,-0.161252,-0.158150,-0.183634,-0.264018,-0.284005,-0.298160,-0.333086,-0.350460,-0.448500,-0.380493,0.577722,-0.376696,-0.964161,-0.524421,-0.242281,-0.059519,0.067295,0.235885,0.331887,0.341134,0.477513,0.495684,0.517000,0.421292,0.286370,0.388237,0.439768,0.550642,0.597722,0.442841,0.395783,0.267093,0.228162,0.072812,-0.230453,-0.126429,-0.108881,-0.290768,-0.376295,-0.091801,-0.227258,-0.122735,-0.259141,0.164652,0.648530,0.345184,0.381820,0.093518,0.527230,0.424269,0.594523,0.490216,0.510912,0.452147,0.188153,0.271984,-0.672841,-0.237477,-0.099432,-0.059059,0.057873,0.202759,0.071800,0.276615,0.464287,0.235393,0.125152,0.134661,0.106514,-0.075115,-0.057701,-0.288847,-0.344241,-0.422780,-0.437393,0.054102,-0.385493,-0.438588,-0.107577,-0.156668,-0.13088,-0.145301,-0.07766,-0.115397,-0.125113,-0.135357,-0.145712,-0.162701,-0.162448,-0.120213,-0.07754,-0.132885,-0.131791,-0.089665,-0.133099,-0.081973,-0.089585,-0.144702,-0.114531,-0.094726,-0.143284,-0.148466,-0.138155,-0.123708,-0.063519,-0.113399,-0.114872,-0.144217,-0.123327,-0.081518,-0.127397,-0.112248,-0.142726,-0.141006,-0.105813,-0.105268,-0.102824,-0.07341,-0.139868,-0.196545,-0.127181,-0.100272,-0.165365,-0.093673,-0.13835,-0.117098,-0.095335,-0.137191,-0.115187,-0.102354,-0.131174,-0.125424,-0.14291,-0.116874,-0.156912,-0.152316,-0.157122,-0.148273,-0.106927,-0.076574,-0.142151,-0.089784,-0.127231,-0.123312,-0.132475,-0.131417,-0.116221,-0.209349,-0.123562,-0.150287,-0.107288,-0.148099,-0.124262,-0.098347,-0.104055,-0.133316,-0.089047,-0.122727,-0.150877,-0.125249,-0.104898,-0.127421,-0.06621,-0.118993,-0.139217,-0.115295,-0.146709,-0.065294,-0.1465,-0.079323,-0.049675,-0.156136,-0.072413,-0.140476,-0.122407,-0.155292,-0.687696,-0.478352,-0.741783,-0.499876,-0.477582,-0.326682,0.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0
2,-0.577773,-0.398934,-0.439389,-0.415455,-0.326508,-0.366434,-0.365811,-0.340025,-0.325864,-0.336580,-0.379096,-0.365899,-0.356846,-0.318513,-0.209680,-0.211340,-0.264203,-0.462831,-0.452323,-0.393569,-0.420113,-0.354361,-0.391330,-0.306932,-0.381881,-0.382517,-0.349092,-0.398157,-0.432243,-0.373269,-0.382618,-0.369037,-0.381996,-0.366695,-0.392640,-0.229804,-0.167866,-0.133081,-0.151600,-0.324600,-0.264080,-0.300802,-0.405891,-0.287158,0.207620,0.239037,-0.313866,-0.079493,-0.267734,-0.148366,-0.154543,-0.451879,-0.333718,-0.158040,-0.216655,-0.366572,-0.007530,0.250152,0.432037,1.599476,2.040963,-0.473454,-0.767502,-0.586042,-0.576882,-0.248531,-0.095668,-0.006223,-0.354701,-0.240925,-0.166404,-0.152929,-0.150945,-0.164758,-0.168177,-0.123832,-0.153926,-0.203013,-0.034435,-0.043049,-0.045966,0.795715,1.388839,-0.374029,-0.429979,-0.385099,-0.445650,-0.448203,-0.525768,-0.396819,-0.451590,-0.427317,-0.317012,-0.380638,-0.371738,-0.318567,-0.326008,-0.330563,-0.381824,-0.367877,-0.348049,-0.307891,-0.220321,-0.214279,-0.229992,-0.390726,-0.362236,-0.321560,-0.351008,-0.230453,-0.126429,-0.108881,-0.290768,-0.376295,-0.466539,-0.356931,-0.311028,-0.438504,-0.500689,-0.427577,-0.502535,-0.541774,-0.461416,-0.302735,-0.288471,-0.286255,-0.673131,-0.292356,-0.368724,-0.272708,-0.308304,-0.659883,-0.366442,-0.399150,-0.447813,-0.374140,-0.338113,-0.372003,-0.377867,-0.420927,-0.355280,-0.436935,-0.463436,-0.137658,-0.130796,-0.154895,0.371055,0.435770,-0.563299,-0.653234,-0.530595,-0.613510,-0.489756,-0.107577,-0.156668,-0.13088,-0.145301,-0.07766,-0.115397,-0.125113,-0.135357,-0.145712,-0.162701,12.569803,-0.120213,-0.07754,-0.132885,-0.131791,-0.089665,-0.133099,-0.081973,-0.089585,-0.144702,-0.114531,-0.094726,-0.143284,-0.148466,-0.138155,-0.123708,-0.063519,-0.113399,-0.114872,-0.144217,-0.123327,-0.081518,-0.127397,-0.112248,-0.142726,-0.141006,-0.105813,-0.105268,-0.102824,-0.07341,-0.139868,-0.196545,-0.127181,-0.100272,-0.165365,-0.093673,-0.13835,-0.117098,-0.095335,-0.137191,-0.115187,-0.102354,-0.131174,-0.125424,-0.14291,-0.116874,-0.156912,-0.152316,-0.157122,-0.148273,-0.106927,-0.076574,-0.142151,-0.089784,-0.127231,-0.123312,-0.132475,-0.131417,-0.116221,-0.209349,-0.123562,-0.150287,-0.107288,-0.148099,-0.124262,-0.098347,-0.104055,-0.133316,-0.089047,-0.122727,12.507669,-0.125249,-0.104898,-0.127421,-0.06621,-0.118993,-0.139217,-0.115295,-0.146709,-0.065294,-0.1465,-0.079323,-0.049675,-0.156136,-0.072413,-0.140476,-0.122407,-0.155292,0.092916,-0.437488,0.415462,-0.466889,-0.181858,-0.315346,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0
3,-0.354900,0.496591,0.994622,1.277306,1.591906,1.677287,1.606881,1.748087,1.904081,1.889294,1.738364,1.576752,1.494487,1.558938,1.609276,1.568311,1.332629,0.984026,0.839597,0.514959,0.284594,-0.276739,-0.353221,0.096080,0.387019,0.865040,1.099135,1.070133,0.991659,1.061519,1.295177,1.332678,1.134766,0.961462,0.617235,0.146131,-0.041681,-0.152053,-0.211816,-0.265933,-0.202779,-0.257198,-0.335849,-0.249837,-0.811386,-1.033782,-1.158545,-1.074494,-0.869768,-0.689148,-0.389217,-0.681278,-0.549500,-0.566580,-0.594445,-0.455367,-0.482656,-0.591050,-0.612270,-0.412722,-0.685715,-0.597597,1.285456,1.336471,0.942987,0.669163,-0.551454,-0.523958,-0.475380,-0.325021,-0.150773,-0.196234,-0.147606,-0.173316,-0.169204,-0.150923,-0.149274,-0.210754,-0.265004,-0.285458,-0.315477,-0.182659,-0.312163,-0.365989,1.735814,1.761590,1.022719,0.551870,-0.319528,0.504134,1.072480,1.313263,1.621388,1.638100,1.552275,1.759010,1.895877,1.885100,1.782633,1.591710,1.517966,1.596252,1.682848,1.752887,1.516678,1.115844,1.045402,0.673050,0.378082,-0.230453,-0.126429,-0.045867,0.391885,0.134281,0.574191,1.838653,1.235768,1.337906,1.350778,1.883902,1.456819,1.768663,1.721056,1.411835,1.968623,1.742622,1.752928,1.244756,1.371568,0.966604,0.563212,-0.481379,0.460230,0.701335,0.878010,1.083677,1.121158,1.475416,1.010403,1.225629,1.342747,1.155213,1.000334,0.755585,0.437711,0.278367,0.457324,0.022750,0.012643,0.702330,0.398022,0.217297,0.153067,-0.107577,-0.156668,-0.13088,-0.145301,-0.07766,-0.115397,-0.125113,-0.135357,-0.145712,-0.162701,-0.162448,-0.120213,-0.07754,-0.132885,-0.131791,-0.089665,-0.133099,-0.081973,-0.089585,-0.144702,-0.114531,-0.094726,-0.143284,-0.148466,-0.138155,-0.123708,-0.063519,-0.113399,-0.114872,-0.144217,-0.123327,-0.081518,-0.127397,-0.112248,-0.142726,-0.141006,-0.105813,-0.105268,-0.102824,-0.07341,-0.139868,-0.196545,-0.127181,-0.100272,-0.165365,-0.093673,-0.13835,-0.117098,-0.095335,-0.137191,-0.115187,-0.102354,-0.131174,-0.125424,-0.14291,-0.116874,-0.156912,-0.152316,-0.157122,-0.148273,-0.106927,-0.076574,-0.142151,-0.089784,-0.127231,-0.123312,-0.132475,-0.131417,-0.116221,-0.209349,-0.123562,-0.150287,-0.107288,-0.148099,-0.124262,-0.098347,-0.104055,-0.133316,-0.089047,-0.122727,-0.150877,-0.125249,-0.104898,-0.127421,-0.06621,-0.118993,-0.139217,-0.115295,-0.146709,-0.065294,-0.1465,-0.079323,-0.049675,-0.156136,-0.072413,-0.140476,-0.122407,-0.155292,-0.687696,-0.478352,-0.741783,-0.499876,-0.477582,-0.326682,0.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0
4,-0.044150,1.302656,1.681609,1.293301,1.503515,1.555880,1.531711,1.696400,1.774292,1.754617,1.624989,1.444818,1.411034,1.456385,1.543499,1.476869,1.173620,0.841383,0.661463,0.366569,0.147870,-0.318275,-0.268628,0.975349,0.992102,0.652140,0.936668,0.990958,0.837046,1.351054,1.361184,1.057103,0.951308,0.875881,0.628637,0.178288,-0.054891,-0.161194,-0.214594,-0.275163,-0.232684,-0.282230,-0.371600,-0.283765,-0.859971,-0.865651,-1.106104,-1.222970,-0.950158,-0.696162,-0.592749,-0.635082,-0.730028,-0.496734,-0.563842,-0.500924,-0.632015,-0.562170,-0.565509,-0.657754,-0.808185,-0.663710,-0.712224,-0.713290,-0.356576,-0.609978,-0.512657,-0.487373,-0.468686,-0.381148,-0.147973,-0.170318,-0.189413,-0.177084,-0.193547,-0.147727,-0.167480,-0.186594,-0.265284,-0.275440,-0.308071,-0.312379,-0.337707,-0.412067,-0.435609,-0.435172,-0.322597,-0.680037,0.009907,1.212459,1.712981,1.363425,1.519702,1.594352,1.550021,1.688788,1.797408,1.765368,1.632095,1.451443,1.458100,1.462623,1.595791,1.668241,1.362484,0.985207,0.850110,0.524808,0.267379,-0.230453,-0.126429,0.069804,0.808344,1.284303,0.446496,1.066939,2.165051,1.338995,1.934034,1.453162,1.245371,1.454444,1.646723,0.433021,1.803068,1.389740,1.313169,1.102740,0.823858,0.466188,0.314138,-0.127840,1.401650,1.200889,0.667047,1.019992,1.089156,0.945283,1.060582,1.152156,1.250189,0.938621,0.917165,0.827906,0.446483,0.328903,0.100657,-0.146326,-0.198349,-0.346857,-0.427988,-0.353855,-0.374651,-0.107577,-0.156668,-0.13088,-0.145301,-0.07766,-0.115397,-0.125113,-0.135357,-0.145712,-0.162701,-0.162448,-0.120213,-0.07754,-0.132885,-0.131791,-0.089665,-0.133099,-0.081973,-0.089585,-0.144702,-0.114531,-0.094726,-0.143284,-0.148466,-0.138155,-0.123708,-0.063519,-0.113399,-0.114872,-0.144217,-0.123327,-0.081518,-0.127397,-0.112248,-0.142726,-0.141006,-0.105813,-0.105268,-0.102824,-0.07341,-0.139868,-0.196545,-0.127181,-0.100272,-0.165365,-0.093673,-0.13835,-0.117098,-0.095335,-0.137191,-0.115187,-0.102354,-0.131174,-0.125424,-0.14291,-0.116874,-0.156912,-0.152316,-0.157122,-0.148273,-0.106927,-0.076574,-0.142151,-0.089784,-0.127231,-0.123312,-0.132475,-0.131417,-0.116221,-0.209349,-0.123562,-0.150287,-0.107288,-0.148099,-0.124262,-0.098347,-0.104055,-0.133316,-0.089047,-0.122727,-0.150877,-0.125249,-0.104898,-0.127421,-0.06621,-0.118993,-0.139217,-0.115295,-0.146709,-0.065294,-0.1465,-0.079323,-0.049675,-0.156136,-0.072413,-0.140476,-0.122407,-0.155292,-0.687696,-0.478352,-0.741783,-0.499876,-0.477582,-0.326682,0.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0
...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...
640,-0.032643,-0.496856,-0.755802,-0.838038,-0.893556,-0.872175,-0.847191,-0.829975,-0.784401,-0.756106,-0.748928,-0.736102,-0.742441,-0.757594,-0.765996,-0.807212,-0.803874,-0.738035,-0.653668,-0.601344,-0.587045,-0.358120,-0.065498,-0.331483,-0.520822,-0.670793,-0.683877,-0.701014,-0.644912,-0.648340,-0.631601,-0.569299,-0.549581,-0.574220,-0.552580,-0.357540,-0.330875,-0.233478,-0.261716,-0.337118,-0.268010,-0.305416,-0.412458,-0.287158,-0.449648,1.457508,0.991808,1.316449,0.413457,0.511917,0.372923,0.325226,-0.136646,-0.083789,-0.013346,-0.050565,-0.253004,-0.409326,-0.460376,-0.496668,-0.669381,-0.730215,-0.666493,-0.755160,-0.662256,0.350352,-0.417514,0.975747,0.288662,0.518689,-0.095805,0.029309,0.002271,-0.038498,-0.164694,-0.149436,-0.134374,-0.169486,-0.234973,-0.273838,-0.304745,-0.314716,-0.346388,-0.433639,-0.414406,-0.432152,-0.469739,0.199191,-0.177061,-0.478507,-0.741426,-0.827134,-0.885772,-0.864553,-0.831613,-0.829428,-0.774520,-0.748300,-0.750667,-0.740765,-0.728100,-0.739205,-0.726255,-0.750187,-0.727710,-0.663156,-0.585106,-0.547788,-0.519829,-0.230453,-0.126429,-0.108881,-0.290768,-0.376295,-0.476486,-0.475414,-0.496690,-0.533007,-0.517166,-0.571570,-0.579175,-0.595673,-0.734959,-0.718807,-0.676203,-0.634365,-0.670712,-0.695255,-0.598824,-0.445034,-0.374874,0.209322,-0.542963,-0.837781,-0.854516,-0.872762,-0.785708,-0.699281,-0.687401,-0.890774,-0.772553,-0.717399,-0.786571,-0.720302,-0.708126,-0.765589,-0.733655,-0.739742,-0.746233,-0.701158,-0.653702,-0.676639,-0.501330,-0.107577,-0.156668,-0.13088,-0.145301,-0.07766,-0.115397,-0.125113,-0.135357,-0.145712,-0.162701,-0.162448,-0.120213,-0.07754,-0.132885,-0.131791,-0.089665,-0.133099,-0.081973,-0.089585,-0.144702,-0.114531,-0.094726,-0.143284,-0.148466,-0.138155,-0.123708,-0.063519,-0.113399,-0.114872,-0.144217,-0.123327,-0.081518,-0.127397,-0.112248,-0.142726,-0.141006,-0.105813,-0.105268,-0.102824,-0.07341,-0.139868,-0.196545,-0.127181,-0.100272,-0.165365,-0.093673,-0.13835,-0.117098,-0.095335,-0.137191,-0.115187,-0.102354,-0.131174,-0.125424,-0.14291,-0.116874,-0.156912,-0.152316,-0.157122,-0.148273,-0.106927,-0.076574,-0.142151,-0.089784,-0.127231,-0.123312,-0.132475,-0.131417,-0.116221,-0.209349,-0.123562,-0.150287,-0.107288,-0.148099,-0.124262,-0.098347,-0.104055,-0.133316,-0.089047,-0.122727,-0.150877,-0.125249,-0.104898,-0.127421,-0.06621,-0.118993,-0.139217,-0.115295,-0.146709,-0.065294,-0.1465,-0.079323,-0.049675,-0.156136,-0.072413,-0.140476,-0.122407,-0.155292,-0.687696,-0.478352,-0.741783,-0.499876,-0.477582,-0.326682,0.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0
641,-0.295971,-0.458243,-0.663160,-0.749123,-0.434091,-0.794147,-0.774753,-0.701887,-0.726469,-0.700027,-0.698926,-0.713048,-0.713462,-0.717832,-0.754284,-0.774402,-0.748899,-0.739659,-0.665280,-0.596684,-0.574311,-0.359248,-0.279134,-0.323190,-0.490731,-0.626798,-0.332075,-0.658476,-0.603560,-0.575760,-0.592611,-0.541571,-0.534078,-0.555855,-0.549806,-0.350394,-0.333228,-0.232032,-0.262470,-0.337961,-0.266655,-0.305877,-0.415376,-0.287158,0.069027,0.040510,0.340230,0.484406,-0.005042,0.202508,0.120854,0.025558,-0.036150,0.034680,0.015896,0.203818,-0.280653,-0.422380,-0.506270,-0.687345,-0.637503,-0.726639,-0.757281,-0.672471,-0.670086,0.516821,-0.081649,-0.276823,-0.060729,-0.032994,-0.129840,-0.148085,-0.122755,-0.123502,-0.141752,-0.111785,-0.108373,-0.102005,-0.258972,-0.285016,-0.308342,-0.339276,-0.331042,-0.434024,-0.412119,-0.425383,-0.463683,0.239137,-0.333334,-0.453440,-0.656376,-0.752474,-0.442335,-0.800298,-0.775833,-0.703626,-0.721271,-0.697366,-0.695244,-0.715522,-0.698113,-0.690335,-0.714763,-0.710489,-0.670579,-0.661787,-0.594749,-0.551770,-0.505046,-0.230453,-0.126429,-0.108881,-0.290768,-0.376295,-0.476486,-0.475414,-0.496690,-0.524453,-0.517166,-0.571570,-0.589509,-0.596971,-0.720280,-0.653085,-0.569978,-0.676777,-0.588166,-0.622662,-0.781639,-0.430839,-0.364170,0.049347,-0.553713,-0.719668,-0.786583,-0.328203,-0.800410,-0.701415,-0.632014,-0.741765,-0.669237,-0.644884,-0.679447,-0.772678,-0.720774,-0.735765,-0.772913,-0.726642,-0.729577,-0.714008,-0.658569,-0.667854,-0.512167,-0.107577,-0.156668,-0.13088,-0.145301,-0.07766,-0.115397,-0.125113,-0.135357,-0.145712,-0.162701,-0.162448,-0.120213,-0.07754,-0.132885,-0.131791,-0.089665,-0.133099,-0.081973,-0.089585,-0.144702,-0.114531,-0.094726,-0.143284,-0.148466,-0.138155,-0.123708,-0.063519,-0.113399,-0.114872,-0.144217,-0.123327,-0.081518,-0.127397,-0.112248,-0.142726,-0.141006,-0.105813,-0.105268,-0.102824,-0.07341,-0.139868,-0.196545,-0.127181,-0.100272,-0.165365,-0.093673,-0.13835,-0.117098,-0.095335,-0.137191,-0.115187,-0.102354,-0.131174,-0.125424,-0.14291,-0.116874,-0.156912,-0.152316,-0.157122,-0.148273,-0.106927,-0.076574,-0.142151,-0.089784,-0.127231,-0.123312,-0.132475,-0.131417,-0.116221,-0.209349,-0.123562,-0.150287,-0.107288,-0.148099,-0.124262,-0.098347,-0.104055,-0.133316,-0.089047,-0.122727,-0.150877,-0.125249,-0.104898,-0.127421,-0.06621,-0.118993,-0.139217,-0.115295,-0.146709,-0.065294,-0.1465,-0.079323,-0.049675,-0.156136,-0.072413,-0.140476,-0.122407,-0.155292,-0.687696,-0.478352,-0.741783,-0.499876,-0.477582,-0.326682,0.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0
642,1.204729,-0.097053,0.296666,0.375182,0.345996,0.318274,0.433815,0.400906,0.248470,0.312972,0.485040,0.453453,0.227429,0.042745,-0.144598,-0.223065,-0.351863,-0.449959,-0.439009,-0.430139,-0.424015,-0.352106,1.252152,-0.190249,0.003221,0.144195,0.119500,0.077908,0.168077,0.166880,0.079121,0.123574,0.181668,0.066719,-0.112205,-0.219085,-0.261556,-0.210191,-0.251896,-0.330417,-0.263718,-0.303570,-0.408445,-0.287158,-0.576243,-0.528224,-0.937113,-0.806367,-0.515699,-0.700092,-0.578825,-0.263657,-0.248403,-0.334370,-0.279442,-0.304881,-0.396158,-0.405264,-0.540731,-0.645300,-0.557081,-0.827594,-0.614951,-0.795008,-0.716873,-0.745263,-0.417116,-0.385553,-0.442887,-0.355915,-0.137340,-0.216969,-0.159751,-0.161662,-0.137678,-0.145805,-0.137680,-0.187300,-0.270897,-0.283354,-0.307578,-0.329583,-0.317644,-0.427492,-0.410158,-0.428444,-0.461749,-0.817222,0.993542,-0.107103,0.337494,0.407801,0.344306,0.346309,0.463137,0.402727,0.255507,0.315413,0.488455,0.435771,0.218543,0.049176,-0.093434,-0.142896,-0.254216,-0.360227,-0.348435,-0.340876,-0.345370,-0.230453,-0.126429,-0.108881,-0.290768,-0.116015,-0.019935,0.013262,-0.199899,0.321447,-0.062882,0.004626,0.425310,0.414590,0.089805,0.246456,-0.169892,-0.189924,-0.101496,-0.518507,-0.311439,-0.232260,-0.245007,1.771633,-0.096459,0.210093,0.173904,0.197862,0.137300,0.473427,0.235791,0.232914,0.151425,0.317129,0.145974,-0.044072,-0.287985,-0.394234,-0.497108,-0.551890,-0.692755,-0.637406,-0.633503,-0.643765,-0.490963,-0.107577,-0.156668,-0.13088,-0.145301,-0.07766,-0.115397,-0.125113,-0.135357,-0.145712,-0.162701,-0.162448,-0.120213,-0.07754,-0.132885,-0.131791,-0.089665,-0.133099,-0.081973,-0.089585,-0.144702,-0.114531,-0.094726,-0.143284,-0.148466,-0.138155,-0.123708,-0.063519,-0.113399,-0.114872,-0.144217,-0.123327,-0.081518,-0.127397,-0.112248,-0.142726,-0.141006,-0.105813,-0.105268,-0.102824,-0.07341,-0.139868,-0.196545,-0.127181,-0.100272,-0.165365,-0.093673,-0.13835,-0.117098,-0.095335,-0.137191,-0.115187,-0.102354,-0.131174,-0.125424,-0.14291,-0.116874,-0.156912,-0.152316,-0.157122,-0.148273,-0.106927,-0.076574,-0.142151,-0.089784,-0.127231,-0.123312,-0.132475,-0.131417,-0.116221,-0.209349,-0.123562,-0.150287,-0.107288,-0.148099,-0.124262,-0.098347,-0.104055,-0.133316,-0.089047,-0.122727,-0.150877,-0.125249,-0.104898,-0.127421,-0.06621,-0.118993,-0.139217,-0.115295,-0.146709,-0.065294,-0.1465,-0.079323,-0.049675,-0.156136,-0.072413,-0.140476,-0.122407,-0.155292,-0.687696,-0.478352,-0.741783,-0.499876,-0.477582,-0.326682,0.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0
643,-0.047749,-0.443411,-0.651249,-0.736785,-0.772124,-0.765684,-0.753541,-0.717600,-0.663871,-0.648786,-0.661325,-0.673759,-0.680029,-0.649120,-0.668256,-0.690426,-0.699192,-0.673879,-0.601263,-0.552020,-0.547765,-0.353985,-0.106629,-0.316867,-0.469548,-0.627721,-0.609540,-0.653591,-0.596226,-0.601794,-0.552086,-0.522400,-0.536662,-0.566139,-0.537788,-0.345800,-0.323206,-0.231165,-0.258655,-0.335811,-0.266700,-0.303455,-0.411364,-0.287158,-0.490268,0.317015,0.523858,0.119737,0.170138,0.329695,-0.026011,0.056974,-0.081340,-0.106391,-0.311208,-0.206780,-0.411217,-0.444994,-0.451655,-0.659020,-0.679315,-0.892427,-0.671012,-0.700478,-0.758795,0.646726,-0.438916,-0.071292,0.011428,-0.189308,-0.115756,-0.050284,-0.152359,-0.125629,-0.147992,-0.129174,-0.163394,-0.191809,-0.275560,-0.266301,-0.309565,-0.333129,-0.339812,-0.456935,-0.425062,-0.428675,-0.453649,0.579628,-0.195705,-0.444668,-0.668241,-0.728955,-0.768284,-0.755357,-0.750309,-0.723245,-0.656367,-0.637808,-0.651635,-0.667101,-0.671814,-0.626991,-0.633440,-0.620844,-0.624137,-0.582207,-0.528402,-0.510038,-0.478130,-0.230453,-0.126429,-0.108881,-0.290768,-0.376295,-0.476486,-0.475414,-0.496690,-0.533007,-0.517166,-0.564107,-0.549425,-0.505183,-0.645105,-0.577831,-0.441470,-0.686499,-0.604893,-0.535384,-0.471376,-0.492465,-0.529600,0.073101,-0.483260,-0.581814,-0.806840,-0.694831,-0.729288,-0.694293,-0.706472,-0.647154,-0.613468,-0.687783,-0.751811,-0.744629,-0.666701,-0.716512,-0.710097,-0.698071,-0.774151,-0.690043,-0.648753,-0.669608,-0.462083,-0.107577,-0.156668,-0.13088,-0.145301,-0.07766,-0.115397,-0.125113,-0.135357,-0.145712,-0.162701,-0.162448,-0.120213,-0.07754,-0.132885,-0.131791,-0.089665,-0.133099,-0.081973,-0.089585,-0.144702,-0.114531,-0.094726,-0.143284,-0.148466,-0.138155,-0.123708,-0.063519,-0.113399,-0.114872,-0.144217,-0.123327,-0.081518,-0.127397,-0.112248,-0.142726,-0.141006,-0.105813,-0.105268,-0.102824,-0.07341,-0.139868,-0.196545,-0.127181,-0.100272,-0.165365,-0.093673,-0.13835,-0.117098,-0.095335,-0.137191,-0.115187,-0.102354,-0.131174,-0.125424,-0.14291,-0.116874,-0.156912,-0.152316,-0.157122,-0.148273,-0.106927,-0.076574,-0.142151,-0.089784,-0.127231,-0.123312,-0.132475,-0.131417,-0.116221,-0.209349,-0.123562,-0.150287,-0.107288,-0.148099,-0.124262,-0.098347,-0.104055,-0.133316,-0.089047,-0.122727,-0.150877,-0.125249,-0.104898,-0.127421,-0.06621,-0.118993,-0.139217,-0.115295,-0.146709,-0.065294,-0.1465,-0.079323,-0.049675,-0.156136,-0.072413,-0.140476,-0.122407,-0.155292,-0.687696,-0.478352,-0.741783,-0.499876,-0.477582,-0.326682,0.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0


In [13]:
birds_features = df_birds.columns
birds_features

Index(['audio.ssd1', 'audio.ssd2', 'audio.ssd3', 'audio.ssd4', 'audio.ssd5',
       'audio.ssd6', 'audio.ssd7', 'audio.ssd8', 'audio.ssd9', 'audio.ssd10',
       ...
       'location_11', 'location_13', 'location_15', 'location_16',
       'location_17', 'location_2', 'location_4', 'location_5', 'location_7',
       'location_8'],
      dtype='object', length=295)

In [30]:
# --- 1. Selecionar colunas
audio_cols = [col for col in df_birds.columns if col.startswith("audio.ssd")]
cluster_cols = [col for col in df_birds.columns if col.startswith("cluster")]
mean_std_cols = [col for col in df_birds.columns if col.startswith("mean_")] + [col for col in df_birds.columns if col.startswith("std_")] + [df_birds["hasSegments"]]
segments_cols = [col for col in df_birds.columns if col.startswith("segments_")]
location_cols = [col for col in df_birds.columns if col.startswith("location_")]

# --- 2. Calcular correlação apenas entre audio_cols (linhas) e other_cols (colunas)
# corr = df_birds[location_cols + audio_cols].corr()
corr = df_birds.corr()

# Extrair apenas bloco desejado (linhas=audio, colunas=outras)
corr_block = corr.loc[location_cols, audio_cols]

# --- 3. Plotar heatmap
fig = px.imshow(
    # corr_block,
    corr,
    text_auto=".2f",
    aspect="auto",
    color_continuous_scale="RdBu_r",
    title="Correlação",
    zmin=-1,   # limite inferior fixo
    zmax=1     # limite superior fixo
)

fig.update_layout(
    # xaxis_title="Outras variáveis",
    # yaxis_title="Variáveis audio.ssd",
    width=1500,
    height=900
)

fig.show()


Após qualquer tipo de clusterização é interessante avaliar quais variáveis contribuíram ou não para aqueles agrupamentos. É importante comparar se a **variabilidade dentro do grupo é menor do que a variabilidade entre grupos** com base nas variáveis da análise.

Aplica-se um **teste F** para análise de variância:

> 𝑭 = 𝑽𝒂𝒓𝒊𝒂𝒃𝒊𝒍𝒊𝒅𝒂𝒅𝒆 𝒆𝒏𝒕𝒓𝒆 𝒈𝒓𝒖𝒑𝒐𝒔 / 𝑽𝒂𝒓𝒊𝒂𝒃𝒊𝒍𝒊𝒅𝒂𝒅𝒆 𝒅𝒆𝒏𝒕𝒓𝒐 𝒅𝒐𝒔 𝒈𝒓𝒖𝒑𝒐𝒔

É possível analisar quais variáveis mais contribuíram para a formação de pelo menos um dos clusters:

> Maiores valores da estatística F (*em conjunto com a significância*)

## Principais Tecnologias
- **Python**: `pandas`, `numpy`, `scikit-learn`, `matplotlib` / `seaborn`, *t-SNE* / *UMAP*
- **Leitura de .arff**: `scipy.io.arff` ou `liac-arff`
- (Opcional) **OpenML**: usar a API para baixar o dataset diretamente (`openml.datasets.get_dataset(41464)`)

## Conclusão Esperada
- Visualizações de clusters bem separadas e interpretáveis
- Relacionamento entre clusters e espécies (ou ausência disso)
- Insights sobre características acústicas compartilhadas
- Apontamento de próximas fases (p.ex. classificação multilabel, extração de atributos complexos, generalização para novos dados)