### Análise exploratória e tratamento de dados de um arquivo de voo único

Neste notebook, um dos arquivos de voo de determinada aeronave foi explorado, a fim de identificar a dimensão dos dados, os atributos relevantes, entre outros aspectos considerados durante a fase de análise exploratória. Além disso, foram apresentadas possibilidades de tratamento dos dados.

In [2]:
import pandas as pd
import os

In [3]:
d1 = pd.read_parquet('06120018\TCRF_ARCHIVE_06120018_20220523142237.parquet')
d1.columns

Index(['recording_time', 'aircraftSerNum-1', 'amscChBasHealthStatus-1b',
       'amscChBasHealthStatus-1a', 'amscChBasHealthStatus-2b',
       'amscHprsovDrivF-1a', 'amscHprsovDrivF-1b', 'amscHprsovDrivF-2b',
       'amscPrsovDrivF-1a', 'amscPrsovDrivF-1b', 'amscPrsovDrivF-2b',
       'basBleedLowPressF-1a', 'basBleedLowPressF-2b', 'basBleedLowTempF-1a',
       'basBleedLowTempF-2b', 'basBleedOverPressF-1a', 'basBleedOverPressF-2b',
       'basBleedOverTempF-1a', 'basBleedOverTempF-2b',
       'bleedAcsBleedConfigStatus-1b', 'bleedAcsBleedConfigStatus-2b',
       'bleedFavTmCmd-1a', 'bleedFavTmCmd-1b', 'bleedFavTmCmd-2a',
       'bleedFavTmCmd-2b', 'bleedFavTmFbk-1a', 'bleedFavTmFbk-1b',
       'bleedFavTmFbk-2b', 'bleedHprsovCmdStatus-1a',
       'bleedHprsovCmdStatus-1b', 'bleedHprsovCmdStatus-2a',
       'bleedHprsovCmdStatus-2b', 'bleedHprsovOpPosStatus-1a',
       'bleedHprsovOpPosStatus-1b', 'bleedHprsovOpPosStatus-2a',
       'bleedHprsovOpPosStatus-2b', 'bleedMonPress-1a', 'ble

In [4]:
d1.shape

(75781, 93)

#### Identificando colunas relevantes segundo TAPI

Observações importantes:

- "amscChBasHealthStatus-2b" foi utilizado no lugar de colunas recomendadas pelo TAPI "[AMSC 2A]", já que esta não consta na tabela.

- "BldPCdiffPRESS" (coluna recomendada pelo TAPI) não consta na tabela.

- Foram interpretadas como 'BleedHprsovCmdSta_01', 'BleedHprsovCmdSta_02', 'BleedHprsovCmdSta_03' e 'BleedHprsovCmdSta_04' do TAPI as colunas 'bleedHprsovCmdStatus-1a','bleedHprsovCmdStatus-1b','bleedHprsovCmdStatus-2a' e'bleedHprsovCmdStatus-2b'.

- 'BleedHprsovOpPosS' de 01 a 04 foram substituídos por 'bleedHprsovOpPosStatus-1a', 'bleedHprsovOpPosStatus-1b', 'bleedHprsovOpPosStatus-2a' e 'bleedHprsovOpPosStatus-2b', pois não constam na tabela.

- "ITT1C", "ITT2C", "N11C", "N12C", "N21C" e "N22C" não constam na tabela.

Abaixo, encontram-se listados os atributos identificados em um arquivo de voo.

['amscChBasHealthStatus-1a','amscChBasHealthStatus-1b','amscChBasHealthStatus-2b','amscHprsovDrivF-1a','amscHprsovDrivF-1b','amscHprsovDrivF-2b','amscPrsovDrivF-1a','amscPrsovDrivF-1b','amscPrsovDrivF-2b','basBleedLowPressF-1a','basBleedLowPressF-2b','basBleedLowTempF-1a','basBleedLowTempF-2b','basBleedOverPressF-1a','basBleedOverPressF-2b','basBleedOverTempF-1a','basBleedOverTempF-2b','bleedFavTmCmd-1a','bleedFavTmCmd-1b','bleedFavTmCmd-2a','bleedFavTmCmd-2b','bleedFavTmFbk-1a','bleedFavTmFbk-1b','bleedFavTmFbk-2b','bleedHprsovCmdStatus-1a','bleedHprsovCmdStatus-1b','bleedHprsovCmdStatus-2a','bleedHprsovCmdStatus-2b',bleedHprsovOpPosStatus-1a','bleedHprsovOpPosStatus-1b','bleedHprsovOpPosStatus-2a','bleedHprsovOpPosStatus-2b','bleedMonPress-1a','bleedMonPress-1b','bleedMonPress-2a','bleedMonPress-2b','bleedOnStatus-1a','bleedOnStatus-1b','bleedOnStatus-2b','bleedOverpressCas-2a','bleedOverpressCas-2b','bleedPrecoolDiffPress-1a','bleedPrecoolDiffPress-1b','bleedPrecoolDiffPress-2a','bleedPrecoolDiffPress-2b','bleedPrsovClPosStatus-1a','bleedPrsovClPosStatus-2a','bleedPrsovFbk-1a','bleedPrsovFbk-1b','bleedPrsovFbk-2b','bleedPrsovOpPosStatus-1a','bleedPrsovOpPosStatus-1b','bleedPrsovOpPosStatus-2a','bleedPrsovOpPosStatus-2b','bleedPrsovTmCmd-1a','bleedPrsovTmCmd-1b','bleedPrsovTmCmd-2a','bleedPrsovTmCmd-2b','bleedSingleOperation-1a','bleedSingleOperation-2b','phaseOfFlight-1','sfyBasFaultWord1Bit13-1a','sfyBasFaultWord1Bit13-2b']





#### Tratamento de dados do dataframe

In [5]:
lCol = ['recording_time','amscChBasHealthStatus-1a','amscChBasHealthStatus-1b','amscChBasHealthStatus-2b','amscHprsovDrivF-1a','amscHprsovDrivF-1b','amscHprsovDrivF-2b','amscPrsovDrivF-1a','amscPrsovDrivF-1b','amscPrsovDrivF-2b','basBleedLowPressF-1a','basBleedLowPressF-2b','basBleedLowTempF-1a','basBleedLowTempF-2b','basBleedOverPressF-1a','basBleedOverPressF-2b','basBleedOverTempF-1a','basBleedOverTempF-2b','bleedFavTmCmd-1a','bleedFavTmCmd-1b','bleedFavTmCmd-2a','bleedFavTmCmd-2b','bleedFavTmFbk-1a','bleedFavTmFbk-1b','bleedFavTmFbk-2b','bleedHprsovCmdStatus-1a','bleedHprsovCmdStatus-1b','bleedHprsovCmdStatus-2a','bleedHprsovCmdStatus-2b','bleedHprsovOpPosStatus-1a','bleedHprsovOpPosStatus-1b','bleedHprsovOpPosStatus-2a','bleedHprsovOpPosStatus-2b','bleedMonPress-1a','bleedMonPress-1b','bleedMonPress-2a','bleedMonPress-2b','bleedOnStatus-1a','bleedOnStatus-1b','bleedOnStatus-2b','bleedOverpressCas-2a','bleedOverpressCas-2b','bleedPrecoolDiffPress-1a','bleedPrecoolDiffPress-1b','bleedPrecoolDiffPress-2a','bleedPrecoolDiffPress-2b','bleedPrsovClPosStatus-1a','bleedPrsovClPosStatus-2a','bleedPrsovFbk-1a','bleedPrsovFbk-1b','bleedPrsovFbk-2b','bleedPrsovOpPosStatus-1a','bleedPrsovOpPosStatus-1b','bleedPrsovOpPosStatus-2a','bleedPrsovOpPosStatus-2b','bleedPrsovTmCmd-1a','bleedPrsovTmCmd-1b','bleedPrsovTmCmd-2a','bleedPrsovTmCmd-2b','bleedSingleOperation-1a','bleedSingleOperation-2b','phaseOfFlight-1','sfyBasFaultWord1Bit13-1a','sfyBasFaultWord1Bit13-2b']

In [6]:
colunas_remover = d1.columns.difference(lCol)

In [7]:
caminho_da_pasta = "06120018"  

lista_arquivos = os.listdir(caminho_da_pasta)

In [8]:
c = 0
df = ''
for i in lista_arquivos:
    c+=1
    if c<81:
        file = pd.read_parquet(f'./06120018/{i}')
        file = file.drop(columns=colunas_remover)
        file["voo"] = i
        if c == 1:
            df = file
        else:
            df  = pd.concat([df, file],axis=0)
        
        print(c)

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80


In [9]:
df.shape

(9951500, 65)

In [10]:
df

Unnamed: 0,recording_time,amscChBasHealthStatus-1b,amscChBasHealthStatus-1a,amscChBasHealthStatus-2b,amscHprsovDrivF-1a,amscHprsovDrivF-1b,amscHprsovDrivF-2b,amscPrsovDrivF-1a,amscPrsovDrivF-1b,amscPrsovDrivF-2b,...,bleedPrsovTmCmd-1a,bleedPrsovTmCmd-1b,bleedPrsovTmCmd-2a,bleedPrsovTmCmd-2b,bleedSingleOperation-1a,bleedSingleOperation-2b,phaseOfFlight-1,sfyBasFaultWord1Bit13-1a,sfyBasFaultWord1Bit13-2b,voo
0,0,,,,,,,,,,...,,,,,0.0,0.0,0.0,,,TCRF_ARCHIVE_06120018_20220523142237.parquet
1,50,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,...,0.0,0.0,0.0,0.0,,,,0.0,0.0,TCRF_ARCHIVE_06120018_20220523142237.parquet
2,100,,,,,,,,,,...,,,,,0.0,0.0,0.0,,,TCRF_ARCHIVE_06120018_20220523142237.parquet
3,150,,,,,,,,,,...,,,,,,,,,,TCRF_ARCHIVE_06120018_20220523142237.parquet
4,200,,,,,,,,,,...,,,,,0.0,0.0,0.0,,,TCRF_ARCHIVE_06120018_20220523142237.parquet
...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...
182876,9143800,,,,,,,,,,...,,,,,0.0,0.0,0.0,,,TCRF_ARCHIVE_06120018_20220603161315.parquet
182877,9143850,,,,,,,,,,...,,,,,,,,,,TCRF_ARCHIVE_06120018_20220603161315.parquet
182878,9143900,,,,,,,,,,...,,,,,0.0,0.0,0.0,,,TCRF_ARCHIVE_06120018_20220603161315.parquet
182879,9143950,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,...,0.0,0.0,0.0,0.0,,,,0.0,0.0,TCRF_ARCHIVE_06120018_20220603161315.parquet


In [11]:
shortColumns = []
for i in df.columns:
    values = df[i].unique()
    if len(values) <10:
        shortColumns.append(i)
        print(i + str(values))

amscChBasHealthStatus-1b[nan  0.]
amscChBasHealthStatus-1a[nan  0.]
amscChBasHealthStatus-2b[nan  0.  2.  3.]
amscHprsovDrivF-1a[nan  0.]
amscHprsovDrivF-1b[nan  0.]
amscHprsovDrivF-2b[nan  0.  1.]
amscPrsovDrivF-1a[nan  0.]
amscPrsovDrivF-1b[nan  0.]
amscPrsovDrivF-2b[nan  0.]
basBleedLowPressF-1a[nan  0.]
basBleedLowPressF-2b[nan  0.]
basBleedLowTempF-1a[nan  0.]
basBleedLowTempF-2b[nan  0.]
basBleedOverPressF-1a[nan  0.]
basBleedOverPressF-2b[nan  0.]
basBleedOverTempF-1a[nan  0.]
basBleedOverTempF-2b[nan  0.]
bleedHprsovCmdStatus-1a[ 0. nan  1.]
bleedHprsovCmdStatus-1b[ 0. nan  1.]
bleedHprsovCmdStatus-2a[ 0. nan  1.]
bleedHprsovCmdStatus-2b[ 0. nan  1.]
bleedHprsovOpPosStatus-1a[ 0. nan  1.]
bleedHprsovOpPosStatus-1b[ 0. nan  1.]
bleedHprsovOpPosStatus-2a[ 0. nan  1.]
bleedHprsovOpPosStatus-2b[ 0. nan  1.]
bleedOnStatus-1a[ 0. nan  1.]
bleedOnStatus-1b[ 0. nan  1.]
bleedOnStatus-2b[ 0. nan  1.]
bleedOverpressCas-2a[nan  0.]
bleedOverpressCas-2b[nan  0.]
bleedPrsovClPosStatus-1a[ 1

In [12]:
print(len(shortColumns))

41


In [13]:
n = 0
for i in shortColumns:
    values = df[i].unique()
    for v in values:
        indices = df[df[i] == v].index
        print(i,v,indices)

amscChBasHealthStatus-1b nan Index([], dtype='int64')
amscChBasHealthStatus-1b 0.0 Index([     1,      5,      9,     13,     17,     21,     25,     29,     33,
           37,
       ...
       182843, 182847, 182851, 182855, 182859, 182863, 182867, 182871, 182875,
       182879],
      dtype='int64', length=2487481)
amscChBasHealthStatus-1a nan Index([], dtype='int64')
amscChBasHealthStatus-1a 0.0 Index([     1,      5,      9,     13,     17,     21,     25,     29,     33,
           37,
       ...
       182843, 182847, 182851, 182855, 182859, 182863, 182867, 182871, 182875,
       182879],
      dtype='int64', length=2487481)
amscChBasHealthStatus-2b nan Index([], dtype='int64')
amscChBasHealthStatus-2b 0.0 Index([     1,      5,      9,     13,     17,     21,     25,     29,     33,
           37,
       ...
       182843, 182847, 182851, 182855, 182859, 182863, 182867, 182871, 182875,
       182879],
      dtype='int64', length=2487428)
amscChBasHealthStatus-2b 2.0 Index([   1

In [14]:
# Forward fill in all the columns
df = df.fillna(method='ffill')

In [15]:
df

Unnamed: 0,recording_time,amscChBasHealthStatus-1b,amscChBasHealthStatus-1a,amscChBasHealthStatus-2b,amscHprsovDrivF-1a,amscHprsovDrivF-1b,amscHprsovDrivF-2b,amscPrsovDrivF-1a,amscPrsovDrivF-1b,amscPrsovDrivF-2b,...,bleedPrsovTmCmd-1a,bleedPrsovTmCmd-1b,bleedPrsovTmCmd-2a,bleedPrsovTmCmd-2b,bleedSingleOperation-1a,bleedSingleOperation-2b,phaseOfFlight-1,sfyBasFaultWord1Bit13-1a,sfyBasFaultWord1Bit13-2b,voo
0,0,,,,,,,,,,...,,,,,0.0,0.0,0.0,,,TCRF_ARCHIVE_06120018_20220523142237.parquet
1,50,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,...,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,TCRF_ARCHIVE_06120018_20220523142237.parquet
2,100,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,...,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,TCRF_ARCHIVE_06120018_20220523142237.parquet
3,150,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,...,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,TCRF_ARCHIVE_06120018_20220523142237.parquet
4,200,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,...,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,TCRF_ARCHIVE_06120018_20220523142237.parquet
...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...
182876,9143800,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,...,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,TCRF_ARCHIVE_06120018_20220603161315.parquet
182877,9143850,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,...,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,TCRF_ARCHIVE_06120018_20220603161315.parquet
182878,9143900,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,...,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,TCRF_ARCHIVE_06120018_20220603161315.parquet
182879,9143950,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,...,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,TCRF_ARCHIVE_06120018_20220603161315.parquet


In [16]:
# Change the NaN in the first row of all columns to 0
df.iloc[0] = df.iloc[0].fillna(0)

In [17]:
shortColumns = []
for i in df.columns:
    values = df[i].unique()
    if len(values) <10:
        shortColumns.append(i)
        print(i + str(values))

amscChBasHealthStatus-1b[0.]
amscChBasHealthStatus-1a[0.]
amscChBasHealthStatus-2b[0. 2. 3.]
amscHprsovDrivF-1a[0.]
amscHprsovDrivF-1b[0.]
amscHprsovDrivF-2b[0. 1.]
amscPrsovDrivF-1a[0.]
amscPrsovDrivF-1b[0.]
amscPrsovDrivF-2b[0.]
basBleedLowPressF-1a[0.]
basBleedLowPressF-2b[0.]
basBleedLowTempF-1a[0.]
basBleedLowTempF-2b[0.]
basBleedOverPressF-1a[0.]
basBleedOverPressF-2b[0.]
basBleedOverTempF-1a[0.]
basBleedOverTempF-2b[0.]
bleedHprsovCmdStatus-1a[0. 1.]
bleedHprsovCmdStatus-1b[0. 1.]
bleedHprsovCmdStatus-2a[0. 1.]
bleedHprsovCmdStatus-2b[0. 1.]
bleedHprsovOpPosStatus-1a[0. 1.]
bleedHprsovOpPosStatus-1b[0. 1.]
bleedHprsovOpPosStatus-2a[0. 1.]
bleedHprsovOpPosStatus-2b[0. 1.]
bleedOnStatus-1a[0. 1.]
bleedOnStatus-1b[0. 1.]
bleedOnStatus-2b[0. 1.]
bleedOverpressCas-2a[0.]
bleedOverpressCas-2b[0.]
bleedPrsovClPosStatus-1a[1. 0.]
bleedPrsovClPosStatus-2a[1. 0.]
bleedPrsovOpPosStatus-1a[0. 1.]
bleedPrsovOpPosStatus-1b[0. 1.]
bleedPrsovOpPosStatus-2a[0. 1.]
bleedPrsovOpPosStatus-2b[0. 1.]

In [18]:
# Percentage each value appears in a column
for i in shortColumns:
    print(df[i].value_counts(normalize=True))



amscChBasHealthStatus-1b
0.0    1.0
Name: proportion, dtype: float64
amscChBasHealthStatus-1a
0.0    1.0
Name: proportion, dtype: float64
amscChBasHealthStatus-2b
0.0    0.999979
2.0    0.000017
3.0    0.000004
Name: proportion, dtype: float64
amscHprsovDrivF-1a
0.0    1.0
Name: proportion, dtype: float64
amscHprsovDrivF-1b
0.0    1.0
Name: proportion, dtype: float64
amscHprsovDrivF-2b
0.0    0.999996
1.0    0.000004
Name: proportion, dtype: float64
amscPrsovDrivF-1a
0.0    1.0
Name: proportion, dtype: float64
amscPrsovDrivF-1b
0.0    1.0
Name: proportion, dtype: float64
amscPrsovDrivF-2b
0.0    1.0
Name: proportion, dtype: float64
basBleedLowPressF-1a
0.0    1.0
Name: proportion, dtype: float64
basBleedLowPressF-2b
0.0    1.0
Name: proportion, dtype: float64
basBleedLowTempF-1a
0.0    1.0
Name: proportion, dtype: float64
basBleedLowTempF-2b
0.0    1.0
Name: proportion, dtype: float64
basBleedOverPressF-1a
0.0    1.0
Name: proportion, dtype: float64
basBleedOverPressF-2b
0.0    1.0
Nam

In [19]:
# Drop the columns that a single value appears 100% of the time
for i in shortColumns:
    if (df[i].value_counts().max()/df[i].value_counts().sum()) == 1.0:
        df.drop(i, axis=1, inplace=True)
        print(i)

amscChBasHealthStatus-1b
amscChBasHealthStatus-1a
amscHprsovDrivF-1a
amscHprsovDrivF-1b
amscPrsovDrivF-1a
amscPrsovDrivF-1b
amscPrsovDrivF-2b
basBleedLowPressF-1a
basBleedLowPressF-2b
basBleedLowTempF-1a
basBleedLowTempF-2b
basBleedOverPressF-1a
basBleedOverPressF-2b
basBleedOverTempF-1a
basBleedOverTempF-2b
bleedOverpressCas-2a
bleedOverpressCas-2b
sfyBasFaultWord1Bit13-1a
sfyBasFaultWord1Bit13-2b


In [20]:
shortColumns = []
for i in df.columns:
    values = df[i].unique()
    if len(values) <10:
        shortColumns.append(i)
        print(i + str(values))

amscChBasHealthStatus-2b[0. 2. 3.]
amscHprsovDrivF-2b[0. 1.]
bleedHprsovCmdStatus-1a[0. 1.]
bleedHprsovCmdStatus-1b[0. 1.]
bleedHprsovCmdStatus-2a[0. 1.]
bleedHprsovCmdStatus-2b[0. 1.]
bleedHprsovOpPosStatus-1a[0. 1.]
bleedHprsovOpPosStatus-1b[0. 1.]
bleedHprsovOpPosStatus-2a[0. 1.]
bleedHprsovOpPosStatus-2b[0. 1.]
bleedOnStatus-1a[0. 1.]
bleedOnStatus-1b[0. 1.]
bleedOnStatus-2b[0. 1.]
bleedPrsovClPosStatus-1a[1. 0.]
bleedPrsovClPosStatus-2a[1. 0.]
bleedPrsovOpPosStatus-1a[0. 1.]
bleedPrsovOpPosStatus-1b[0. 1.]
bleedPrsovOpPosStatus-2a[0. 1.]
bleedPrsovOpPosStatus-2b[0. 1.]
bleedSingleOperation-1a[0. 1.]
bleedSingleOperation-2b[0. 1.]
phaseOfFlight-1[0. 1. 2.]


In [41]:

for i in shortColumns:
 print(df[i].value_counts(normalize=True))

amscChBasHealthStatus-2b
0.0    0.999979
2.0    0.000017
3.0    0.000004
Name: proportion, dtype: float64
amscHprsovDrivF-2b
0.0    0.999996
1.0    0.000004
Name: proportion, dtype: float64
bleedHprsovCmdStatus-1a
0.0    0.851792
1.0    0.148208
Name: proportion, dtype: float64
bleedHprsovCmdStatus-1b
0.0    0.799408
1.0    0.200592
Name: proportion, dtype: float64
bleedHprsovCmdStatus-2a
0.0    0.860638
1.0    0.139362
Name: proportion, dtype: float64
bleedHprsovCmdStatus-2b
0.0    0.860638
1.0    0.139362
Name: proportion, dtype: float64
bleedHprsovOpPosStatus-1a
0.0    0.649764
1.0    0.350236
Name: proportion, dtype: float64
bleedHprsovOpPosStatus-1b
0.0    0.649749
1.0    0.350251
Name: proportion, dtype: float64
bleedHprsovOpPosStatus-2a
0.0    0.676166
1.0    0.323834
Name: proportion, dtype: float64
bleedHprsovOpPosStatus-2b
0.0    0.676169
1.0    0.323831
Name: proportion, dtype: float64
bleedOnStatus-1a
1.0    0.935883
0.0    0.064117
Name: proportion, dtype: float64
bleedOnS

In [42]:
df_proportion = []

for i in shortColumns:
    value = df[i].value_counts(normalize=True)
    df_proportion.append(value.iloc[0])

In [46]:
shortColumns

['amscChBasHealthStatus-2b',
 'amscHprsovDrivF-2b',
 'bleedHprsovCmdStatus-1a',
 'bleedHprsovCmdStatus-1b',
 'bleedHprsovCmdStatus-2a',
 'bleedHprsovCmdStatus-2b',
 'bleedHprsovOpPosStatus-1a',
 'bleedHprsovOpPosStatus-1b',
 'bleedHprsovOpPosStatus-2a',
 'bleedHprsovOpPosStatus-2b',
 'bleedOnStatus-1a',
 'bleedOnStatus-1b',
 'bleedOnStatus-2b',
 'bleedPrsovClPosStatus-1a',
 'bleedPrsovClPosStatus-2a',
 'bleedPrsovOpPosStatus-1a',
 'bleedPrsovOpPosStatus-1b',
 'bleedPrsovOpPosStatus-2a',
 'bleedPrsovOpPosStatus-2b',
 'bleedSingleOperation-1a',
 'bleedSingleOperation-2b',
 'phaseOfFlight-1']

In [43]:
df_proportion

[0.9999786966788926,
 0.9999959805054515,
 0.8517923931065668,
 0.7994078279656334,
 0.8606378937848566,
 0.8606382957343114,
 0.6497638546952721,
 0.6497485806159875,
 0.6761664070743104,
 0.6761692207204943,
 0.9358828317339095,
 0.935882630759182,
 0.9090075867959604,
 0.9358810229613627,
 0.9200406973823042,
 0.9358810229613627,
 0.9358820278349997,
 0.9090112043410541,
 0.9090108023915993,
 0.973102245892579,
 0.9730799376978345,
 0.3955110284881676]

In [52]:
df_teste = df.drop(['amscChBasHealthStatus-2b',
 'amscHprsovDrivF-2b', 'bleedSingleOperation-1a',
 'bleedSingleOperation-2b'], axis='columns')

In [53]:
df_teste.columns

Index(['recording_time', 'bleedFavTmCmd-1a', 'bleedFavTmCmd-1b',
       'bleedFavTmCmd-2a', 'bleedFavTmCmd-2b', 'bleedFavTmFbk-1a',
       'bleedFavTmFbk-1b', 'bleedFavTmFbk-2b', 'bleedHprsovCmdStatus-1a',
       'bleedHprsovCmdStatus-1b', 'bleedHprsovCmdStatus-2a',
       'bleedHprsovCmdStatus-2b', 'bleedHprsovOpPosStatus-1a',
       'bleedHprsovOpPosStatus-1b', 'bleedHprsovOpPosStatus-2a',
       'bleedHprsovOpPosStatus-2b', 'bleedMonPress-1a', 'bleedMonPress-1b',
       'bleedMonPress-2a', 'bleedMonPress-2b', 'bleedOnStatus-1a',
       'bleedOnStatus-1b', 'bleedOnStatus-2b', 'bleedPrecoolDiffPress-1a',
       'bleedPrecoolDiffPress-1b', 'bleedPrecoolDiffPress-2a',
       'bleedPrecoolDiffPress-2b', 'bleedPrsovClPosStatus-1a',
       'bleedPrsovClPosStatus-2a', 'bleedPrsovFbk-1a', 'bleedPrsovFbk-1b',
       'bleedPrsovFbk-2b', 'bleedPrsovOpPosStatus-1a',
       'bleedPrsovOpPosStatus-1b', 'bleedPrsovOpPosStatus-2a',
       'bleedPrsovOpPosStatus-2b', 'bleedPrsovTmCmd-1a', 'bleedPrsov

In [55]:
df_teste['']

Unnamed: 0,recording_time,bleedFavTmCmd-1a,bleedFavTmCmd-1b,bleedFavTmCmd-2a,bleedFavTmCmd-2b,bleedFavTmFbk-1a,bleedFavTmFbk-1b,bleedFavTmFbk-2b,bleedHprsovCmdStatus-1a,bleedHprsovCmdStatus-1b,...,bleedPrsovOpPosStatus-1a,bleedPrsovOpPosStatus-1b,bleedPrsovOpPosStatus-2a,bleedPrsovOpPosStatus-2b,bleedPrsovTmCmd-1a,bleedPrsovTmCmd-1b,bleedPrsovTmCmd-2a,bleedPrsovTmCmd-2b,phaseOfFlight-1,voo
0,0,0.0,0.0,0.0,0.0,0.000,0.0,0.000,0.0,0.0,...,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,TCRF_ARCHIVE_06120018_20220523142237.parquet
1,50,0.0,0.0,0.0,0.0,0.375,1.5,0.125,0.0,0.0,...,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,TCRF_ARCHIVE_06120018_20220523142237.parquet
2,100,0.0,0.0,0.0,0.0,0.375,1.5,0.125,0.0,0.0,...,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,TCRF_ARCHIVE_06120018_20220523142237.parquet
3,150,0.0,0.0,0.0,0.0,0.375,1.5,0.125,0.0,0.0,...,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,TCRF_ARCHIVE_06120018_20220523142237.parquet
4,200,0.0,0.0,0.0,0.0,0.375,1.5,0.125,0.0,0.0,...,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,TCRF_ARCHIVE_06120018_20220523142237.parquet
...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...
182876,9143800,0.0,0.0,0.0,0.0,0.250,1.5,0.125,0.0,0.0,...,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,TCRF_ARCHIVE_06120018_20220603161315.parquet
182877,9143850,0.0,0.0,0.0,0.0,0.250,1.5,0.125,0.0,0.0,...,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,TCRF_ARCHIVE_06120018_20220603161315.parquet
182878,9143900,0.0,0.0,0.0,0.0,0.250,1.5,0.125,0.0,0.0,...,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,TCRF_ARCHIVE_06120018_20220603161315.parquet
182879,9143950,0.0,0.0,0.0,0.0,0.250,1.5,0.125,0.0,0.0,...,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,TCRF_ARCHIVE_06120018_20220603161315.parquet
