# The following code is responsible for converting MIDI files into text notation.

The whole code is based on the following model: https://keunwoochoi.wordpress.com/2016/02/23/lstmetallica/ 

# I found a way to use the original python library!

The original code was developed using Python2 and the Python-midi library, which is not available for Python3.

Basically i found a Python3 compatible version of the original library.


Reference here: https://github.com/jameswenzel/mydy/blob/master/src/Containers.py
https://github.com/jameswenzel/Fractal-Midi/blob/master/script.py
https://github.com/vishnubob/python-midi

## Opening MIDI file
Basically the next cell opens and reads a dummy MIDI file written by me.

I'm also setting the  ```resolution ``` parameter to 480. This parameter is equivalent to **PPQ** in MIDI files.

**PPQ** (*Pulse per Quarter Note*) is a fixed value which sets the number of pulses contained in a quarter note, it's like a "sampling" frequency.

Each quarter note (no matter what is the original speed of the song) will contain 480 pulses. Then these pulses are converted into actual playback using the **Tempo** information of the MIDI file (obviously a quarter note at 100 BPM is slower than a quarter note at 130 BPM)

In [47]:
from mydy import Events, FileIO, Containers, Constants


In [28]:
from mydy import Events, FileIO, Containers, Constants
test=FileIO.read_midifile('bellofigo forse meglio.mid') #returns a Pattern with the MIDI file information (resolution ecc...), based on documentation https://github.com/jameswenzel/mydy/blob/master/src/FileIO.py

test.resolution=480 #qui sto settando quanti tick ho in una quarter note. Quindi ogni quarter note avra 480 ticks.
print(test) #seems that changing the BPM doesn't influence the ticks.
#Resolution is the same as PPQ



mydy.Pattern(format=1, resolution=480, tracks=\
[mydy.Track(relative: True\
  [mydy.SetTempoEvent(tick=0.0, data=[10, 197, 90]),
   mydy.TimeSignatureEvent(tick=0.0, data=[4, 2, 24, 8]),
   mydy.NoteOnEvent(tick=0.0, channel=0, data=[36, 100]),
   mydy.NoteOffEvent(tick=0.0, channel=0, data=[36, 64]),
   mydy.NoteOnEvent(tick=0.0, channel=0, data=[42, 84]),
   mydy.NoteOffEvent(tick=30.00091555528428, channel=0, data=[42, 64]),
   mydy.NoteOnEvent(tick=210.00640888698996, channel=0, data=[42, 72]),
   mydy.NoteOffEvent(tick=30.00091555528428, channel=0, data=[42, 64]),
   mydy.NoteOnEvent(tick=210.00640888698996, channel=0, data=[38, 100]),
   mydy.NoteOffEvent(tick=0.0, channel=0, data=[38, 64]),
   mydy.NoteOnEvent(tick=120.00366222113712, channel=0, data=[36, 95]),
   mydy.NoteOffEvent(tick=0.0, channel=0, data=[36, 64]),
   mydy.NoteOnEvent(tick=0.0, channel=0, data=[42, 74]),
   mydy.NoteOffEvent(tick=30.00091555528428, channel=0, data=[42, 64]),
   mydy.NoteOnEvent(tick=10.005188

## Reading notes

The following cell access the loaded MIDI file and reads the  ```Track```. Using the  ```mydy``` library, MIDI files are structured in the following way:

 ```Pattern -> Track -> MIDI_Events ```
 
 This means that, whenever I open a MIDI file, i will get a  ```Pattern ```, which contains  ```Track ```, which contains ```MIDI_Events ```.
 
 This scructure is used because a single MIDI file can contain multiple instruments (guitar, drums, bass ecc...), so each  ```Track ``` corresponds to an instrument, and the  ```MIDI_Events ``` are the note played by the single instrument.
 
 In our case we can assume to be working with single  ```Track ``` MIDI files (we're interested only in drums).
 
 **NB**: i'm using  ```track.make_ticks_abs()``` to convert the time (expressed in ticks/PPQ) from a relative value into an absolute one.<br>The standard representation of MIDI files represents note based on the time elapsed by the previous note. With this command i'm converting the time from relative to absolute (each note is represented with the time elapsed by the beginning of the song) 

In [51]:

track = test[1] #selecting the track (since it's only one, it will be always at index 0)
track_abs = track.make_ticks_abs()# Converting time from relative to an absolute measure


filtered_list=track_abs.filter(lambda e: isinstance(e, Events.NoteOnEvent))# Selects only Note_On events, i'm discarding the note off

for element in filtered_list:
    
    print(element.tick)

#Just a bunch of test printings 
#print(type(filtered_list))
#print(filtered_list)
#print(len(filtered_list))
#print(filtered_list[2].data[1])
#print(test.resolution)


0.0
600.0
720.0
1200.0
1560.0
1920.0
2520.0
2640.0
3120.0
3480.0
3840.0
4440.0
4560.0
5040.0
5400.0
5760.0
6360.0
6480.0
6960.0
7320.0


# Automatic MIDI extraction from multiple files

In [48]:
#function to approximate te length to the next bar https://stackoverflow.com/questions/3407012/c-rounding-up-to-the-nearest-multiple-of-a-number
def roundup(numToRound, multiple):
    if multiple == 0:
        return numToRound
    remainder = numToRound % multiple
    if remainder==0:
        return numToRound
    return numToRound + multiple - remainder

    

In [49]:
import os
#Fix here the problem with the tick. Now i'm just adding the last tick to the next track. This makes the new track starting right from there
#instead of waiting for the new BAR. I have to quantize it and make it start from the new BAR 
pattern = Containers.Pattern(fmt=0)
max_time=0;
approx_bar = 480*4
pattern.resolution= 480
for filename in os.listdir("Dataset"):
    print(os.path.join("Dataset",filename))
    path= os.path.join("Dataset",filename)
    
    test=FileIO.read_midifile(path)
    test.resolution = 480
    test=test[0].make_ticks_abs()
    test=test.filter(lambda e: isinstance(e, Events.NoteOnEvent))# Selects only Note_On events, i'm discarding the note off
    print("printo la singola track")
    print(test)
    for note in test: #probabile bug qui! il valore massimo di tick con i dati attuali dovrebbe essere 61320! BUG QUI! SICURO!
        note.tick = note.tick + max_time
    print('max time before')
    print(max_time)
    max_time = test[-1].tick
    #insert here round up on max_time
    max_time=roundup(max_time, approx_bar)
    print('max time after')
    print(max_time)
    
    pattern.append(test)
    
print(pattern)



Dataset\loop 1.mid
printo la singola track
mydy.Track(relative: False\
  [mydy.NoteOnEvent(tick=0.0, channel=0, data=[36, 100]),
   mydy.NoteOnEvent(tick=0.0, channel=0, data=[42, 100]),
   mydy.NoteOnEvent(tick=240.0, channel=0, data=[42, 100]),
   mydy.NoteOnEvent(tick=480.0, channel=0, data=[42, 100]),
   mydy.NoteOnEvent(tick=720.0, channel=0, data=[36, 100]),
   mydy.NoteOnEvent(tick=720.0, channel=0, data=[42, 100]),
   mydy.NoteOnEvent(tick=750.0, channel=0, data=[42, 100]),
   mydy.NoteOnEvent(tick=780.0, channel=0, data=[42, 100]),
   mydy.NoteOnEvent(tick=810.0, channel=0, data=[42, 100]),
   mydy.NoteOnEvent(tick=840.0, channel=0, data=[42, 100]),
   mydy.NoteOnEvent(tick=960.0, channel=0, data=[42, 100]),
   mydy.NoteOnEvent(tick=1200.0, channel=0, data=[38, 100]),
   mydy.NoteOnEvent(tick=1200.0, channel=0, data=[42, 100]),
   mydy.NoteOnEvent(tick=1440.0, channel=0, data=[42, 100]),
   mydy.NoteOnEvent(tick=1680.0, channel=0, data=[36, 100]),
   mydy.NoteOnEvent(tick=1680

In [50]:
filtered_list = []
for track in pattern:
    print("printo una track")
    print(track)
    filtered_list.append(track)
    

printo una track
mydy.Track(relative: False\
  [mydy.NoteOnEvent(tick=0.0, channel=0, data=[36, 100]),
   mydy.NoteOnEvent(tick=0.0, channel=0, data=[42, 100]),
   mydy.NoteOnEvent(tick=240.0, channel=0, data=[42, 100]),
   mydy.NoteOnEvent(tick=480.0, channel=0, data=[42, 100]),
   mydy.NoteOnEvent(tick=720.0, channel=0, data=[36, 100]),
   mydy.NoteOnEvent(tick=720.0, channel=0, data=[42, 100]),
   mydy.NoteOnEvent(tick=750.0, channel=0, data=[42, 100]),
   mydy.NoteOnEvent(tick=780.0, channel=0, data=[42, 100]),
   mydy.NoteOnEvent(tick=810.0, channel=0, data=[42, 100]),
   mydy.NoteOnEvent(tick=840.0, channel=0, data=[42, 100]),
   mydy.NoteOnEvent(tick=960.0, channel=0, data=[42, 100]),
   mydy.NoteOnEvent(tick=1200.0, channel=0, data=[38, 100]),
   mydy.NoteOnEvent(tick=1200.0, channel=0, data=[42, 100]),
   mydy.NoteOnEvent(tick=1440.0, channel=0, data=[42, 100]),
   mydy.NoteOnEvent(tick=1680.0, channel=0, data=[36, 100]),
   mydy.NoteOnEvent(tick=1680.0, channel=0, data=[42, 1

In [51]:
for element in filtered_list:
    for note in element:
        print(note.tick)

0.0
0.0
240.0
480.0
720.0
720.0
750.0
780.0
810.0
840.0
960.0
1200.0
1200.0
1440.0
1680.0
1680.0
1920.0
1920.0
2040.0
2160.0
2160.0
2400.0
2400.0
2640.0
2880.0
3120.0
3120.0
3360.0
3600.0
3600.0
3630.0
3660.0
3690.0
3720.0
3840.0
3840.0
4080.0
4320.0
4560.0
4560.0
4590.0
4620.0
4650.0
4680.0
4800.0
5040.0
5040.0
5280.0
5520.0
5520.0
5760.0
5760.0
5880.0
6000.0
6000.0
6240.0
6240.0
6480.0
6600.0
6720.0
6960.0
6960.0
7200.0
7440.0
7440.0
7470.0
7500.0
7530.0
7560.0
7680.0
7680.0
7920.0
8160.0
8400.0
8400.0
8430.0
8460.0
8490.0
8520.0
8640.0
8880.0
8880.0
9120.0
9360.0
9360.0
9600.0
9600.0
9720.0
9840.0
9840.0
10080.0
10080.0
10320.0
10560.0
10800.0
10800.0
11040.0
11280.0
11280.0
11310.0
11340.0
11370.0
11400.0
11520.0
11520.0
11760.0
12000.0
12240.0
12240.0
12270.0
12300.0
12330.0
12360.0
12480.0
12720.0
12720.0
12960.0
13200.0
13200.0
13440.0
13440.0
13560.0
13680.0
13680.0
13920.0
13920.0
14160.0
14280.0
14400.0
14640.0
14640.0
14880.0
15120.0
15120.0
15150.0
15180.0
15210.0
15240.0
1

## Creating couples (Pitch, time)
So basically now i'm processing the previous raw data (as you can see from the printings, it's quite a mess) in order to obtain couples of ```(pitch, time)``` for each note.


In [52]:
notes = []
couple = ()
for element in filtered_list:
    for note in element:
            ### test printings
        print(note.tick)
        print(note.data[0])
    
    
        notes.append(tuple((note.data[0],note.tick))) #storing couples of (note_pitch,tick_time)
        



0.0
36
0.0
42
240.0
42
480.0
42
720.0
36
720.0
42
750.0
42
780.0
42
810.0
42
840.0
42
960.0
42
1200.0
38
1200.0
42
1440.0
42
1680.0
36
1680.0
42
1920.0
36
1920.0
42
2040.0
42
2160.0
38
2160.0
42
2400.0
36
2400.0
42
2640.0
42
2880.0
42
3120.0
38
3120.0
42
3360.0
42
3600.0
36
3600.0
42
3630.0
42
3660.0
42
3690.0
42
3720.0
42
3840.0
36
3840.0
42
4080.0
42
4320.0
42
4560.0
36
4560.0
42
4590.0
42
4620.0
42
4650.0
42
4680.0
42
4800.0
42
5040.0
38
5040.0
42
5280.0
42
5520.0
36
5520.0
42
5760.0
36
5760.0
42
5880.0
42
6000.0
38
6000.0
42
6240.0
36
6240.0
42
6480.0
42
6600.0
42
6720.0
42
6960.0
38
6960.0
42
7200.0
42
7440.0
36
7440.0
42
7470.0
42
7500.0
42
7530.0
42
7560.0
42
7680.0
36
7680.0
42
7920.0
42
8160.0
42
8400.0
36
8400.0
42
8430.0
42
8460.0
42
8490.0
42
8520.0
42
8640.0
42
8880.0
38
8880.0
42
9120.0
42
9360.0
36
9360.0
42
9600.0
36
9600.0
42
9720.0
42
9840.0
38
9840.0
42
10080.0
36
10080.0
42
10320.0
42
10560.0
42
10800.0
38
10800.0
42
11040.0
42
11280.0
36
11280.0
42
11310.0
42
11340

86400.0
42
86640.0
38
86640.0
42
86880.0
42
87120.0
36
87120.0
42
87360.0
36
87360.0
42
87600.0
38
87600.0
42
87840.0
38
87840.0
42
88080.0
42
88110.0
42
88140.0
42
88170.0
42
88200.0
42
88230.0
42
88260.0
42
88290.0
42
88320.0
36
88320.0
42
88560.0
42
88800.0
42
89040.0
38
89040.0
42
89280.0
42
89520.0
36
89520.0
42
89760.0
36
89760.0
42
90000.0
42
90240.0
42
90360.0
42
90480.0
38
90480.0
42
90720.0
42
90960.0
36
90960.0
42
91200.0
36
91200.0
42
91440.0
38
91440.0
42
91680.0
38
91680.0
42
91920.0
42
92160.0
36
92160.0
42
92400.0
42
92520.0
38
92640.0
42
92760.0
36
92880.0
36
92880.0
42
93120.0
42
93360.0
36
93360.0
42
93600.0
38
93600.0
42
93720.0
36
93840.0
42
94080.0
36
94080.0
42
94320.0
42
94440.0
38
94560.0
42
94680.0
36
94800.0
36
94800.0
42
95040.0
42
95280.0
36
95280.0
42
95400.0
38
95520.0
42
95640.0
36
95760.0
42
95790.0
42
95820.0
42
95850.0
42
95880.0
42
95910.0
42
95940.0
42
95970.0
42
96000.0
36
96000.0
42
96240.0
42
96360.0
38
96480.0
42
96600.0
36
96720.0
36
96720.0
42

In [52]:
notes = []
couple = ()

for element in filtered_list:
    
    ### test printings
    print(element.tick)
    print(element.data[0])
    ###
    
    notes.append(tuple((element.data[0],element.tick))) #storing couples of (note_pitch,tick_time)


print('printing tuples')
print(notes)


0.0
60
600.0
60
720.0
60
1200.0
60
1560.0
60
1920.0
60
2520.0
60
2640.0
60
3120.0
60
3480.0
60
3840.0
60
4440.0
60
4560.0
60
5040.0
60
5400.0
60
5760.0
60
6360.0
60
6480.0
60
6960.0
60
7320.0
60
printing tuples
[(60, 0.0), (60, 600.0), (60, 720.0), (60, 1200.0), (60, 1560.0), (60, 1920.0), (60, 2520.0), (60, 2640.0), (60, 3120.0), (60, 3480.0), (60, 3840.0), (60, 4440.0), (60, 4560.0), (60, 5040.0), (60, 5400.0), (60, 5760.0), (60, 6360.0), (60, 6480.0), (60, 6960.0), (60, 7320.0)]


In [7]:
#RANDOM TEST ON THE LIBRARY, USELESS
test= Constants.C_3
print(test)

36


## Trouble and make it double
Now we're going into the tough part.

The following cell is an *utility* in order to convert our MIDI into a text file (that we will use to traing our network, read the reference model here: https://keunwoochoi.wordpress.com/2016/02/23/lstmetallica/)

Basically it creates 2 classes: ```Note``` and ```Note_List```.

**1)** *Note*:    Creates a simple ```Note``` object composed by ```pitch``` ```c_tick``` and ```idx```. ```pitch``` and ```c_tick``` are the already mentioned pitch and time, while ```idx``` is a variable that counts the index of my note on a 16-th note reference. <br>**For example**: if i have 2 whole notes, the second note will have ```idx=16```.

**2)** *Note_list*:  Creates an empty list where we will add our *Note* objects. It also contains supports methods used to create and manage this list.

**List of methods**: I'll add some comments to the code and make them more readable.


In [53]:
from mydy import Events, FileIO, Containers
import pdb

#Function responsible for converting midi notes into text. Since i have to train my network over the structure i decided
#which is 0b0000000 for no note, 0b01000000 for kick ecc... i need to convert midi notes into this format.

#The original script used for midi-text translation has been lost, must be re-implemented again
PPQ = 480 # Pulse per quater note. Used in sequencers. Standard value
event_per_bar = 32 # to quantise.
min_ppq = PPQ / (event_per_bar/4)

# ignore: 39 hand clap, 54 tambourine, 56 Cowbell, 58 Vibraslap, 60-81

#the dictionary below maps values to other ones. Reduced the size of the used notes. For example
#if i have an eletric snare or a stick snare, i just map both of them into a standard snare

drum_conversion = {35:36, # acoustic bass drum -> bass drum (36)
                    37:38, 40:38, # 37:side stick, 38: acou snare, 40: electric snare
                    43:41, # 41 low floor tom, 43 ghigh floor tom
                    47:45, # 45 low tom, 47 low-mid tom
                    50:48, # 50 high tom, 48 hi mid tom
                    44:42, # 42 closed HH, 44 pedal HH
                    57:49, # 57 Crash 2, 49 Crash 1
                    59:51, 53:51, 55:51, # 59 Ride 2, 51 Ride 1, 53 Ride bell, 55 Splash
                    52:49 # 52: China cymbal
                    }

#Used in the code to map elements, everything that has not one of the following number is discarded.
#Basically i'm ignoring notes that are not in my dataset (for examle i'll ignore shakers ecc...)
                # k, sn,cHH,oHH,LFtom,ltm,htm,Rde,Crash
allowed_pitch = [36, 38, 42, 46, 41, 45, 48, 51, 49] # 46: open HH
cymbals_pitch = [49, 51] # crash, ride
cymbals_pitch = [] # crash, ride
# pitch_to_midipitch = {36:midi.C_2, # kick # for general MIDI Drum map
# 						38:midi.D_2, # Snare
# 						39:midi.Eb_2, # hand clap (it's alive by mistake..)
# 						41:midi.F_2, # Low floor tom
# 						42:midi.Gb_2, # Close HH
# 						45:midi.A_2, # Low tom
# 						46:midi.Bb_2, # Open HH
# 						48:midi.C_3,  # Hi Mid Tom
# 						49:midi.Db_3, # Crash
# 						51:midi.Eb_3 # Ride
# 						}

#mapping midi values into notes
pitch_to_midipitch = {36:Constants.C_3,  # for logic 'SoCal' drum mapping
                        38:Constants.D_3, 
                        39:Constants.Eb_3,
                        41:Constants.F_3,
                        42:Constants.Gb_3,
                        45:Constants.A_3,
                        46:Constants.Bb_3,
                        48:Constants.C_4,
                        49:Constants.Db_4,
                        51:Constants.Eb_4
                        }
#la singola nota è un elemento composto da pitch (numerico, pitch midi) e tick (modo per tenere il tempo in midi)
class Note:
    def __init__(self, pitch, c_tick):
        self.pitch = pitch
        self.c_tick = c_tick # cumulated_tick of a midi note

    def add_index(self, idx):
        '''index --> 16-th note-based index starts from 0'''
        self.idx = idx

class Note_List():
    def __init__(self):
        ''''''
        self.notes = []
        self.quantised = False
        self.max_idx = None

    def add_note(self, note):
        '''note: instance of Note class'''
        self.notes.append(note)

    def quantise(self, minimum_ppq):
        '''
        e.g. if minimum_ppq=120, quantise by 16-th note.
        
        '''
        if not self.quantised:
            for note in self.notes:
                note.c_tick = ((note.c_tick+minimum_ppq/2)//minimum_ppq)* minimum_ppq # quantise
                #here the index is calculated. The index is an absolute index over the 16th notes.
                #for example an index of value 34, means that my current note appears after 34 chromes
                #it's simply calculated by dividing the cumulated tick of the note by the ticks contained in a 16th note
                note.add_index(note.c_tick/minimum_ppq)
            #NB: THE QUANTIZATION FUNCTION ITERATES OVER ALL THE NOTES. So first i add all the notes, then i iterate and quantize

            #Does this automatically reference to the last item added?
            #YES. The counter note will store the last element of the iteration. So basically here i'm assigning as max index the index of the last added note
            self.max_idx = note.idx

            #Here checks if if my ending is a full musical bar. For example, if my file ends with a single kick, i'll add that note.
            #but that kick will (probably) be at the beginning of the last musical bar. So i have to "pad" until the end.
            #It's like adding a pause on my piece, so i have all complete bars and no trucated ones at the end
            if (self.max_idx + 1) % event_per_bar != 0:
                self.max_idx += event_per_bar - ((self.max_idx + 1) % event_per_bar) # make sure it has a FULL bar at the end.
            self.quantised = True

        return

    def simplify_drums(self):
        ''' use only allowed pitch - and converted not allowed pitch to the similar in a sense of drums!
        '''
        #Here forces conversion into the pitches in drum_conversion
        for note in self.notes:
            if note.pitch in drum_conversion: # ignore those not included in the key
                note.pitch = drum_conversion[note.pitch]
        #https://stackoverflow.com/questions/30670310/what-do-brackets-in-a-for-loop-in-python-mean
        #The following one is a list comprehension. Basically generates a new list from an existing one using a given condition on the elements
        self.notes = [note for note in self.notes if note.pitch in allowed_pitch]	

        return

    def return_as_text(self):
        ''''''
        length = int(self.max_idx + 1) # of events in the track.
        #print(type(length))
        event_track = []
        #Thw following cycle create a 9 by N matrix. I append N times a vector of nine zeros.
        #This means that i create N notes, and then i initialize them with all zeros (9 zeros, since a note is represented by a 9 element binary number)

        for note_idx in range(length):  #sostituire xrange con range in Python3
            event_track.append(['0']*len(allowed_pitch))

        num_bars = length/event_per_bar# + ceil(len(event_texts_temp) % _event_per_bar)

        for note in self.notes:
            pitch_here = note.pitch
            #The following line returns the index of the passed pitch. Basically given an input generic pitch
            #it returns the associated pitch in my vocabolary (computes the actual mapping from the whole
            #vocabolary of notes into my reduced one)
            note_add_pitch_index = allowed_pitch.index(pitch_here) # 0-8
            #print(type(note.idx))  
            #print(type(note_add_pitch_index))
            event_track[int(note.idx)][note_add_pitch_index] = '1'
            # print note.idx, note.c_tick, note_add_pitch_index, ''.join(event_track[note.idx])
            # pdb.set_trace()

        event_text_temp = ['0b'+''.join(e) for e in event_track] # encoding to binary

        event_text = []
        # event_text.append('SONG_BEGIN')
        # event_text.append('BAR')
        print(num_bars)
        print(type(num_bars))        
        for bar_idx in range(int(num_bars)):
            event_from = bar_idx * event_per_bar
            event_to = event_from + event_per_bar
            event_text = event_text + event_text_temp[event_from:event_to]
            event_text.append('BAR')

        # event_text.append('SONG_END')

        return ' '.join(event_text)


## Creating Note_List
As simple as that, i'm taking my starting list of couples ```(pitch,time)``` into ```Note```  objects. 

Why? Because you should recycle also code, not only plastic. LUL



In [54]:

##NB: AGGIUNGERE GLI idx ALLE SINGLE NOTES! 
note_list = Note_List()
for note in notes:
    pitch = note[0]
    tick = note[1]
    idx = int(tick / min_ppq)
    new_note = Note(pitch,tick)
    new_note.add_index(idx)
    note_list.add_note(new_note)


In [55]:
for note in note_list.notes:
    print(note.idx)

0
0
4
8
12
12
12
13
13
14
16
20
20
24
28
28
32
32
34
36
36
40
40
44
48
52
52
56
60
60
60
61
61
62
64
64
68
72
76
76
76
77
77
78
80
84
84
88
92
92
96
96
98
100
100
104
104
108
110
112
116
116
120
124
124
124
125
125
126
128
128
132
136
140
140
140
141
141
142
144
148
148
152
156
156
160
160
162
164
164
168
168
172
176
180
180
184
188
188
188
189
189
190
192
192
196
200
204
204
204
205
205
206
208
212
212
216
220
220
224
224
226
228
228
232
232
236
238
240
244
244
248
252
252
252
253
253
254
256
256
258
260
262
262
264
266
268
268
269
270
272
274
276
276
277
278
280
282
284
284
285
286
288
288
290
292
294
294
296
298
300
300
301
302
304
306
308
308
309
310
312
313
314
316
316
317
318
320
320
322
324
326
326
328
330
332
332
333
334
336
338
340
340
341
342
344
346
348
348
349
350
352
352
354
356
358
358
360
362
364
364
365
366
368
370
372
372
373
374
376
377
378
380
380
381
382
384
384
386
388
390
390
392
394
396
396
397
398
400
402
404
404
405
406
408
410
412
412
413
414
416
416
418
420
4

In [56]:

print('printing ticks before quantization')
for note in note_list.notes:
    print(note.c_tick)
    
note_list.quantise(min_ppq)

print('printing ticks after quantization')
for note in note_list.notes:
    print(note.c_tick)

printing ticks before quantization
0.0
0.0
240.0
480.0
720.0
720.0
750.0
780.0
810.0
840.0
960.0
1200.0
1200.0
1440.0
1680.0
1680.0
1920.0
1920.0
2040.0
2160.0
2160.0
2400.0
2400.0
2640.0
2880.0
3120.0
3120.0
3360.0
3600.0
3600.0
3630.0
3660.0
3690.0
3720.0
3840.0
3840.0
4080.0
4320.0
4560.0
4560.0
4590.0
4620.0
4650.0
4680.0
4800.0
5040.0
5040.0
5280.0
5520.0
5520.0
5760.0
5760.0
5880.0
6000.0
6000.0
6240.0
6240.0
6480.0
6600.0
6720.0
6960.0
6960.0
7200.0
7440.0
7440.0
7470.0
7500.0
7530.0
7560.0
7680.0
7680.0
7920.0
8160.0
8400.0
8400.0
8430.0
8460.0
8490.0
8520.0
8640.0
8880.0
8880.0
9120.0
9360.0
9360.0
9600.0
9600.0
9720.0
9840.0
9840.0
10080.0
10080.0
10320.0
10560.0
10800.0
10800.0
11040.0
11280.0
11280.0
11310.0
11340.0
11370.0
11400.0
11520.0
11520.0
11760.0
12000.0
12240.0
12240.0
12270.0
12300.0
12330.0
12360.0
12480.0
12720.0
12720.0
12960.0
13200.0
13200.0
13440.0
13440.0
13560.0
13680.0
13680.0
13920.0
13920.0
14160.0
14280.0
14400.0
14640.0
14640.0
14880.0
15120.0
15120.

88320.0
88320.0
88560.0
88800.0
89040.0
89040.0
89280.0
89520.0
89520.0
89760.0
89760.0
90000.0
90240.0
90360.0
90480.0
90480.0
90720.0
90960.0
90960.0
91200.0
91200.0
91440.0
91440.0
91680.0
91680.0
91920.0
92160.0
92160.0
92400.0
92520.0
92640.0
92760.0
92880.0
92880.0
93120.0
93360.0
93360.0
93600.0
93600.0
93720.0
93840.0
94080.0
94080.0
94320.0
94440.0
94560.0
94680.0
94800.0
94800.0
95040.0
95280.0
95280.0
95400.0
95520.0
95640.0
95760.0
95820.0
95820.0
95880.0
95880.0
95940.0
95940.0
96000.0
96000.0
96000.0
96240.0
96360.0
96480.0
96600.0
96720.0
96720.0
96960.0
97200.0
97200.0
97440.0
97440.0
97560.0
97680.0
97920.0
97920.0
98040.0
98160.0
98280.0
98400.0
98520.0
98640.0
98640.0
98880.0
99120.0
99120.0
99240.0
99360.0
99480.0
99600.0
99840.0
99840.0
100080.0
100200.0
100320.0
100440.0
100560.0
100560.0
100800.0
101040.0
101040.0
101280.0
101280.0
101400.0
101520.0
101760.0
101760.0
102000.0
102120.0
102240.0
102360.0
102480.0
102480.0
102720.0
102960.0
102960.0
103080.0
103200.

In [57]:
note_list.simplify_drums()

In [58]:
txt = note_list.return_as_text()

60.0
<class 'float'>


In [59]:
print(txt)

0b101000000 0b000000000 0b000000000 0b000000000 0b001000000 0b000000000 0b000000000 0b000000000 0b001000000 0b000000000 0b000000000 0b000000000 0b101000000 0b001000000 0b001000000 0b000000000 0b001000000 0b000000000 0b000000000 0b000000000 0b011000000 0b000000000 0b000000000 0b000000000 0b001000000 0b000000000 0b000000000 0b000000000 0b101000000 0b000000000 0b000000000 0b000000000 BAR 0b101000000 0b000000000 0b001000000 0b000000000 0b011000000 0b000000000 0b000000000 0b000000000 0b101000000 0b000000000 0b000000000 0b000000000 0b001000000 0b000000000 0b000000000 0b000000000 0b001000000 0b000000000 0b000000000 0b000000000 0b011000000 0b000000000 0b000000000 0b000000000 0b001000000 0b000000000 0b000000000 0b000000000 0b101000000 0b001000000 0b001000000 0b000000000 BAR 0b101000000 0b000000000 0b000000000 0b000000000 0b001000000 0b000000000 0b000000000 0b000000000 0b001000000 0b000000000 0b000000000 0b000000000 0b101000000 0b001000000 0b001000000 0b000000000 0b001000000 0b000000000 0b000000

In [60]:
text_file = open("sample.txt", "w")
text_file.write(txt)
text_file.close()

## Testing Txt to MIDI conversion in the following cells

The following cell receives a single line from the txt file until the first 'BAR' element. ```encoded_drums``` is an array where each entry is a note in .txt format, so something like ```0xb011010100```.

```allowed_pitch``` has the following structure ```allowed_pitch = [36, 38, 42, 46, 41, 45, 48, 51, 49]``` where each entry is corresponds to a note in MIDI number (kick, snare ecc..., you can fin the exact notation inside the ```Note_list``` class).

So basically the following function iterates over the single txt note, and for each one of them, iterates over the single binary number and checks if it's equal to one. 
If so, it assigns the equivalent note value taken from ```allowed_pitch```. Basically is doing a 1on1 mapping between the single digit of the binary txt note and the single note taken from ```allowed_pitch```.

In [61]:
#Function that converts txt to notes. The note is represented as a number (in the MIDI scale)

#in encoded drums ho una riga intera dal file (quindi i vari 0xb00101110) 
def text_to_notes(encoded_drums, note_list=None):
    ''' 
    0b0000000000 0b10000000 ...  -> corresponding note. 
    '''
    if note_list == None:
        note_list = Note_List()
#https://www.programiz.com/python-programming/methods/built-in/enumerate enumerate mi ritorna coppie di (indice,valore) 
    for word_idx, word in enumerate(encoded_drums):
        c_tick_here = word_idx*min_ppq 

        for pitch_idx, pitch in enumerate(allowed_pitch):

            if word[pitch_idx+2] == '1':
                new_note = Note(pitch, c_tick_here)
                note_list.add_note(new_note)
    return note_list

The following function uses ```text_to_notes``` in order to fully convert .txt into MIDI. 

It receives as input the file name

In [62]:
import os

def conv_text_to_midi(filename):
    if os.path.exists(filename[:-4]+'.mid'):
        return
    f = open(filename, 'r')
    #These multiple readlines are actually useless. Need to check the output of the NN, but right now they're useless.
    #One single readline is enough
    #f.readline() # title
    #f.readline() # seed sentence
    #legge una riga intera dal file
    sentence = f.readline()
    #splitta gli elementi letti a ogni spazio.
    encoded_drums = sentence.split(' ')

    #find the first BAR

    first_bar_idx = encoded_drums.index('BAR') 

    #encoded_drums = encoded_drums[first_bar_idx:]
    try:
        encoded_drums = [ele for ele in encoded_drums if ele not in ['BAR', 'SONG_BEGIN', 'SONG_END', '']]
    except:
        pdb.set_trace()

    # prepare output
    note_list = Note_List()
    pattern = Containers.Pattern(fmt=0) #Don't know why there's an assertion in the code for fmt=0 if Pattern.len < 1
    track = Containers.Track()
    #??
    PPQ = 480
    min_ppq = PPQ / (event_per_bar/4)
    track.resolution = PPQ # ???? too slow. why??
    pattern.resolution = PPQ
    # track.resolution = 192
    pattern.append(track)

    velocity = 84
    duration = min_ppq*9/10  # it is easier to set new ticks if duration is shorter than _min_ppq

    note_list = text_to_notes(encoded_drums, note_list=note_list)

    max_c_tick = 0 
    not_yet_offed = [] # set of midi.pitch object
    print('entering for note_idx cycle')
    for note_idx, note in enumerate(note_list.notes[:-1]):
        # add onset
        tick_here = note.c_tick - max_c_tick
        pitch_here = pitch_to_midipitch[note.pitch]
        # if pitch_here in cymbals_pitch: # "Lazy-off" for cymbals 
        # 	off = midi.NoteOffEvent(tick=0, pitch=pitch_here)
        # 	track.append(off)

        on = Events.NoteOnEvent(tick=tick_here, velocity=velocity, pitch=pitch_here)
        track.append(on)
        max_c_tick = max(max_c_tick, note.c_tick)
        # add offset for something not cymbal

        # if note_list.notes[note_idx+1].c_tick == note.c_tick:
        # 	if pitch_here not in cymbals_pitch:
        # 	# 	not_yet_offed.append(pitch_here)

        # else:
        # check out some note that not off-ed.
        
        #in questo ciclo pare non ci entri mai. 
        for off_idx, waiting_pitch in enumerate(not_yet_offed):
            print(off_idx)
            if off_idx == 0:
                off = Events.NoteOffEvent(tick=duration, pitch=waiting_pitch)
                max_c_tick = max_c_tick + duration
            else:
                print('appending end note')
                off = Events.NoteOffEvent(tick=0, pitch=waiting_pitch)
            track.append(off)
            not_yet_offed = [] # set of midi.pitch object 

    # finalise
    if note_list.notes == []:
        print ('No notes in %s' % filename)
        return
        pdb.set_trace()
    note = note_list.notes[-1]
    tick_here = note.c_tick - max_c_tick
    pitch_here = pitch_to_midipitch[note.pitch]
    on = Events.NoteOnEvent(tick=tick_here, velocity=velocity, pitch=pitch_here)
    off = Events.NoteOffEvent(tick=duration, pitch=pitch_here)

    for off_idx, waiting_pitch in enumerate(not_yet_offed):
        off = Events.NoteOffEvent(tick=0, pitch=waiting_pitch)

    # end of track event
    eot = Events.EndOfTrackEvent(tick=1)
    track.append(eot)
    # print pattern
    #print(pattern)
    FileIO.write_midifile(filename[:-4]+'.mid', pattern)


In [63]:
conv_text_to_midi("sample.txt")


entering for note_idx cycle


In [31]:
pattern = Containers.Pattern(fmt=0)
print(pattern)

mydy.Pattern(format=0, resolution=220, tracks=\
[])



