# Computing GC Content
### Problem
The GC-content of a DNA string is given by the percentage of symbols in the string that are 'C' or 'G'. For example, the GC-content of "AGCTATAG" is 37.5%. Note that the reverse complement of any DNA string has the same GC-content.

DNA strings must be labeled when they are consolidated into a database. A commonly used method of string labeling is called FASTA format. In this format, the string is introduced by a line that begins with '>', followed by some labeling information. Subsequent lines contain the string itself; the first line to begin with '>' indicates the label of the next string.

In Rosalind's implementation, a string in FASTA format will be labeled by the ID "Rosalind_xxxx", where "xxxx" denotes a four-digit code between 0000 and 9999.

##### Given: 
At most 10 DNA strings in FASTA format (of length at most 1 kbp each).

##### Return: 
The ID of the string having the highest GC-content, followed by the GC-content of that string. Rosalind allows for a default error of 0.001 in all decimal answers unless otherwise stated; please see the note on absolute error below.
### Sample Dataset
(ignore the #. placed for formatting purposes)
#>Rosalind_6404
CCTGCGGAAGATCGGCACTAGAATAGCCAGAACCGTTTCTCTGAGGCTTCCGGCCTTCCC
TCCCACTAATAATTCTGAGG
#>Rosalind_5959
CCATCGGTAGCGCATCCTTAGTCCAATTAAGTCCCTATCCAGGCGCTCCGCCGAAGGTCT
ATATCCATTTGTCAGCAGACACGC
#>Rosalind_0808
CCACCCTCGTGGTATGGCTAGGCATTCAGGAACCGGAGAACGCTTCAGACCAGCCCGGAC
TGGGAACCTGCGGGCAGTAGGTGGAAT

### Expected Output
Rosalind_0808

60.919540

In [1]:
#Placed the dataset in a text file
def compGcCont(s):
    ID = []
    gcContent = []

    #Reading File and storing it as a list of strings separated by new line
    file = open("./" + s, "r")
    s = file.readlines()
    
    #Sorting the list of strings into IDs and GC Percentages
    for i in range(0, len(s), 2):
        
        #removing header and new line from IDs and adding them into ID list
        ID.append(s[i][1:(len(s[i])-1)])
        
        #calculating GC Percentages and appending them into a list
        cCount = s[i+1].count('C')
        gCount = s[i+1].count('G')
        gcContent.append(((float(cCount+gCount))/(len(s[i+1])))*100.0)
    
    #Printing ID and GC content of DNA with maximum GC Content
    print(ID[gcContent.index(max(gcContent))]) 
    print(round(max(gcContent), 6))

    
#Test    
compGcCont("gcContent.txt")


Rosalind_0808
60.91954
