# Financial Analysis of Top German Companies

In this notebook, we load and analyze key financial metrics for several major German companies.
We perform data transformations, compute statistical measures, group by business sector, and create various plots to visualize trends in
revenue, net income, return on assets (ROA), and return on equity (ROE)

In [1]:
%useLatestDescriptors
%use dataframe, kandy

In [2]:
// Read data from a CSV file into a DataFrame
val dataFrame = DataFrame.read("top_12_german_companies.csv")
    .renameToCamelCase().rename("rOA(%)", "rOE(%)").into("ROA", "ROE")

In [3]:
dataFrame.head()

company,period,revenue,netIncome,liabilities,assets,equity,ROA,ROE,debtToEquity,percentageDebtToEquity
Volkswagen AG,12/31/2017,9750496618,516889818400000,21354201295,54861302788,33507101493,942.175.618,1.542.627.668,637.303.746,"0,00%"
Siemens AG,12/31/2017,19716237464,1276840007000000,45009303223,75268101508,30258798286,1.696.389.282,4.219.731.382,1.487.478.214,"283,68%"
Allianz SE,12/31/2017,19458831198,1600107100000000,48538978480,69583711255,21044732775,2.299.542.624,7.603.361.452,2.306.466.848,"329,65%"
BMW AG,12/31/2017,18808147150,960184349600000,35382107627,67327482638,31945375011,142.614.028,3.005.706.927,1.107.581.539,"0,00%"
BASF SE,12/31/2017,16895580815,1797081911000000,28309420014,68036567115,39727147101,2.641.347.127,4.523.561.449,71.259.635,"634,80%"


## Data Preparation: Formatting and Categorization

In this step, we prepare and clean the data for analysis.

- Custom Date Format: We define a custom date format (MM/DD/YYYY) to parse the "period" column into `LocalDate` without zero-padding for months.
- Business Sectors: We create an `enum` to classify companies into sectors such as Automotive, Banking, IT, and others.
- Data Transformation:
    - Convert the "period" column to `LocalDate` using the custom format.
    - Parse the "percentageDebtToEquity" column by removing the percentage sign and converting it to a `Double`.
    - Sort the data by "company" and "period".
    - Add a new column, "sector," which assigns companies to specific business sectors based on their names.


This step ensures the dataset is well-structured and categorized for further analysis.

In [4]:
import kotlinx.datetime.format.Padding
import kotlinx.datetime.format.char

// Define a custom date format without zero-padding for the month,
// separating month/day/year with slashes
val format = LocalDate.Format {
    monthNumber(Padding.NONE)
    char('/')
    dayOfMonth()
    char('/')
    year()
}

// Enum of Business Sectors
enum class BusinessSector(val simpleName: String) {
    AUTOMOTIVE("Automotive"),
    BANKING("Banking"),
    INDUSTRIAL_TECH("Industrial"),
    INSURANCE_FINANCE("Insurance"),
    TELECOMMUNICATIONS("Telecom"),
    IT_SOFTWARE("IT"),
    PHARMA_CHEMICAL("Pharma"),
    OTHER("Other")
}

// Create a new DataFrame by converting the "period" column to LocalDate using the custom format
// and converting "percentageDebtToEquity" column to Double,
// then sorting based on "company" and "period", and finally adding a "sector" column
// depending on the company name
val companiesDf = dataFrame
    .convert { period }.with { LocalDate.parse(it, format) }
    .convert { percentageDebtToEquity }.with { it.removeSuffix("%").replace(',', '.').toDouble() }
    .convert { ROA and ROE }.with { it.replace(".", "").toDouble() }
    .sortBy { company and period }
    .add("sector") {
        when (company) {
            "Volkswagen AG", "BMW AG", "Daimler AG", "Porsche AG" -> BusinessSector.AUTOMOTIVE
            "Siemens AG", "BASF SE" -> BusinessSector.INDUSTRIAL_TECH
            "Allianz SE" -> BusinessSector.INSURANCE_FINANCE
            "Deutsche Bank AG" -> BusinessSector.BANKING
            "Deutsche Telekom AG" -> BusinessSector.TELECOMMUNICATIONS
            "SAP SE" -> BusinessSector.IT_SOFTWARE
            "Bayer AG", "Merck KGaA" -> BusinessSector.PHARMA_CHEMICAL
            else -> BusinessSector.OTHER
        }
    }

### Aggregating Financial Data

These steps group data by company and calculate key metrics (mean, median, std, min, max) for financial columns like revenue, net income, and ratios.

In [5]:
companiesDf.groupBy { company }.aggregate {
    val financeColumns = it.select { revenue and netIncome and liabilities and assets and equity and ROA and ROE and debtToEquity and percentageDebtToEquity }
    financeColumns.mean() into "mean"
    financeColumns.median() into "median"
    financeColumns.std() into "std"
    financeColumns.min() into "min"
    financeColumns.max() into "max"
}

company,mean,Unnamed: 2_level_0,Unnamed: 3_level_0,Unnamed: 4_level_0,Unnamed: 5_level_0,Unnamed: 6_level_0,Unnamed: 7_level_0,Unnamed: 8_level_0,median,Unnamed: 10_level_0,Unnamed: 11_level_0,Unnamed: 12_level_0,Unnamed: 13_level_0,Unnamed: 14_level_0,Unnamed: 15_level_0,Unnamed: 16_level_0,Unnamed: 17_level_0,std,Unnamed: 19_level_0,Unnamed: 20_level_0,Unnamed: 21_level_0,Unnamed: 22_level_0,Unnamed: 23_level_0,Unnamed: 24_level_0,Unnamed: 25_level_0,min,Unnamed: 27_level_0,Unnamed: 28_level_0,Unnamed: 29_level_0,Unnamed: 30_level_0,Unnamed: 31_level_0,Unnamed: 32_level_0,Unnamed: 33_level_0,Unnamed: 34_level_0,max,Unnamed: 36_level_0,Unnamed: 37_level_0,Unnamed: 38_level_0,Unnamed: 39_level_0,Unnamed: 40_level_0,Unnamed: 41_level_0,Unnamed: 42_level_0,Unnamed: 43_level_0
Unnamed: 0_level_1,revenue,netIncome,liabilities,assets,equity,ROA,ROE,percentageDebtToEquity,revenue,netIncome,liabilities,assets,equity,ROA,ROE,debtToEquity,percentageDebtToEquity,revenue,netIncome,liabilities,assets,equity,ROA,ROE,percentageDebtToEquity,revenue,netIncome,liabilities,assets,equity,ROA,ROE,debtToEquity,percentageDebtToEquity,revenue,netIncome,liabilities,assets,equity,ROA,ROE,debtToEquity,percentageDebtToEquity
Allianz SE,13015427603000000,1314983974006250,29927069416812500,56971548896812500,27044479479843750,2086997152625000,3437586364281250,451397500,13380302290,1240886942000000,30751494230,56314634789,27995269353,1511668947500000,3325694133500000,2.356.716.444,369450000,4031726568538354,590182030528996,11841968487721144,15082579450801540,8964325169429451,1886302018904862,2043975922447980,497898219,5318114486,373428560200000,10477167725,30329911272,11830459986,174587839000000,611965731000000,1.031.419.625,0,19814938566,2402780363000000,49692687138,93604660446,47947280377,7368255218000000,7893783146000000,983.597.322,2293350000
BASF SE,13206929984593750,1322951430006250,28948414331687500,63253275177031250,34304860845375000,2095734476375000,3667010068812500,552036250,12383084935,1230840732500000,29078100833,59504915680,36459467650,1885690844000000,3229014762000000,428.846.942,401815000,3953606287298842,599272294436623,12414635870741161,17108154038669016,11372819844337757,1688294717024260,2188317238825261,628148363,6644620233,504888024600000,10848294146,30031637051,14342060597,8183968000000,314982347000000,1.087.470.983,0,19846387618,2884007106000000,49603599758,95477772055,49441756560,9156051924000000,9827624651000000,979.650.312,2617560000
BMW AG,12793626967750000,1264378095221875,31641220583968750,62541000370843750,30899779786968750,1970748924500000,3279635339500000,420747813,12797309067,1108858547500000,33479741513,60857899135,31338099384,1991201713000000,2626961321000000,2.697.180.169,311305000,4594622507890679,602113584885902,11501509839387596,17802412071598537,12847726142863003,1488654231552701,2331079366967284,510167476,5647276212,430192893600000,10865997336,31832927467,10560668658,142614028000000,125152581000000,1.037.570.923,0,19909637251,2732548048000000,49769403556,95122657900,48801871894,7406783982000000,8011078402000000,995.319.063,1814590000
Bayer AG,12280150040531250,1244583134568750,32207322936625000,60858790576218750,28651467639718750,2074341115531250,3907549735156250,411859375,12667494670,1129327250500000,32258852641,60203666215,29501507476,1742836396000000,3481261635500000,3.501.645.798,306700000,3722908864690987,492735419593162,11640892417013212,18011890733431713,11375096650473440,1184037640651491,2464055972901899,450237144,5728072323,524552249300000,10046371516,27849556029,10017618918,139484054000000,47691775000000,1.034.227.784,0,19647577142,2670535587000000,49223539015,91140421051,48315458259,4910670497000000,9607657241000000,987.443.008,1818110000
Daimler AG,12982945622281250,1262169775665625,33110683422812500,61025135448218750,27914452025312500,2057640070781250,3599838843531250,384516875,13278094462,1162724252000000,34617753396,58368560660,27907274004,1847166935000000,3296400653000000,2.191.321.951,334975000,4328582507932568,579119236436015,10107899623321072,14627955672994179,12080334988462170,1226544584317596,2378725754217035,419087621,5152922484,440969634700000,10787737144,31178454239,10659050238,55749485000000,124523492000000,1.000.804.404,0,19604996789,2711256442000000,48085311015,86840936189,46368261566,4273603847000000,9878142355000000,931.230.088,1948440000
Deutsche Bank AG,12552273759250000,1254325661253125,31804324927843750,64484873493968750,32680548566156250,2042935983156250,3542614134843750,398719688,11868501230,1101588195500000,33874325159,64508527196,32899597047,1688790297000000,2806814710500000,300.121.019,293870000,4429631424406467,576058631887190,12815920189273525,17260497429425790,11041972482317745,1230001154483563,2106292761582215,486555376,5220250485,375695945900000,10546244209,28137106192,11728609038,107264457000000,229299295000000,1.037.275.695,0,19868905162,2752792674000000,49199711543,92083567435,49507409035,4955368641000000,8550802927000000,981.504.776,2008380000
Deutsche Telekom AG,12948836356781250,1242393628968750,29721302574218750,62968327384843750,33247024810656250,1907479731343750,3626682563343750,346870937,14031055010,1175249519000000,30653814592,62361230856,35293691238,1734117630000000,2968449223500000,316.023.962,340680000,4267095201540025,522294949722901,11114012390578995,15930592724460455,12022516129146206,1262589468122650,2258582990930430,369846773,5149849693,378951538100000,11494162751,32355610354,11183252715,178292287000000,429978286000000,1.052.519.909,0,19614896783,2834016899000000,47810647914,96220600130,49229460503,6805417841000000,9041825831000000,986.923.036,1547480000
Merck KGaA,13429626794312500,1394800429671875,34212478452031250,62611554099312500,28399075647281250,2301233277562500,3953880877625000,458833438,14463857538,1431693442000000,34840616728,61671584787,28674185880,2304004777000000,3730883412500000,2.700.385.141,393650000,4312918107535037,673799451793135,10310890849580265,14436585303052738,11015177395550402,1412484406025840,2504604793688944,428230486,5897253747,389688114000000,12733925598,36970270841,11945417506,977949000000,110168192000000,1.050.906.517,0,19217681417,2691544192000000,49354599469,95054854077,49110493703,6055903707000000,9237712947000000,977.852.725,1758200000
Porsche AG,11801739996500000,1128405275881250,31933699799437500,62902797570750000,30969097771437500,1812543408500000,3138381255406250,326929375,11821233243,1115737038000000,33611865718,63838811515,29516759934,1492638445000000,2786218764000000,2.299.471.864,307740000,3643118507061031,377613515217368,10551496082676825,19091868224734550,12568676517076225,1532298279541574,2250832283751501,365118629,5536870879,298030924100000,11368621660,21773242219,10404620558,17386781000000,226901839000000,1.068.975.741,0,17963578177,2139702451000000,49106113898,95043652065,49918115168,8532052149000000,8202899676000000,948.145.009,1634060000
SAP SE,12416678253562500,1100020966293750,32416812456343750,61494676430937500,29077863974656250,1803926622593750,3279309301281250,238142187,12908509779,936733652350000,35197257099,56011046255,26407085362,1494215797500000,2698095641000000,3.390.562.837,0,4286554454753746,485685880782450,12073688294924797,19813718642814350,12615671347131495,1235422951830178,2326900921602219,377507489,5282345417,529257365400000,10102113588,23437806831,10018751320,96093011000000,10929103000000,1.182.708.533,0,19774628627,2276360916000000,49206362475,96574017987,49752983577,4844445609000000,8305270255000000,991.512.245,1819360000


In [6]:
// Group by "company" and aggregate key financial columns
companiesDf.groupBy { sector }.aggregate {
    val financeColumns = it.select { revenue and netIncome and liabilities and assets and equity and ROA and ROE and debtToEquity and percentageDebtToEquity }
    financeColumns.mean() into "mean"
    financeColumns.median() into "median"
    financeColumns.std() into "std"
    financeColumns.min() into "min"
    financeColumns.max() into "max"
}

sector,mean,Unnamed: 2_level_0,Unnamed: 3_level_0,Unnamed: 4_level_0,Unnamed: 5_level_0,Unnamed: 6_level_0,Unnamed: 7_level_0,Unnamed: 8_level_0,median,Unnamed: 10_level_0,Unnamed: 11_level_0,Unnamed: 12_level_0,Unnamed: 13_level_0,Unnamed: 14_level_0,Unnamed: 15_level_0,Unnamed: 16_level_0,Unnamed: 17_level_0,std,Unnamed: 19_level_0,Unnamed: 20_level_0,Unnamed: 21_level_0,Unnamed: 22_level_0,Unnamed: 23_level_0,Unnamed: 24_level_0,Unnamed: 25_level_0,min,Unnamed: 27_level_0,Unnamed: 28_level_0,Unnamed: 29_level_0,Unnamed: 30_level_0,Unnamed: 31_level_0,Unnamed: 32_level_0,Unnamed: 33_level_0,Unnamed: 34_level_0,max,Unnamed: 36_level_0,Unnamed: 37_level_0,Unnamed: 38_level_0,Unnamed: 39_level_0,Unnamed: 40_level_0,Unnamed: 41_level_0,Unnamed: 42_level_0,Unnamed: 43_level_0
Unnamed: 0_level_1,revenue,netIncome,liabilities,assets,equity,ROA,ROE,percentageDebtToEquity,revenue,netIncome,liabilities,assets,equity,ROA,ROE,debtToEquity,percentageDebtToEquity,revenue,netIncome,liabilities,assets,equity,ROA,ROE,percentageDebtToEquity,revenue,netIncome,liabilities,assets,equity,ROA,ROE,debtToEquity,percentageDebtToEquity,revenue,netIncome,liabilities,assets,equity,ROA,ROE,debtToEquity,percentageDebtToEquity
INSURANCE_FINANCE,13015427603000000,1314983974006250,29927069416812500,56971548896812500,27044479479843750,2086997152625000,3437586364281250,451397500,13380302290,1240886942000000,30751494230,56314634789,27995269353,1511668947500000,3325694133500000,2.356.716.444,369450000,4031726568538354,590182030528996,11841968487721144,15082579450801540,8964325169429451,1886302018904862,2043975922447980,497898219,5318114486,373428560200000,10477167725,30329911272,11830459986,174587839000000,611965731000000,1.031.419.625,0,19814938566,2402780363000000,49692687138,93604660446,47947280377,7368255218000000,7893783146000000,983.597.322,2293350000
INDUSTRIAL_TECH,12934714693750000,1223305504875000,29649972651265625,61284478201109375,31634505549875000,2090295909765625,3587689393734375,455869062,12274566346,1171189842000000,29454204370,59053475202,33939244833,1863036066500000,3394789346000000,3.948.375.437,373720000,4085478693957651,556763437220554,11716214952053246,16741825253907963,11508005629635681,1494884669783042,2166034306212876,547277337,5703877749,459390813500000,10137976567,29444881442,10556038645,8183968000000,45603762000000,1.005.532.267,0,19846387618,2884007106000000,49603599758,95650487131,49441756560,9156051924000000,9827624651000000,994.554.924,2617560000
AUTOMOTIVE,12501654488507812,1247645639892969,31070048020976562,61259037225367190,30188989204414062,2118081428039063,3235733994031250,416289219,12586633987,1149659115500000,32950291088,60822980273,30063127704,1878812649000000,2786218764000000,240.707.213,341475000,4046388553197570,517499635250430,10919839740967953,17518169422712760,12459491566511446,1540731600206889,2239985575500741,441890333,5152922484,298030924100000,10690275300,21773242219,10404620558,17386781000000,124523492000000,1.000.804.404,0,19909637251,2732548048000000,49769403556,95156833814,49918115168,8672214379000000,9878142355000000,995.319.063,1958370000
PHARMA_CHEMICAL,12854888417421875,1319691782120313,33209900694328125,61735172337765625,28525271643500000,2187787196546875,3930715306390625,435346406,13018516306,1178237040000000,32993119668,61395602419,28795214254,2012013515500000,3681807322000000,2.952.563.918,333980000,4038390085582360,590421621153112,10955103627573359,16216439910641758,11108094353332819,1297937669004892,2464727270115643,436512966,5728072323,389688114000000,10046371516,27849556029,10017618918,977949000000,47691775000000,1.034.227.784,0,19647577142,2691544192000000,49354599469,95054854077,49110493703,6055903707000000,9607657241000000,987.443.008,1818110000
BANKING,12552273759250000,1254325661253125,31804324927843750,64484873493968750,32680548566156250,2042935983156250,3542614134843750,398719688,11868501230,1101588195500000,33874325159,64508527196,32899597047,1688790297000000,2806814710500000,300.121.019,293870000,4429631424406467,576058631887190,12815920189273525,17260497429425790,11041972482317745,1230001154483563,2106292761582215,486555376,5220250485,375695945900000,10546244209,28137106192,11728609038,107264457000000,229299295000000,1.037.275.695,0,19868905162,2752792674000000,49199711543,92083567435,49507409035,4955368641000000,8550802927000000,981.504.776,2008380000
TELECOMMUNICATIONS,12948836356781250,1242393628968750,29721302574218750,62968327384843750,33247024810656250,1907479731343750,3626682563343750,346870937,14031055010,1175249519000000,30653814592,62361230856,35293691238,1734117630000000,2968449223500000,316.023.962,340680000,4267095201540025,522294949722901,11114012390578995,15930592724460455,12022516129146206,1262589468122650,2258582990930430,369846773,5149849693,378951538100000,11494162751,32355610354,11183252715,178292287000000,429978286000000,1.052.519.909,0,19614896783,2834016899000000,47810647914,96220600130,49229460503,6805417841000000,9041825831000000,986.923.036,1547480000
IT_SOFTWARE,12416678253562500,1100020966293750,32416812456343750,61494676430937500,29077863974656250,1803926622593750,3279309301281250,238142187,12908509779,936733652350000,35197257099,56011046255,26407085362,1494215797500000,2698095641000000,3.390.562.837,0,4286554454753746,485685880782450,12073688294924797,19813718642814350,12615671347131495,1235422951830178,2326900921602219,377507489,5282345417,529257365400000,10102113588,23437806831,10018751320,96093011000000,10929103000000,1.182.708.533,0,19774628627,2276360916000000,49206362475,96574017987,49752983577,4844445609000000,8305270255000000,991.512.245,1819360000


In [7]:
companiesDf.groupBy { sector }.aggregate {
    revenue.mean() into "Avg revenue"
    revenue.sum() into "Total revenue"
    netIncome.mean() into "Avg Net Income"
    netIncome.sum() into "Sum Net Income"
    ROA.mean() into "Avg ROA"
    ROE.mean() into "Avg ROE"
}.sortBy { sector }

sector,Avg revenue,Total revenue,Avg Net Income,Sum Net Income,Avg ROA,Avg ROE
AUTOMOTIVE,12501654488507812,1600211774529,1247645639892969,159698641906300000,2118081428039063,3235733994031250
BANKING,12552273759250000,401672760296,1254325661253125,40138421160100000,2042935983156250,3542614134843750
INDUSTRIAL_TECH,12934714693750000,827821740400,1223305504875000,78291552312000000,2090295909765625,3587689393734375
INSURANCE_FINANCE,13015427603000000,416493683296,1314983974006250,42079487168200005,2086997152625000,3437586364281250
TELECOMMUNICATIONS,12948836356781250,414362763417,1242393628968750,39756596126999985,1907479731343750,3626682563343750
IT_SOFTWARE,12416678253562500,397333704114,1100020966293750,35200670921400000,1803926622593750,3279309301281250
PHARMA_CHEMICAL,12854888417421875,822712858715,1319691782120313,84460274055700000,2187787196546875,3930715306390625


In [8]:
// Group by "period" and "sector" then compute total revenue and net income
val timeSerDf = companiesDf.groupBy { period and sector }.aggregate {
    revenue.sum() into "totalRevenue"
    netIncome.sum() into "totalNetIncome"
}

// List of business sectors
val listOfSectors = listOf(
    BusinessSector.AUTOMOTIVE,
    BusinessSector.BANKING,
    BusinessSector.INSURANCE_FINANCE,
    BusinessSector.INDUSTRIAL_TECH,
    BusinessSector.TELECOMMUNICATIONS,
    BusinessSector.IT_SOFTWARE,
    BusinessSector.PHARMA_CHEMICAL
)

// Matching colors for each sector
val listOfSectorColors = listOf(
    Color.hex("#ffaf00"),
    Color.hex("#f46920"),
    Color.hex("#f53255"),
    Color.hex("#f857c1"),
    Color.hex("#29bdfd"),
    Color.hex("#00cbbf"),
    Color.hex("#01c159")
)

## Visualizing Revenue and Net Income by Sector

1. Revenue by Sector:
    - A line chart shows total revenue over time, grouped by business sector.
    - Points highlight specific values, and each sector is color-coded using a predefined palette.
    - The chart includes a legend for sector identification.
2. Net Income by Sector:
    - A similar line chart displays total net income over time for each sector.
    - Points and color-coding are used to enhance clarity, with a legend indicating the sectors.

These visualizations help analyze trends and compare financial performance across sectors over time.

In [9]:
// Plot total revenue by period and sector
timeSerDf.plot {
    // Map the x-axis to the "period" column
    x(period) { axis.name = "Date" }
    // Map the y-axis to the aggregated "totalRevenue"
    y(totalRevenue) { axis.name = "Revenue" }

    // Draw a line chart
    line {
        // Color lines by the "sector" column
        color(sector) {
            // Use a categorical color scale with predefined colors and sectors
            scale = categorical(range = listOfSectorColors, domain = listOfSectors)
            // Configure and label the legend
            legend {
                name = "Sector"
                this.breaksLabeled(
                    BusinessSector.AUTOMOTIVE to BusinessSector.AUTOMOTIVE.simpleName,
                    BusinessSector.BANKING to BusinessSector.BANKING.simpleName,
                    BusinessSector.INSURANCE_FINANCE to BusinessSector.INSURANCE_FINANCE.simpleName,
                    BusinessSector.INDUSTRIAL_TECH to BusinessSector.INDUSTRIAL_TECH.simpleName,
                    BusinessSector.TELECOMMUNICATIONS to BusinessSector.TELECOMMUNICATIONS.simpleName,
                    BusinessSector.IT_SOFTWARE to BusinessSector.IT_SOFTWARE.simpleName,
                    BusinessSector.PHARMA_CHEMICAL to BusinessSector.PHARMA_CHEMICAL.simpleName
                )
            }
        }
    }
    // Add points on top of the line chart
    points {
        size = 3.0
        color(sector) { scale = categorical(range = listOfSectorColors, domain = listOfSectors) }
    }

    // Adjust the layout and overall plot appearance
    layout {
        title = "Revenue by Sector"
        size = 875 to 500
    }
}

In [10]:
// Plot total net income by period and sector
timeSerDf.plot {
    // Map the x-axis to the "period" column
    x(period) { axis.name = "Date" }
    // Map the y-axis to the aggregated "totalNetIncome"
    y(totalNetIncome) { axis.name = "Net Income" }

    // Draw a line chart
    line {
        // Color lines by the "sector" column
        color(sector) {
            // Use the same categorical color scale and sector list
            scale = categorical(range = listOfSectorColors, domain = listOfSectors)
            // Configure and label the legend
            legend {
                name = "Sector"
                this.breaksLabeled(
                    BusinessSector.AUTOMOTIVE to BusinessSector.AUTOMOTIVE.simpleName,
                    BusinessSector.BANKING to BusinessSector.BANKING.simpleName,
                    BusinessSector.INSURANCE_FINANCE to BusinessSector.INSURANCE_FINANCE.simpleName,
                    BusinessSector.INDUSTRIAL_TECH to BusinessSector.INDUSTRIAL_TECH.simpleName,
                    BusinessSector.TELECOMMUNICATIONS to BusinessSector.TELECOMMUNICATIONS.simpleName,
                    BusinessSector.IT_SOFTWARE to BusinessSector.IT_SOFTWARE.simpleName,
                    BusinessSector.PHARMA_CHEMICAL to BusinessSector.PHARMA_CHEMICAL.simpleName
                )
            }
        }

    }

    // Add points on top of the line chart
    points {
        size = 3.0
        color(sector) { scale = categorical(range = listOfSectorColors, domain = listOfSectors) }
    }

    // Adjust the layout and overall plot appearance
    layout {
        title = "Net Income by Sector"
        size = 875 to 500
    }
}

## ROA and ROE Analysis by Sector

1. Computing Averages and Standard Deviations:
    - Group the data by sector and calculate the mean and standard deviation for Return on Assets (ROA) and Return on Equity (ROE).
    - This creates a summarized dataset for sector-level performance comparison.
2. Visualizing ROA by Sector:
    - A bar chart displays the average ROA for each sector.
    - Error bars represent one standard deviation, showing the variability within each sector.
3. Visualizing ROE by Sector:
    - A similar bar chart illustrates the average ROE across sectors.
    - Error bars provide insight into the standard deviation of ROE within each sector.

These charts help compare sector-level profitability metrics and assess consistency within sectors.

In [11]:
// Group data by sector to compute average and standard deviations of ROA and ROE
val roeAndRoaDf = companiesDf.groupBy { sector }.aggregate {
    ROA.mean() into "Avg ROA"
    ROA.std() into "Std ROA"
    ROE.mean() into "Avg ROE"
    ROE.std() into "Std ROE"
}

roeAndRoaDf

sector,Avg ROA,Std ROA,Avg ROE,Std ROE
INSURANCE_FINANCE,2086997152625000,1886302018904862,3437586364281250,2043975922447980
INDUSTRIAL_TECH,2090295909765625,1494884669783042,3587689393734375,2166034306212876
AUTOMOTIVE,2118081428039063,1540731600206889,3235733994031250,2239985575500741
PHARMA_CHEMICAL,2187787196546875,1297937669004892,3930715306390625,2464727270115643
BANKING,2042935983156250,1230001154483563,3542614134843750,2106292761582215
TELECOMMUNICATIONS,1907479731343750,1262589468122650,3626682563343750,2258582990930430
IT_SOFTWARE,1803926622593750,1235422951830178,3279309301281250,2326900921602219


In [12]:
// Plot average ROA by sector with error bars representing one standard deviation
roeAndRoaDf.plot {
    // Set the x-axis to the sector names
    x(sector.map { it.simpleName }) { axis.name = "Sector of Business" }

    bars {
        // Use the "Avg ROA" column for the bar heights
        y(`Avg ROA`) { scale = continuous(min = .0, max = 4.5e+9) }
        // Fill bars with a chosen color
        fillColor = Color.hex("#ffaf00")
    }
    lineRanges {
        // Calculate the min and max for the error bars (Std ROA)
        yMin(`Avg ROA`.toList().zip(`Std ROA`.toList()).map { it.first - it.second })
        yMax(`Avg ROA`.toList().zip(`Std ROA`.toList()).map { it.first + it.second })
        // Color the line of the ranges
        borderLine.color = Color.GREY
    }

    // Adjust layout options such as title and overall size
    layout {
        title = "Average ROA By Sector With Standard Deviation"
        size = 875 to 500
    }
}

In [13]:
// Plot average ROE by sector with error bars representing one standard deviation
roeAndRoaDf.plot {
    // Set the x-axis to the sector names
    x(sector.map { it.simpleName }) { axis.name = "Sector of Business" }

    bars {
        // Use the "Avg ROE" column for the bar heights
        y(`Avg ROE`)
        // Fill bars with a chosen color
        fillColor = Color.hex("#ffaf00")
    }
    lineRanges {
        // Calculate the min and max for the error bars (Std ROE)
        yMin(`Avg ROE`.toList().zip(`Std ROE`.toList()).map { it.first - it.second })
        yMax(`Avg ROE`.toList().zip(`Std ROE`.toList()).map { it.first + it.second })
        // Color the line of the ranges
        borderLine.color = Color.GREY
    }

    // Adjust layout options such as title and overall size
    layout {
        title = "Average ROE By Sector With Standard Deviation"
        size = 875 to 500
    }
}