The idea of this project is to show the main features of data.table which I believe will cover most of the data manipulation operations.
data.table
is known because of it's speed but I would also argue it's syntax is extremely clear once understood.
I personally found it was simpler to learn DT than R-Base for most data manipulation operations but I would still recommend beginners to
learn R-Base first and then dive in other libraries.
I've been using DT for around 5 years and I use it exclusively for my work. I'm not an expert on the package in every sense so I'll be talking from my experience from a user standpoint. Feel free to make github issues or PR in order to make improvements to the code and add other relevant points.
I've downloaded financial data of multiple companies in the nasdaq exchange, the file has 450 MB that is small for today standards.
Link to download: https://1drv.ms/u/s!AgbJRsdS66y6eillJ211VlOhbHU?e=eyYtWO
Select the prices_nasdaq.csv
and right click to download it.