The trial round was not limited in time and unfortunately the task description has been lost :()
- A. Decision Stump: given x and y arrays, build a decision stump by finding the split c and the two values a, b such that RMS is optimal.
- B. Function fit: given x and y arrays, fit
(a*sin(x) + b*log(x))**2 + c * x**2
to minimise MAE on y - C. Estimate the number of unique terms: given a very long input that contains repeating entries, estimate the number of unique values with a certaint precision (
~5%
). The difficulty here was that the memory was execution time were strongly constrained. Basicset
or import ofpandas
were crossing the memory limit. - D. Restaurant reviews: for training there was a data set of paired comparisons between restaurants. Distances and average reviews. For each pair there was the decision of the user given- which one to choose. The task was to be able to generate user ratings for individual restaurants.