Decisions taken

Docker should be installed to run the complete service, it runs MongoDB and RabbitMQ services, fundamental to the application's functionality

Installation

git clone https://github.com/ColoBonoris/excel-api
cd excel-api
yarn install
cp .env.example .env # Here you'll also find the api keys for both endpoints

Note: first run (and not further ones) could fail connecting the worker or the app with any of the services, I ran into this problem and the sollution was only running the worker or app once again

Run app

yarn services:start # Runs RabbitMQ and MongoDB containers
yarn worker # Starts the worker process
yarn dev # Starts the server, should be done in other terminal since the previous one will be busy with the worker

Useful commands

yarn test # Runs all tests
yarn services:stop # Stops both containers

Note: for testing the app, you can use the swagger docs and the files large_test.xlsx / test_data with the mapping inside of test_mapping.json as good examples.

Potential changes

Interrupted jobs are not being taken in consideration, for it we should simply use RabbitMQ's ACK functionality, but avoiding filling up the queue
Testing could be way more extensive
Primitives allowed are a considerably short subset of TypeScript's primitive datatypes
We could implement more workers for improving performance

Admitted types for each mapping field

Primitives: String, Number, Array <Number>
For this first version, these are the only types accepted, for enhancing, you should only modify /src/utils/parseMapping.ts or replace it with another mapping function
Mapping function can be way more modular and maybe more efficient, we are using streams for having at least some efficiency when mapping big documents
For adapting to the specifications, Array <Number> fields are ordered ascendent when mapping

Interacting with the API

Uploading an Excel File

Endpoint

POST /api/upload

Headers

Header	Description	Required
`x-api-key`	API Key for authentication	✅ Yes
`mapping`	JSON object defining the data mapping	✅ Yes

Body (multipart/form-data)

Field	Type	Description	Required
`file`	File (.xlsx)	The Excel file to upload	✅ Yes

Mapping Specification

The mapping header must be a valid JSON object where each key represents a column, and the value specifies the expected type.

Allowed Data Types

Type	Description
`"String"`	Converts the value to a trimmed string.
`"Number"`	Converts the value to a number (errors if conversion fails).
`"Array<Number>"`	Parses an array of numbers (errors if any element is not numeric).

Normalization Rules

Case insensitive ("string" is the same as "String").
Whitespace is ignored (" Array < Number > " is valid).
Invalid types cause an error.

Example Requests

✅ Valid Mapping

{
  "name": "String",
  "age": "Number",
  "scores": "Array<Number>"
}

Request Example (multipart/form-data):

curl -X POST "http://localhost:3000/api/upload" \
  -H "x-api-key: YOUR_API_KEY" \
  -H "mapping: {\"name\":\"String\", \"age\":\"Number\", \"scores\":\"Array<Number>\"}" \
  -F "file=@test_data.xlsx;type=application/vnd.openxmlformats-officedocument.spreadsheetml.sheet"

Expected Behavior

✅ Valid Input

Excel Content:

name	age	scores
John	25	[1,2,3]
Alice	30	[4,5,6]

Processed Output:

{
  "name": "John",
  "age": 25,
  "scores": [1, 2, 3]
}

❌ Invalid Mapping Example

{
  "isActive": "Boolean"
}

Response:

{
  "error": "Unknown mapping type for key 'isActive': Boolean"
}

Checking Job Status

Endpoint

GET /api/status/{jobId}

You can optionally include two query parameters for pagination:

resultPage (number, optional): The 1-based page to retrieve for the result data. Default is 1.
errorPage (number, optional): The 1-based page to retrieve for the error data. Default is 1.

Response

Field	Type	Description
`status`	`"pending" \| "processing" \| "done"`	The current job state
`result`	`Object`	The processed data (only if `status="done"`)
`errors`	`Array`	Errors found during processing (only if `status="done"`)

Example Response (Job Done)

{
  "status": "done",
  "result": [{ "name": "John", "age": 25, "scores": [1, 2, 3] }],
  "errors": []
}

Example Response (With Errors)

{
  "status": "done",
  "result": [{ "name": "Alice", "age": 30, "scores": [4, 5, 6] }],
  "errors": [{ "col": 2, "row": 3 }],
  "resultPage": 0,
  "errorPage": 0
}

Name		Name	Last commit message	Last commit date
Latest commit History 39 Commits
.yarn		.yarn
src		src
.env.example		.env.example
.eslintrc.json		.eslintrc.json
.gitignore		.gitignore
.yarnrc.yml		.yarnrc.yml
NOTES.md		NOTES.md
README.md		README.md
docker-compose.yml		docker-compose.yml
jest.config.js		jest.config.js
package-lock.json		package-lock.json
package.json		package.json
prettierc.json		prettierc.json
tsconfig.json		tsconfig.json
yarn.lock		yarn.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Decisions taken

Installation

Run app

Useful commands

Potential changes

Admitted types for each mapping field

Interacting with the API

Uploading an Excel File

Endpoint

Headers

Body (multipart/form-data)

Mapping Specification

Allowed Data Types

Normalization Rules

Example Requests

✅ Valid Mapping

Expected Behavior

✅ Valid Input

❌ Invalid Mapping Example

Checking Job Status

Endpoint

Response

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors 1

Languages

Folders and files

Latest commit

History

Repository files navigation

Decisions taken

Installation

Run app

Useful commands

Potential changes

Admitted types for each mapping field

Interacting with the API

Uploading an Excel File

Endpoint

Headers

Body (multipart/form-data)

Mapping Specification

Allowed Data Types

Normalization Rules

Example Requests

✅ Valid Mapping

Expected Behavior

✅ Valid Input

❌ Invalid Mapping Example

Checking Job Status

Endpoint

Response

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors 1

Languages

Packages