Clinisync is a modular, extensible ETL platform designed to securely clean and transform healthcare data from spreadsheet files into standardized, de-identified CSVs.
Built with Node.js and Express, it provides both API and CLI interfaces and supports multi-sheet Excel processing with full privacy compliance, unit tests, CI/CD integration, and documentation powered by Docusaurus.
npm install
npm run devcurl -F "excel=@path/to/input.xlsx" http://localhost:3000/uploadReturns a JSON response with a list of generated cleaned CSV files.
NHI→ anonymizedID-001,ID-002, etc.DOB→Age(accurate, format-tolerant)- Removes:
Address,Contact, unnamed columns - Applies rules consistently across all sheets
csv_procedure/
├── src/
│ ├── etl/ # ETL modules: extract, transform, load, idMapper
│ └── routes/ # Express upload route
├── __tests__/ # Jest test suite
├── docs/ # Docusaurus site (see /docs/README.md)
├── VERSION # Current release version
├── CHANGELOG.md # Project changelog
└── README.md
npm testIncludes full unit and integration coverage with Jest.
Full docs available at:
https://devilsdev.github.io/csv_procedure/
To run locally:
cd docs
npm install
npm run startTo deploy:
./deploy-gh-pages.shSee CHANGELOG.md
Licensed under the MIT License.
(c) Ali Kahwaji, 2025