Skip to content
View alroychiang's full-sized avatar

Block or report alroychiang

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
alroychiang/README.md

Hi! 👋 I’m Alroy Chiang

Applied Physics fresh graduate from Nanyang Technological University 🇸🇬

Linkedin Badge  

I am a fresh graduate with a strong interest in coding looking for opportunities to apply my knowledge and skills in a professional setting! I have worked with companies such as EDP Renewables and Smart Nation Translational Laboratories to aid in audio processing and data cleaning projects which will be described in more detail below.

  • 👀 I’m interested in becoming a better programmer!
  • 👨‍💻 I have a brief experience with SQL, Java, and C.
  • 🚧 I have a strong foundation in Python for data cleaning and data validation!
  • 🦕 I’m looking to collaborate on projects where I can learn and put my skills to use.
  • 📫 How to reach me at: ACHIANG004@e.ntu.edu.sg

Tools:

  • Database Management: SQL, MySQL
  • Audio Pre-Processing: Python, Jupyter Labs, soundfile, noisereduce, cv2, converter, pydub, youtube_dl, numpy, pandas, shutil, subprocess, openpyxyl, moviepy, tqdm, converter, ffmpeg
  • Data Visualization: VS Code, Python, Seaborn, Matplotlib
  • Data Cleaning: VS Code, Python, numpy, pandas, json, os, glob, csv, shutil, re, sys, datetime

Projects:

  • Conducted video processing using Python, JupyterLabs & Audacity to automate the conversion of audio file formats, download significant audio and video datasets from an online repository.
  • Performed data cleaning by normalizing audio waveforms and reducing background noises on audio datasets for future Machine Learning projects.
  • Developed an automated process using ffmpeg to accurately crop over 400 video files given their unique timestamps corresponding from an external Excel file.
  • Utilized Python’s json, pandas, numpy, re, csv, shutil, os, datetime and glob libraries to develop a data pre-processing pipeline handling data quality, data transformation, meta data generation and data publication.
  • Employed the pipeline to identify missing values & outliers, transform logarithmic & time series data, perform data aggregation, execute time gap analysis and construct a data catalogue.
  • Maintained and produced user, design and test procedure documentations to establish and maintain the pipeline development process.
  • Applied matplotlib library to conduct Data Validation on historical energy data, successfully identifying data dropouts and permanent step changes in our data collection system.
  • Deployed matplotlib library to conduct Correlation Analysis on time series data.

Database Management Course:

  • Designed Entity Relationship (ER) diagrams from data relationship sentences, gained a better understanding of attributes, unique attributes, entity instances and partial keys.
  • Designed Relational Schemas from their respective ER diagrams and data relationship sentences as well. Gained a better understanding of tables, primary keys, foreign keys and many-to-many relationships
  • Utilized SQL to extract, sort, identify data of interest from a real world dataset (Dognition database). Generate queries and sub queries using clauses such as GROUP BY, FULL JOIN, DISTINCT, HAVING etc.

Popular repositories Loading

  1. audio-processing-SNTL-Internship audio-processing-SNTL-Internship Public

    This repository downloads audio set

    Jupyter Notebook

  2. alroychiang alroychiang Public

    Config files for my GitHub profile.

  3. data-pre-processing-pipeline data-pre-processing-pipeline Public

    Data Pre-Processing Pipeline

    Python

  4. Managing-Big-Data-with-MySQL Managing-Big-Data-with-MySQL Public

    Jupyter Notebook

  5. data-validation data-validation Public

    Python