Skip to content

Achndrs4/achndrs4.github.io

 
 

Repository files navigation

title permalink layout
Who Am I?
/about-me/
page

To my future coworkers, managers, and friends:

My name is Ani and I am a Data Engineer and Computational Linguist. I think I’d be a great addition to your team!

I have extensive experience - both within the world of NLP and in Data Engineering - of deploying scalable solutions to large datasets. I have worked with streaming technologies like Flink and Kafka, and have focused my experience on writing production code in GoLang, Python and Scala to process millions of user and automation events without the risk of data loss or poor scalability. To deploy these applications, I have built pipelines using Jenkins, Airflow, Docker, and Kubernetes.

As important as the tools are the skill set required to wield them, and here too I have some experience that I think could help the team. I have worked on several different classification challenges including writing tooling to facilitate OCR, machine translation, feature transformation, token extraction, etc . In addition, my education in linguistics has given me the opportunity to use some advanced mathematical tools in natural language processing, like the Viterbi Algorithm and CKY-Parsing.

Finally, I am a privacy-first engineer. It is my obligation and duty, not only to you but to the customers whose data is being used, to protect that data from being extracted. I do this by focusing on buildiung trustless systems. In other words:

  • I will fight to reduce the amount of data collected unless it has critical business/KPI value
  • If sensitive data needs to be used, I use tools like k-anonimity and ℓ-diversity to make sure that collected data cannot be used in malicious ways

At its core, I love the creativity and team effort required to extract meaning out of data - whether it be structured, semi-structured, or completely unstructured. Send me a message if you think I’d be a good fit for your team, or if you’re interested in discussing user privacy, the state of NLP, or anything linguistics related.