cv | Divyanshu Raj

General Information

Name	Divyanshu Raj
Roles	Student, Software Developer, Machine Learning Engineer
Email	draj5@asu.edu
URL	https://divraz.github.io
About Me	Dedicated and versatile Software Development Engineer with a strong foundation in Java and Python, specializing in cloud-native architectures. Experienced in Amazon Web Services (AWS) and Google Cloud Platform (GCP) for scalable solutions. Skilled in microservices, data pipelines, and machine learning. Committed to continuous learning and passionate about creating efficient, innovative software solutions.

Education

2022 - 2024
Master of Science, Computer Science

Arizona State University, Tempe, USA
- Thesis Track of Intersection of Robotics and NLP
- Language-Conditioned Change-point Detection to Identify Sub-Tasks in Robotics Domains
2013 - 2017

Bachelor of Technology, Information Technology

Indian Institute of Information Technology, Allahabad, India

Experience

May 2023 - Aug 2023
Software Development Engineer Intern

Amazon, Tempe, USA
- Designed the architecture for Brand Customer Reviews (BCR) Auto Reply that establishes contact between a seller and the buyer of the product with critical reviews, automatically in an event-driven fashion with an AWS CICD pipeline.
- Reduced average latency from 200 ms to 2 ms on Opensearch queries used by BCR Auto Reply architecture.
- Handled the Migration of existing BCR architecture with detailed documentation and roll-back plan for all regions.
- Implementation of BCR Auto Reply architecture with S3 Bucket, AWS SQS Queue, AWS Serverless Lambda, AWS Dynamo database, and Elastic search with complete code coverage through unit tests and Integration tests in JAVA.
- Projected to benefit 1000+ customers over a sum of $100k on a monthly basis as they used a paid third-party tool to automate this before. The total cost for this feature on Amazon's end is $200 monthly with the optimized architecture.
Jul 2017 - aug 2022
Software Development Engineer

Streamoid Technologies, Bangalore, India
- Designed, and developed a Data Ingestion pipeline using API first approach. FastAPI docker images deployed on Google Cloud Run for autoscaling, Google Datastore as a decoupling layer, and time series support. 90% reduction in operations.
- Enabled 20% faster data ingestion by clients (~20M entries daily) with an 80% increase in frequency, and 90% reduction in MySQL database CRUD functionality using delta updates based on time series data.
- Prepared system architecture and implemented a real-time scalable Data Processing pipeline, an event-driven, push-based architecture with a priority queue. Used RabbitMQ clusters, Redis, Google Kubernetes Engine, KEDA, AWS Lambda, and Google Cloud Run for a scalable processing pipeline. The pipeline is 20% faster than previous and has achieved a 40% cost reduction to $0.0001 per product .
- Semi-Supervised Text Classification, NER, and Text Generation with Transformers in Fashion space. Fine-tuned BERT with MLM and GPT2 with CLM with 15 Million fashion sentences and released the language models for downstream tasks.
- Improved and implemented a Transformers-based Text Classification pipeline that used a fine-tuned BERT model as a base layer with 72 classes and achieved 98% model accuracy, and 95% real-world accuracy.

Research Experience

jan 2023 - present
Research Assistant

Logos Labs, Ariona State University, Tempe, USA
- “Language-Conditioned Change-Point Detection to Identify Sub-Tasks in Robotics Domain” published at Articulate Robots workshop at RSS 2023. Continuing Thesis on this work.
- Training an Inverse semantic model for plan execution with Robotic-Human Interaction using T5. Designed an experiment with mini-grid, reinforcement learning, and BART to achieve this.
aug 2022 - dec 2022
Research Assistant

CIDSE Labs, Ariona State University, Tempe, USA
- Strategy to prune Text Datasets to achieve SOTA using Transformer embeddings (Identifying Quality data points) Used SNLI and GLUE datasets.
- Pruned them using 4 strategies (random, easy, hard, and equal ratio) and evaluate the model's accuracy. Uses, context vector, Kmeans clustering, RoBERTa mode

Teaching Experience

aug 2023 - present
Teaching Assistant

Ariona State University, Tempe, USA
- Data Structures and Algorithms with Dr. Nakul Gopalan
jan 2023 - may 2023
Teaching Assistant

Ariona State University, Tempe, USA
- Perception in Robotics with Dr. Nakul Gopalan

Activities and Achievements

2020 - present
- Writer for “Towards Data Science”, "Towards Dev" and “Analytics Vidhya” on medium. Several articles have been published with them.
2014
- All India Rank 193 in ACM ICPC 2014

Skills

● System Design ● Software Development ● Backend Development ● AWS ● Microservices Architecture ● Load Testing ● Data Ingestion ● Database Management ● Event-Driven Architecture ● Machine Learning ● Deep Learning ● Quantum Machine Learning ● Cloud Platforms ● Data Analysis ● Version Control ● Cloud Computing ● Teaching and Training ● Research ● Technical Writing ● Competitive Coding ● Git ● AWS CICD Pipeline ● AWS Opensearch ● AWS S3 Bucket ● AWS SQS Queue ● AWS Serverless Lambda ● AWS Dynamo Database ● AWS SNS Topic ● FastAPI ● Docker ● Google Cloud Run ● Google Datastore ● MySQL ● RabbitMQ ● Redis ● Google Kubernetes Engine ● Google Logging ● MongoDB ● Solr ● Robotics ● NLP ● Transformers ● BERT ● GPT ● T5 ● JAVA ● Python ● C++ ● LaTeX ● Communication ● Problem-Solving ● Teamwork ● Leadership ● Adaptability ● Time Management ● Critical Thinking ● Creativity ● Research Skills ● Writing Skills

Table of contents

General Information

Education

Master of Science, Computer Science

Bachelor of Technology, Information Technology

Experience

Software Development Engineer Intern

Software Development Engineer

Research Experience

Research Assistant

Research Assistant

Teaching Experience

Teaching Assistant

Teaching Assistant

Activities and Achievements

Skills