Hi,
I'm Yudheksha GK

I'm a

MSCS graduate from Indiana University Bloomington.
Expertise in Data Science & ML, Software Development, and Data Analytics.

profile image
about image

About Me

Hey there! Ever had the chance to work with some seriously cool industry experts, professors, and software dev wizards? I have, and let me tell you, it’s been a wild ride! I’ve picked up some awesome tricks along the way and always put my heart into creating top notch, clean code. Right now, I’m diving headfirst into a master’s in computer science at Indiana University Bloomington, all in the name of mastering the art of making tech work better for everyone.But hey, it’s not all work and no play! When I’m not wrestling with code, you’ll catch me with a book in hand, sketching out random doodles, or binge watching my favorite shows. I’m also on a never ending quest for epic tunes, currently jamming to some classic rock, and I’m all about discovering new cuisines and indulging in some anime goodness. Let’s chat sometime! Seriously, hit me up; I’d love to connect and see where the conversation takes us!

Experience

November, 2024 – Present

Data Analyst - Indiana University - Kelley School of Business Indianapolis

  • Developed a Python/pandas ETL pipeline to process 1,000+ S&P Global transcripts by classifying content, extracting metadata, and integrating results into PostgreSQL, reducing manual processing by 50% and supporting financial research and analysis.
  • Created a Tableau dashboard with KPI cards, a line chart tracking year-over-year net income, and a waterfall chart breaking down the income statement, enabling stakeholders to analyze financial performance and cut analysis time by 20%.
  • Cleaned and categorized 6,000+ CEO records in Excel by using Pivot Tables, XLOOKUP, and VBA macros, increasing verification speed by 40%.

January, 2022 - July, 2022

Undergraduate Researcher - University of Auckland, New Zealand

  • Collaborated with a team of 3 to develop a DDoS detection system using the CIC-DDoS2019 dataset (80K+ rows, 80+ features) in Python, leveraging pandas, NumPy, scikit-learn, Matplotlib, and Seaborn for data preprocessing, EDA, and feature engineering for predictive modeling.
  • Applied supervised ML models, including Random Forest and SVM (RBF kernel), in scikit-learn and performed hyperparameter tuning with GridSearchCV, improving model accuracies from approximately 85% to 90%.
  • Integrated optimized models into a soft-voting ensemble model, achieving 93% accuracy on large-scale network traffic data.

Education

2023 - 2025

Master of Science in Computer Science - Indiana University Bloomington, USA

CGPA: 3.5/4

Coursework: Applied Algorithms, Data Mining, Software Engineering, Applied Machine Learning, Applied Database Technologies, Security for Networked Systems, Computer Networks, Database Design, System and Protocol Security and Information Assurance, Data Visualization.

2019 - 2023

Bachelor of Technology in Computer Science - Vellore Institute of Technology, India

CGPA: 8.52/10

Coursework: Data Structures and Algorithms, Operating Systems, Database Management Systems, Computer Architecture and Organization, Discrete Mathematics and Graph Theory, Theory of Computation, Machine Learning, Artificial Intelligence, Software Engineering, Network and Communication, Internet and Web Programming, Java Programming, Parallel and Distributed Computing, Internet of Things, Applications of Differential and Difference Equations.

Tech Stack

Programming Languages

  • Python
  • Java
  • C++
  • JavaScript
  • TypeScript
  • R
  • SQL
  • C

Web Technologies

  • HTML
  • CSS
  • React.js
  • Next.js
  • Node.js
  • Express.js
  • Firebase
  • REST APIs
  • Render

Databases

  • MySQL
  • PostgreSQL
  • MongoDB
  • Firestore

Data Visualization

  • Tableau
  • Excel
  • Power BI

Cloud Platforms

  • AWS
  • GCP

Tools

  • Jupyter Notebook
  • Google Colab
  • Git
  • GitHub
  • Docker
  • Kubernetes
  • Jenkins (CI/CD)
  • Postman

Machine Learning and Deep Learning

  • TensorFlow
  • Keras
  • PyTorch
  • Scikit-learn
  • pandas
  • NumPy
  • Matplotlib
  • Seaborn
  • SciPy
  • Regression
  • Classification Models
  • Deep Learning (CNN, RNN, Autoencoders)

Generative AI

  • Large Language Models
  • Transformers
  • Tokenization
  • Embeddings
  • Prompt Engineering
  • RAG (Retrieval-Augmented Generation)
  • Vector Databases

Projects

PhishDetect

Phishing URL Detector

AvenueEstate

MERN Real Estate Marketplace

Netflix Dashboard

Netflix Content Analysis Dashboard

Airbnb NYC Data Analysis

Airbnb NYC Data Analysis

Flu Shot Compliance Analysis

Flu Shot Compliance Analysis

AI Summarizer Application

AI Summarizer Application

Publications

A study of AES and RSA algorithms based on GPUs

A study of AES and RSA algorithms based on GPUs

A Machine Learning based Approach to Early Stage Diabetes Prediction

A Machine Learning based Approach to Early Stage Diabetes Prediction

An Intelligent Flood Automation System Using IoT and Machine Learning

An Intelligent Flood Automation System Using IoT and Machine Learning