Turning Complex Data into Clear, Actionable Insights

As a Data Science master's candidate with a distinguished academic background in Statistics, I'm driven by a natural curiosity for the stories hidden within data. I excel at building and refining machine learning models that turn complex information into clear, compelling insights. I am looking to join an innovative team where I can apply my skills to build products that solve real-world problems.

Portrait of Harneet Kaur

About Me

Hello! I'm Harneet, a data science enthusiast currently pursuing my Master's at VIT University. My journey from a statistics undergraduate to a data science professional is fueled by a passion for solving puzzles and uncovering hidden patterns in data. I believe that every dataset holds a story, and I enjoy the process of bringing that story to light through careful analysis and robust modeling.

Beyond the world of data, I lead a disciplined and balanced life. I enjoy long walks, which help me think and stay active, and I have a deep appreciation for music. This balance between analytical thinking and creative pursuits allows me to approach problems with a unique and holistic perspective. I'm excited to bring this blend of technical skill and creative problem-solving to a challenging new role.

My Projects

Here are some of the projects I've worked on that showcase my skills in machine learning and data analysis.

NLP-Powered Mental Health Classification

Engineered an end-to-end NLP pipeline to classify mental health status from a dataset of over 53,000 posts, establishing a robust baseline for future deep learning models.

NLP Text Classification Python
View on GitHub →

Hybrid Product Recommendation System

Developed a hybrid recommendation engine using collaborative filtering and advanced NLP, leading to a 480% increase in product discovery for users.

Recommender Systems NLP Generative AI
View on GitHub →

Technical Skills

A snapshot of the languages, frameworks, and tools I use to bring data-driven projects to life.

Languages & Programming

  • Python: Pandas, NumPy, Scikit-learn, Matplotlib, Seaborn, Plotly
  • R
  • SQL

Data Analysis & Statistics

  • Exploratory Data Analysis (EDA)
  • Hypothesis Testing & A/B Testing
  • Statistical Inference & Modeling
  • Probability Distributions & Theory

Classical Machine Learning

  • Regression: Linear, Logistic, Regularization (Ridge, Lasso)
  • Classification: SVM, Decision Trees, Random Forest, KNN
  • Clustering: K-Means
  • Techniques: Feature Engineering (TF-IDF, Word2Vec), Hyperparameter Tuning (GridSearchCV), Handling Class Imbalance (SMOTE)

Deep Learning & NLP

  • Frameworks: TensorFlow, Keras
  • Architectures: LSTMs, Artificial Neural Networks (ANNs)
  • Tasks: Text Classification, Sentiment Analysis, NER, Topic Modeling
  • Applications: Recommender Systems, Generative AI & LLMs

Tools, Systems & Big Data

  • Development: Git & GitHub, Jupyter Notebook, Google Colab
  • Database Systems: Relational database design (Normalization, ER Diagrams)
  • Big Data (In Progress): Apache Spark, Hadoop Ecosystem (HDFS, MapReduce)

Education

My academic journey has provided me with a strong theoretical and practical foundation in statistics and data science.

M.Sc. in Data Science

VIT University, Vellore (2024 – Present)

An intensive program focused on mastering the complete data science lifecycle, from data collection to predictive modeling.

Current CGPA: 9.46 (Sem 1: 9.56, Sem 2: 9.35)

B.Sc. (Hons.) in Statistics

BJB Autonomous College, Bhubaneswar (2020 – 2023)

Graduated with distinction, building a strong foundational knowledge of statistical theory and quantitative analysis. My consistent performance earned me the second-highest rank in the department.

Final CGPA: 9.3/10.0

Higher Secondary Education (CBSE)

Dalmia Vidya Mandir (2018 – 2020)

Established a strong record of academic excellence, graduating among the top students in my batch with a focus on Physics, Chemistry, and Mathematics.

Class 12: 93.4%   |   Class 10: 95.2%

Achievements

Key academic and competitive examination results that highlight my dedication and proficiency.

IIT JAM (Statistics)

AIR 607

CUET PG (Statistics)

AIR 348

Department Rank

2nd in B.Sc.

Certifications & Learning

A commitment to continuous learning and staying updated with the latest industry trends.

Ongoing

100 Days of Machine Learning

YouTube

Ongoing

100DaysOfCode in Python

Udemy

Completed

Education on Sustainable Development

NPTEL (100/100 Score)

Completed

GenAI Cybersecurity

Udemy

Completed

Data Analytics using Excel

Udemy

Let's Connect

I'm currently seeking new opportunities and would love to hear from you. Feel free to reach out via email or connect with me on social media.

harneetkaur4464@gmail.com

+91 8018724774