Turning Complex Data into Clear, Actionable Insights
As a Data Science master's candidate with a distinguished academic background in Statistics, I'm driven by a natural curiosity for the stories hidden within data. I excel at building and refining machine learning models that turn complex information into clear, compelling insights. I am looking to join an innovative team where I can apply my skills to build products that solve real-world problems.

About Me
Hello! I'm Harneet, a data science enthusiast currently pursuing my Master's at VIT University. My journey from a statistics undergraduate to a data science professional is fueled by a passion for solving puzzles and uncovering hidden patterns in data. I believe that every dataset holds a story, and I enjoy the process of bringing that story to light through careful analysis and robust modeling.
Beyond the world of data, I lead a disciplined and balanced life. I enjoy long walks, which help me think and stay active, and I have a deep appreciation for music. This balance between analytical thinking and creative pursuits allows me to approach problems with a unique and holistic perspective. I'm excited to bring this blend of technical skill and creative problem-solving to a challenging new role.
My Projects
Here are some of the projects I've worked on that showcase my skills in machine learning and data analysis.
NLP-Powered Mental Health Classification
Engineered an end-to-end NLP pipeline to classify mental health status from a dataset of over 53,000 posts, establishing a robust baseline for future deep learning models.
Hybrid Product Recommendation System
Developed a hybrid recommendation engine using collaborative filtering and advanced NLP, leading to a 480% increase in product discovery for users.
Technical Skills
A snapshot of the languages, frameworks, and tools I use to bring data-driven projects to life.
Languages & Programming
- Python: Pandas, NumPy, Scikit-learn, Matplotlib, Seaborn, Plotly
- R
- SQL
Data Analysis & Statistics
- Exploratory Data Analysis (EDA)
- Hypothesis Testing & A/B Testing
- Statistical Inference & Modeling
- Probability Distributions & Theory
Classical Machine Learning
- Regression: Linear, Logistic, Regularization (Ridge, Lasso)
- Classification: SVM, Decision Trees, Random Forest, KNN
- Clustering: K-Means
- Techniques: Feature Engineering (TF-IDF, Word2Vec), Hyperparameter Tuning (GridSearchCV), Handling Class Imbalance (SMOTE)
Deep Learning & NLP
- Frameworks: TensorFlow, Keras
- Architectures: LSTMs, Artificial Neural Networks (ANNs)
- Tasks: Text Classification, Sentiment Analysis, NER, Topic Modeling
- Applications: Recommender Systems, Generative AI & LLMs
Tools, Systems & Big Data
- Development: Git & GitHub, Jupyter Notebook, Google Colab
- Database Systems: Relational database design (Normalization, ER Diagrams)
- Big Data (In Progress): Apache Spark, Hadoop Ecosystem (HDFS, MapReduce)
Education
My academic journey has provided me with a strong theoretical and practical foundation in statistics and data science.
M.Sc. in Data Science
VIT University, Vellore (2024 – Present)
An intensive program focused on mastering the complete data science lifecycle, from data collection to predictive modeling.
Current CGPA: 9.46 (Sem 1: 9.56, Sem 2: 9.35)
B.Sc. (Hons.) in Statistics
BJB Autonomous College, Bhubaneswar (2020 – 2023)
Graduated with distinction, building a strong foundational knowledge of statistical theory and quantitative analysis. My consistent performance earned me the second-highest rank in the department.
Final CGPA: 9.3/10.0
Higher Secondary Education (CBSE)
Dalmia Vidya Mandir (2018 – 2020)
Established a strong record of academic excellence, graduating among the top students in my batch with a focus on Physics, Chemistry, and Mathematics.
Class 12: 93.4% | Class 10: 95.2%
Achievements
Key academic and competitive examination results that highlight my dedication and proficiency.
IIT JAM (Statistics)
AIR 607
CUET PG (Statistics)
AIR 348
Department Rank
2nd in B.Sc.
Certifications & Learning
A commitment to continuous learning and staying updated with the latest industry trends.
100 Days of Machine Learning
YouTube
100DaysOfCode in Python
Udemy
Education on Sustainable Development
NPTEL (100/100 Score)
GenAI Cybersecurity
Udemy
Data Analytics using Excel
Udemy
Let's Connect
I'm currently seeking new opportunities and would love to hear from you. Feel free to reach out via email or connect with me on social media.