Available for Opportunities

Hi, I'm Neha Malviya

|

Building intelligent applications that transform raw data into business decisions. Specializing in AI, Machine Learning, NLP, and Data Analytics — from medical diagnosis to customer intelligence.

4+ ML Projects
98.2% Best Accuracy
5+ Technologies
portfolio.py
import pandas as pd
from sklearn.svm import SVC
from textblob import TextBlob
 
# Train breast cancer classifier
clf = SVC(kernel='rbf')
clf.fit(X_train, y_train)
 
# Result
>>> Accuracy: 98.2%
Scroll to explore

About Me

I'm a passionate AI & Data Analytics developer with a background in Data Science and Business Analytics — currently pursuing my BBA at Rider University with a 3.8 GPA and building intelligent applications that turn raw data into real business value.

My work spans end-to-end machine learning pipelines, NLP applications, and interactive data dashboards that tackle real-world problems — from predicting cancer diagnoses with 98.2% accuracy to automating executive KPI reports using Generative AI, to building live Google Analytics dashboards in Power BI.

I'm passionate about the intersection of Generative AI and business intelligence. I also bring hands-on experience as a Research Assistant on a National Science Foundation-funded climate project and as a Tutor teaching Excel and Business Data Analytics to fellow students.

🎓 Rider University — BBA, Business/Data Analytics  ·  GPA 3.8  ·  2024–2026
🎓 Thakur College of Science & Commerce — BS, Data Science  ·  2022–2024
📧 malviyan@rider.edu
📍 Lawrence, New Jersey, USA
💼 Open to Data Analyst, Data Scientist & Business Analyst roles — NJ & NY

Featured Projects

End-to-end applications combining data science, machine learning, and web development

01
GenAI Python Streamlit

GenAI KPI Meeting Assistant

An intelligent business analytics tool that transforms raw sales data into executive-ready KPI reports — complete with trend analysis, AI-generated insights, and automated email delivery.

🎯 Problem

Business teams waste hours manually preparing sales reports and meeting summaries from scattered data.

💡 Solution

Streamlit dashboard that auto-detects columns, generates trend charts, and emails PDF reports with one click.

📊 Outcome

Meeting prep time cut from hours to minutes. Automated week-over-week insights delivered to stakeholders.

Python Streamlit Plotly Pandas FPDF Seaborn

Key Features

  • Smart auto-detection of sales, region, category & date columns
  • Dynamic trend charts: daily / weekly / monthly / yearly granularity
  • AI-generated insights & anomaly flags (e.g. high discounts, low margin)
  • Automated PDF report generation & email delivery with charts
  • Regional × category sales heatmap
02
NLP ML Streamlit

Universal Customer Review Analyzer

A powerful NLP pipeline that automatically clusters any customer review dataset and performs sentiment analysis — works for products, restaurants, services, or social media.

🎯 Problem

Companies receive thousands of unstructured reviews making it impossible to manually extract trends and sentiment.

💡 Solution

Universal NLP pipeline: spaCy lemmatization → TF-IDF vectorization → K-Means clustering → TextBlob sentiment scoring.

📊 Outcome

Processes up to 3,000 reviews per run. Groups into up to 10 topic clusters with per-cluster sentiment breakdown.

Python spaCy TextBlob TF-IDF K-Means Streamlit

Key Features

  • spaCy lemmatization with stopword removal for clean text
  • Configurable clustering: 3–10 topic groups (slider control)
  • Positive / Neutral / Negative sentiment per cluster
  • Stacked bar sentiment chart per cluster
  • Email report with chart attachment
03
Analytics Viz Streamlit

ML Olympics Analysis WebApp

Interactive Streamlit webapp exploring 120+ years of Olympic history — from medal tallies to athlete age distributions and gender participation trends.

🎯 Problem

Over 135,000 Olympic athlete records spanning 51 games are complex and hard to explore without interactive tools.

💡 Solution

Dynamic filtering webapp for medal tallies, heatmaps, athlete distributions, and men-vs-women trends.

📊 Outcome

Four analysis views: Medal Tally, Overall Stats, Country-wise, and Athlete-wise — all interactive.

Python Streamlit Plotly Seaborn Pandas SciPy

Key Features

  • Dynamic medal tally by year and country
  • Sport × year event growth heatmap
  • Age distribution for gold / silver / bronze medalists
  • Height vs. weight scatter by sport
  • Men vs. women participation trend over 120 years
04
ML Healthcare AI Flask

Breast Cancer Prediction WebApp

A Flask-based medical AI tool using Support Vector Machine to classify breast tumors as benign or malignant — achieving 98.2% accuracy on the Wisconsin dataset.

🎯 Problem

False-positive cancer screenings lead to costly, unnecessary surgeries. Doctors need reliable ML decision support.

💡 Solution

Flask web app with SVM classifier trained on 569 patient records. Both ANN and SVM evaluated — SVM won.

📊 Outcome

98.2% accuracy. Real-time predictions with confidence scores reducing unnecessary surgical interventions.

Python Flask Scikit-learn SVM NumPy Pandas

Key Features

  • SVM classifier with 98.2% test accuracy
  • ANN vs SVM model comparison
  • 30-feature clinical data input interface
  • Real-time prediction with probability confidence score
  • Wisconsin Breast Cancer dataset: 357 benign / 212 malignant
05
Power BI Google Analytics DAX

Google Trend Dashboard in Power BI

A live, interactive Google Analytics dashboard built in Power BI using API integration, advanced DAX measures, and intuitive visualizations — achieving 95%+ data accuracy.

🎯 Problem

Marketing and business teams struggle to monitor real-time web traffic, user behavior, and engagement metrics in one place.

💡 Solution

Power BI dashboard connected via Google Analytics API with complex DAX measures for engagement rates, session trends, and user conversion patterns.

📊 Outcome

95%+ data accuracy. Live KPIs including traffic sources, bounce rates, and geographic performance — updated in real time.

Power BI Google Analytics API DAX API Integration Data Visualization

Key Features

  • Live API integration: Google Analytics → Power BI (real-time refresh)
  • Complex DAX measures: engagement rates, session trends, conversion patterns
  • KPI visuals: traffic sources, bounce rates, geographic performance
  • 95%+ data accuracy with minimal manual intervention
  • Stakeholder-ready data storytelling and narrative reporting

Experience

Resident Assistant

Jan 2026 – Present
Rider University  ·  Part-time  ·  Lawrence, NJ (On-site)
  • Serve as a primary student leader and project manager for a diverse residential community at Hill Hall
  • Manage facility infrastructure issues, coordinating timely repairs with facilities management
  • Develop and distribute digital newsletters to enhance resident communication and engagement
  • Provide 24-hour coverage for building security and conflict resolution, ensuring a safe living environment
Leadership Project Management Data Management Communication

Research Assistant

Oct 2025 – Present
Rider University  ·  Essex County, NJ (Hybrid)  ·  NSF-Funded Project
  • Prepare and analyze tree-ring samples to study historical climate patterns (dendrochronology)
  • Apply time-series statistics for long-term climate research and data interpretation
  • Utilize stereomicroscopes and software tools including R for data collection and analysis
  • Collaborate with a research team on a National Science Foundation (NSF)-funded project
R Programming Time-Series Statistics Dendrochronology Research

Student Office Assistant

Jul 2025 – Present
Rider University  ·  Lawrence, NJ (On-site)
  • Manage sensitive student data and support administrative functions for the department
  • Improve scheduling efficiency and enhance reporting accuracy, streamlining office operations
  • Develop strong organizational and analytical skills contributing to overall office productivity
Data Management Reporting Scheduling Administration

Tutor — Excel & Business Data Analytics

Oct 2025 – Jan 2026
Rider University  ·  Remote  ·  4 months
  • Instructed students in CIS 185, focusing on Microsoft Excel for business applications
  • Taught BDA 201 — Introductory Business Data Analytics concepts and tools
  • Developed engaging lesson plans to enhance student understanding of data analysis tools
  • Fostered a collaborative learning environment encouraging participation and critical thinking
Tutoring Microsoft Excel Business Analytics Teaching

Student Worker

Oct 2024 – Aug 2025
Compass Group  ·  Part-time  ·  New Jersey (On-site)  ·  11 months
  • Supported day-to-day campus operations as part of the Compass Group food service team
  • Developed professional work ethic, time management, and teamwork skills in a fast-paced environment
Operations Customer Service Teamwork

DataFest 2025 — Data Analytics Competition

2025
Competitive Data Science Event
  • Participated in an intensive data analytics competition with real-world dataset challenge
  • Applied exploratory data analysis and statistical modeling under competition conditions
  • Delivered data-driven recommendations and visualizations to a panel of industry judges
Data Analysis Statistics Visualization Presentation

Skills

🐍

Programming

Python95%
SQL80%
HTML / CSS / JavaScript75%
🤖

Machine Learning

Scikit-learn90%
SVM / Classification88%
K-Means Clustering85%
Neural Networks (ANN)75%
💬

NLP & AI

spaCy / NLTK85%
TF-IDF / TextBlob88%
Sentiment Analysis90%
Generative AI (Gemini)78%
📊

Data & Visualization

Pandas / NumPy93%
Plotly / Matplotlib90%
Power BI82%
Google Analytics78%
🌐

Web & Tools

Streamlit92%
Flask85%
Git / GitHub80%
Jupyter / VS Code90%

All Technologies

Python SQL Pandas NumPy Scikit-learn SVM KMeans ANN spaCy TextBlob TF-IDF NLTK Flask Streamlit Plotly Matplotlib Seaborn Power BI Google Analytics FPDF Gemini AI Git Jupyter SciPy Excel HTML / CSS JavaScript

Licenses & Certifications

Industry-recognized credentials validating my data analytics and BI expertise

Tableau Business Intelligence Analyst Specialization

Tableau Learning Partner
📅 Issued: January 2026 🔑 ID: 181R7L9ADK1Y

Completed a rigorous 7-course journey covering the full BI process — Requirements Elicitation, ETL, Data Preprocessing, Exploratory Data Analysis, Spatial & Advanced Visualization, and Stakeholder Communication through data storytelling.

ETL Tableau Public Exploratory Data Analysis Spatial Visualization Data Storytelling Business Intelligence
✓ Verified

Data Ecosystem

Tableau Learning Partner
📅 Issued: July 2025 🔑 ID: BMA2QAY7MXCG

Comprehensive certification covering the data ecosystem: databases, data warehousing, Tableau software, data management principles, and the full lifecycle from raw data to actionable business insights.

Databases Tableau Software Data Management Data Warehousing
✓ Verified

Learned in Class

Key skills and concepts developed through coursework at Rider University

📊

Data Analytics & Mining

  • Exploratory Data Analysis (EDA)
  • Feature engineering & selection
  • Data cleaning and preprocessing
  • Descriptive and inferential statistics
  • Association rule mining
🤖

Machine Learning

  • Supervised & unsupervised learning
  • Decision trees, Random Forests, SVM
  • K-Means & hierarchical clustering
  • Model evaluation: precision, recall, F1
  • Cross-validation & hyperparameter tuning
🧠

Artificial Intelligence

  • Neural networks & deep learning basics
  • Natural Language Processing (NLP)
  • Generative AI concepts & applications
  • Computer vision fundamentals
  • AI ethics and responsible AI
📈

Statistics for Business

  • Probability and distributions
  • Hypothesis testing & significance
  • Regression: linear & logistic
  • Time series analysis & forecasting
  • A/B testing and experimental design
💼

Business Analytics & BI

  • Business Intelligence tools (Power BI)
  • KPI design & performance dashboards
  • Google Analytics & web analytics
  • Data storytelling & visualization
  • Decision analysis & business modeling
🗄️

Database Management

  • Relational database design with SQL
  • Database normalization principles
  • Query optimization techniques
  • NoSQL fundamentals
  • ETL processes and data pipelines

Videos & Demos

Watch the projects in action — real demos, real data

KPI Meeting Assistant — Super Sales Demo

Full walkthrough of the GenAI KPI dashboard with Super Store sales data, trend analysis, AI-generated insights, and automated PDF report delivery.

GenAI Streamlit KPI Analytics

Customer Review Analyzer — Sentiment Analysis Demo

Demonstration of the NLP pipeline: text cleaning, K-Means clustering, sentiment tagging, and per-cluster visualization.

NLP Sentiment Analysis Streamlit

Customer Analytics — Interactive Features Demo

Customer-first analytics demo showcasing real-time interactive insights and data-driven recommendations.

Analytics Visualization Streamlit

Sentiment Analysis — Extended Feature Walkthrough

Extended demo showing advanced clustering configurations and email report generation workflow.

NLP Machine Learning AI

Resume

📄

Download My Resume

A complete overview of my education, projects, skills, and experience in one document.

⬇ Download Resume (PDF)

Add your resume PDF to the portfolio_website folder and rename it Neha_Malviya_Resume.pdf

Quick Snapshot

🎓
Education

Rider University
Data Analytics & AI

💻
Core Stack

Python · ML · NLP
Analytics · BI Tools

🏆
Top Achievement

98.2% ML Accuracy
Medical AI Project

🚀
Projects Built

4+ Production Apps
GenAI · ML · NLP

🎯
Looking For

Data Analyst · ML Engineer
AI / BI Roles

🌟
Competed In

DataFest 2025
Data Analytics Competition

Get In Touch

I'm actively looking for data analytics, AI, and ML opportunities.
Let's talk — I'd love to connect!