Available for Opportunities

Hi, I'm Neha Malviya

|

Building intelligent applications that transform raw data into business decisions. Specializing in AI, Machine Learning, NLP, and Data Analytics — from medical diagnosis to customer intelligence.

4+ ML Projects
98.2% Best Accuracy
5+ Technologies
portfolio.py
import pandas as pd
from sklearn.svm import SVC
from textblob import TextBlob
 
# Train breast cancer classifier
clf = SVC(kernel='rbf')
clf.fit(X_train, y_train)
 
# Result
>>> Accuracy: 98.2%
Scroll to explore

About Me

I'm a passionate AI & Data Analytics developer with a background in Data Science and Business Analytics — currently pursuing my BBA at Rider University with a 3.8 GPA and building intelligent applications that turn raw data into real business value.

My work spans end-to-end machine learning pipelines, NLP applications, and interactive data dashboards that tackle real-world problems — from predicting cancer diagnoses with 98.2% accuracy to automating executive KPI reports using Generative AI, to building live Google Analytics dashboards in Power BI.

I'm passionate about the intersection of Generative AI and business intelligence. I also bring hands-on experience as a Research Assistant on a National Science Foundation-funded climate project and as a Tutor teaching Excel and Business Data Analytics to fellow students.

🎓 Rider University — BBA, Business/Data Analytics  ·  GPA 3.8  ·  2024–2026
🎓 Thakur College of Science & Commerce — BS, Data Science  ·  2022–2024
📧 malviyan@rider.edu
📍 Lawrence, New Jersey, USA
💼 Open to Data Analyst, Data Scientist & Business Analyst roles — NJ & NY

Featured Projects

End-to-end applications combining data science, machine learning, and web development

01
GenAI Python Streamlit

GenAI KPI Meeting Assistant

An intelligent business analytics tool that transforms raw sales data into executive-ready KPI reports — complete with trend analysis, AI-generated insights, and automated email delivery.

🎯 Problem

Business teams waste hours manually preparing sales reports and meeting summaries from scattered data.

💡 Solution

Streamlit dashboard that auto-detects columns, generates trend charts, and emails PDF reports with one click.

📊 Outcome

Meeting prep time cut from hours to minutes. Automated week-over-week insights delivered to stakeholders.

Python Streamlit Plotly Pandas FPDF Seaborn

Key Features

  • Smart auto-detection of sales, region, category & date columns
  • Dynamic trend charts: daily / weekly / monthly / yearly granularity
  • AI-generated insights & anomaly flags (e.g. high discounts, low margin)
  • Automated PDF report generation & email delivery with charts
  • Regional × category sales heatmap
02
NLP ML Streamlit

Universal Customer Review Analyzer

A powerful NLP pipeline that automatically clusters any customer review dataset and performs sentiment analysis — works for products, restaurants, services, or social media.

🎯 Problem

Companies receive thousands of unstructured reviews making it impossible to manually extract trends and sentiment.

💡 Solution

Universal NLP pipeline: spaCy lemmatization → TF-IDF vectorization → K-Means clustering → TextBlob sentiment scoring.

📊 Outcome

Processes up to 3,000 reviews per run. Groups into up to 10 topic clusters with per-cluster sentiment breakdown.

Python spaCy TextBlob TF-IDF K-Means Streamlit

Key Features

  • spaCy lemmatization with stopword removal for clean text
  • Configurable clustering: 3–10 topic groups (slider control)
  • Positive / Neutral / Negative sentiment per cluster
  • Stacked bar sentiment chart per cluster
  • Email report with chart attachment
03
Analytics Viz Streamlit

ML Olympics Analysis WebApp

Interactive Streamlit webapp exploring 120+ years of Olympic history — from medal tallies to athlete age distributions and gender participation trends.

🎯 Problem

Over 135,000 Olympic athlete records spanning 51 games are complex and hard to explore without interactive tools.

💡 Solution

Dynamic filtering webapp for medal tallies, heatmaps, athlete distributions, and men-vs-women trends.

📊 Outcome

Four analysis views: Medal Tally, Overall Stats, Country-wise, and Athlete-wise — all interactive.

Python Streamlit Plotly Seaborn Pandas SciPy

Key Features

  • Dynamic medal tally by year and country
  • Sport × year event growth heatmap
  • Age distribution for gold / silver / bronze medalists
  • Height vs. weight scatter by sport
  • Men vs. women participation trend over 120 years
04
ML Healthcare AI Flask

Breast Cancer Prediction WebApp

A Flask-based medical AI tool using Support Vector Machine to classify breast tumors as benign or malignant — achieving 98.2% accuracy on the Wisconsin dataset.

🎯 Problem

False-positive cancer screenings lead to costly, unnecessary surgeries. Doctors need reliable ML decision support.

💡 Solution

Flask web app with SVM classifier trained on 569 patient records. Both ANN and SVM evaluated — SVM won.

📊 Outcome

98.2% accuracy. Real-time predictions with confidence scores reducing unnecessary surgical interventions.

Python Flask Scikit-learn SVM NumPy Pandas

Key Features

  • SVM classifier with 98.2% test accuracy
  • ANN vs SVM model comparison
  • 30-feature clinical data input interface
  • Real-time prediction with probability confidence score
  • Wisconsin Breast Cancer dataset: 357 benign / 212 malignant
05
React PostgreSQL Netlify

Rider EventHub — Campus Event Platform

A full-stack event discovery platform for Rider University — students browse, filter, and submit campus events, with an admin dashboard for moderation and an organizer portal for tracking submissions.

🎯 Problem

Campus events are scattered across emails, Instagram, and flyers — students miss events they'd love, and departments see low attendance.

💡 Solution

Centralized React web app with category filtering, @rider.edu email verification, admin approval workflow, and organizer status tracking.

📊 Outcome

Live at rider-eventhub.netlify.app. One hub for all campus events — searchable, filterable, mobile-friendly, and admin-controlled.

React PostgreSQL Netlify Functions CSS SHA-256 Auth

Key Features

  • Browse & filter events by category, date, and keyword search
  • Submit events with @rider.edu email verification
  • Admin dashboard: approve, reject, and feature events
  • Organizer portal to track submission status without login
  • Serverless backend with PostgreSQL via Netlify Functions
06
GenAI Claude API Streamlit

AI Chat Bot

A conversational AI chatbot demo built with Streamlit and powered by the Claude API — showcasing how generative AI can be integrated into a clean, interactive web interface.

🎯 Goal

Explore how Claude AI can be embedded into a Streamlit app to create a responsive, intelligent chat experience.

💡 Approach

Built a lightweight chat interface in Python using the Anthropic SDK, with conversation history, styled message bubbles, and real-time AI responses.

📊 Result

A working demo that proves how quickly a functional AI chat interface can be built end-to-end using modern generative AI tools.

Python Streamlit Claude API Anthropic SDK Pandas

Key Features

  • Interactive chat UI with message history using Streamlit
  • Integrated with Anthropic's Claude API via the Python SDK
  • Real-time AI responses with clean, styled message bubbles
  • Lightweight demo — runs locally with minimal setup
07
Power BI Google Analytics DAX

Google Trend Dashboard in Power BI

A live, interactive Google Analytics dashboard built in Power BI using API integration, advanced DAX measures, and intuitive visualizations — achieving 95%+ data accuracy.

🎯 Problem

Marketing and business teams struggle to monitor real-time web traffic, user behavior, and engagement metrics in one place.

💡 Solution

Power BI dashboard connected via Google Analytics API with complex DAX measures for engagement rates, session trends, and user conversion patterns.

📊 Outcome

95%+ data accuracy. Live KPIs including traffic sources, bounce rates, and geographic performance — updated in real time.

Power BI Google Analytics API DAX API Integration Data Visualization

Key Features

  • Live API integration: Google Analytics → Power BI (real-time refresh)
  • Complex DAX measures: engagement rates, session trends, conversion patterns
  • KPI visuals: traffic sources, bounce rates, geographic performance
  • 95%+ data accuracy with minimal manual intervention
  • Stakeholder-ready data storytelling and narrative reporting
08
Python Streamlit Pandas

Olympics Data Explorer

An interactive Streamlit web app for exploring Olympics history — medal tallies, country-wise performance, athlete statistics, and participation trends spanning decades of data.

🎯 Problem

Olympics data is vast and scattered — hard to explore trends, compare countries, and surface athlete-level insights in one place.

💡 Solution

Built an interactive Streamlit dashboard with dynamic filtering by year and country, visual heatmaps, and athlete-level breakdowns.

📊 Outcome

Rich visual storytelling of Olympics history — from overall participation trends to individual athlete performance across sports and events.

Python Streamlit Pandas Matplotlib Seaborn Plotly

Key Features

  • Medal tally by country and year with dynamic filtering
  • Overall trends in sports, events, and athlete participation
  • Country-wise performance breakdown and heatmaps
  • Athlete-wise statistics and performance trends

Experience

Business Analyst Intern 🆕 New

Mar 2026 – Apr 2026
CPA4Tax and Accounting Services PC  ·  Kendall Park, NJ (On-site)  ·  20 hrs/week
  • Manage and administer workflow processes within Canopy practice management software for a CPA firm
  • Create, maintain, and optimize automations within Canopy to improve operational efficiency across the firm
  • Assign and coordinate tasks to team members for workflow management within Canopy
  • Redact confidential and sensitive client information in compliance with firm data security policies
  • Gain hands-on experience in workflow analytics, automation management, and operational optimization
Canopy Software Workflow Automation Business Analytics Data Security CPA Operations

Resident Assistant

Jan 2026 – Present
Rider University  ·  Part-time  ·  Lawrence, NJ (On-site)
  • Serve as a primary student leader and project manager for a diverse residential community at Hill Hall
  • Manage facility infrastructure issues, coordinating timely repairs with facilities management
  • Develop and distribute digital newsletters to enhance resident communication and engagement
  • Provide 24-hour coverage for building security and conflict resolution, ensuring a safe living environment
Leadership Project Management Data Management Communication

Research Assistant

Oct 2025 – Present
Rider University  ·  Essex County, NJ (Hybrid)  ·  NSF-Funded Project
  • Prepare and analyze tree-ring samples to study historical climate patterns (dendrochronology)
  • Apply time-series statistics for long-term climate research and data interpretation
  • Utilize stereomicroscopes and software tools including R for data collection and analysis
  • Collaborate with a research team on a National Science Foundation (NSF)-funded project
R Programming Time-Series Statistics Dendrochronology Research

Student Office Assistant

Jul 2025 – Present
Rider University  ·  Lawrence, NJ (On-site)
  • Manage sensitive student data and support administrative functions for the department
  • Improve scheduling efficiency and enhance reporting accuracy, streamlining office operations
  • Develop strong organizational and analytical skills contributing to overall office productivity
Data Management Reporting Scheduling Administration

Tutor — Excel & Business Data Analytics

Oct 2025 – Jan 2026
Rider University  ·  Remote  ·  4 months
  • Instructed students in CIS 185, focusing on Microsoft Excel for business applications
  • Taught BDA 201 — Introductory Business Data Analytics concepts and tools
  • Developed engaging lesson plans to enhance student understanding of data analysis tools
  • Fostered a collaborative learning environment encouraging participation and critical thinking
Tutoring Microsoft Excel Business Analytics Teaching

Student Worker

Oct 2024 – Aug 2025
Compass Group  ·  Part-time  ·  New Jersey (On-site)  ·  11 months
  • Supported day-to-day campus operations as part of the Compass Group food service team
  • Developed professional work ethic, time management, and teamwork skills in a fast-paced environment
Operations Customer Service Teamwork

DataFest 2025 — Data Analytics Competition

2025
Competitive Data Science Event
  • Participated in an intensive data analytics competition with real-world dataset challenge
  • Applied exploratory data analysis and statistical modeling under competition conditions
  • Delivered data-driven recommendations and visualizations to a panel of industry judges
Data Analysis Statistics Visualization Presentation

Skills

🐍

Programming

Python SQL HTML / CSS JavaScript
🤖

Machine Learning

Scikit-learn SVM K-Means Clustering Neural Networks Random Forest
💬

NLP & AI

spaCy NLTK TF-IDF TextBlob Sentiment Analysis Generative AI Claude API
📊

Data & Visualization

Pandas NumPy Plotly Matplotlib Seaborn Power BI Tableau Google Analytics
🌐

Web & Tools

Streamlit Flask React PostgreSQL Git Jupyter VS Code Excel

All Technologies

Python SQL Pandas NumPy Scikit-learn SVM KMeans ANN spaCy TextBlob TF-IDF NLTK Flask Streamlit Plotly Matplotlib Seaborn Power BI Google Analytics FPDF Gemini AI Git Jupyter SciPy Excel HTML / CSS JavaScript

Licenses & Certifications

Industry-recognized credentials validating my data analytics and BI expertise

Tableau Business Intelligence Analyst Specialization

Tableau Learning Partner & IBM & Microsoft · Coursera
📅 Issued: January 2026 🔑 ID: 181R7L9ADK1Y

Completed a rigorous 12-course specialization covering the full BI pipeline — from data cleaning and SQL to advanced Tableau visualization, business analysis, and data storytelling.

Introduction to Business Analytics99.37%
Data Analysis with SQL: Inform a Business Decision100%
Data Ecosystem98.22%
Communicating Data Insights with Tableau96.50%
Business Analysis Process93.71%
Preparing Data for Analysis with Microsoft Excel93.05%
Introduction to Tableau92.15%
Introduction to Data Analytics92.50%
Advanced Data Visualization with Tableau91.87%
Data Analysis with Tableau90.16%
Data Visualization with Tableau88.79%
Data Cleaning in Excel: Techniques to Clean Messy Data83.33%
Tableau SQL Excel Business Analysis Data Visualization Data Storytelling ETL Business Intelligence
✓ Verified

Data Ecosystem

Tableau Learning Partner · Coursera
📅 Issued: July 2025 🔑 ID: BMA2QAY7MXCG

Certification covering the data ecosystem: databases, data warehousing, Tableau software, data management principles, and the full lifecycle from raw data to actionable business insights. Achieved a grade of 98.22%.

Databases Tableau Software Data Management Data Warehousing
✓ Verified

Learned in Class

Key skills and concepts developed through coursework at Rider University

📊

Data Analytics & Mining

  • Exploratory Data Analysis (EDA)
  • Feature engineering & selection
  • Data cleaning and preprocessing
  • Descriptive and inferential statistics
  • Association rule mining
🤖

Machine Learning

  • Supervised & unsupervised learning
  • Decision trees, Random Forests, SVM
  • K-Means & hierarchical clustering
  • Model evaluation: precision, recall, F1
  • Cross-validation & hyperparameter tuning
🧠

Artificial Intelligence

  • Neural networks & deep learning basics
  • Natural Language Processing (NLP)
  • Generative AI concepts & applications
  • Computer vision fundamentals
  • AI ethics and responsible AI
📈

Statistics for Business

  • Probability and distributions
  • Hypothesis testing & significance
  • Regression: linear & logistic
  • Time series analysis & forecasting
  • A/B testing and experimental design
💼

Business Analytics & BI

  • Business Intelligence tools (Power BI)
  • KPI design & performance dashboards
  • Google Analytics & web analytics
  • Data storytelling & visualization
  • Decision analysis & business modeling
🗄️

Database Management

  • Relational database design with SQL
  • Database normalization principles
  • Query optimization techniques
  • NoSQL fundamentals
  • ETL processes and data pipelines

Videos & Demos

Watch the projects in action — real demos, real data

KPI Meeting Assistant — Super Sales Demo

Full walkthrough of the GenAI KPI dashboard with Super Store sales data, trend analysis, AI-generated insights, and automated PDF report delivery.

GenAI Streamlit KPI Analytics

Customer Review Analyzer — Sentiment Analysis Demo

Demonstration of the NLP pipeline: text cleaning, K-Means clustering, sentiment tagging, and per-cluster visualization.

NLP Sentiment Analysis Streamlit

Customer Analytics — Interactive Features Demo

Customer-first analytics demo showcasing real-time interactive insights and data-driven recommendations.

Analytics Visualization Streamlit

Sentiment Analysis — Extended Feature Walkthrough

Extended demo showing advanced clustering configurations and email report generation workflow.

NLP Machine Learning AI

Rider EventHub — Live Platform Demo

Full walkthrough of the Rider University campus event hub — browsing, filtering, submitting events, and the admin moderation dashboard.

React PostgreSQL Netlify

AI Chat Bot — Claude API Demo

A demo of a conversational AI chatbot built with Streamlit and the Claude API, featuring a clean chat interface and real-time AI responses.

GenAI Claude API Streamlit

Google Analytics — Web Traffic Analysis

Walkthrough of Google Analytics for tracking website traffic, user behavior, and key performance metrics to drive data-informed decisions.

Google Analytics Web Analytics Data Analysis

Olympics Data Explorer — Full App Demo

Interactive walkthrough of the Olympics Data Explorer: medal tallies, country heatmaps, athlete stats, and participation trends built with Streamlit and Python.

Python Streamlit Data Analysis

Resume

📄

Download My Resume

A complete overview of my education, projects, skills, and experience in one document.

⬇ Download Resume (PDF)

Can't download? Email me and I'll send it right over.

Quick Snapshot

🎓
Education

Rider University
Data Analytics & AI

💻
Core Stack

Python · ML · NLP
Analytics · BI Tools

🏆
Top Achievement

98.2% ML Accuracy
Medical AI Project

🚀
Projects Built

4+ Production Apps
GenAI · ML · NLP

🎯
Looking For

Data Analyst · ML Engineer
AI / BI Roles

🌟
Competed In

DataFest 2025
Data Analytics Competition

Get In Touch

I'm actively looking for data analytics, AI, and ML opportunities.
Let's talk — I'd love to connect!