HarshilSanghvi
Crafting innovative solutions at the intersection of software engineering, data science, and artificial intelligence.

About Me
Passionate about transforming complex challenges into elegant solutions through the power of artificial intelligence and modern software development.
Crafting the Future with AI & Code
I'm Harshil Sanghvi, a passionate technologist with a Master's in Computer Science from Stony Brook University. My journey spans from academic excellence as a Gold Medalist during my undergrad to real-world impact as a Software Engineer at Oracle.
With 4+ research publications and expertise in artificial intelligence, backend engineering, and data science, I bridge the gap between cutting-edge research and practical solutions that drive business value.
When I'm not coding or researching, you'll find me binging a good thriller or out on a hiking trail with friends.
Gold Medalist
Top academic performer with distinction in Machine Learning
Industry Experience
Software Engineer at Oracle | 3x Oracle Cloud Certified (Data Science, Generative AI, AI Foundations) | Former AI Intern at TalentNow
Research Publications
4+ published papers in AI/ML conferences and journals
Full-Stack Developer
Backend-focused (Python, Java) with React familiarity
Education
Master of Science in Computer Science
Stony Brook University
GPA: 3.93/4.0
Bachelor of Technology in Computer Science and Engineering
Nirma University
GPA: 9.43/10 • Gold Medal
Experience
Software Engineer
Oracle
Retail AI, Large Language Models & Data Science
AI Application Intern
TalentNow
LLM-powered systems & AI automation
Crafting the Future with AI & Code
I'm Harshil Sanghvi, a passionate technologist with a Master's in Computer Science from Stony Brook University. My journey spans from academic excellence as a Gold Medalist to real-world impact as a Software Engineer at Oracle.
With 4+ research publications and expertise in AI, backend engineering, and data science, I bridge the gap between cutting-edge research and practical solutions.
Education
MS Computer Science
Stony Brook University • 3.93/4.0
BTech CSE
Nirma University • 9.43/10 • Gold Medal
Experience
Software Engineer
Oracle • Retail AI & LLMs
AI Application Intern
TalentNow • LLM Systems
My Journey
A timeline of my education and professional experiences, showcasing growth and impact
Software Engineer
Oracle
- • Software Engineer in the Retail AI team, specializing in Large Language Models, Data Science, and enterprise-scale AI solutions
- • Developing an automated scalable RAG evaluation pipeline that dynamically tests pipelines when embedding or generation models change, utilizing comprehensive metrics to automatically select optimal models for production RAG systems
- • Took ownership of a RAG-powered Slack bot that accelerates software development by enabling developers to quickly find solutions to previously encountered problems, significantly reducing wait times and improving team productivity
- • Building robust AI infrastructure and evaluation frameworks to ensure consistent model performance and seamless integration across Oracle's retail AI ecosystem
AI Application Intern
TalentNow
- • Engineered a normalized vector database architecture for candidate-skill matching, reducing space complexity by ~37% and similarity comparisons by 99.7%
- • Developed an LLM-powered semantic skill matching system with intelligent thresholding and Redis-based fallback logic
- • Decreased end-to-end matching latency from 60+ seconds to ~13 seconds through optimization
- • Built a SQL-agent prototype using LangChain and OpenAI, achieving 50% faster query responses
Master of Science in Computer Science
Stony Brook University
- • Specialized in Machine Learning, Data Science, and Software Engineering
- • Conducted research in 3D diffusion models for medical imaging
- • Teaching Assistant for Object-Oriented Programming courses
Data Engineer
Stony Brook University - Shrestha Lab
- • Collaborated with Prof. Prerana Shrestha to enhance research productivity and data accuracy through automated data processing tools, improving overall experimental workflow efficiency
- • Developed end-to-end Python data processing pipeline for signaled active avoidance (SAA) training and testing experiments, automating extraction from .gslog and .ffii files
- • Reduced manual data gathering time from hours to seconds by implementing efficient CSV file input processing for entry, exit, latency, and critical metrics analysis
- • Expanded capabilities by developing Python scripts for generating insightful visualizations, enabling data-driven decision-making and facilitating valuable conclusions from experimental data
- • Advanced scientific research methodologies by integrating technology-driven solutions in behavioral experiments and analysis
Software Engineer Intern
Interactive Brokers
- • Developed and deployed REST API for IBKR's Digital Account Management (DAM) 2.0, enabling efficient extraction of request IDs within a specified time frame and improving system efficiency
- • Designed and implemented Batch Applications for the Automated Customer Enrollment System (ACES), automating Anti-Money Laundering (AML) task assignments and generating files for Consolidated Audit Trail (CAT) and Tableau Reporting
- • Conducted an independent research project for a Self-Service Reporting Dashboard, identifying key performance insights and enhancing data-driven decision-making for the Client Accounts team
- • Consolidated 10+ legacy scripts into a unified Java/Spring Boot codebase, improving processing efficiency and reducing maintenance overhead
Bachelor of Technology in Computer Science and Engineering
Nirma University
- • Graduated with Gold Medal - Ranked #1 in entire Computer Science & Engineering department
- • Active in developing software solutions and research
- • Multiple internships in AI/ML and software development
Data Science Intern
Johnson Control Hitachi
- • Designed and implemented a modular Python system for streamlined outdoor unit data retrieval from the Air-CloudPro API, featuring configurable parameters and secure API communication, resulting in 20% reduction in data retrieval time
- • Developed a robust anomaly detection algorithm incorporating statistical methods (GESD, Z-Score, Box Plot, Tukey Fences) and ML models (Isolation Forest, AutoEncoder, LOF) for accurate outlier identification
- • Achieved 20% increase in detection accuracy compared to existing rule-based system, resulting in enhanced system reliability and 15% reduction in maintenance costs
- • Conducted comprehensive algorithm evaluations using AUC-ROC, F1 Score, Precision, and Recall metrics to ensure optimal performance
Machine Learning Intern
Northern Trust Corporation
- • Implemented DistilBERT for email classification, achieving 17% increase in classification accuracy while leveraging Exploratory Data Analysis (EDA) to clean and preprocess the dataset
- • Developed a FastAPI-based dashboard to showcase email categorization and confidence scores for each category, resulting in 25% reduction in time spent on email management tasks
- • Created an embeddings-based cognitive search engine with similarity scoring, featuring user-friendly dashboard for query input and tabular results sorted by similarity score
- • Achieved 92.5% precision in retrieving accurate document, paragraph, and line numbers, resulting in 20% increase in organizational productivity
Skills & Expertise
A comprehensive overview of my technical proficiencies across various domains
Machine Learning & AI
Advanced ML algorithms and AI systems
Data Science & Analytics
Statistical analysis and data insights
Programming Languages
Multi-language proficiency
Backend Development
Scalable server-side applications
Cloud & DevOps
Cloud infrastructure and deployment
Frontend Development
Modern web interfaces and UX
Want to see myCertifications?
Explore my collection of professional certifications and achievements from leading tech companies and educational institutions.
View AchievementsFeatured Projects
A comprehensive showcase of real-world projects spanning machine learning, full-stack development, data science, and research automation that demonstrate technical expertise and problem-solving skills.
Advanced Aviation Delay Prediction
Comprehensive machine learning analysis of flight departure delays using massive DOT dataset with 99.2% accuracy. Employed geospatial analysis, clustering, and advanced regression models.
Want to see more of my work or discuss a potential collaboration?
Let's Work TogetherCertificates & Achievements
A comprehensive collection of professional certifications, academic achievements, and technical credentials spanning cloud technologies, programming languages, and core competencies.
Gold Medal Achievement
Awarded Gold Medal for outstanding academic performance with 9.43/10 CGPA in undergraduate program
Graduate Excellence
Maintaining exceptional 3.93/4.0 GPA in Master's program at Stony Brook University
4+ Research Publications
Published research papers in top-tier AI/ML conferences and journals with significant impact
Oracle Cloud Data Science Professional
Oracle Cloud Infrastructure 2025 Certified Data Science Professional - Advanced data science and machine learning on Oracle Cloud
Oracle Cloud Generative AI Professional
Oracle Cloud Infrastructure 2025 Certified Generative AI Professional - Expertise in generative AI technologies and applications
Oracle Cloud AI Foundations Associate
Oracle Cloud Infrastructure 2025 Certified AI Foundations Associate - Foundational knowledge in cloud AI technologies
Git & GitHub Essentials
Udemy certification in Git and GitHub fundamentals for version control and collaboration
Problem Solving Excellence
Stony Brook University certification in problem solving, critical thinking, and resourcefulness
Teamwork & Collaboration
Stony Brook University certification in teamwork, accountability, and cross-team collaboration
JavaScript Proficiency
HackerRank JavaScript (Basic) certification demonstrating programming fundamentals
Deep Learning Specialization
Coursera certifications in Neural Networks, Deep Learning, and Hyperparameter Tuning
SQL Database Skills
HackerRank SQL (Basic & Intermediate) certifications for database management and queries
Python Programming
Multiple Python certifications including HackerRank Python (Basic) and Coursera Python specializations
C/C++ Programming
HackerRank C (Basic & Intermediate) and Problem Solving certifications, plus Spoken Tutorial training
Ready to Achieve More Together?
These certifications and achievements represent my commitment to continuous learning and excellence. Let's collaborate and create something extraordinary together.
Let's ConnectLet's Connect
Ready to discuss your next project, explore collaboration opportunities, or just connect? I'd love to hear from you.
Send a Message
Ready to Start Something Amazing?
Whether you're looking for a collaborator, need technical consultation, or want to discuss research opportunities, I'm here to help bring your ideas to life.
Start the Conversation