PROFESSIONAL SUMMARY
Seasoned Data Scientist and NLP Engineer with 4+ years of specialized expertise in Large Language Models (LLMs), Natural Language Processing, and Generative AI. Proven track record in architecting and deploying production-grade NLP solutions, fine-tuning transformer models, and building intelligent document processing systems. Adept at leveraging state-of-the-art language models for business-critical applications including conversational AI, intelligent search, and automated content analysis.
PROFESSIONAL EXPERIENCE
Maharashtra Knowledge Corporation Limited | Navi Mumbai
- Lead NLP architect for GenAI initiatives, fine-tuning and deploying LLMs (BERT, GPT, T5) and diffusion models for enterprise content generation, achieving 35% improvement in user engagement metrics
- Architected conversational AI chatbot using advanced transformer models and semantic matching algorithms to intelligently recommend 250+ courses based on natural language understanding of candidate profiles and educational backgrounds
- Designed and implemented enterprise RAG (Retrieval-Augmented Generation) search solution using vector databases and embedding models to enable intelligent querying of knowledge bases for investors and customers
- Developed custom prompt engineering frameworks and fine-tuning pipelines for domain-specific LLM applications, reducing inference latency by 40%
- Led NLP-driven business intelligence initiatives that directly supported customer acquisition and engagement strategies through sentiment analysis and intent classification
Maharashtra Knowledge Corporation Limited | Navi Mumbai
- Spearheaded AI-Powered Marksheet Verification System: Developed end-to-end NLP pipeline processing 100,000+ academic documents with 94% accuracy using transformer-based classification models
- Implemented intelligent OCR with named entity recognition (NER) for automatic extraction of student details, grades, and institutional information
- Built cross-referencing validation system using semantic similarity models to match extracted data against official university records
- Created automated anomaly detection framework identifying discrepancies and errors in marksheet data
- Designed natural language query interface for generating custom student result combinations and reports
- Engineered multi-modal document understanding pipeline combining computer vision and NLP for certificate authentication, including text extraction, layout analysis, and security feature detection
- Developed named entity recognition and information extraction systems for structured data extraction from unstructured educational documents
- Implemented face recognition systems with liveness detection using deep learning for enhanced authentication security
Maharashtra Knowledge Corporation Limited | Navi Mumbai
- Architected Autonomous Log Analyzer & Incident Predictor: Built offline, open-source AI system for intelligent log analysis and proactive issue prediction
- Developed log parsing and normalization engine handling both structured (JSON) and unstructured (text) log formats
- Implemented unsupervised anomaly detection using clustering algorithms and statistical methods for real-time issue identification
- Integrated local LLM (Llama, GPT4All) for log summarization and natural language explanation generation
- Created feedback loop mechanism enabling continuous model improvement through user interactions
- Built intuitive CLI interface with text/HTML output supporting end-to-end workflow: ingestion → detection → explanation → feedback
- Achieved successful deployment across multiple production environments with demonstrable reduction in incident response time
- Designed and trained transformer-based models for text classification, sentiment analysis, and business forecasting applications
- Built scalable NLP pipelines incorporating tokenization, embedding generation, feature engineering, and model validation
- Deployed production NLP models to AWS and Azure using containerization (Docker) and CI/CD practices, ensuring 99.9% uptime
Maharashtra Knowledge Corporation Limited | Pune
- Developed sentiment analysis models using BERT and distilBERT for analyzing user feedback across 50,000+ text samples, providing actionable insights for UX improvements
- Built automated text analytics workflows generating comprehensive NLP-driven reports on user engagement patterns and content trends
- Implemented multilingual text extraction systems for ID verification using Tesseract OCR and language detection models
- Applied topic modeling and semantic analysis to educational content, improving content recommendation accuracy by 28%
- Created behavioral analytics framework combining NLP insights with user retention strategies
KEY NLP & LLM PROJECTS
AI-Powered Marksheet Verification System
Intelligent Document Processing & Validation
- End-to-end NLP solution for automated academic document verification
- Technologies: BERT, LayoutLM, Tesseract OCR, Named Entity Recognition, Semantic Similarity
- Impact: 90% reduction in manual verification time, 94% accuracy in anomaly detection
Autonomous Log Analyzer & Incident Predictor
Offline LLM-Powered System Intelligence
- Open-source AI tool for proactive log analysis and issue prediction
- Technologies: Local LLMs (Llama/GPT4All), Anomaly Detection, Clustering, Pattern Recognition
- Impact: Fully offline solution enabling predictive maintenance and intelligent incident summarization
TECHNICAL SKILLS
NLP & LLM:
Transformers, BERT, GPT, T5, LLaMA, Prompt Engineering, Fine-tuning, RAG, Vector Databases, Semantic Search, NER, Sentiment Analysis
ML/DL Frameworks:
PyTorch, TensorFlow, HuggingFace, spaCy, NLTK, Scikit-learn, Keras, LangChain
Programming:
Python, SQL, JavaScript
Data Tools:
Pandas, NumPy, Matplotlib, Seaborn
Cloud & Deployment:
AWS (SageMaker, S3, Lambda), Azure ML, Docker, FastAPI, Flask, Git
Vector Databases:
Pinecone, ChromaDB
EDUCATION & CERTIFICATIONS
University of Mumbai | CGPI: 7.49
Copyright Registration (2023) - Programs for Sentiment Analysis and NLP Tasks
KEY ACHIEVEMENTS
4+ years specialized experience in NLP, LLMs, and language model fine-tuning across diverse applications
Successfully deployed production-grade LLM applications serving 100,000+ users with 99.9% uptime
Architected enterprise RAG systems and conversational AI solutions driving measurable business impact
Led end-to-end NLP projects from data collection to model deployment, achieving consistent >90% accuracy
Developed offline, open-source AI tools demonstrating expertise in local LLM deployment and optimization
Proven track record in cross-functional collaboration, translating complex NLP solutions into business value