I am a Data Analyst and AI Engineer with over 2 years of experience specializing in building ELT pipelines, machine learning models, and integrating large language models into workflows. My expertise lies in Snowflake, Python, and SQL, enabling me to deliver comprehensive data solutions from raw data ingestion to AI-powered insights. I excel at translating complex business problems into scalable technical architectures and communicating findings effectively to both technical and non-technical audiences.
Throughout my career, I have designed and deployed automated ELT workflows that significantly improve dataset readiness for visualization and business reporting. I have built and maintained machine learning models for various data types, leveraging classical statistical methods as well as modern LLM-based enrichment techniques to generate actionable insights.
My technical skills include advanced SQL scripting, Python programming with libraries such as Pandas and Scikit-learn, and experience with AI frameworks like GPT-4 and LangChain. I am proficient in data engineering tools including dbt, Airflow, and MLflow, and have implemented full MLOps pipelines to ensure reproducibility and scalability of ML services.
I have developed interactive dashboards using Power BI and React.js to present key performance indicators and trend analyses that support faster decision-making. Additionally, I have delivered NLP solutions such as text classification, sentiment analysis, and named entity recognition using both traditional NLP and LLM approaches.
In my current academic role as an ML Research Intern, I have engineered real-time behavioral monitoring agents and predictive models for burnout and anomaly detection. I have integrated AI-driven tone and sentiment analysis tools and built backend services with FastAPI to serve ML predictions and insights.
I am passionate about leveraging data and AI to solve real-world problems and continuously improving my skills through academic research and practical application. I thrive in collaborative environments where I can contribute to impactful projects and communicate complex concepts clearly to diverse stakeholders.
Designed and deployed automated ELT workflows integrating Snowflake, Python, and SQL, significantly improving dataset readiness for visualization and business reporting. Built and maintained ML models for structured and semi-structured datasets using Scikit-learn, statistical modeling, and LLM-based enrichment to derive actionable business insights. Developed intermediate-to-advanced SQL scripts using window functions, CTEs, analytical functions, joins, and subqueries to support analytics workflows and optimize query performance. Designed Power BI dashboards presenting KPIs, trend analyses, and executive summaries, enabling faster decision-making for business stakeholders. Delivered NLP solutions including text classification, sentiment scoring, and named entity extraction using both classical NLP techniques and modern LLMs. Applied Python (Pandas, NumPy) for data cleaning, feature engineering, and API automation, reducing preprocessing time and improving model reliability across projects. Conducted statistical analysis including probability distributions, variance analysis, and hypothesis testing to validate ML model assumptions and interpret outcomes. Participated in data quality assessments and implemented error-checking mechanisms to maintain high integrity across all data pipelines. Managed version control and collaborative development using Git/GitHub, ensuring reproducibility and traceability of all code changes.
Designed and built a real-time behavioral monitoring agent in Python, tracking keyboard timing, mouse dynamics, system activity, and bot-like interactions for risk analysis. Engineered daily and historical behavioral features (typing delay, mouse speed, CPU idle patterns, drift metrics) used as inputs for ML modeling and productivity scoring. Implemented burnout prediction and anomaly detection models using Random Forest Regression and Isolation Forest, including a 30-day trend analyzer identifying rising stress patterns. Built drift detection logic using statistical feature comparisons to measure significant shifts in user behavior over time. Integrated GPT-4 via LangChain for tone analysis, sentiment drift comparison, daily cognitive summaries, and an interactive AI support chatbot. Developed a FastAPI backend serving ML predictions, trend reports, and GenAI insights; built a React dashboard to visualize scores and anomaly alerts in real time. Implemented a full MLOps pipeline with DVC for data versioning, MLflow for experiment tracking, and Docker for containerized deployment across all services. Collaborated on academic research requirements, presenting findings and demo walkthroughs to faculty and peers in both technical and non-technical formats.
Jobicy
592 professionals pay to access exclusive and experimental features on Jobicy
Free
USD $0/month
For people just getting started
Plus
USD $8/month
Everything in Free, and: