Projects Portfolio
2023 Data Science Job Market Analysis
Tools: SQL, Data Analysis, PostgreSQL
- ๐ Dive into the data job market! Focusing on data scientist roles, this project explores ๐ฐ top-paying jobs, ๐ฅ in-demand skills, and ๐ where high demand meets high salary in data science.
CodeIntelligent Rival for Nine Men's Morris
Tools: Python, Machine Learning, AI
- ๐ฎ Created Adversarial Search Nine Menโs Morris agent using Mini-Max and Alpha-Beta pruning algorithms with dynamically altering Mini-Max depth, achieving an ๐ 88% win rate against human players. - ๐ Competed in a class-wide AI competition organized by the professor, securing a top 3 position among 80+ participants.
CodeCounter-Speech Generator against Online Hate Speech
Tools: PySpark, NLP, ML
- ๐ง Built a deep learning model using Hugging Face T5 to generate counter-speech against online speech, with an accuracy of 85%. - ๐ Evaluated the effectiveness of the generated counter-speech in mitigating negativity through sentiment analysis using TextBlob.
CodeStock Price Prediction with LSTM-RNN
Tools: PySpark, Databricks, AWS
- โ๏ธ Conceived a scalable real-time Spark pipeline on AWS infrastructure for data ingestion, pre-processing, and stock prediction. - ๐ Leveraged Databricks with PySpark to construct a model using LSTM-RNN achieving a squared error of 5% on historical data. - ๐ค Collaborated with a team of 3 students to refine the model, improving its accuracy to 93% through hyperparameter tuning.
CodeDatabricks Formula 1 Racing Analysis
Tools: Databricks, Spark SQL, Azure Data Factory
- ๐๏ธ Created databricks notebooks to ingest, transform, analyze, and create reports on Formula 1 racing data. - ๐ Written Spark SQL queries to find the dominant drivers and teams for visualization. - โฐ Scheduled the pipeline using Azure Data Factory (ADF) for monitoring and alerts.
CodeImpact of Airbnb on Housing Supply
Tools: MySQL, Data Analytics, Tableau
- ๐ฅ Led a team of 5 to analyze Airbnb's impact on housing, completing the project on time. - ๐บ๏ธ Utilized GeoPandas and ZCTA5 shapefiles to extract granular data at the zip code level. - ๐ Created a scalable Tableau dashboard to assess risk for 1.5M Airbnb listings from state to zip code level.