Projects Portfolio

2023 Data Science Job Market Analysis

Tools: SQL, Data Analysis, PostgreSQL

- ๐Ÿ“Š Dive into the data job market! Focusing on data scientist roles, this project explores ๐Ÿ’ฐ top-paying jobs, ๐Ÿ”ฅ in-demand skills, and ๐Ÿ“ˆ where high demand meets high salary in data science.

Code

Intelligent Rival for Nine Men's Morris

Tools: Python, Machine Learning, AI

- ๐ŸŽฎ Created Adversarial Search Nine Menโ€™s Morris agent using Mini-Max and Alpha-Beta pruning algorithms with dynamically altering Mini-Max depth, achieving an ๐Ÿ† 88% win rate against human players. - ๐Ÿ† Competed in a class-wide AI competition organized by the professor, securing a top 3 position among 80+ participants.

Code

Counter-Speech Generator against Online Hate Speech

Tools: PySpark, NLP, ML

- ๐Ÿง  Built a deep learning model using Hugging Face T5 to generate counter-speech against online speech, with an accuracy of 85%. - ๐Ÿ˜Œ Evaluated the effectiveness of the generated counter-speech in mitigating negativity through sentiment analysis using TextBlob.

Code

Stock Price Prediction with LSTM-RNN

Tools: PySpark, Databricks, AWS

- โš™๏ธ Conceived a scalable real-time Spark pipeline on AWS infrastructure for data ingestion, pre-processing, and stock prediction. - ๐Ÿ“Š Leveraged Databricks with PySpark to construct a model using LSTM-RNN achieving a squared error of 5% on historical data. - ๐Ÿค Collaborated with a team of 3 students to refine the model, improving its accuracy to 93% through hyperparameter tuning.

Code

Databricks Formula 1 Racing Analysis

Tools: Databricks, Spark SQL, Azure Data Factory

- ๐ŸŽ๏ธ Created databricks notebooks to ingest, transform, analyze, and create reports on Formula 1 racing data. - ๐Ÿ“Š Written Spark SQL queries to find the dominant drivers and teams for visualization. - โฐ Scheduled the pipeline using Azure Data Factory (ADF) for monitoring and alerts.

Code

Impact of Airbnb on Housing Supply

Tools: MySQL, Data Analytics, Tableau

- ๐Ÿ‘ฅ Led a team of 5 to analyze Airbnb's impact on housing, completing the project on time. - ๐Ÿ—บ๏ธ Utilized GeoPandas and ZCTA5 shapefiles to extract granular data at the zip code level. - ๐Ÿ“Š Created a scalable Tableau dashboard to assess risk for 1.5M Airbnb listings from state to zip code level.