Welcome to my data science portfolio. I’ve been working on some projects independently to build some hands-on experience.
Recent Projects
Project II: Job Change of Data Scientists
Predict if a data scientist candidate will work for a company. Cleaned over 20,000 data extracted from Kaggle. Performed an exploratory data analysis (EDA) and extracted valuable information from the data. Optimized Logistic Regression, K-Nearest Neighbours, Decission-Tree Classifier and Random Forest Classifier using GridsearchCV to reach the best model. Link to gihub repository
read more
Project I: Apartment Price Estimator
Created a tool that estimates apartment price in Mexico City to help construction companies estimate the potential price of their apartments. Scraped over 25,000 apartment descriptions from Inmuebles24 using python and BeautifulSoup Engineered features from the text of each apartment description to quantify the value of amenities, such as pool, garden, etc. Optimized Linear, Ridge, and Gradient Boosting Regressor using GridsearchCV to reach the best model. Price by Delegacion in millions of MXN Correlation of each numeric variable Link to github repository
read more