{

"name": "julio",

"occupation": "data scientist and analyst",

"likes": ["drumming", "skating", "gaming"]

}

My Work

Customer Churn Prediction with ML
House Pricing Prediction with ML
  • Built a complete Machine Learning pipeline in Python (Pandas, NumPy, Scikit-learn).

  • Performed data preprocessing, feature engineering, and dataset balancing.

  • Compared multiple algorithms: Logistic Regression, SVM, KNN, and Random Forest.

  • Focused on the recall metric to reduce false negatives in churn prediction.

  • Adjusted Logistic Regression threshold, achieve higher values.

  • Developed a full regression pipeline in Python (Pandas, NumPy, Scikit-learn).

  • Conducted comprehensive EDA on numerical and categorical features.

  • Engineered and selected features relevant to housing attributes, location, and condition.

  • Tested multiple regression models: Linear, Ridge, Lasso, ElasticNet.

  • Packaged the final model into a reusable Scikit-learn pipeline for deployment.

  • Scraped top-rated movie data from TMDB using BeautifulSoup and Requests.

  • Extracted features such as title, genre(s), release date, runtime, certification, score, budget, and revenue.

  • Structured dataset and exported to CSV/Excel for reuse.

  • Conducted light exploratory analysis to explore relationships between genre, duration, certification and financial performance.

Web Scraping Top-Rated-Movies

All this work can be found on my GitHub link page.

About Me

I am an entry-level Data Scientist with strong skills in Data Analysis. I'm mostly focused on Python and SQL, with hands-on experience in data wrangling, exploratory data analysis, web scraping, and API integration.
I have got almost a year of experience and, I've completed the IBM Data Science Professional Certificate and developed end-to-end projects involving machine learning, data cleaning, and visualization.

I'm especially interested in working with real-world data - structuring it, extracting meaningful insights, and building solutions that are practical and scalable.
My goal is to work on projects related to data analysis, scraping, and machine learning model development, especially to gain hands-on experience and grow professionally.

Beyond technical skills, I'm a self-taught learner who places great emphasis on researching what I don't yet know, and I’m fully aware of how much I can still learn from more experienced professionals.

My Skills

Python
BeautifulSoup
Scikit-learn
Power BI
Microsoft
Selenium
SQL
seaborn
Jupyter
matplotlib
NumPy
pandas

Let's Connect

Interested in my projects or want to collaborate? Feel free to reach out using the form or contact me via LinkedIn/GitHub... or even Instagram, there is not lack of options.

© 2025 Quests4Data

By Júlio Mauro