Skip to content
#

tools-techniques

Here are 36 public repositories matching this topic...

This project predicts customer churn for a telecom company by analyzing user contracts, personal data, and service usage. It uses pandas for data manipulation and scikit-learn for model building, applying Logistic Regression, Decision Trees, and Gradient Boosting. The aim is to enable proactive customer retention supporting business decisions

  • Updated Oct 5, 2024
  • Jupyter Notebook

Automates the creation of a data science tutorial with machine learning using Serper API and OpenAI. Four agents (Researcher, Writer, Developer, Reviewer) collaborate to research, write, code, and review, resulting in a complete tutorial with code examples. Includes setup instructions for using API keys and environment configuration.

  • Updated Nov 5, 2024
  • Python

This project aims to detect negative movie reviews for the Film Junky Union community by analyzing IMDB data. It uses pandas for data manipulation and scikit-learn for building models, including Logistic Regression and Gradient Boosting. Applies tokenization and TF-IDF are applied to classify reviews as positive or negative

  • Updated Oct 5, 2024
  • Jupyter Notebook

This project analyzes a dataset on video game sales to uncover patterns that determine a game's success. The analysis covers user reviews, sales by platform and genre, and regional preferences. Python (pandas, matplotlib) is used for data manipulation and visualization, while various statistical methods explore correlations and trends.

  • Updated Oct 5, 2024
  • Jupyter Notebook

This project builds a classification model for Megaline's telecom clients to recommend updated plans based on their usage behavior. It utilizes machine learning algorithms like Decision Trees, Random Forests, and Logistic Regression to maximize accuracy. The goal is to enable plan recommendations, improving customer satisfaction and revenue

  • Updated Oct 5, 2024
  • Jupyter Notebook

This project developed a model to analyze and track the profitability of contracts at a law firm. It integrated data on revenue, attorney costs, contract expenses, billable hours, and indirect costs to evaluate individual contract performance. The model provided valuable insightslater evolved into a customized system still in use today

  • Updated Oct 20, 2024

This project aims to predict customer insurance claims by analyzing personal data and claim history. Using models like Decision Trees, Random Forests, and Logistic Regression, it evaluates customer risk factors and insurance claim frequency. Data preprocessing and feature engineering are employed, while accuracy and F1-score measure effectiveness

  • Updated Oct 5, 2024
  • Jupyter Notebook

This project analyzes taxi trip data in Chicago to identify patterns in passenger preferences and the impact of external factors like weather on ride frequency. SQL is used for data extraction, and pandas/scikit-learn are utilized for exploratory data analysis and hypothesis testing. The outcomes improve marketing strategies and user experience

  • Updated Oct 5, 2024
  • Jupyter Notebook

Improve this page

Add a description, image, and links to the tools-techniques topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the tools-techniques topic, visit your repo's landing page and select "manage topics."

Learn more