Welcome to the Text Analysis and Visualization Project! This repository contains code for analyzing and visualizing text data using various powerful Python libraries.
This project demonstrates how to perform the following tasks:
- Data manipulation and analysis with Pandas
- Interactive visualizations with Plotly Express
- Generating word clouds with WordCloud
- Sentiment analysis with TextBlob
- Advanced natural language processing (NLP) with SpaCy
- Topic modeling with Scikit-Learn's CountVectorizer and LatentDirichletAllocation
To get started, clone this repository and install the necessary dependencies:
git clone https://github.com/dedemilano/text-analysis-visualization.git
cd text-analysis-visualization
pip install -r requirements.txt
- Pandas: For data manipulation and analysis
- Plotly Express: For creating interactive visualizations
- WordCloud: For generating word clouds
- Matplotlib: For static and animated visualizations
- TextBlob: For basic NLP tasks like sentiment analysis
- SpaCy: For advanced NLP tasks
- Scikit-Learn: For machine learning and topic modeling
Load and manipulate your text data using Pandas DataFrame:
Create interactive bar charts to visualize data distributions:
Generate a word cloud from text data:
Analyze the sentiment of text data:
Perform advanced NLP tasks like tokenization, lemmatization, and named entity recognition:
Discover latent topics in text data:
This project is licensed under the Deoth License - see the LICENSE file for details.
Thanks to the developers of these amazing libraries: Pandas, Plotly, WordCloud, Matplotlib, TextBlob, SpaCy, and Scikit-Learn!