This project performs Exploratory Data Analysis (EDA) on Netflix datasets using Python.
The analysis focuses on content distribution, genres, ratings, countries, movie durations, and Netflix growth trends over the years.
The project was built using Jupyter Notebook and various Python data analysis libraries.
- Python
- Pandas
- Matplotlib
- Seaborn
- Jupyter Notebook
- Git & GitHub
Netflix-Data-Analysis/
│
├── data/
│ └── netflix_titles.csv
│
├── output/
│ ├── movies_vs_tvshows.png
│ ├── content_over_years.png
│ ├── top_genres.png
│ ├── top_countries.png
│ ├── content_ratings.png
│ ├── movie_duration.png
│ ├── top_directors.png
│ └── top_actors.png
│
├── netflix_analysis.ipynb
├── README.md
├── requirements.txt
└── .gitignoreCompared the number of Movies and TV Shows available on Netflix.
Analyzed Netflix content growth and expansion trends over time.
Identified the most popular genres available on the platform.
Explored which countries contribute the highest amount of content.
Studied the distribution of ratings such as TV-MA, TV-14, PG-13, and more.
Analyzed the distribution of movie durations on Netflix.
Identified directors with the highest number of titles on Netflix.
Explored actors who appear most frequently in Netflix content.
- Movies dominate Netflix content compared to TV Shows.
- Netflix experienced rapid content growth after 2015.
- International Movies and Dramas are among the most popular genres.
- The United States contributes the highest amount of Netflix content.
- TV-MA and TV-14 are the most common content ratings.
- Indian actors and directors appear frequently on the platform.
Through this project, I improved my understanding of:
- Data Cleaning
- Exploratory Data Analysis (EDA)
- Data Visualization
- GroupBy & Aggregation
- Working with real-world datasets
- Git & GitHub workflow
Sujay Pandit







