Using pandas, matplotlib and seaborn libraries in data analytics in the different jupyter notebooks in the repository. This readme is a table of contents of the notebooks.
- summary statistics: mean, median, cumulative stats
- the .agg( ) method
- dropping duplicates
- counting categorical variables
- calculating after using .groupby()
- multiple grouped summaries
- pivot tables
- filling in missing values
- bar plots, line plots, scatter plots, histograms
- list of dictionaries to a dataframe
- dictionary of lists to a DataFrame
- reading and writing CSVs using pandas
- components of a dataframe
- sorting dataframe columns
- subsetting columns
- subsetting rows
- subsetting rows by categorical variables
- adding new columns to a DataFrame
- setting and removing indexes
- subsetting with .loc[ ]
- setting multi level indexes
- sorting by index values
- slicing and subsetting with .loc( ) and iloc( )
- slicing time series
- subsetting by rows / columns
- creating pivot tables
- subsetting pivot tables
- calculating in pivot tables
- inner joins
- relationships
- one to many merge
- left join
- right join
- outer join
- self join
- merging on indexes (single index, multi-index, left_on, right_on)
- merge_ordered( )
- foward filling technique
- merge_asof( )
- selecting with .query( )
- reshaping data with melt
- adding data to the axes object
- customizing plots (adding markers, setting linestyle, choosing color, adding axes labels, adding titles)
- subplots
- creating small multiples with plt.subplots( )
- shared y axis
- read data with time index
- plotting time series with different variables
- annotating time series data
- quantitative comparisons with barcharts
- stacked bar charts
- quantitative comparisons with histograms
- statistical plotting - errorbars
- boxplots
- quantitative comparisons with scatter plots
- plotting styles
- saving visualizations
- automating figures from data
- introduction
- scatter plots
- count plots
- adding a third variable using hue
- setting hue order
- specifying hue colors
- introduction to relational plots (enables you to visualize the relationship between two quantitative variables using either scatter plots or line plots.)and subplots
- scatterplot() vs. relplot()
- subplots in columns / rows / both
- wrapping cols
- ordering cols
- two factor subplots
- customizing Scatter Plots (size, point style, transparency)
- line plots
- countplots and barplots
- boxplots
- pointplots
- changing plot style and color
- changing the figure style
- changing the palette (Diverging, Sequential)
- changing the scale
- adding Titles and Labels