You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Python library that helps structure data in DataFrames and contains built-in data analysis functions.
importpandasaspd
Column | Feature | Attribute | Series | Field | Dimension
Row | Index | Record | Tuple | Observation | Sample
Pandas is an exploratory data analysis toolkit with a rich set of attributes and methods
Pandas provide a wide range of functions and methods
Widely used for data cleaning, data exploration, data manipulation, and data analysis tasks.
Toolkit for reading, writing, accessing, filtering, grouping, aggregating, merging, joining, combining, reshaping, cleaning, selecting data and performing statistical computation.
The financial term for multidimensional structured data sets is Panel
Supports various formats of data: csv, tsv, txt, xls, xlsx, json, etc.
Performance optimization ( Changing data types, storage type )
Integrates well with other important libraries like NumPy, Matplotlib, Seaborn, Scipy, etc.
Features of Pandas:
Time series support
Handling missing values
Grouped operations
Categorical data support
Merging and joining DataFrames
Statistical functions
Data visualization tools
Data Type or Data Structure
Description
pandas.Series()
1D array is an object that can hold any data type.
pandas.DataFrame()
2D table is like a data structure that can hold multiple types of data in columns.
Attribute
Meaning
df.index
The row index labels of DataFrame ( Default: RangeIndex
df.columns
The column index labels of DataFrame (axis = 1)
df.size
Number of columns in DataFrame
df.shape
A tuple of rows and columns ( nrows, ncols )
df.ndim
Number of dimensions in the DataFrame ( 1D, 2D, 3D )
df.values
Values of DataFrame
df.axes
List containing index and columns indices in a DataFrame