Besides static plots, we can also easily create beautiful interactive plots in pandas. Pandas provides easy and flexible syntax and methods for data visualization. In…
Easily Use Dask DataFrames in place of Pandas for Large Datasets
This article displays how convenient, easy and fast it is to use Dask DataFrames to read and store large datasets that Pandas is hard to…
How to Easily Speed up Pandas with Modin
This article will display how to easily speed up Pandas’ code by just changing a single line of code with Python Modin library using a…
3 Convenient Methods to Read and Concatenate Multiple Data Files in Pandas
How to read and concatenate multiple data files like.csv data file in pandas with 3 convenient methods using concrete real-world datasets In most cases, we only need…
How Easily to Visualize Data with Pandas
Pandas provides easy and simple syntax to visualize data by creating the most widely used plots, but it seems the power of Pandas’s data visualization…
Interactive Plots Becomes Much Easier with HoloViews (II): Multiple Plots
HoloViews is a high-level plotting library for Python, which create overlay and layout of multiple interactive plots with simple syntax and minimal efforts Part (I)…
Interactive Plots Becomes Much Easier with HoloViews (I): Basic Plots
HoloViews is a high-level plotting library for Python, which create interactive plots with simple syntax and minimal efforts For data visualization, there are many libraries…
A Convenient Python Package to Download Open Datasets
4 concrete examples are used to show how to use a Python package to download open datasets from GitHub, Google Drive, Kaggle and other online…
2 Convenient Methods to Search and Download Datasets from Kaggle with Jupyter Notebook
Machine learning, modelling, and data analysis all need suitable datasets to test the algorithms I still clearly remember that when I pursued my PhD in…
Modelselect: A Python Package Helps Fast Develop Optimal Linear Regression Model
Modelselect package helps get an Optimal Linear Regression Model by removing insignificant variables and solve multicollinearity problems I developed a small package called modelselect, which…